top of page
  • Writer's pictureMayowa Taiwo

On-Prem ETL Development using SSIS: Data warehouse Development from OLTP Sources to OLAP Cubes

Updated: Nov 13, 2023


SSIS PACKAGES AS REUSABLE PACKAGES ON SQL SERVER



Background Statement

The modern business landscape places immense value on data and business intelligence, leading to the increasing adoption of tools like Power BI. To gain executive support for such endeavors, demonstrating the practical benefits of migration projects is crucial. This case study explores the creation of a Technical Proof of Concept (PoC) aimed at showcasing how data-driven transformations can align with the organization's goal of data-driven decision-making.


Introduction

Businesses today are in a constant quest for insights that can inform their strategies and improve decision-making. Leveraging data and analytics tools like Power BI is essential in this pursuit. In this project, the Northwind Database serves as a practical demonstration of the efficacy of an automated data pipeline using SQL Server Integration Services (SSIS). The focus is on seamless data integration and transformation, highlighting the significance of maintaining data integrity and coherence for analytical purposes.


Project Goal

The primary goal of this project is to create a Technical Proof of Concept (PoC) that demonstrates the value of data-driven transformation from an Operational Data Source (OLTP) to a Data Warehouse (OLAP). Key objectives include:

  • Showcasing the use of SSIS for data integration and transformation.

  • Highlighting the importance of data integrity and coherence.

  • Preparing for the transition to modern ETL tools like Azure Data Factory (ADF).

Methodology


Data Collection (Queries are given in this Link) Data from diverse sources is collected and extracted using SSIS. Pipelines are set up between source databases and destination Databases in SQL Server. SQL queries were used to create views through union of tables in preparation for Dimensional Modelling.


Data Pre-processing Given the cleanliness of the data, minimal data cleaning is required. Tables are transformed into Fact and Dimensions tables, ready for data modeling through Dimensional Modeling. Different Transformation were done on the Data including Sorting, data type changes etc.


Data Modelling Database tables are meticulously designed to maintain relationships between Fact and Dimensions tables, ensuring data suitability for analysis. One-to-many relationships are established from Dimensions table to fact tables. Fact tables was redone as a new view to include primary keys from each of the dimension tables to ensure proper linking of Fact and dimensions tables.


Data Analysis SSIS is utilized for creating an automated data pipeline that handles data extraction, transformation, and loading, emphasizing its importance in data integration.


Key Insights

Data Integration and Transformation The project effectively showcases the power of SQL Server Integration Services (SSIS) in creating a robust automated data pipeline, underscoring its significance in data integration and transformation.


Cloud Transition Readiness While SSIS remains a valuable tool, the project recognizes the shift towards cloud-based solutions like Azure Data Factory (ADF). The skills acquired in SSIS, including data extraction, transformation, and loading, provide a solid foundation for adapting to modern ETL tools.


Practical Application and use cases

The insights obtained from this case study have practical applications, including:

  • Demonstrating the feasibility of automated data pipelines for organizations considering similar data-driven transformations.

  • Preparing professionals to adapt to modern ETL tools like Azure Data Factory (ADF).

Summary and Conclusion

This case study exemplifies the value of data-driven transformation and the importance of data integration and transformation tools like SSIS. By demonstrating the transition from OLTP to OLAP, it equips organizations with insights and skills to make data-driven decisions and adapt to evolving data management landscapes.

Comentários


bottom of page