SlideShare una empresa de Scribd logo
1 de 13
Descargar para leer sin conexión
SetFocus
Business Intelligence –Master Program


  SQL SERVER INTEGRATION SERVICE




            PORJECT DOCUMENTATION



                         ON

  AllWorks, Inc. – a Ficticious Construction Company




                    Submitted By

                   Yubaraj Khanal
                      (May,2009)


                                                       0
Table of Contents


1.      Introduction: ....................................................................................................................................... 2
2.      Project Requirements: ........................................................................................................................ 2
3.      Project Overview ................................................................................................................................. 2
4.      Create SSIS Packages ........................................................................................................................... 2
4.1. Employee Master Package: ................................................................................................................. 3
4.2. Employee Rate Package ...................................................................................................................... 4
4.3. ClientMaster Package.......................................................................................................................... 4
4.4. ClientGrouping Master Package.......................................................................................................... 5
4.5. Division Master Package ..................................................................................................................... 6
4.6. Client grouping to client xref package ................................................................................................ 7
4.7. Project (Job)Master Package ............................................................................................................... 8
4.8. Project(Job)TimeSheetPackage ........................................................................................................... 9
4.9. Database Backup Package ................................................................................................................. 11
4.10. Database Re index and Shrink Package............................................................................................. 11
4.11. Master Package ................................................................................................................................. 11
4.12. Project Build, Deploy and Execution ................................................................................................. 12




SSIS Student Project Documentation                                                                                                                       1
1. Introduction:

        The project is for a fictitious construction company called AllWorks. The basic requirement of
        the project is to design and crate an ETL solution to update a SQL Server 2005 database from
        excel and flat file sources using SQL Server Integration Service.


2. Project Requirements:

        The major task of the project is to transfer the data from the different raw data sources (xls, csv
        and xml) to the SQL Server Database.

        Different packages are created to read source file and insert them into the AllWorksDBStudent
        SQL dataset. After execution of every package, an email should be generated that will send
        report containing no of rows inserted, updated or error while execution. All packages should be
        executed everyday at midnight and after execution a database should be backed up, re index
        and shrink.


3. Project Overview

        The source and destination of the data is shown in following table:
                               Source                             Destination (AllWorksDBStudent
                 File                        Details                   Database) Table Name
                                 Employees Sheet                 Employee Table
        Employees.xls
                                 Employeerates sheet             EmployeeRates Table
                                 Clientlisting sheet             Client Table
                                 Special grouping sheet          ClientGroupings
        ClientGeographies.xls
                                 Division Definition sheet       Division Table
                                 Special Groupings Sheet         ClientGroupingXCliens Table
        ProjectMaster.xls        ProjectMaster Sheet             Project(“Job”)Master Table
        EmpTime####.csv          Different files with .csv ext.  EmployeeTimeSheet Table




4. Create SSIS Packages

        To integrate external sources of data into SQL server database, altogether (including mater
        package) 11 packages were created. Each data source was read by related component, data
        converted to the proper SQL supported type, check for validation and stored to the related
        tables in the database. Data was converted with the following rules:
              Source Data Type                       Converted Data Type
        Integer                        Four Byte Signed Inter (DT_14)
        Character/Varchar              String(DT_STR)
        Date                           Date(DT_Date) or Database

SSIS Student Project Documentation                                                                       2
TimeStamp(DT_DBTIMESTAMP)
        Currency/General Numbers      Numeric(DT_NUMERIC)
        Logical/Boolean               Boolean(DT_BOOLEAN)




    4.1. Employee Master Package:

    Steps

    1. Data retrieves from excel (file:employees.xls, sheet: employees) using Data Flow Source- Excel
       Source.
    2. Data is converted according to the requirements.
    3. FullName column is added by using derived column.
    4. Lookup Transformation is added to check with employee primary key.
    5. A conditional split is used to determine either the record is new, existing and to be updated or
       will null values.
    6. According to the condition, no of rows were counted and send to the destination.
    7. An email report was generated that contains no of rows inserted, updated and error generated.




SSIS Student Project Documentation                                                                  3
4.2. Employee Rate Package

    Steps

    1. Data retrieves from excel (file:employees.xls, sheet: employeerates) using Data Flow Source-
       Excel Source.
    2. Data is converted according to the requirements.
    3. A lookup Transformation is added to check the valid employeeid.
    4. Another Lookup Transformation is added to check with employee primary key.
    5. A conditional split is used to determine either the record is new, existing and to be updated or
       will null values.
    6. According to the condition, no of rows were counted and send to the destination.
    7. An email report was generated that contains no of rows inserted, updated and error generated.




    4.3. ClientMaster Package

        Steps



SSIS Student Project Documentation                                                                  4
1. Data retrieves from excel (file: clientgeorgraphics.xls, sheet: clientlisting) using Data Flow
           Source- Excel Source.
        2. Data is converted according to the requirements.
        3. Lookup Transformation is added to check with client primary key.
        4. A conditional split is used to determine either the record is new, existing and to be updated
           or will null values.
        5. According to the condition, no of rows were counted and send to the destination.
        6. An email report was generated that contains no of rows inserted, updated and error
           generated.




    4.4. ClientGrouping Master Package

        Steps :

        1. Data retrieves from excel (file:clientgeographics.xls, sheet:specialgroupings) using Data Flow
           Source- Excel Source.
        2. Data is converted according to the requirements.
        3. An Aggregate dataflow task is added and records are grouped by grouping no and grouping
           name.

SSIS Student Project Documentation                                                                    5
4. Lookup Transformation is added to check with group primary key.
        5. A conditional split is used to determine either the record is new, existing and to be updated
           or will null values.
        6. According to the condition, no of rows were counted and send to the destination.
        7. An email report was generated that contains no of rows inserted, updated and error
           generated.




    4.5. Division Master Package

        Steps :

        1. Data retrieves from excel (file:clientgeographics.xls, sheet:divisiondefinitions) using Data
           Flow Source- Excel Source.
        2. Data is converted according to the requirements.
        3. An Aggregate dataflow task is added and records are grouped by division no and division
           descriptions.
        4. Lookup Transformation is added to check with division primary key.
        5. A conditional split is used to determine either the record is new, existing and to be updated
           or will null values.
        6. According to the condition, no of rows were counted and send to the destination.
        7. An email report was generated that contains no of rows inserted, updated and error
           generated.




SSIS Student Project Documentation                                                                   6
4.6. Client grouping to client xref package

        Steps
        1. Data retrieves from excel (file:clientgeographics.xls, sheet:specialgroupings) using Data Flow
            Source- Excel Source.
        2. Data is converted according to the requirements.
        3. A Lookup transformation added to check valid group
        4. Another Lookup Transformation is added to check with client primary key(client table).
        5. A conditional split is used to determine either the record is new, existing and to be updated
            or will null values.
        6. According to the condition, no of rows were counted and send to the destination.
        7. An email report was generated that contains no of rows inserted, updated and error
            generated.




SSIS Student Project Documentation                                                                   7
4.7. Project (Job)Master Package

        1. Data retrieves from excel (file:projectmaster.xls, sheet:projectmaster) using Data Flow
           Source- Excel Source.
        2. Data is converted according to the requirements.
        3. A Lookup Transformation is added to check for valid client (client table).
        4. Another Lookup Transformation is added to check with job master primary key.
        5. A conditional split is used to determine either the record is new, existing and to be updated
           or will null values.
        6. According to the condition, no of rows were counted and send to the destination.
        7. An email report was generated that contains no of rows inserted, updated and error
           generated.




SSIS Student Project Documentation                                                                   8
4.8. Project(Job)TimeSheetPackage

        1. Data retrieves from different .csv files inside time folder. To retrieve and transform records
           from multiple .csv files, a For each loop container with Foreach File Enumerator is used.
        2. Data is converted according to the requirements.
        3. Lookup Transformation is added to check the valid job (Job Master Table).
        4. A conditional split is used to determine the status of the job i.e. either the job is closed or
           active.
        5. For closed jobs, a multicast is used to send the data to excel files and error log table.
        6. For Active jobs, another Lookup transformation is added to check with employeepk,
           jobmasterpk and jobdate.
        7. A conditional split is used to determine either the record is new, existing and to be updated
           or will null values.
        8. According to the condition, no of rows were counted and send to the destination.
        9. An email report was generated that contains no of rows inserted, updated and error
           generated.




SSIS Student Project Documentation                                                                     9
SSIS Student Project Documentation   10
4.9. Database Backup Package

        After completion of all 8 packages, another package is generated to backup the database. For
        this, A Maintenance Plan Task- Backup Database Task control is added and set it to backup the
        database. After backup database a send mail task is added to report the status of database
        backup process.




    4.10. Database Re index and Shrink Package

        In this package, two maintenance controls Rebuilt Index and Shrink Database task are added.
        After execution of the task an email was generated with send mail task to report the progress.




    4.11. Master Package

        Finally, all packages are organized in a Master package. The following screen shows the layout of
        the master package.




SSIS Student Project Documentation                                                                   11
4.12. Project Build, Deploy and Execution

        After preparation of all packages, the packages were building and deployed and installed on SQL
        Server Database. Using SQL Server Agent, master package was scheduled to run on everyday at
        midnight.




SSIS Student Project Documentation                                                                  12

Más contenido relacionado

La actualidad más candente

PostgreSQL Table Partitioning / Sharding
PostgreSQL Table Partitioning / ShardingPostgreSQL Table Partitioning / Sharding
PostgreSQL Table Partitioning / ShardingAmir Reza Hashemi
 
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...Altinity Ltd
 
SAP BusinessObjects BI 4.3
SAP BusinessObjects BI 4.3SAP BusinessObjects BI 4.3
SAP BusinessObjects BI 4.3Wiiisdom
 
data platform on kubernetes
data platform on kubernetesdata platform on kubernetes
data platform on kubernetes창언 정
 
Multiple files single target single interface
Multiple files single target single interfaceMultiple files single target single interface
Multiple files single target single interfaceDharmaraj Borse
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveDataWorks Summit
 
Presentation 1 - SSRS (1)
Presentation 1 - SSRS (1)Presentation 1 - SSRS (1)
Presentation 1 - SSRS (1)Anurag Rana
 
Technical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfTechnical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfIlham31574
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversScyllaDB
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2inovex GmbH
 
A Complex SSIS Package
A Complex SSIS PackageA Complex SSIS Package
A Complex SSIS PackageNitil Dwivedi
 
Sql server-integration-services-ssis-step-by-step-sample-chapters
Sql server-integration-services-ssis-step-by-step-sample-chaptersSql server-integration-services-ssis-step-by-step-sample-chapters
Sql server-integration-services-ssis-step-by-step-sample-chaptersNadinKa Karimou
 
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Darshankumar Prajapati
 
Azure data factory
Azure data factoryAzure data factory
Azure data factoryBizTalk360
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Cathrine Wilhelmsen
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Cathrine Wilhelmsen
 

La actualidad más candente (20)

PostgreSQL
PostgreSQLPostgreSQL
PostgreSQL
 
PostgreSQL Table Partitioning / Sharding
PostgreSQL Table Partitioning / ShardingPostgreSQL Table Partitioning / Sharding
PostgreSQL Table Partitioning / Sharding
 
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...
OSA Con 2022 - Apache Iceberg_ An Architectural Look Under the Covers - Alex ...
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
SAP BusinessObjects BI 4.3
SAP BusinessObjects BI 4.3SAP BusinessObjects BI 4.3
SAP BusinessObjects BI 4.3
 
data platform on kubernetes
data platform on kubernetesdata platform on kubernetes
data platform on kubernetes
 
Multiple files single target single interface
Multiple files single target single interfaceMultiple files single target single interface
Multiple files single target single interface
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
 
Presentation 1 - SSRS (1)
Presentation 1 - SSRS (1)Presentation 1 - SSRS (1)
Presentation 1 - SSRS (1)
 
Technical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdfTechnical Deck Delta Live Tables.pdf
Technical Deck Delta Live Tables.pdf
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
Azure Data Factory v2
Azure Data Factory v2Azure Data Factory v2
Azure Data Factory v2
 
A Complex SSIS Package
A Complex SSIS PackageA Complex SSIS Package
A Complex SSIS Package
 
Oracle Tablespace - Basic
Oracle Tablespace - BasicOracle Tablespace - Basic
Oracle Tablespace - Basic
 
Sql server-integration-services-ssis-step-by-step-sample-chapters
Sql server-integration-services-ssis-step-by-step-sample-chaptersSql server-integration-services-ssis-step-by-step-sample-chapters
Sql server-integration-services-ssis-step-by-step-sample-chapters
 
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
Multiple Flat Files(CSV) to Target Table in ODI12c(12.2.1.0.0)
 
Azure data factory
Azure data factoryAzure data factory
Azure data factory
 
080827 abramson inmon vs kimball
080827 abramson   inmon vs kimball080827 abramson   inmon vs kimball
080827 abramson inmon vs kimball
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
 
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
Pipelines and Packages: Introduction to Azure Data Factory (DATA:Scotland 2019)
 

Similar a A-Project Report- SSIS

SSIS Project Profile
SSIS Project ProfileSSIS Project Profile
SSIS Project Profiletthompson0421
 
B Woodward Portfolio
B Woodward PortfolioB Woodward Portfolio
B Woodward Portfoliobwoodward
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence PortfolioYubaraj Khanal
 
MMYERS Portfolio
MMYERS PortfolioMMYERS Portfolio
MMYERS PortfolioMike Myers
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolioamoffat
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolioguestc38d4b
 
DBT PU BI Lab Manual for ETL Exercise.pdf
DBT PU BI Lab Manual for ETL Exercise.pdfDBT PU BI Lab Manual for ETL Exercise.pdf
DBT PU BI Lab Manual for ETL Exercise.pdfJanakiramanS13
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfoliogaryt1953
 
David Weston SSIS Portfolio
David Weston SSIS PortfolioDavid Weston SSIS Portfolio
David Weston SSIS Portfoliodlweston
 
RakeshCS_5.4years
RakeshCS_5.4yearsRakeshCS_5.4years
RakeshCS_5.4yearsRakesh CS
 
Bi Ppt Portfolio Elmer Donavan
Bi Ppt Portfolio  Elmer DonavanBi Ppt Portfolio  Elmer Donavan
Bi Ppt Portfolio Elmer DonavanEJDonavan
 
Eugene Wabomnor Bi Portfolio
Eugene Wabomnor Bi PortfolioEugene Wabomnor Bi Portfolio
Eugene Wabomnor Bi Portfolioi661e21
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolioguest5a8ee60b
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfoliopleeloy
 
Dennis Schmid Portfolio
Dennis Schmid PortfolioDennis Schmid Portfolio
Dennis Schmid PortfolioDennis Schmid
 
Ssis ssas sps_mdx_hong_bingli
Ssis ssas sps_mdx_hong_bingliSsis ssas sps_mdx_hong_bingli
Ssis ssas sps_mdx_hong_bingliHong-Bing Li
 

Similar a A-Project Report- SSIS (20)

SSIS Project Profile
SSIS Project ProfileSSIS Project Profile
SSIS Project Profile
 
B Woodward Portfolio
B Woodward PortfolioB Woodward Portfolio
B Woodward Portfolio
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
Aditya_2015
Aditya_2015Aditya_2015
Aditya_2015
 
MMYERS Portfolio
MMYERS PortfolioMMYERS Portfolio
MMYERS Portfolio
 
Shashi.Kiran_CV
Shashi.Kiran_CVShashi.Kiran_CV
Shashi.Kiran_CV
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
sql resume
sql resumesql resume
sql resume
 
DBT PU BI Lab Manual for ETL Exercise.pdf
DBT PU BI Lab Manual for ETL Exercise.pdfDBT PU BI Lab Manual for ETL Exercise.pdf
DBT PU BI Lab Manual for ETL Exercise.pdf
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
David Weston SSIS Portfolio
David Weston SSIS PortfolioDavid Weston SSIS Portfolio
David Weston SSIS Portfolio
 
Ganesh CV
Ganesh CVGanesh CV
Ganesh CV
 
RakeshCS_5.4years
RakeshCS_5.4yearsRakeshCS_5.4years
RakeshCS_5.4years
 
Bi Ppt Portfolio Elmer Donavan
Bi Ppt Portfolio  Elmer DonavanBi Ppt Portfolio  Elmer Donavan
Bi Ppt Portfolio Elmer Donavan
 
Eugene Wabomnor Bi Portfolio
Eugene Wabomnor Bi PortfolioEugene Wabomnor Bi Portfolio
Eugene Wabomnor Bi Portfolio
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
Business Intelligence Portfolio
Business Intelligence PortfolioBusiness Intelligence Portfolio
Business Intelligence Portfolio
 
Dennis Schmid Portfolio
Dennis Schmid PortfolioDennis Schmid Portfolio
Dennis Schmid Portfolio
 
Ssis ssas sps_mdx_hong_bingli
Ssis ssas sps_mdx_hong_bingliSsis ssas sps_mdx_hong_bingli
Ssis ssas sps_mdx_hong_bingli
 

Último

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 

Último (20)

Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

A-Project Report- SSIS

  • 1. SetFocus Business Intelligence –Master Program SQL SERVER INTEGRATION SERVICE PORJECT DOCUMENTATION ON AllWorks, Inc. – a Ficticious Construction Company Submitted By Yubaraj Khanal (May,2009) 0
  • 2. Table of Contents 1. Introduction: ....................................................................................................................................... 2 2. Project Requirements: ........................................................................................................................ 2 3. Project Overview ................................................................................................................................. 2 4. Create SSIS Packages ........................................................................................................................... 2 4.1. Employee Master Package: ................................................................................................................. 3 4.2. Employee Rate Package ...................................................................................................................... 4 4.3. ClientMaster Package.......................................................................................................................... 4 4.4. ClientGrouping Master Package.......................................................................................................... 5 4.5. Division Master Package ..................................................................................................................... 6 4.6. Client grouping to client xref package ................................................................................................ 7 4.7. Project (Job)Master Package ............................................................................................................... 8 4.8. Project(Job)TimeSheetPackage ........................................................................................................... 9 4.9. Database Backup Package ................................................................................................................. 11 4.10. Database Re index and Shrink Package............................................................................................. 11 4.11. Master Package ................................................................................................................................. 11 4.12. Project Build, Deploy and Execution ................................................................................................. 12 SSIS Student Project Documentation 1
  • 3. 1. Introduction: The project is for a fictitious construction company called AllWorks. The basic requirement of the project is to design and crate an ETL solution to update a SQL Server 2005 database from excel and flat file sources using SQL Server Integration Service. 2. Project Requirements: The major task of the project is to transfer the data from the different raw data sources (xls, csv and xml) to the SQL Server Database. Different packages are created to read source file and insert them into the AllWorksDBStudent SQL dataset. After execution of every package, an email should be generated that will send report containing no of rows inserted, updated or error while execution. All packages should be executed everyday at midnight and after execution a database should be backed up, re index and shrink. 3. Project Overview The source and destination of the data is shown in following table: Source Destination (AllWorksDBStudent File Details Database) Table Name Employees Sheet Employee Table Employees.xls Employeerates sheet EmployeeRates Table Clientlisting sheet Client Table Special grouping sheet ClientGroupings ClientGeographies.xls Division Definition sheet Division Table Special Groupings Sheet ClientGroupingXCliens Table ProjectMaster.xls ProjectMaster Sheet Project(“Job”)Master Table EmpTime####.csv Different files with .csv ext. EmployeeTimeSheet Table 4. Create SSIS Packages To integrate external sources of data into SQL server database, altogether (including mater package) 11 packages were created. Each data source was read by related component, data converted to the proper SQL supported type, check for validation and stored to the related tables in the database. Data was converted with the following rules: Source Data Type Converted Data Type Integer Four Byte Signed Inter (DT_14) Character/Varchar String(DT_STR) Date Date(DT_Date) or Database SSIS Student Project Documentation 2
  • 4. TimeStamp(DT_DBTIMESTAMP) Currency/General Numbers Numeric(DT_NUMERIC) Logical/Boolean Boolean(DT_BOOLEAN) 4.1. Employee Master Package: Steps 1. Data retrieves from excel (file:employees.xls, sheet: employees) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. FullName column is added by using derived column. 4. Lookup Transformation is added to check with employee primary key. 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. SSIS Student Project Documentation 3
  • 5. 4.2. Employee Rate Package Steps 1. Data retrieves from excel (file:employees.xls, sheet: employeerates) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. A lookup Transformation is added to check the valid employeeid. 4. Another Lookup Transformation is added to check with employee primary key. 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. 4.3. ClientMaster Package Steps SSIS Student Project Documentation 4
  • 6. 1. Data retrieves from excel (file: clientgeorgraphics.xls, sheet: clientlisting) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. Lookup Transformation is added to check with client primary key. 4. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 5. According to the condition, no of rows were counted and send to the destination. 6. An email report was generated that contains no of rows inserted, updated and error generated. 4.4. ClientGrouping Master Package Steps : 1. Data retrieves from excel (file:clientgeographics.xls, sheet:specialgroupings) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. An Aggregate dataflow task is added and records are grouped by grouping no and grouping name. SSIS Student Project Documentation 5
  • 7. 4. Lookup Transformation is added to check with group primary key. 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. 4.5. Division Master Package Steps : 1. Data retrieves from excel (file:clientgeographics.xls, sheet:divisiondefinitions) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. An Aggregate dataflow task is added and records are grouped by division no and division descriptions. 4. Lookup Transformation is added to check with division primary key. 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. SSIS Student Project Documentation 6
  • 8. 4.6. Client grouping to client xref package Steps 1. Data retrieves from excel (file:clientgeographics.xls, sheet:specialgroupings) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. A Lookup transformation added to check valid group 4. Another Lookup Transformation is added to check with client primary key(client table). 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. SSIS Student Project Documentation 7
  • 9. 4.7. Project (Job)Master Package 1. Data retrieves from excel (file:projectmaster.xls, sheet:projectmaster) using Data Flow Source- Excel Source. 2. Data is converted according to the requirements. 3. A Lookup Transformation is added to check for valid client (client table). 4. Another Lookup Transformation is added to check with job master primary key. 5. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 6. According to the condition, no of rows were counted and send to the destination. 7. An email report was generated that contains no of rows inserted, updated and error generated. SSIS Student Project Documentation 8
  • 10. 4.8. Project(Job)TimeSheetPackage 1. Data retrieves from different .csv files inside time folder. To retrieve and transform records from multiple .csv files, a For each loop container with Foreach File Enumerator is used. 2. Data is converted according to the requirements. 3. Lookup Transformation is added to check the valid job (Job Master Table). 4. A conditional split is used to determine the status of the job i.e. either the job is closed or active. 5. For closed jobs, a multicast is used to send the data to excel files and error log table. 6. For Active jobs, another Lookup transformation is added to check with employeepk, jobmasterpk and jobdate. 7. A conditional split is used to determine either the record is new, existing and to be updated or will null values. 8. According to the condition, no of rows were counted and send to the destination. 9. An email report was generated that contains no of rows inserted, updated and error generated. SSIS Student Project Documentation 9
  • 11. SSIS Student Project Documentation 10
  • 12. 4.9. Database Backup Package After completion of all 8 packages, another package is generated to backup the database. For this, A Maintenance Plan Task- Backup Database Task control is added and set it to backup the database. After backup database a send mail task is added to report the status of database backup process. 4.10. Database Re index and Shrink Package In this package, two maintenance controls Rebuilt Index and Shrink Database task are added. After execution of the task an email was generated with send mail task to report the progress. 4.11. Master Package Finally, all packages are organized in a Master package. The following screen shows the layout of the master package. SSIS Student Project Documentation 11
  • 13. 4.12. Project Build, Deploy and Execution After preparation of all packages, the packages were building and deployed and installed on SQL Server Database. Using SQL Server Agent, master package was scheduled to run on everyday at midnight. SSIS Student Project Documentation 12