SlideShare una empresa de Scribd logo
1 de 27
Descargar para leer sin conexión
Five Steps to Mastering Master Data Management
                                     Ron Lewis
                              November 19, 2009
Presentation Overview

• Introduction
• What is Master Data Management?
                           g
• The 5 Steps for Master Data Management:
    • Discovery – finding all of the data sources, who they are used by and how they are used
    • Analysis – identifying authoritative sources, discrepancies, and candidates for consolidation
    • Design – designing the metadata repository
    • Implementation–implementing a metadata repository
    • Establish data governance

• Leveraging Technology to facilitate:
    • Business Process and Data Modeling
                                       g
    • Data Governance and Discovery
    • Metadata Repository Implementation
                   g
    • Metadata Management

• Presentation Focus:           The Discovery and Analysis Phases
19/11/2009                                                                                            2
Master Data Management

• Master Data Management
    • Master Data is: Principle business data essential for conducting business
    • MDM provides an enterprise perspective on the critical Business Processes and the Data necessary to
      support them
    • Bottom line: Improve decision making



• Core Tasks
    • Building the Business Process Models
    • Data Governance (Standardizing data - nomenclature, domains, data quality and consumption rules)
    • Synchronizing related operational systems using the data
    • Integrating/reconciling disparate data silos to provide single enterprise view
    • Building and managing an enterprise metadata repository



• Challenge: Must Shift Thinking to the Enterprise Perspective

11/15/2009                                                                                               3
Discovery Phase

• Step 1 – Discovery
    • Capturing and modeling the essential business processes
    • Mapping processes to the data necessary to complete each process successfully
    • Identifying data sources and gathering appropriate metadata

• Primary Challenges-
    • Cost - It’s Expensive and Disruptive
    • Gaining Executive Leadership Support – (“You mean we don’t have this already?”)

• Solution
  Solution-
    • Start with what’s most important
    • What’s important should be obvious




11/15/2009                                                                              4
Discovery Phase

• Involve your infrastructure and/or security personnel
• Iteration I: Capture existing data and schemas
                 p            g
    • Find your database servers, respective owners and access
    • Reverse engineering your physical data models
    • Build a master data dictionary and catalog
                                   y           g

• Iteration II: Profile existing applications to help with business
    • Database Centric: ETL, Stored Procedures, and Triggers
    • Application Source Code and User Behavior

• Tools You’ll Need
    • Infrastructure/security tools (
                            y       (Nessus)
                                           )
    • Data Modeling and Profiling tools (ER/Studio Data Architect/DBOptimizer)
    • Application Profiling tools (NitroSecurity APM)
    • Repository to manage the metadata byproducts
        p      y        g                yp



19/11/2009                                                                       5
Infrastructure / Security Tooling




19/11/2009                          6
Use ER Studio to Reverse Engineer




19/11/2009                          7
Reverse Engineer Physical Schemas




19/11/2009                          8
Example Reverse Engineered Model




19/11/2009                         9
Start Building Master Data Catalog




19/11/2009                           10
Exporting Catalog for Sharing




19/11/2009                      11
Discovery – Profiling Data Use

• Biggest Challenges We’re Solving:
    • Reconciling and integrating disparate “Data Silos” into a central location
    • Identifying duplicative data elements (or attributes)
    • Laying the foundation for identifying which of the data sources contain the actual “source data”

• High Percentage of Business Logic is encapsulated as Programming Logic
    g          g                g          p              g      g g
    • Stored Procedures and Trigger code stored in the database
    • Application Source Code
    • Extract Transform and Load Scripts
    • We need visibility to this logic, and we need to be able to store it somewhere

• Tools necessary for this:
    • DSAuditor and DB Optimizer or Performance Center (to capture live data use)
    • Source Code Analyzers (I like Fortify SCA, and Embarcadero JBuilder)
    • Profile ETL using Embarcadero’s MetaWizard (usually convert ETL to XML)
    • Store metadata in ER/Studio Data Architect’s Data Lineage and Transform Rules Support


19/11/2009                                                                                               12
Profiling Data Use with DBOptimizer




19/11/2009                            13
Analysis Phase

• Step 2 – Analysis
    • Identifying authoritative sources, discrepancies, and candidates for consolidation
    • Evaluating Data Flow and Transform Rules
    • Capturing/Defining Synonyms and Assigning Aliases
    • Setting the Foundation for Data Governance

• Primary Challenges-
    • Cost – It’s Time Consuming and is a “Team Effort”
    • Getting ancillary information that teams don’t want to share
            g         y

• Solution-
    • Start with what’s most important
    • Wh ’ i
      What’s important should b obvious
                        h ld be b i




11/15/2009                                                                                 14
Analysis Phase

• Iteration I: Evaluate ETL for data lineage and transform rules
    • Start by reverse engineering the ETL, converting it to XML
    • Incorporate it into the repository

• Iteration II: Identify synonymous elements and build alias list
    • Evaluate data domains and transform rules for issues such as state and use
    • Enlist database and development staff to identify alias and tag the data elements in the master catalog

• Tools You’ll Need
    • Data Modeling tools (ER/Studio and MetaWizard)
    • Repository to manage the metadata byproducts (ER/Studio)




19/11/2009                                                                                                 15
Analysis Phase – Evaluating ETL

• Biggest Challenges We’re Solving:
    • Finding which data source is feeding what other data sources
    • Collecting Data Lineage metadata
    • Making it accessible to the right team members

• Convert the ETL to a form that allows manipulation (
                                             p       (such as XML) )
• Importing the metadata into the data modeling tool
• Build, publish and control access to your master data repository
• Start gathering and applying metadata tags
• Tools necessary for this:
    • MetaWizard
    • ER/Studio Data Architect (or the like)




19/11/2009                                                             16
Data Lineage and Transform Rules




19/11/2009                         17
Setting the Foundation for Governance




  19/11/2009
                                        18
Analysis Phase – Identifying Synonyms


• Biggest Challenges We’re Solving:
    • Indentifying like data elements and candidates for consolidation
    • Building Aliases
    • Establishing the foundation for Data Governance

• Evaluate data nomenclature using tool functions such as Merge and
                                 g                           g
  Compare to identify the obvious overlaps
• Compare descriptors from database staff
• Compare data use and consumption rules derived from tools such as DB
  Optimizer
• Tools necessary f this:
                  for
    • ER/Studio Data Architect (or the like)




19/11/2009                                                               19
Performing Analysis With Compare Utility




19/11/2009                                 20
Exporting to Excel for Input into Database




19/11/2009                                   21
Candidates for Consolidation




19/11/2009                     22
Step 3 Building the Repository

• Step 3–Building Metadata Repository
    • Populating the Repository with the right metadata
    • Establishing and Controlling Access to the metadata
    • Performing metadata management

• Primary Challenges-
        y        g
    • Defining who needs access to what metadata
    • Establishing the rules of use

• Suggestions
  Suggestions-
    • Implement change control and auditing tool
    • What’s important should be obvious
    • Understand the value of the metadata on profitability




19/11/2009                                                    23
Step 4 Implementing the repository

• Step 4 - Implementing the repository
    • Mapping the metadata to the requisite business processes
    • Leveraging the metadata to determine candidates for business process re-engineering

• Primary Challenges-
    • Getting the p
            g     processes down in modeled form
    • Obtaining Middle Level Management and Senior Leadership buy in to changes identified by metadata

• Suggestions-
    • Leverage a modeling tool that facilitates data to process mapping (integrated metadata)
    • Focus on what’s most important to the business—try not to focus on EVERYTHING




19/11/2009                                                                                           24
Step 5 Establishing Data Governance

• Step 5 – Establishing Data Governance
    • All of the above steps lays the foundation for good data governance
    • Get Senior Leadership to stipulate policy enforcing the rules you’ve derived
    • Build a Plan and Standardize Iteratively – (don’t try to fix everything all at once)

• Primary Challenges-
        y        g
    • Fundamental Opposition to Change
    • Maintaining Momentum

• Suggestions
  Suggestions-
    • Find a quick kill – tackle the biggest organizational problem you can handle
    • Focus on what’s most important to the business—and what drives easily visible ROI




19/11/2009                                                                                   25
Summary

• What We Covered:
    • Defined Master Data and Master Data Management
    • The 5 Steps for Master Data Management:
         • Discovery – finding all of the data sources, who they are used by and how they are used
         • Analysis – identifying authoritative sources, discrepancies, and candidates for consolidation
         • Design – designing the metadata repository
         • Implementation–implementing a metadata repository
         • Establish data governance
    • Demonstrated how to leverage specific technology to facilitate:
         • Business Process and Data Modeling
         • Data Governance and Discovery
         • Metadata Repository Implementation
         • Metadata Management




19/11/2009                                                                                                 26
Questions and Answers

• Tools Discussed:
     • Nessus
     • ER/Studio Data Architect / Business Architect and ER/Studio Repository
     • DBOptimizer
     • Change Manager



• Technologies Discussed:
     • Building the Data Catalog
     • Capturing and Storing Metadata
     • Metadata Analysis



• Contact Info:
•   Ron Lewis, Ron.Lewis@cdotech.com




19/11/2009                                                                      27

Más contenido relacionado

La actualidad más candente

Reference master data management
Reference master data managementReference master data management
Reference master data managementDr. Hamdan Al-Sabri
 
Whitepaper on Master Data Management
Whitepaper on Master Data Management Whitepaper on Master Data Management
Whitepaper on Master Data Management Jagruti Dwibedi ITIL
 
5 Level of MDM Maturity
5 Level of MDM Maturity5 Level of MDM Maturity
5 Level of MDM MaturityPanaEk Warawit
 
The what, why, and how of master data management
The what, why, and how of master data managementThe what, why, and how of master data management
The what, why, and how of master data managementMohammad Yousri
 
Infosys best practices_mdm_wp
Infosys best practices_mdm_wpInfosys best practices_mdm_wp
Infosys best practices_mdm_wpwardell henley
 
Master Data Management: Extracting Value from Your Most Important Intangible ...
Master Data Management: Extracting Value from Your Most Important Intangible ...Master Data Management: Extracting Value from Your Most Important Intangible ...
Master Data Management: Extracting Value from Your Most Important Intangible ...FindWhitePapers
 
3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final PresentationJames Chi
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBMInfoSphereUGFR
 
Master Data Management - Gartner Presentation
Master Data Management - Gartner PresentationMaster Data Management - Gartner Presentation
Master Data Management - Gartner Presentation303Computing
 
Strategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management SystemsStrategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management SystemsBoris Otto
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindDATAVERSITY
 
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3keefe008
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data ManagementMoniqueO Opris
 
Adopting a Process-Driven Approach to Master Data Management
Adopting a Process-Driven Approach to Master Data ManagementAdopting a Process-Driven Approach to Master Data Management
Adopting a Process-Driven Approach to Master Data ManagementSoftware AG
 
10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Managementibi
 
A New Way of Thinking About MDM
A New Way of Thinking About MDMA New Way of Thinking About MDM
A New Way of Thinking About MDMDATAVERSITY
 

La actualidad más candente (20)

Reference master data management
Reference master data managementReference master data management
Reference master data management
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Whitepaper on Master Data Management
Whitepaper on Master Data Management Whitepaper on Master Data Management
Whitepaper on Master Data Management
 
Ebook - The Guide to Master Data Management
Ebook - The Guide to Master Data Management Ebook - The Guide to Master Data Management
Ebook - The Guide to Master Data Management
 
Mdm: why, when, how
Mdm: why, when, howMdm: why, when, how
Mdm: why, when, how
 
5 Level of MDM Maturity
5 Level of MDM Maturity5 Level of MDM Maturity
5 Level of MDM Maturity
 
The what, why, and how of master data management
The what, why, and how of master data managementThe what, why, and how of master data management
The what, why, and how of master data management
 
Infosys best practices_mdm_wp
Infosys best practices_mdm_wpInfosys best practices_mdm_wp
Infosys best practices_mdm_wp
 
Master Data Management: Extracting Value from Your Most Important Intangible ...
Master Data Management: Extracting Value from Your Most Important Intangible ...Master Data Management: Extracting Value from Your Most Important Intangible ...
Master Data Management: Extracting Value from Your Most Important Intangible ...
 
3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation3 Keys To Successful Master Data Management - Final Presentation
3 Keys To Successful Master Data Management - Final Presentation
 
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZIBM InfoSphere MDM v11 Overview - Aomar BARIZ
IBM InfoSphere MDM v11 Overview - Aomar BARIZ
 
Master Data Management - Gartner Presentation
Master Data Management - Gartner PresentationMaster Data Management - Gartner Presentation
Master Data Management - Gartner Presentation
 
Strategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management SystemsStrategic Business Requirements for Master Data Management Systems
Strategic Business Requirements for Master Data Management Systems
 
The Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data MindThe Importance of MDM - Eternal Management of the Data Mind
The Importance of MDM - Eternal Management of the Data Mind
 
Multidomain MDM at Amadeus
Multidomain MDM at AmadeusMultidomain MDM at Amadeus
Multidomain MDM at Amadeus
 
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
ASUG 10_27_2016 Entegris PLM-MDM Business Process Optimization 3
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
Adopting a Process-Driven Approach to Master Data Management
Adopting a Process-Driven Approach to Master Data ManagementAdopting a Process-Driven Approach to Master Data Management
Adopting a Process-Driven Approach to Master Data Management
 
10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management10 Worst Practices in Master Data Management
10 Worst Practices in Master Data Management
 
A New Way of Thinking About MDM
A New Way of Thinking About MDMA New Way of Thinking About MDM
A New Way of Thinking About MDM
 

Destacado

Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic ConceptsSr Edith Bogue
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementPerficient, Inc.
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementAmanda Whitmire
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data ManagementSung Kuan
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...Christopher Bradley
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & ToolsStephanie Wright
 
Legal Entity Risk and Counter-Party Exposure April 2016
Legal Entity Risk and Counter-Party Exposure  April 2016Legal Entity Risk and Counter-Party Exposure  April 2016
Legal Entity Risk and Counter-Party Exposure April 2016bfreeman1987
 
Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and ProcessingCRRC-Armenia
 
Data Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionData Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionAndrew Borgschulte
 
Informatica MDM Presentation
Informatica MDM PresentationInformatica MDM Presentation
Informatica MDM PresentationMaxHung
 
New opportunities for connected data : Neo4j the graph database
New opportunities for connected data : Neo4j the graph databaseNew opportunities for connected data : Neo4j the graph database
New opportunities for connected data : Neo4j the graph databaseCédric Fauvet
 
Data Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwData Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwramesh rao
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data ManagementCloudbells.com
 
Master Data Management - MDM - Pasos para implementar MDM
Master Data Management - MDM - Pasos para implementar MDMMaster Data Management - MDM - Pasos para implementar MDM
Master Data Management - MDM - Pasos para implementar MDMJose Pla
 
Digital Transformation in a Connected World
Digital Transformation in a Connected WorldDigital Transformation in a Connected World
Digital Transformation in a Connected WorldNeo4j
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015Neo4j
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to GraphsNeo4j
 

Destacado (18)

Data Management - Basic Concepts
Data Management - Basic ConceptsData Management - Basic Concepts
Data Management - Basic Concepts
 
Unlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data ManagementUnlocking Success in the 3 Stages of Master Data Management
Unlocking Success in the 3 Stages of Master Data Management
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Data Management for Dummies
Data Management for DummiesData Management for Dummies
Data Management for Dummies
 
Master Data Management
Master Data ManagementMaster Data Management
Master Data Management
 
How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...How to identify the correct Master Data subject areas & tooling for your MDM...
How to identify the correct Master Data subject areas & tooling for your MDM...
 
Data Management: Tips & Tools
Data Management: Tips & ToolsData Management: Tips & Tools
Data Management: Tips & Tools
 
Legal Entity Risk and Counter-Party Exposure April 2016
Legal Entity Risk and Counter-Party Exposure  April 2016Legal Entity Risk and Counter-Party Exposure  April 2016
Legal Entity Risk and Counter-Party Exposure April 2016
 
Data Archiving and Processing
Data Archiving and ProcessingData Archiving and Processing
Data Archiving and Processing
 
Data Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLionData Cleanup Presentation - RecordLion
Data Cleanup Presentation - RecordLion
 
Informatica MDM Presentation
Informatica MDM PresentationInformatica MDM Presentation
Informatica MDM Presentation
 
New opportunities for connected data : Neo4j the graph database
New opportunities for connected data : Neo4j the graph databaseNew opportunities for connected data : Neo4j the graph database
New opportunities for connected data : Neo4j the graph database
 
Data Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bwData Archiving -Ramesh sap bw
Data Archiving -Ramesh sap bw
 
Introduction to Data Management
Introduction to Data ManagementIntroduction to Data Management
Introduction to Data Management
 
Master Data Management - MDM - Pasos para implementar MDM
Master Data Management - MDM - Pasos para implementar MDMMaster Data Management - MDM - Pasos para implementar MDM
Master Data Management - MDM - Pasos para implementar MDM
 
Digital Transformation in a Connected World
Digital Transformation in a Connected WorldDigital Transformation in a Connected World
Digital Transformation in a Connected World
 
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
Neo4j   graphs in the real world - graph days d.c. - april 14, 2015Neo4j   graphs in the real world - graph days d.c. - april 14, 2015
Neo4j graphs in the real world - graph days d.c. - april 14, 2015
 
RDBMS to Graphs
RDBMS to GraphsRDBMS to Graphs
RDBMS to Graphs
 

Similar a 5 Steps To Master Data Management

ETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL TestingETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL TestingVibrant Event
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingVibrant Event
 
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Victor Holman
 
Akili Data Integration using PPDM
Akili Data Integration using PPDMAkili Data Integration using PPDM
Akili Data Integration using PPDMrnaramore
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourceTerry Bunio
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationDATAVERSITY
 
rough-work.pptx
rough-work.pptxrough-work.pptx
rough-work.pptxsharpan
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010ERwin Modeling
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
Datastage Online Training
Datastage Online TrainingDatastage Online Training
Datastage Online TrainingNagendra Kumar
 
Sap business objects data services toc
Sap business objects data services tocSap business objects data services toc
Sap business objects data services tocsaddagiri
 
Informatica mdm online training in chennai
Informatica mdm online training in chennaiInformatica mdm online training in chennai
Informatica mdm online training in chennaiGoLogica Technologies
 

Similar a 5 Steps To Master Data Management (20)

Chap005
Chap005Chap005
Chap005
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
ETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL TestingETL Testing - Introduction to ETL Testing
ETL Testing - Introduction to ETL Testing
 
ETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testingETL Testing - Introduction to ETL testing
ETL Testing - Introduction to ETL testing
 
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
 
Data warehouseold
Data warehouseoldData warehouseold
Data warehouseold
 
Unit 2
Unit 2Unit 2
Unit 2
 
Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing Datastage Introduction To Data Warehousing
Datastage Introduction To Data Warehousing
 
Data warehouse
Data warehouseData warehouse
Data warehouse
 
Akili Data Integration using PPDM
Akili Data Integration using PPDMAkili Data Integration using PPDM
Akili Data Integration using PPDM
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open source
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Chap05 data resource mgt
Chap05 data resource mgtChap05 data resource mgt
Chap05 data resource mgt
 
rough-work.pptx
rough-work.pptxrough-work.pptx
rough-work.pptx
 
Creating enterprise standards 09302010
Creating enterprise standards 09302010Creating enterprise standards 09302010
Creating enterprise standards 09302010
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Datastage Online Training
Datastage Online TrainingDatastage Online Training
Datastage Online Training
 
Sap business objects data services toc
Sap business objects data services tocSap business objects data services toc
Sap business objects data services toc
 
Informatica mdm online training in chennai
Informatica mdm online training in chennaiInformatica mdm online training in chennai
Informatica mdm online training in chennai
 
ETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptxETL-Datawarehousing.ppt.pptx
ETL-Datawarehousing.ppt.pptx
 

Más de Embarcadero Technologies

PyTorch for Delphi - Python Data Sciences Libraries.pdf
PyTorch for Delphi - Python Data Sciences Libraries.pdfPyTorch for Delphi - Python Data Sciences Libraries.pdf
PyTorch for Delphi - Python Data Sciences Libraries.pdfEmbarcadero Technologies
 
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...Embarcadero Technologies
 
Linux GUI Applications on Windows Subsystem for Linux
Linux GUI Applications on Windows Subsystem for LinuxLinux GUI Applications on Windows Subsystem for Linux
Linux GUI Applications on Windows Subsystem for LinuxEmbarcadero Technologies
 
Python on Android with Delphi FMX - The Cross Platform GUI Framework
Python on Android with Delphi FMX - The Cross Platform GUI Framework Python on Android with Delphi FMX - The Cross Platform GUI Framework
Python on Android with Delphi FMX - The Cross Platform GUI Framework Embarcadero Technologies
 
Introduction to Python GUI development with Delphi for Python - Part 1: Del...
Introduction to Python GUI development with Delphi for Python - Part 1:   Del...Introduction to Python GUI development with Delphi for Python - Part 1:   Del...
Introduction to Python GUI development with Delphi for Python - Part 1: Del...Embarcadero Technologies
 
FMXLinux Introduction - Delphi's FireMonkey for Linux
FMXLinux Introduction - Delphi's FireMonkey for LinuxFMXLinux Introduction - Delphi's FireMonkey for Linux
FMXLinux Introduction - Delphi's FireMonkey for LinuxEmbarcadero Technologies
 
Python for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 IntroductionPython for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 IntroductionEmbarcadero Technologies
 
RAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and InstrumentationRAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and InstrumentationEmbarcadero Technologies
 
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBaseEmbeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBaseEmbarcadero Technologies
 
Rad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup DocumentRad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup DocumentEmbarcadero Technologies
 
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinarMove Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinarEmbarcadero Technologies
 
Getting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and AndroidGetting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and AndroidEmbarcadero Technologies
 
ER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data ArchitectureER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data ArchitectureEmbarcadero Technologies
 
The Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst PracticesThe Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst PracticesEmbarcadero Technologies
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsEmbarcadero Technologies
 
Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016Embarcadero Technologies
 

Más de Embarcadero Technologies (20)

PyTorch for Delphi - Python Data Sciences Libraries.pdf
PyTorch for Delphi - Python Data Sciences Libraries.pdfPyTorch for Delphi - Python Data Sciences Libraries.pdf
PyTorch for Delphi - Python Data Sciences Libraries.pdf
 
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...
Android on Windows 11 - A Developer's Perspective (Windows Subsystem For Andr...
 
Linux GUI Applications on Windows Subsystem for Linux
Linux GUI Applications on Windows Subsystem for LinuxLinux GUI Applications on Windows Subsystem for Linux
Linux GUI Applications on Windows Subsystem for Linux
 
Python on Android with Delphi FMX - The Cross Platform GUI Framework
Python on Android with Delphi FMX - The Cross Platform GUI Framework Python on Android with Delphi FMX - The Cross Platform GUI Framework
Python on Android with Delphi FMX - The Cross Platform GUI Framework
 
Introduction to Python GUI development with Delphi for Python - Part 1: Del...
Introduction to Python GUI development with Delphi for Python - Part 1:   Del...Introduction to Python GUI development with Delphi for Python - Part 1:   Del...
Introduction to Python GUI development with Delphi for Python - Part 1: Del...
 
FMXLinux Introduction - Delphi's FireMonkey for Linux
FMXLinux Introduction - Delphi's FireMonkey for LinuxFMXLinux Introduction - Delphi's FireMonkey for Linux
FMXLinux Introduction - Delphi's FireMonkey for Linux
 
Python for Delphi Developers - Part 2
Python for Delphi Developers - Part 2Python for Delphi Developers - Part 2
Python for Delphi Developers - Part 2
 
Python for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 IntroductionPython for Delphi Developers - Part 1 Introduction
Python for Delphi Developers - Part 1 Introduction
 
RAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and InstrumentationRAD Industrial Automation, Labs, and Instrumentation
RAD Industrial Automation, Labs, and Instrumentation
 
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBaseEmbeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
Embeddable Databases for Mobile Apps: Stress-Free Solutions with InterBase
 
Rad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup DocumentRad Server Industry Template - Connected Nurses Station - Setup Document
Rad Server Industry Template - Connected Nurses Station - Setup Document
 
TMS Google Mapping Components
TMS Google Mapping ComponentsTMS Google Mapping Components
TMS Google Mapping Components
 
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinarMove Desktop Apps to the Cloud - RollApp & Embarcadero webinar
Move Desktop Apps to the Cloud - RollApp & Embarcadero webinar
 
Useful C++ Features You Should be Using
Useful C++ Features You Should be UsingUseful C++ Features You Should be Using
Useful C++ Features You Should be Using
 
Getting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and AndroidGetting Started Building Mobile Applications for iOS and Android
Getting Started Building Mobile Applications for iOS and Android
 
Embarcadero RAD server Launch Webinar
Embarcadero RAD server Launch WebinarEmbarcadero RAD server Launch Webinar
Embarcadero RAD server Launch Webinar
 
ER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data ArchitectureER/Studio 2016: Build a Business-Driven Data Architecture
ER/Studio 2016: Build a Business-Driven Data Architecture
 
The Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst PracticesThe Secrets of SQL Server: Database Worst Practices
The Secrets of SQL Server: Database Worst Practices
 
Driving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data AssetsDriving Business Value Through Agile Data Assets
Driving Business Value Through Agile Data Assets
 
Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016Troubleshooting Plan Changes with Query Store in SQL Server 2016
Troubleshooting Plan Changes with Query Store in SQL Server 2016
 

Último

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 

Último (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 

5 Steps To Master Data Management

  • 1. Five Steps to Mastering Master Data Management Ron Lewis November 19, 2009
  • 2. Presentation Overview • Introduction • What is Master Data Management? g • The 5 Steps for Master Data Management: • Discovery – finding all of the data sources, who they are used by and how they are used • Analysis – identifying authoritative sources, discrepancies, and candidates for consolidation • Design – designing the metadata repository • Implementation–implementing a metadata repository • Establish data governance • Leveraging Technology to facilitate: • Business Process and Data Modeling g • Data Governance and Discovery • Metadata Repository Implementation g • Metadata Management • Presentation Focus: The Discovery and Analysis Phases 19/11/2009 2
  • 3. Master Data Management • Master Data Management • Master Data is: Principle business data essential for conducting business • MDM provides an enterprise perspective on the critical Business Processes and the Data necessary to support them • Bottom line: Improve decision making • Core Tasks • Building the Business Process Models • Data Governance (Standardizing data - nomenclature, domains, data quality and consumption rules) • Synchronizing related operational systems using the data • Integrating/reconciling disparate data silos to provide single enterprise view • Building and managing an enterprise metadata repository • Challenge: Must Shift Thinking to the Enterprise Perspective 11/15/2009 3
  • 4. Discovery Phase • Step 1 – Discovery • Capturing and modeling the essential business processes • Mapping processes to the data necessary to complete each process successfully • Identifying data sources and gathering appropriate metadata • Primary Challenges- • Cost - It’s Expensive and Disruptive • Gaining Executive Leadership Support – (“You mean we don’t have this already?”) • Solution Solution- • Start with what’s most important • What’s important should be obvious 11/15/2009 4
  • 5. Discovery Phase • Involve your infrastructure and/or security personnel • Iteration I: Capture existing data and schemas p g • Find your database servers, respective owners and access • Reverse engineering your physical data models • Build a master data dictionary and catalog y g • Iteration II: Profile existing applications to help with business • Database Centric: ETL, Stored Procedures, and Triggers • Application Source Code and User Behavior • Tools You’ll Need • Infrastructure/security tools ( y (Nessus) ) • Data Modeling and Profiling tools (ER/Studio Data Architect/DBOptimizer) • Application Profiling tools (NitroSecurity APM) • Repository to manage the metadata byproducts p y g yp 19/11/2009 5
  • 6. Infrastructure / Security Tooling 19/11/2009 6
  • 7. Use ER Studio to Reverse Engineer 19/11/2009 7
  • 8. Reverse Engineer Physical Schemas 19/11/2009 8
  • 9. Example Reverse Engineered Model 19/11/2009 9
  • 10. Start Building Master Data Catalog 19/11/2009 10
  • 11. Exporting Catalog for Sharing 19/11/2009 11
  • 12. Discovery – Profiling Data Use • Biggest Challenges We’re Solving: • Reconciling and integrating disparate “Data Silos” into a central location • Identifying duplicative data elements (or attributes) • Laying the foundation for identifying which of the data sources contain the actual “source data” • High Percentage of Business Logic is encapsulated as Programming Logic g g g p g g g • Stored Procedures and Trigger code stored in the database • Application Source Code • Extract Transform and Load Scripts • We need visibility to this logic, and we need to be able to store it somewhere • Tools necessary for this: • DSAuditor and DB Optimizer or Performance Center (to capture live data use) • Source Code Analyzers (I like Fortify SCA, and Embarcadero JBuilder) • Profile ETL using Embarcadero’s MetaWizard (usually convert ETL to XML) • Store metadata in ER/Studio Data Architect’s Data Lineage and Transform Rules Support 19/11/2009 12
  • 13. Profiling Data Use with DBOptimizer 19/11/2009 13
  • 14. Analysis Phase • Step 2 – Analysis • Identifying authoritative sources, discrepancies, and candidates for consolidation • Evaluating Data Flow and Transform Rules • Capturing/Defining Synonyms and Assigning Aliases • Setting the Foundation for Data Governance • Primary Challenges- • Cost – It’s Time Consuming and is a “Team Effort” • Getting ancillary information that teams don’t want to share g y • Solution- • Start with what’s most important • Wh ’ i What’s important should b obvious h ld be b i 11/15/2009 14
  • 15. Analysis Phase • Iteration I: Evaluate ETL for data lineage and transform rules • Start by reverse engineering the ETL, converting it to XML • Incorporate it into the repository • Iteration II: Identify synonymous elements and build alias list • Evaluate data domains and transform rules for issues such as state and use • Enlist database and development staff to identify alias and tag the data elements in the master catalog • Tools You’ll Need • Data Modeling tools (ER/Studio and MetaWizard) • Repository to manage the metadata byproducts (ER/Studio) 19/11/2009 15
  • 16. Analysis Phase – Evaluating ETL • Biggest Challenges We’re Solving: • Finding which data source is feeding what other data sources • Collecting Data Lineage metadata • Making it accessible to the right team members • Convert the ETL to a form that allows manipulation ( p (such as XML) ) • Importing the metadata into the data modeling tool • Build, publish and control access to your master data repository • Start gathering and applying metadata tags • Tools necessary for this: • MetaWizard • ER/Studio Data Architect (or the like) 19/11/2009 16
  • 17. Data Lineage and Transform Rules 19/11/2009 17
  • 18. Setting the Foundation for Governance 19/11/2009 18
  • 19. Analysis Phase – Identifying Synonyms • Biggest Challenges We’re Solving: • Indentifying like data elements and candidates for consolidation • Building Aliases • Establishing the foundation for Data Governance • Evaluate data nomenclature using tool functions such as Merge and g g Compare to identify the obvious overlaps • Compare descriptors from database staff • Compare data use and consumption rules derived from tools such as DB Optimizer • Tools necessary f this: for • ER/Studio Data Architect (or the like) 19/11/2009 19
  • 20. Performing Analysis With Compare Utility 19/11/2009 20
  • 21. Exporting to Excel for Input into Database 19/11/2009 21
  • 23. Step 3 Building the Repository • Step 3–Building Metadata Repository • Populating the Repository with the right metadata • Establishing and Controlling Access to the metadata • Performing metadata management • Primary Challenges- y g • Defining who needs access to what metadata • Establishing the rules of use • Suggestions Suggestions- • Implement change control and auditing tool • What’s important should be obvious • Understand the value of the metadata on profitability 19/11/2009 23
  • 24. Step 4 Implementing the repository • Step 4 - Implementing the repository • Mapping the metadata to the requisite business processes • Leveraging the metadata to determine candidates for business process re-engineering • Primary Challenges- • Getting the p g processes down in modeled form • Obtaining Middle Level Management and Senior Leadership buy in to changes identified by metadata • Suggestions- • Leverage a modeling tool that facilitates data to process mapping (integrated metadata) • Focus on what’s most important to the business—try not to focus on EVERYTHING 19/11/2009 24
  • 25. Step 5 Establishing Data Governance • Step 5 – Establishing Data Governance • All of the above steps lays the foundation for good data governance • Get Senior Leadership to stipulate policy enforcing the rules you’ve derived • Build a Plan and Standardize Iteratively – (don’t try to fix everything all at once) • Primary Challenges- y g • Fundamental Opposition to Change • Maintaining Momentum • Suggestions Suggestions- • Find a quick kill – tackle the biggest organizational problem you can handle • Focus on what’s most important to the business—and what drives easily visible ROI 19/11/2009 25
  • 26. Summary • What We Covered: • Defined Master Data and Master Data Management • The 5 Steps for Master Data Management: • Discovery – finding all of the data sources, who they are used by and how they are used • Analysis – identifying authoritative sources, discrepancies, and candidates for consolidation • Design – designing the metadata repository • Implementation–implementing a metadata repository • Establish data governance • Demonstrated how to leverage specific technology to facilitate: • Business Process and Data Modeling • Data Governance and Discovery • Metadata Repository Implementation • Metadata Management 19/11/2009 26
  • 27. Questions and Answers • Tools Discussed: • Nessus • ER/Studio Data Architect / Business Architect and ER/Studio Repository • DBOptimizer • Change Manager • Technologies Discussed: • Building the Data Catalog • Capturing and Storing Metadata • Metadata Analysis • Contact Info: • Ron Lewis, Ron.Lewis@cdotech.com 19/11/2009 27