SlideShare una empresa de Scribd logo
1 de 37
Unlocking value from data with Data Integration Tools Phil Watt, Principal Integration Architect, HP Business Intelligence Solutions, EMEA 29/04/2010 1
Outline Introduction Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration: Summary – followed by hands on session 29/04/2010 2
About me 29/04/2010 3 19 years big data 10 years Data Integration tools High volume Complex business rules Governance and metadata management Clients include BSkyB BT Barclays/Barclaycard Centrica  Experian John Lewis Partnership Microsoft A major UK political party Strong focus on pragmatic delivery Best practices Design patterns Tool evaluation, selection and implementation
Scope 29/04/2010 4
Glossary 29/04/2010 5
The challenge 29/04/2010 6
Data warehouse example sizes 29/04/2010 7
Public and academic examples 29/04/2010 8 Birmingham City Council http://www.experian.co.uk/www/pages/about_us/our_clients/ http://www.qas.co.uk/company/press/new-experian-software-helps-public-sector-to-enhance-single-citizen-view-projects-503.htm University of Toulouse – academic medical research http://www.talend.com/open-source-provider/casestudy/CaseStudy_Academic_Medical_Research_EN.php
Benefits of DI tools 29/04/2010 9
Extract, Transform and Load 29/04/2010 10 e.g. CRM or  ERP system Hub and spoke Shared DW and ETL server
Extract, Load and Transform 29/04/2010 11 e.g. CRM or  ERP system Shared DW and ETL server
ETL versus ELT 29/04/2010 12
Multiple sources and targets 29/04/2010 13
DI Tools Features Timeline1995 – 2005 29/04/2010 14
DI Tools Features Timeline from 2006 29/04/2010 15
Market features 29/04/2010 16
Gartner Magic Quadrant Taken from research document, ‘Magic Quadrant for Data Integration Tools’  Authors: Ted Friedman, Mark A. Beyer, Eric Thoo Full report available by registering at www.talend.com 29/04/2010 17 Image removed for web publication as agreed with Gartner
Magic Quadrant Disclaimer The Magic Quadrant is copyrighted November 25, 2009 by Gartner, Inc. and is reused with permission.  The Magic Quadrant is a graphical representation of a marketplace at and for a specific time period.  It depicts Gartner's analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner.  Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those vendors placed in the "Leaders" quadrant.  The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action.  Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. 29/04/2010 18
Best practices 29/04/2010 19
Worst Practices 29/04/2010 20
Gartner advice 29/04/2010 21 ,[object Object]
Allocate 20 - 30% to mapping and transformation rules
Avoid custom-coding or desktop tools
Increase business user involvement to improve successBest Practices Mitigate Data Migration Risks and Challenges – May 2009
Governance and the data integration lifecycle 29/04/2010 22
Best practices 29/04/2010 23 ,[object Object]
Spend 50% of project time doing discovery, analysis, design
Get business users involved early and often
Use tools to accelerate and compress timescales
Pay attention to governance and metadata
So you can:
De-risk the project
Reduce overall cost and timescales
Achieve best possible quality,[object Object]
Qualification matrix (PW ) 29/04/2010 25
Demonstration 29/04/2010 26
29/04/2010 27

Más contenido relacionado

Similar a Unlocking value from data with data integration tools

Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsDavid J Rosenthal
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester WebinarCloudera, Inc.
 
Big data analytics fas trak solution overview
Big data analytics fas trak solution overviewBig data analytics fas trak solution overview
Big data analytics fas trak solution overviewMarc St-Pierre
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...Comit Projects Ltd
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13Jim Sutter
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13James Sutter
 
Forecast 2014: Business Strategy Enabled by Cloud
Forecast 2014:  Business Strategy Enabled by Cloud Forecast 2014:  Business Strategy Enabled by Cloud
Forecast 2014: Business Strategy Enabled by Cloud Open Data Center Alliance
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDenodo
 
Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)Microsoft Norge AS
 
DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)Stadt Wien
 
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BICC Thomas More
 
Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...Scott Abel
 
Graphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jGraphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jNeo4j
 
Cloud cpmputing and busness processes
Cloud cpmputing and busness processesCloud cpmputing and busness processes
Cloud cpmputing and busness processesMinka Fudulova
 
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...MilliporeSigma
 

Similar a Unlocking value from data with data integration tools (20)

Modern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and ImplementationsModern Business Intelligence - Design and Implementations
Modern Business Intelligence - Design and Implementations
 
Bp006 Duguid
Bp006 DuguidBp006 Duguid
Bp006 Duguid
 
Kudu Forrester Webinar
Kudu Forrester WebinarKudu Forrester Webinar
Kudu Forrester Webinar
 
Big data analytics fas trak solution overview
Big data analytics fas trak solution overviewBig data analytics fas trak solution overview
Big data analytics fas trak solution overview
 
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
ETDP 2015 D1 SMAC & the Journey from Automation to Digital Factory - Snjeev K...
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
 
Technology business management_7.13
Technology business management_7.13Technology business management_7.13
Technology business management_7.13
 
eBook-DataSciencePlatform
eBook-DataSciencePlatformeBook-DataSciencePlatform
eBook-DataSciencePlatform
 
Game plan wkshp1
Game plan wkshp1Game plan wkshp1
Game plan wkshp1
 
Forecast 2014: Business Strategy Enabled by Cloud
Forecast 2014:  Business Strategy Enabled by Cloud Forecast 2014:  Business Strategy Enabled by Cloud
Forecast 2014: Business Strategy Enabled by Cloud
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
Reporte forrester bpms
Reporte forrester bpmsReporte forrester bpms
Reporte forrester bpms
 
Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)Business Intelligence (Av Arif Shafique)
Business Intelligence (Av Arif Shafique)
 
DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)DALICC (Data Licenses Clearance Centre)
DALICC (Data Licenses Clearance Centre)
 
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
 
Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...Successful Processes for Selecting a Content Management System: How to Become...
Successful Processes for Selecting a Content Management System: How to Become...
 
Graphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jGraphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4j
 
Cloud cpmputing and busness processes
Cloud cpmputing and busness processesCloud cpmputing and busness processes
Cloud cpmputing and busness processes
 
Clarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic PresentationClarity It Sourcing Diagnostic Presentation
Clarity It Sourcing Diagnostic Presentation
 
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...Beyond Bioprocessing 4.0:  The Convergence of IT, OT and Processing Technolog...
Beyond Bioprocessing 4.0: The Convergence of IT, OT and Processing Technolog...
 

Último

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 

Último (20)

Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 

Unlocking value from data with data integration tools

  • 1. Unlocking value from data with Data Integration Tools Phil Watt, Principal Integration Architect, HP Business Intelligence Solutions, EMEA 29/04/2010 1
  • 2. Outline Introduction Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration: Summary – followed by hands on session 29/04/2010 2
  • 3. About me 29/04/2010 3 19 years big data 10 years Data Integration tools High volume Complex business rules Governance and metadata management Clients include BSkyB BT Barclays/Barclaycard Centrica Experian John Lewis Partnership Microsoft A major UK political party Strong focus on pragmatic delivery Best practices Design patterns Tool evaluation, selection and implementation
  • 7. Data warehouse example sizes 29/04/2010 7
  • 8. Public and academic examples 29/04/2010 8 Birmingham City Council http://www.experian.co.uk/www/pages/about_us/our_clients/ http://www.qas.co.uk/company/press/new-experian-software-helps-public-sector-to-enhance-single-citizen-view-projects-503.htm University of Toulouse – academic medical research http://www.talend.com/open-source-provider/casestudy/CaseStudy_Academic_Medical_Research_EN.php
  • 9. Benefits of DI tools 29/04/2010 9
  • 10. Extract, Transform and Load 29/04/2010 10 e.g. CRM or ERP system Hub and spoke Shared DW and ETL server
  • 11. Extract, Load and Transform 29/04/2010 11 e.g. CRM or ERP system Shared DW and ETL server
  • 12. ETL versus ELT 29/04/2010 12
  • 13. Multiple sources and targets 29/04/2010 13
  • 14. DI Tools Features Timeline1995 – 2005 29/04/2010 14
  • 15. DI Tools Features Timeline from 2006 29/04/2010 15
  • 17. Gartner Magic Quadrant Taken from research document, ‘Magic Quadrant for Data Integration Tools’ Authors: Ted Friedman, Mark A. Beyer, Eric Thoo Full report available by registering at www.talend.com 29/04/2010 17 Image removed for web publication as agreed with Gartner
  • 18. Magic Quadrant Disclaimer The Magic Quadrant is copyrighted November 25, 2009 by Gartner, Inc. and is reused with permission. The Magic Quadrant is a graphical representation of a marketplace at and for a specific time period. It depicts Gartner's analysis of how certain vendors measure against criteria for that marketplace, as defined by Gartner. Gartner does not endorse any vendor, product or service depicted in the Magic Quadrant, and does not advise technology users to select only those vendors placed in the "Leaders" quadrant. The Magic Quadrant is intended solely as a research tool, and is not meant to be a specific guide to action. Gartner disclaims all warranties, express or implied, with respect to this research, including any warranties of merchantability or fitness for a particular purpose. 29/04/2010 18
  • 21.
  • 22. Allocate 20 - 30% to mapping and transformation rules
  • 23. Avoid custom-coding or desktop tools
  • 24. Increase business user involvement to improve successBest Practices Mitigate Data Migration Risks and Challenges – May 2009
  • 25. Governance and the data integration lifecycle 29/04/2010 22
  • 26.
  • 27. Spend 50% of project time doing discovery, analysis, design
  • 28. Get business users involved early and often
  • 29. Use tools to accelerate and compress timescales
  • 30. Pay attention to governance and metadata
  • 33. Reduce overall cost and timescales
  • 34.
  • 35. Qualification matrix (PW ) 29/04/2010 25
  • 43. Demo metrics 29/04/2010 33 Performance Hardware – dual core 2.0Ghz Intel Centrino, 2.5Gb Ram Environment – WinXP, Oracle Express (DB) +DI tool (Expressor 2.0) 3 data sources Customers 155 MB 1000K records Today’s orders 112 MB 100K records Yesterday's orders 0.3 MB 3K records Total data volume 267 MB 1.1M records Execution time 72 seconds Throughput 3.7 MB/sec 41k/sec
  • 44. Demo features 29/04/2010 34 Developer Productivity Graphical development Semantic Rationalisation and Re-usable Business Rules Demo represents a generic business scenario XML, message queues (MSMQ) , database inputs/outputs, joins, aggregations and referential integrity management Similar features to the ATG/Integrated Basket challenges?
  • 45. Summary 29/04/2010 35 Business drivers – why use a DI tool? the challenge private sector public sector Background and history DI tools timeline Emerging features – and value Governance and Best Practice Selecting a tool for your situation Demonstration:
  • 47. References 29/04/2010 37 Curt Monashhttp://www.dbms2.com/2009/04/30/ebays-two-enormous-data-warehouses/ Wired: http://www.wired.com/wired/archive/12.04/grid.html Zdnet: http://blogs.zdnet.com/storage/?p=213 Professor Chris Bishop: http://conferences.theiet.org/lectures/turing/ Gartner http://www.gartner.com LHC data (2007): http://www-conf.slac.stanford.edu/xldb07/xldb_lhc.pdf

Notas del editor

  1. eBay – 2 Petabytes and 6.5 PetabytesFacebook2.5 PetabytesWal-mart2.5 PetabytesYahoo> 10 Petabytes plannedLHC (Large Hadron Collider, Year 1)10 Petabytes data/yearNational ID Cards (planned estimate)>2 Terabytes
  2. Many tools have claimed this in the past
  3. 2 typesengine based, (Informatica, Ab Initio, expressor, etc)code generators (ETI, Talend, etc.)
  4. DatabasesDifferent character sets (ASCII, EBCDIC, Unicode)International characters (unicode)Queues,Web Services (SOAP, WSDL, RPC)XMLODBC/JDBC
  5. Features listed up to 2004 represent minimum marketable features for new entrants to the marketplace
  6. Describe value of each Workflow optimisation is the key driver nowEarly tools focussed on selling developer features, strengths around complexity rather than value to delivery process.
  7. Almost weekly news of M&A
  8. Example of one analyst business’s view of the DI Tools marketplaceGartner’s Magic Quadrant provides a view of eligible vendors in the marketplace.Indicates this is a mature market, with considerable global interest and healthy competitionAlso notable that HP, for example, does not have a tool in this spaceThere may be vendors not in the Magic Quadrant that are worth considering – don’t rule out vendors based on inclusion/exclusion from this report
  9. Goes much further than illustrated in this slideGovernance must apply structures to manage quality of dataEnterprises must incentivise people to maintain and improve data qualityyou cannot manage what you can’t measureMetrics must align to personal objectives