SlideShare una empresa de Scribd logo
1 de 76
Descargar para leer sin conexión
Agile Data Warehousing	





                Data Vault, What is the buzz about	

                                    	

                             TDWI München	

                              June 18, 2012	

                             Ronald Damhof	





R.D.Damhof
“Our highest priority is to satisfy the
                customer through early and continuous
                     delivery of valuable software”
                                                                   Agile Manifesto, 2001
                       Kent Beck, Mike Beedle, Arie van Bennekum, Alistair Cockburn, Ward Cunningham,
                      Martin Fowler, James Grenning, Jim Highsmith, Andrew Hunt, Ron Jeffries, Jon Kern,
                Brian Marick, Robert C. Martin, Steve Mellor, Ken Schwaber, Jeff Sutherland, Dave Thomas




R.D.Damhof
‘Calculating risk


    Source	

                                    ‘Yield modules’	

    Source	


                                      ‘Customer 
                                    segmentation’	

                ‘Semantic gap’	




R.D.Damhof
Everybody mines their own data	

                	

                Everybody enriches their own data	


                Everybody uses their own data	

                	

                User = Developer	


                With his selfmade tools	

                	

                Data quality determined by the individual	

                	

                It’s a grind – limited reusability	

                	

                Leadtimes unpredictable	

                	

                No management 	

                	



R.D.Damhof
Lets ‘order’ an information product	

                	

                And hire a master/expert	

                	

                Separation between user/developer	

                	

                Developer/expert mines the data	

                	

                The information product = custom made	


                Data quality is mostly dependable on the
                developer/expert	


                Leadtimes unpredictable	


                Still not much reusability	


                	



R.D.Damhof
A central department who knows what
                information you need	


                That assembles information products,
                ready to be used for you	


                ‘I now what you want’ – black	


                Efficiency is the name of the game	


                At least I got something, but it does not
                comply - even remotely - to my needs	


                Even worse; the guild-days are still there
                – the expert is now submerged, but
                needed to get the data you actually need.	

                	

                Introduction of management – you want
                something? Please apply in 3-fold…	

                	

R.D.Damhof
Creating information products, the
                                                                 moment they are asked for	

                                                                 	

                                                                 Against quality criteria which are in line
                                                                 with the expectation of the customer	

                                                                 	

                                                                 Empower the customer with skills and
                                                                 facilities to be more self sufficient	

                                                                 	

                                                                 Minimize ‘data’-stock as much as possible	

                                                                 	

                                                                 Embrace new wishes and changes
                                                                 required by the customer 	

                                                                 	

                                                                 The customer is the most important part
                                                                 of the production process	




                Stephen Denning (2011) – Radical Management	

R.D.Damhof
A modern data management environment:	

                    	

                    	

                    	

                    The ‘Supermarket’	





                         The ‘Restaurant’	





                The ‘Do it yourself buffet’	




R.D.Damhof
R.D.Damhof
Push characteristics	

§  Mass production	

§  Known specifications, operational definitions, standards	

§  Repeatable, predictable,  even better; uniform process	

§  Part of the system that needs statistical control	

§  Inventory allowed/necessary	

§  Supply driven	

§  Reliability over flexibility	

                                         Pull characteristics	

                                         § 
                                           Just in time	

                                         § 
                                           Demand driven	

                                         § 
                                           Build to order	

                                         § 
                                           Preferably no inventory	

                                         § 
                                           Flexibility over Reliability	





 R.D.Damhof
Back to the issue at hand……	



§     What: the ‘production process of data’	

§     Where: Coordination - Local versus central 	

§     How: System Engineering - Systematic vs. Opportunistic	

§     What principles guide us - leading principles 	





R.D.Damhof
Local	
  vs	
  Central	
  deployment	
  

Informa.on	
  Delivery	
  Proces	
  
                                                                                                                  Recipient	
  




                                                     Informa.on	
  Delivery	
  process	
  
 4.	
  Generate	
  Informa.on	
  products	
                                                          End-­‐user	
  (Local)	
  




                                                                                                                                                           Data	
  	
  func.on	
  service	
  
                                                                                             4	
        4	
             4	
                4	
     4	
  
     3.	
  Enrich	
  and	
  cleanse	
  data	
  
                                                                                             3	
        3	
             3	
                3	
     3	
  

      2.	
  Register	
  	
  Standardize	
  
                                                                                             2	
        2	
             2	
                2	
     2	
  

                                                                                             1	
        1	
         1	
        1	
       1	
  
      1.	
  Get	
  the	
  raw	
  uncut	
  data	
  
                                                                                                                Generic	
  proces	
  (Central)	
  

                                                                                                             Data	
  sources	
  
                                                                                                         (internal	
  	
  external)	
  




 R.D.Damhof
System Engineering - Systematic vs. Opportunistic	


                    Manoeuvrability
                (opportunistic approach)
                                                           Ad-hoc development proces
                                     Selfservice	

                 Developer=user
                                    Development	

     Self-sufficient/ great degree of freedom
                                                                    Very broad tasks


                                                           Lightweight development process
                                     Delegated 	

     Minimum of specialisation/ distinction of roles
                                    Development	

            Self-sufficient/ limited freedom



                                                        Development line discipline (OTAP)
                                                         Developers at a distance from users
                                   IT Development	

                                                        Mutually dependent/ within frameworks
                                                            Heavy separation of function
               Sustainability
           (Systematic approach)


R.D.Damhof
Leading principles	

                                                                   Compliant
                         Adaptible	



                                        Sustainable	





                                                          Decoupled	


                Centralized	

                                                               Standardized	

                                                                     
                                  Effective	

                 Industrialized	





R.D.Damhof
1	
                              2	
                               3	
                       4	
  
                        Company	
  xxx	
  data	
  management	
  Domain	
  




                                Source	
  store	
  
                                                                                                                BI	
  apps
                                                                                                                Reports	
  
                                                                                                                             	
  



                                                                                 Business	
  View	
  
          Sources	
  
                                                                                                               BI	
  Apps	
  
                                                                                                               Analysis	
  


                                                            Enterprise	
  	
  
                                                         Data	
  Warehouse	
  

                                                                                                              BI	
  Apps
                                                                                                              Ad-­‐hoc	
  
                                                                                                                          	
  

                                                      Data,	
  ‘What’	
                                    Func.on,	
  ‘How’	
       ‘Where’,	
  ‘Whom’	
  


                                                                                                                                                  15	
  
R.D.Damhof
Source	
  to	
     Sourcestore	
  to	
   Sourcestore	
  to	
  
                                                                                      EDW	
  (DV)	
  
                       product	
          product	
             BV	
  


    Adaptable	
  



    Sustainable	
  



    Compliant	
  


    Decoupled	
  



    Effec.ve	
  



    Standardized	
  



     Centralized	
  
                                                                                                        16	
  
R.D.Damhof
1	
                             2	
                       3	
                                    4	
  
                        Company	
  xxx	
  data	
  warehouse	
  	
  Business	
  Intelligence	
  	
  Domain	
  




                                 Source	
  store	
  
                                                                                                                     BI	
  apps
                                                                                                                     Reports	
  
                                                                                                                                  	
  



                                                                                    Business	
  View,	
  	
  
                                                                                    Data	
  feeds	
  
          Sources	
  
                                                                                                                    BI	
  Apps	
  
                                                                                                                    Analysis	
  


                                                             Enterprise	
  	
  
                                                          Data	
  Warehouse	
  

                                                                                                                   BI	
  Apps
                                                                                                                   Ad-­‐hoc	
  
                                                                                                                               	
  

                                                       Data,	
  ‘What’	
                                        Func.on,	
  ‘How’	
       ‘Where’,	
  ‘Whom’	
  


                                                                                                                                                       17	
  
R.D.Damhof
Administra.ve	
  process	
                     Informa.on	
  Delivery	
  Process	
                                         Decision-­‐	
  	
  control	
  

                                                                                             Generate	
              Data	
  	
  Informa.on	
  recipients	
  
                                                                                             Distribute	
  
                                                                             Enrich	
  
                                                    Register	
  	
  
                                                   Standardize	
  
Proces	
                              AXain	
  




                                                                        Why                                                               PDCA	
  

                                                                        DV?	

                                               Compliance	
  repor.ng	
  
                                                                                            Informa.on	
  
                                                                                              products	
  
                                                                                                                               Risk	
  Management	
  
        Push	
  
          Systems	
                               DV	
  based	
                                                   Pull	
  
        (internal	
  	
                            Data	
  	
                                                                     Performance	
  
         external)	
                              Warehouse	
                                                                      Management	
  

                                                                        Business	
                                                  Supply	
  chain	
  
                                   Staging	
                            rules	
                                                     op.miza.on	
  
                                                                         Push	
            Data	
  products	
  
                                                                                                                                 Fraud	
  detec.on	
  

                                                                                                                                  Market	
  basket	
  
                                                                                                                                    analysis	
  

                                                            Control	
  /	
  Metadata	
                                                                      18	
  
  R.D.Damhof
Metamodel	
  driven	
  automa.on	
  
     -­‐ Models	
  (process,	
  rules	
  and	
  data)	
  determine	
  the	
  metadata,	
  the	
  metadata	
  determines	
  the	
  automa.on	
  ar.facts	
  
     -­‐ Aim	
  is	
  to	
  be	
  100%	
  declara.ve	
  
     -­‐ It	
  can	
  not	
  be	
  generated	
  all,	
  specific	
  tailored	
  metadata	
  will	
  remain	
  necessary	
  


                                            Metadata	
  driven	
  automa.on	
  
                                            -­‐	
  Inputs:	
  Source	
  model(s),	
  target	
  model,	
  Template	
  Design,	
  Naming	
  conven.ons	
  
                                            -­‐	
  Advanced	
  inputs:	
  Normaliza.on	
  preferences,	
  Ontologies	
  

                                            Taken	
  from	
  Dan	
  Linstedt’s	
  blog	
  post:	
  hXp://danlinstedt.com/datavaultcat/code-­‐genera.on-­‐for-­‐data-­‐vault-­‐not-­‐as-­‐easy-­‐as-­‐you-­‐think/	
  

                                                            Data	
  Vault	
  
                                                         implementa.ons	
     Template	
  driven	
  automa.on	
  
                                                                                                   -­‐ In	
  the	
  most	
  basic	
  forms;	
  documenta.on	
  	
  -­‐	
  describing	
  a	
  paXern	
  
                                                                                                   -­‐ More	
  advanced;	
  genera.ng	
  XML	
  code	
  for	
  2nd	
  gen.	
  ETL	
  tooling	
  
                                                                                                   -­‐ Vb	
  -­‐	
  hXp://www.grundsatzlich-­‐it.nl/bi-­‐tools-­‐templator.html	
  




                                                                                                                                                                                                                        19	
  
R.D.Damhof
My PoV about (Data Vault) automation Tooling	


      §  Generation is an aid, not a goal in itself	

           Do not accommodate the principles to fit the tool....	

           Look for decoupling	

            	

      §  Truly understand the mechanics - handcraft it first!	

           Invest in proper education and learning	

           Invest in getting ready time	

           Involve your customers from the start	

            	

      §  PoC, PoC, PoC	


      §  Deliver, Deliver, Deliver	

                                                                     20	
  
R.D.Damhof
Agility  Data Vault (1)	



          Why is it that you can build and deploy extremely
          small particles in Data Vault and not in other
          approaches, without having an increase in the
          overhead and coordination of these particles? In
          other words; 'Divide and Conquer to beat the
          Size / Complexity Dynamic’	





R.D.Damhof
Agility  Data Vault (2)	



         Why is it that you can re-engineer your existing model
         and guarantee that the changes remain local?
         Something that is hugely beneficial in data warehouses
         that - by definition - grow over time.	





R.D.Damhof
Agility  Data Vault (3)	



      Why is it that - as your (Data Vault based) data
      warehouse grows - your costs grow ‘merely’ in linear
      fashion initially, and as you approach the end state
      marginal growth in cost decreases exponentially.	





R.D.Damhof
Data Vault as-such is not Agile, it is the development
                 process that needs to be agile, DV merely supports
                            the agile development process.   	

                                          	



                             “Our highest priority is to satisfy the
                           customer through early and continuous
                                delivery of valuable software”

                                                                             Agile Manifesto, 2001
                                 Kent Beck, Mike Beedle, Arie van Bennekum, Alistair Cockburn, Ward Cunningham,
                                Martin Fowler, James Grenning, Jim Highsmith, Andrew Hunt, Ron Jeffries, Jon Kern,
                          Brian Marick, Robert C. Martin, Steve Mellor, Ken Schwaber, Jeff Sutherland, Dave Thomas




R.D.Damhof
Data Model Time Line	

 	

                                           Historic Overview




                © (Linstedt, Graziano,  Hultgren, The New Business Supermodel, The Business of Data Vault Modeling, 2008, p. 36) 	





    §  Created By Dan Linstedt	

    §  Released in 2000	

    §  Formally Introduced in the Netherlands in 2007	

           §  First DV Book: The Business of Data Vault Modeling 2008
                                                                     	

    §  First (Dutch) User group in 2010	

    §  Technical book from Dan Linstedt in 2011	



R.D.Damhof
Application
                Architecture   	




R.D.Damhof
Top Down Approach
                                	





R.D.Damhof
Bottom Up Approach
                                 	





R.D.Damhof
Bottom Up Approach
                                 	





R.D.Damhof
Bottom Up Approach
                                 	





R.D.Damhof
Bottom Up Approach
                                 	





R.D.Damhof
Irony
                    	





R.D.Damhof
Hybrid Approach (Data Vault)
                                           	





R.D.Damhof
R.D.Damhof
Kimball or Inmon ETL
                                                      	

                                   -  Complex ETL	

                                   -  Truth oriented	

                                   -  Business Rules before EDW	





ETL/Load Architecture
                    	

-  100% of the data (within
   scope) 100% of the time	

-  Source driven /Auditable: 	

-  “Fact Oriented”	

-  Template/metadata driven	

-  No Business Rules	





R.D.Damhof	

                                   Pictures: Genesee Academy ©
Classic Data Vault Application Architecture	


        Business	
  
       Transac.on	
  
         System	
  	
  
                                                                                                       Staging	
  
                                                               Data	
  Vault	
                                                 Datasets	
  
                                                                                                         Out	
  
        Business	
  
       Transac.on	
                                            Generic	
  Business	
  Rules	
  
         System	
  	
  
                                                                                   Rule	
  Vault	
  


                            Structure	
  transforma.on	
                                  Business	
  rule	
  execu.on	
  
                            Hub	
  =	
  business	
  keys	
                                Structure	
  and	
  value	
  transforma.on	
  




Adaptable	
       Sustainable	
           Compliant	
              Decoupled	
                Effec.veness	
               Standardized	
   Centralized	
  


                          ?	
                                                                                                      ?	
  
                                                                                                                                                   36	
  
  R.D.Damhof
Data Vault Application Architecture	



      §        Central EDW	

      §        Business rules downstream 	

      §        Incremental/Non destructive Loading	

      §        100% of the data (within scope) 100% of the time	

      §        Auditable/Partly source driven	





R.D.Damhof
Modeling
                       	




R.D.Damhof
R.D.Damhof
R.D.Damhof
R.D.Damhof
R.D.Damhof	

   Pictures: Genesee Academy ©
R.D.Damhof	

   Pictures: Genesee Academy ©
R.D.Damhof	

   Pictures: Genesee Academy ©
R.D.Damhof	

   Pictures: Genesee Academy ©
Data Vault Constructs	





R.D.Damhof	

                              Pictures: Genesee Academy ©
Data Vault Constructs	





R.D.Damhof	

                              Pictures: Genesee Academy ©
Data Vault Constructs	





R.D.Damhof	

                              Pictures: Genesee Academy ©
Core Components
                              	




R.D.Damhof
Data Vault Core Components
                                         	





R.D.Damhof	

                                  Pictures: Genesee Academy ©
Data Vault Core Components
                                         	





R.D.Damhof	

                                  Pictures: Genesee Academy ©
Hubs
                   	





R.D.Damhof	

            Pictures: Genesee Academy ©
Hubs
                   	





R.D.Damhof	

            Pictures: Genesee Academy ©
Hubs
                   	





R.D.Damhof	

            Pictures: Genesee Academy ©
Satellites
                         	





R.D.Damhof	

                  Pictures: Genesee Academy ©
Satellites
                         	





R.D.Damhof	

                  Pictures: Genesee Academy ©
Links
                    	





R.D.Damhof	

             Pictures: Genesee Academy ©
Links
                    	





R.D.Damhof	

             Pictures: Genesee Academy ©
Loading
                      	




R.D.Damhof
HUB load
                       	





R.D.Damhof	

                Pictures: Genesee Academy ©
HUB load
                                        	





INSERT INTO 	

customer_hub (cust#,load_dts,record_src)	

SELECT source.customer#, @load_dts, @record_src	

FROM source_customer AS source 	

  	

WHERE 	

NOT EXISTS 	

  	

(SELECT * FROM customer_hub AS hub WHERE hub.customer#=source.customer#)	



R.D.Damhof	

                                                         Pictures: Genesee Academy ©
Link Load	





                Loading a Link
                             	





R.D.Damhof	

                      Pictures: Genesee Academy ©
Link Load	





    INSERT INTO 	

    custcontact_link(cust_id,contact_id,load_dts, record_src)	

    SELECT source.customer#, @load_dts, @record_src	

            	

FROM source_table AS source 	

    INNER JOIN contact_hub AS contact ON 	

            	

       	

   	

   	

contact. contact#= source.contact#	

    INNER JOIN customer_hub AS cust ON                   	

   	

    	

    	

   	

          	

            	

cust. customer#= source.customer#	

    WHERE 	

    NOT EXISTS 	

    (SELECT * FROM custcontact_link AS link WHERE 	

    link. contact_id= contact.id and link.cust_id= cust.id)	

R.D.Damhof	

                                                                            Pictures: Genesee Academy ©
Satellite Load
                               	





                Loading a Satellite
                                  	





R.D.Damhof	

                           Pictures: Genesee Academy ©
Satellite Load
                                                   	





INSERT INTO 	

customer_sat (hub_id,load_dts, name,record_src)	

SELECT 	

  	

hub.id, @load_dts, source.cust_name, ,@record_src	

FROM source_customer AS source 	

INNER JOIN customer_hub AS hub ON               	

    	

   	

   	

   	

   	

        	

     cust.customer#= source.customer# #	

INNER JOIN customer_sat AS sat ON               	

	

  	

 sat.id= hub.id# AND sat “Is most recent” AND 	

sat.name  source.name	


 R.D.Damhof	

                                                                 Pictures: Genesee Academy ©
Data Vault Loading Paradigm	





R.D.Damhof	

                                    Pictures: Genesee Academy ©
Top 10 Rules for Data Vault Modeling
                                                   	





R.D.Damhof	

                                            Pictures: Genesee Academy ©
Agility  Data Vault - recap (1)	

                Why is it that you can build and deploy extremely small particles in
                Data Vault and not in other approaches, without having an increase
                in the overhead and coordination of these particles? In other
                words; 'Divide and Conquer to beat the Size / Complexity Dynamic’	




                            Why is it that you can re-engineer your existing
                            model and guarantee that the changes remain local?
                            Something that is hugely beneficial in data
                            warehouses that - by definition - grow over time.	




    Why is it that - as your (Data Vault based) data
    warehouse grows - your costs grow ‘merely’ in linear
    fashion initially, and as you approach the end state
    marginal growth in cost decreases exponentially.	


R.D.Damhof
Agility  Data Vault - recap (2)	


                Remember the Push characteristics	

                ➡  Mass production	

                                           Data Vault	



                ➡  Known specifications, operational definitions, standards	

    Data Vault	


                ➡  Repeatable, predictable,  even better; uniform process	

   Data Vault	


                ➡  Part of the system that needs statistical control	

         Data Vault	


                ➡  Inventory allowed/necessary	

                               Data Vault	


                ➡  Mainly supply driven	

                                      Data Vault	


                ➡  Reliability over flexibility	

                               Data Vault	




   Automation of a Data Vault ‘production process’ is just common sense	


R.D.Damhof
Bonus Slides
                                                    	

                Forks and mutations in DV ‘evolution’




R.D.Damhof
Type 1 - Classic Data Vault	


        Business	
  
       Transac.on	
  
         System	
  	
  
                                                                                                       Staging	
  
                                                               Data	
  Vault	
                                                 Datasets	
  
                                                                                                         Out	
  
        Business	
  
       Transac.on	
                                            Generic	
  Business	
  Rules	
  
         System	
  	
  
                                                                                   Rule	
  Vault	
  


                            Structure	
  transforma.on	
                                  Business	
  rule	
  execu.on	
  
                            Hub	
  =	
  business	
  keys	
                                Structure	
  and	
  value	
  transforma.on	
  




Adaptable	
       Sustainable	
           Compliant	
              Decoupled	
                Effec.veness	
               Standardized	
   Centralized	
  


                          ?	
                                                                                                      ?	
  
                                                                                                                                                   71	
  
  R.D.Damhof
Type 2 - Source Data Vault	

              Business	
  
             Transac.on	
                          Staging	
  Vault	
  
               System	
  	
  
                                                                                                 Business	
  	
                    Data	
  Marts	
  
                                                                                                Data	
  Vault	
  
              Business	
  
             Transac.on	
                          Staging	
  Vault	
  
               System	
  	
  


                     Structure	
  transforma.on	
                         Business	
  rule	
  execu.on	
            Structure	
  transforma.on	
  
                     No	
  integra.on,	
  Hub=surrogate	
  keys	
         Integra.on	
  
                     Persis.ng	
  staging	
  in	
  DV	
  format	
         DV	
  modelled	
  	
  




Adaptable	
       Sustainable	
           Compliant	
                 Decoupled	
           Effec.veness	
             Standardized	
   Centralized	
  


     ?	
                ?	
                                                 ?	
  
                                                                                                                                                       72	
  
  R.D.Damhof
Source	
  


                Source	
  



                                                   	
  100%	
  Seman.c	
  gap	
  


                Source	
                Staging	
  DV	
  
                                                                             Business	
  DV	
  
                Source	
                Staging	
  DV	
  




                                                                   100%	
  Seman.c	
  gap	
  



                     S.ll	
  the	
  source	
  

                                                    Integra.on,	
  cleansing,	
  consolida.on	
  
                                                    Business	
  rule	
  execu.on	
  upstream	
  ??	
  
                                                    DV	
  modelled	
  	
  
                                                                                                         73	
  
R.D.Damhof
Source	
  


                Source	
  



                                                 	
  100%	
  Seman.c	
  gap	
  


                Source	
   Source	
  Staging	
  DV	
  
                                                                       Business	
  DV	
  
                                                                     Data	
  Warehouse	
  
                Source	
   Source	
  Staging	
  DV	
  




                                                                100%	
  Seman.c	
  gap	
  



                     S.ll	
  the	
  source	
  

                                                 Integra.on,	
  cleansing,	
  consolida.on	
  
                                                 Business	
  rule	
  execu.on	
  upstream	
  ??	
  
                                                 DV	
  modelled	
  	
  
                                                                                                      74	
  
R.D.Damhof
Wanna know more?
                               	

      §  Training  certification: www.geneseeacademy.com	


      §  Books: ‘Super Charge Your Data Warehouse: Invaluable Data
          Modeling Rules to Implement Your Data Vault’ – D.Linstedt /
          K.Graziano	


      §  Linkedin: Data Vault Discussions (approx. 800 members)	


      §  Niche non-commercial conferences; www.dwhautomation.com	


      §  Many blogs, articles, presentations on the World Wide Web	


      §  The best way to learn; try it, make some code, experience, engage	




R.D.Damhof
Thank You	

                Drs.	
  Ronald	
  D.	
  Damhof	
  

                Blog	
                                                          hXp://prudenza.typepad.com/                                  	
  
                                                                                hXp://www.b-­‐eye-­‐network.com/blogs/damhof/	
  	
  
                Linkedin	
                                                      hXp://nl.linkedin.com/in/ronalddamhof	
  

                Email	
                                                         ronald.damhof@prudenza.nl	
  

                TwiXer	
                                                        RonaldDamhof	
  

                Skype	
                                                         Ronald.Damhof	
  

                Mobile	
                                                        +31(0)6	
  269	
  67	
  184	
  

                Others	
                                                        Informa.on	
  Quality	
  Cer.fied	
  Professional	
  (IQCP)	
  
                                                                                Data	
  Vault	
  Cer.fied	
  Grand	
  Master           	
  
                                                                                Cer.fied	
  Scrum	
  Master           	
  
                                                                                Member	
  of	
  the	
  Boulder	
  BI	
  Brain	
  Trust	
  (#BBBT)	
  

                Ronald	
  Damhof	
  is	
  an	
  independent	
  prac..oner	
  in	
  the	
  field	
  of	
  data	
  management	
  and	
  decision	
  support.	
  Graduated	
  in	
  1995	
  in	
  the	
  
                study	
  of	
  Economics.	
  Since	
  1995	
  he	
  worked	
  as	
  a	
  prac..oner	
  into	
  the	
  field	
  of	
  Informa.on	
  Management	
  with	
  a	
  focus	
  on	
  decision	
  
                support	
  and	
  data	
  management,	
  trying	
  hard	
  to	
  enhance	
  the	
  rigor	
  and	
  relevance	
  in	
  these	
  fields	
  by	
  combining	
  scien.fic	
  research	
  
                with	
  the	
  everyday	
  challenges	
  of	
  the	
  prac..oner.	
  Ronald	
  is	
  mainly	
  hired	
  by	
  customers	
  in	
  the	
  role	
  of	
  business/IT	
  architect,	
  
                auditor,	
  coach	
  	
  trainer.	
  He	
  blogs	
  on	
  B-­‐Eye-­‐Network.com	
  as	
  well	
  as	
  his	
  own	
  blog,	
  is	
  a	
  member	
  of	
  the	
  pres.gious	
  BBBT,	
  wrote	
  
                several	
  ar.cles	
  regarding	
  decision	
  support	
  architectures	
  and	
  is	
  a	
  researcher	
  in	
  the	
  field	
  of	
  Informa.on	
  Management.	
  	
  
                	
  
                Although	
  Ronald	
  likes	
  to	
  work	
  with	
  theore.cal	
  grounded	
  research	
  and	
  proven	
  prac.ces,	
  Ronald	
  is	
  not	
  a	
  'white	
  paper'	
  architect;	
  
                put	
  your	
  money	
  where	
  your	
  mouth	
  is,	
  is	
  his	
  moXo.	
  He	
  likes	
  to	
  see	
  architectures	
  'live'	
  in	
  enterprises,	
  not	
  just	
  write	
  about	
  it.	
  In	
  
                most	
  organiza.ons	
  his	
  role	
  extends	
  architecture	
  onen.	
  In	
  truely	
  agile	
  spirit	
  the	
  roles	
  he	
  plays	
  depend	
  on	
  the	
  context	
  of	
  the	
  
                client;	
  he	
  can	
  be	
  a	
  missionary	
  (selling	
  the	
  value),	
  a	
  project	
  manager	
  (geong	
  it	
  done),	
  a	
  scrum	
  master	
  (removing	
  impediments),	
  
                specialist	
  (educa.ng	
  hardware	
  peeps,	
  data	
  architects,	
  data	
  logis.cs	
  etc.)	
  or	
  a	
  leader.	
  




                                                                                                                                                                                                                             76	
  
R.D.Damhof

Más contenido relacionado

La actualidad más candente

Print and beyond insights - Transforming transactions to engage customers
Print and beyond insights - Transforming transactions to engage customersPrint and beyond insights - Transforming transactions to engage customers
Print and beyond insights - Transforming transactions to engage customersRoger Christiansen
 
IGC Solutions for IBM ECM
IGC Solutions for IBM ECMIGC Solutions for IBM ECM
IGC Solutions for IBM ECMkaterogersbrown
 
It infrastructure cost reduction vision v5 customer
It infrastructure cost reduction vision v5   customerIt infrastructure cost reduction vision v5   customer
It infrastructure cost reduction vision v5 customerddeschenes99
 
Novell Success Stories: Collaboration in Travel and Hospitality
Novell Success Stories: Collaboration in Travel and HospitalityNovell Success Stories: Collaboration in Travel and Hospitality
Novell Success Stories: Collaboration in Travel and HospitalityNovell
 
Congress 2012: Enterprise Cloud Adoption – an Evolution from Infrastructure ...
Congress 2012:  Enterprise Cloud Adoption – an Evolution from Infrastructure ...Congress 2012:  Enterprise Cloud Adoption – an Evolution from Infrastructure ...
Congress 2012: Enterprise Cloud Adoption – an Evolution from Infrastructure ...eurocloud
 
1st day 3 - agility vs risk
1st day   3 - agility vs risk1st day   3 - agility vs risk
1st day 3 - agility vs riskLilian Schaffer
 
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...Vincent Kwon
 
L+L Printing Company Customer Profile
L+L Printing Company Customer ProfileL+L Printing Company Customer Profile
L+L Printing Company Customer ProfileSmartDraw Software
 
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...eG Innovations
 
Ergo - Managed Desktop Services Brochure
Ergo - Managed Desktop Services BrochureErgo - Managed Desktop Services Brochure
Ergo - Managed Desktop Services Brochureffurlong
 
Business Infrastructure & IT Pain points
Business Infrastructure & IT Pain pointsBusiness Infrastructure & IT Pain points
Business Infrastructure & IT Pain pointsRichardson Eyres
 
Video Conferencing
Video ConferencingVideo Conferencing
Video ConferencingVideoguy
 
Electronic Software Delivery at IOM
Electronic Software Delivery at IOMElectronic Software Delivery at IOM
Electronic Software Delivery at IOMFlexera
 
Newgen Solutions for Telecom
Newgen Solutions for TelecomNewgen Solutions for Telecom
Newgen Solutions for Telecomnewgenpartners
 
Smart Solutions Presentation 2009
Smart Solutions Presentation 2009Smart Solutions Presentation 2009
Smart Solutions Presentation 2009bmagown
 

La actualidad más candente (20)

Print and beyond insights - Transforming transactions to engage customers
Print and beyond insights - Transforming transactions to engage customersPrint and beyond insights - Transforming transactions to engage customers
Print and beyond insights - Transforming transactions to engage customers
 
IGC Solutions for IBM ECM
IGC Solutions for IBM ECMIGC Solutions for IBM ECM
IGC Solutions for IBM ECM
 
It infrastructure cost reduction vision v5 customer
It infrastructure cost reduction vision v5   customerIt infrastructure cost reduction vision v5   customer
It infrastructure cost reduction vision v5 customer
 
Novell Success Stories: Collaboration in Travel and Hospitality
Novell Success Stories: Collaboration in Travel and HospitalityNovell Success Stories: Collaboration in Travel and Hospitality
Novell Success Stories: Collaboration in Travel and Hospitality
 
121211 improve your productivity
121211 improve your productivity121211 improve your productivity
121211 improve your productivity
 
Congress 2012: Enterprise Cloud Adoption – an Evolution from Infrastructure ...
Congress 2012:  Enterprise Cloud Adoption – an Evolution from Infrastructure ...Congress 2012:  Enterprise Cloud Adoption – an Evolution from Infrastructure ...
Congress 2012: Enterprise Cloud Adoption – an Evolution from Infrastructure ...
 
variable data publishing
variable data publishingvariable data publishing
variable data publishing
 
1st day 3 - agility vs risk
1st day   3 - agility vs risk1st day   3 - agility vs risk
1st day 3 - agility vs risk
 
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...
IBM Smart Business Desktop Cloud - How to optimise the ROI from your desktop ...
 
L+L Printing Company Customer Profile
L+L Printing Company Customer ProfileL+L Printing Company Customer Profile
L+L Printing Company Customer Profile
 
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...
Desktop Transformation Success - The 5 Secrets to Delivering User Satisfactio...
 
Ergo - Managed Desktop Services Brochure
Ergo - Managed Desktop Services BrochureErgo - Managed Desktop Services Brochure
Ergo - Managed Desktop Services Brochure
 
Business Infrastructure & IT Pain points
Business Infrastructure & IT Pain pointsBusiness Infrastructure & IT Pain points
Business Infrastructure & IT Pain points
 
Video Conferencing
Video ConferencingVideo Conferencing
Video Conferencing
 
SAP on Cloud - An Innovation from Wharfedale Technologies
SAP on Cloud - An Innovation from Wharfedale TechnologiesSAP on Cloud - An Innovation from Wharfedale Technologies
SAP on Cloud - An Innovation from Wharfedale Technologies
 
Electronic Software Delivery at IOM
Electronic Software Delivery at IOMElectronic Software Delivery at IOM
Electronic Software Delivery at IOM
 
E dms
E dmsE dms
E dms
 
Data protection in cloud
Data protection in cloudData protection in cloud
Data protection in cloud
 
Newgen Solutions for Telecom
Newgen Solutions for TelecomNewgen Solutions for Telecom
Newgen Solutions for Telecom
 
Smart Solutions Presentation 2009
Smart Solutions Presentation 2009Smart Solutions Presentation 2009
Smart Solutions Presentation 2009
 

Destacado

Sas insight sessie data management - Data Quadrant Model
Sas insight sessie data management - Data Quadrant ModelSas insight sessie data management - Data Quadrant Model
Sas insight sessie data management - Data Quadrant ModelPrudenza B.V
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slidesWilliam Lam
 
TDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWTDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWukc4
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Andrey Akulov
 
Which data should you move to Hadoop?
Which data should you move to Hadoop?Which data should you move to Hadoop?
Which data should you move to Hadoop?Attunity
 
Executive BI, Analytics, Modeling and Insights Strategy Framework Practices
Executive BI, Analytics, Modeling and Insights Strategy Framework PracticesExecutive BI, Analytics, Modeling and Insights Strategy Framework Practices
Executive BI, Analytics, Modeling and Insights Strategy Framework PracticesInsightSlides
 
Gartner: The BI, Analytics and Performance Management Framework
Gartner: The BI, Analytics and Performance Management FrameworkGartner: The BI, Analytics and Performance Management Framework
Gartner: The BI, Analytics and Performance Management FrameworkGartner
 
Tdwi march 2015 presentation
Tdwi march 2015 presentationTdwi march 2015 presentation
Tdwi march 2015 presentationAlison Macfie
 

Destacado (9)

Sas insight sessie data management - Data Quadrant Model
Sas insight sessie data management - Data Quadrant ModelSas insight sessie data management - Data Quadrant Model
Sas insight sessie data management - Data Quadrant Model
 
Tdwi solution spotlight presentation slides
Tdwi solution spotlight   presentation slidesTdwi solution spotlight   presentation slides
Tdwi solution spotlight presentation slides
 
TDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDWTDWI Roundtable: The HANA EDW
TDWI Roundtable: The HANA EDW
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.
 
Which data should you move to Hadoop?
Which data should you move to Hadoop?Which data should you move to Hadoop?
Which data should you move to Hadoop?
 
Executive BI, Analytics, Modeling and Insights Strategy Framework Practices
Executive BI, Analytics, Modeling and Insights Strategy Framework PracticesExecutive BI, Analytics, Modeling and Insights Strategy Framework Practices
Executive BI, Analytics, Modeling and Insights Strategy Framework Practices
 
Going MAD: A Framework For Delivering Pervasive BI Solutions
Going MAD: A Framework For Delivering Pervasive BI SolutionsGoing MAD: A Framework For Delivering Pervasive BI Solutions
Going MAD: A Framework For Delivering Pervasive BI Solutions
 
Gartner: The BI, Analytics and Performance Management Framework
Gartner: The BI, Analytics and Performance Management FrameworkGartner: The BI, Analytics and Performance Management Framework
Gartner: The BI, Analytics and Performance Management Framework
 
Tdwi march 2015 presentation
Tdwi march 2015 presentationTdwi march 2015 presentation
Tdwi march 2015 presentation
 

Similar a Tdwi agile data warehouse - dv, what is the buzz about

Peter Coffee CIO Forum 20100406
Peter Coffee CIO Forum 20100406Peter Coffee CIO Forum 20100406
Peter Coffee CIO Forum 20100406Peter Coffee
 
Aberdeen ppt-iam integrated-db-06 20120412
Aberdeen ppt-iam integrated-db-06 20120412Aberdeen ppt-iam integrated-db-06 20120412
Aberdeen ppt-iam integrated-db-06 20120412OracleIDM
 
Top 10 Data Center Success Criteria
Top 10 Data Center Success CriteriaTop 10 Data Center Success Criteria
Top 10 Data Center Success CriteriaInternap
 
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12peak10marketing
 
Cloudy with a chance of downtime
Cloudy with a chance of downtimeCloudy with a chance of downtime
Cloudy with a chance of downtimeAFCOM
 
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...Skoda Minotti
 
4 Ways To Save Big Money in Your Data Center and Private Cloud
4 Ways To Save Big Money in Your Data Center and Private Cloud4 Ways To Save Big Money in Your Data Center and Private Cloud
4 Ways To Save Big Money in Your Data Center and Private Cloudtervela
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing DataWorks Summit
 
Requirements for Public Sector Cloud Computing
Requirements for Public Sector Cloud ComputingRequirements for Public Sector Cloud Computing
Requirements for Public Sector Cloud ComputingPeter Coffee
 
Paperless PeopleSoft: Electronic Forms for HCM & Financials
Paperless PeopleSoft: Electronic Forms for HCM & FinancialsPaperless PeopleSoft: Electronic Forms for HCM & Financials
Paperless PeopleSoft: Electronic Forms for HCM & FinancialsSmart ERP Solutions, Inc.
 
Unlock the power_of_personal_media_in_enterprise
Unlock the power_of_personal_media_in_enterpriseUnlock the power_of_personal_media_in_enterprise
Unlock the power_of_personal_media_in_enterpriseBecky_Hayman
 
Workforce Insight - A Case Study (cfactor and DeVry)
Workforce Insight - A Case Study (cfactor and DeVry)Workforce Insight - A Case Study (cfactor and DeVry)
Workforce Insight - A Case Study (cfactor and DeVry)cfactor Works Inc.
 
Retain Talent and Improve Employee Satisfaction
Retain Talent and Improve Employee SatisfactionRetain Talent and Improve Employee Satisfaction
Retain Talent and Improve Employee SatisfactionHuman Capital Media
 
Master agile development and testing
Master agile development and testingMaster agile development and testing
Master agile development and testingvmglover
 
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011Eric D. Boyd
 
Zenith Infotech Mirror Cloud Presentation. 112211
Zenith Infotech    Mirror Cloud Presentation. 112211Zenith Infotech    Mirror Cloud Presentation. 112211
Zenith Infotech Mirror Cloud Presentation. 112211hdmchughgmailcom
 
Dynamo Systems - QCon SF 2012 Presentation
Dynamo Systems - QCon SF 2012 PresentationDynamo Systems - QCon SF 2012 Presentation
Dynamo Systems - QCon SF 2012 PresentationShanley Kane
 
Customer Day 18th May 2012
Customer Day 18th May 2012Customer Day 18th May 2012
Customer Day 18th May 2012ctrlsblog
 
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud ComputingDr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud Computingikanow
 

Similar a Tdwi agile data warehouse - dv, what is the buzz about (20)

Data Flux
Data FluxData Flux
Data Flux
 
Peter Coffee CIO Forum 20100406
Peter Coffee CIO Forum 20100406Peter Coffee CIO Forum 20100406
Peter Coffee CIO Forum 20100406
 
Aberdeen ppt-iam integrated-db-06 20120412
Aberdeen ppt-iam integrated-db-06 20120412Aberdeen ppt-iam integrated-db-06 20120412
Aberdeen ppt-iam integrated-db-06 20120412
 
Top 10 Data Center Success Criteria
Top 10 Data Center Success CriteriaTop 10 Data Center Success Criteria
Top 10 Data Center Success Criteria
 
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12
Webinar- Simple and Cost-Effective Disaster Recovery in the Cloud - 7-19-12
 
Cloudy with a chance of downtime
Cloudy with a chance of downtimeCloudy with a chance of downtime
Cloudy with a chance of downtime
 
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...
IT Compliance and Governance with DLP Controls and Vulnerability Scanning Sof...
 
4 Ways To Save Big Money in Your Data Center and Private Cloud
4 Ways To Save Big Money in Your Data Center and Private Cloud4 Ways To Save Big Money in Your Data Center and Private Cloud
4 Ways To Save Big Money in Your Data Center and Private Cloud
 
Ibm big data ibm marriage of hadoop and data warehousing
Ibm big dataibm marriage of hadoop and data warehousingIbm big dataibm marriage of hadoop and data warehousing
Ibm big data ibm marriage of hadoop and data warehousing
 
Requirements for Public Sector Cloud Computing
Requirements for Public Sector Cloud ComputingRequirements for Public Sector Cloud Computing
Requirements for Public Sector Cloud Computing
 
Paperless PeopleSoft: Electronic Forms for HCM & Financials
Paperless PeopleSoft: Electronic Forms for HCM & FinancialsPaperless PeopleSoft: Electronic Forms for HCM & Financials
Paperless PeopleSoft: Electronic Forms for HCM & Financials
 
Unlock the power_of_personal_media_in_enterprise
Unlock the power_of_personal_media_in_enterpriseUnlock the power_of_personal_media_in_enterprise
Unlock the power_of_personal_media_in_enterprise
 
Workforce Insight - A Case Study (cfactor and DeVry)
Workforce Insight - A Case Study (cfactor and DeVry)Workforce Insight - A Case Study (cfactor and DeVry)
Workforce Insight - A Case Study (cfactor and DeVry)
 
Retain Talent and Improve Employee Satisfaction
Retain Talent and Improve Employee SatisfactionRetain Talent and Improve Employee Satisfaction
Retain Talent and Improve Employee Satisfaction
 
Master agile development and testing
Master agile development and testingMaster agile development and testing
Master agile development and testing
 
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011
Understanding the Value of the Cloud - Centare Lunch & Learn - June 2, 2011
 
Zenith Infotech Mirror Cloud Presentation. 112211
Zenith Infotech    Mirror Cloud Presentation. 112211Zenith Infotech    Mirror Cloud Presentation. 112211
Zenith Infotech Mirror Cloud Presentation. 112211
 
Dynamo Systems - QCon SF 2012 Presentation
Dynamo Systems - QCon SF 2012 PresentationDynamo Systems - QCon SF 2012 Presentation
Dynamo Systems - QCon SF 2012 Presentation
 
Customer Day 18th May 2012
Customer Day 18th May 2012Customer Day 18th May 2012
Customer Day 18th May 2012
 
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud ComputingDr. Michael Valivullah, NASS/USDA - Cloud Computing
Dr. Michael Valivullah, NASS/USDA - Cloud Computing
 

Más de Prudenza B.V

[Dutch] Data: Van Innovatie naar Waarde
[Dutch] Data: Van Innovatie naar Waarde[Dutch] Data: Van Innovatie naar Waarde
[Dutch] Data: Van Innovatie naar WaardePrudenza B.V
 
FB 24-31 Ronald Damhof_FR
FB 24-31 Ronald Damhof_FRFB 24-31 Ronald Damhof_FR
FB 24-31 Ronald Damhof_FRPrudenza B.V
 
FB 24-31 Ronald Damhof
FB 24-31 Ronald Damhof FB 24-31 Ronald Damhof
FB 24-31 Ronald Damhof Prudenza B.V
 
FB_24-31_Ronald Damhof
FB_24-31_Ronald DamhofFB_24-31_Ronald Damhof
FB_24-31_Ronald DamhofPrudenza B.V
 
Idq summit2014 ronald damhof - it's all about the data
Idq summit2014   ronald damhof - it's all about the dataIdq summit2014   ronald damhof - it's all about the data
Idq summit2014 ronald damhof - it's all about the dataPrudenza B.V
 
Keynote 5 juni 2014 - dutch data vault masters - shu-ha-ri
Keynote   5 juni 2014 - dutch data vault masters - shu-ha-riKeynote   5 juni 2014 - dutch data vault masters - shu-ha-ri
Keynote 5 juni 2014 - dutch data vault masters - shu-ha-riPrudenza B.V
 
Keynote 22 mei 2014 - dwh automation - 4 Quadrant
Keynote   22 mei 2014 - dwh automation - 4 QuadrantKeynote   22 mei 2014 - dwh automation - 4 Quadrant
Keynote 22 mei 2014 - dwh automation - 4 QuadrantPrudenza B.V
 
20130527 jill dyche - im ronald [Dutch]
20130527   jill dyche - im ronald [Dutch]20130527   jill dyche - im ronald [Dutch]
20130527 jill dyche - im ronald [Dutch]Prudenza B.V
 
20130527 jill dyche - im ronald
20130527   jill dyche - im ronald20130527   jill dyche - im ronald
20130527 jill dyche - im ronaldPrudenza B.V
 
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgenPrudenza B.V
 
[Dutch] Analytics is waarde-loos
[Dutch] Analytics is waarde-loos[Dutch] Analytics is waarde-loos
[Dutch] Analytics is waarde-loosPrudenza B.V
 
Data Vault automation conference - all presentations
Data Vault automation conference - all presentationsData Vault automation conference - all presentations
Data Vault automation conference - all presentationsPrudenza B.V
 
Keynote Ronald Damhof Data Vault Automation
Keynote Ronald Damhof Data Vault Automation Keynote Ronald Damhof Data Vault Automation
Keynote Ronald Damhof Data Vault Automation Prudenza B.V
 

Más de Prudenza B.V (13)

[Dutch] Data: Van Innovatie naar Waarde
[Dutch] Data: Van Innovatie naar Waarde[Dutch] Data: Van Innovatie naar Waarde
[Dutch] Data: Van Innovatie naar Waarde
 
FB 24-31 Ronald Damhof_FR
FB 24-31 Ronald Damhof_FRFB 24-31 Ronald Damhof_FR
FB 24-31 Ronald Damhof_FR
 
FB 24-31 Ronald Damhof
FB 24-31 Ronald Damhof FB 24-31 Ronald Damhof
FB 24-31 Ronald Damhof
 
FB_24-31_Ronald Damhof
FB_24-31_Ronald DamhofFB_24-31_Ronald Damhof
FB_24-31_Ronald Damhof
 
Idq summit2014 ronald damhof - it's all about the data
Idq summit2014   ronald damhof - it's all about the dataIdq summit2014   ronald damhof - it's all about the data
Idq summit2014 ronald damhof - it's all about the data
 
Keynote 5 juni 2014 - dutch data vault masters - shu-ha-ri
Keynote   5 juni 2014 - dutch data vault masters - shu-ha-riKeynote   5 juni 2014 - dutch data vault masters - shu-ha-ri
Keynote 5 juni 2014 - dutch data vault masters - shu-ha-ri
 
Keynote 22 mei 2014 - dwh automation - 4 Quadrant
Keynote   22 mei 2014 - dwh automation - 4 QuadrantKeynote   22 mei 2014 - dwh automation - 4 Quadrant
Keynote 22 mei 2014 - dwh automation - 4 Quadrant
 
20130527 jill dyche - im ronald [Dutch]
20130527   jill dyche - im ronald [Dutch]20130527   jill dyche - im ronald [Dutch]
20130527 jill dyche - im ronald [Dutch]
 
20130527 jill dyche - im ronald
20130527   jill dyche - im ronald20130527   jill dyche - im ronald
20130527 jill dyche - im ronald
 
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen
[Dutch] Data Health Care: hoe je data in goede gezondheid te krijgen
 
[Dutch] Analytics is waarde-loos
[Dutch] Analytics is waarde-loos[Dutch] Analytics is waarde-loos
[Dutch] Analytics is waarde-loos
 
Data Vault automation conference - all presentations
Data Vault automation conference - all presentationsData Vault automation conference - all presentations
Data Vault automation conference - all presentations
 
Keynote Ronald Damhof Data Vault Automation
Keynote Ronald Damhof Data Vault Automation Keynote Ronald Damhof Data Vault Automation
Keynote Ronald Damhof Data Vault Automation
 

Último

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 

Último (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 

Tdwi agile data warehouse - dv, what is the buzz about

  • 1. Agile Data Warehousing Data Vault, What is the buzz about TDWI München June 18, 2012 Ronald Damhof R.D.Damhof
  • 2. “Our highest priority is to satisfy the customer through early and continuous delivery of valuable software” Agile Manifesto, 2001 Kent Beck, Mike Beedle, Arie van Bennekum, Alistair Cockburn, Ward Cunningham, Martin Fowler, James Grenning, Jim Highsmith, Andrew Hunt, Ron Jeffries, Jon Kern, Brian Marick, Robert C. Martin, Steve Mellor, Ken Schwaber, Jeff Sutherland, Dave Thomas R.D.Damhof
  • 3. ‘Calculating risk Source ‘Yield modules’ Source ‘Customer segmentation’ ‘Semantic gap’ R.D.Damhof
  • 4. Everybody mines their own data Everybody enriches their own data Everybody uses their own data User = Developer With his selfmade tools Data quality determined by the individual It’s a grind – limited reusability Leadtimes unpredictable No management R.D.Damhof
  • 5. Lets ‘order’ an information product And hire a master/expert Separation between user/developer Developer/expert mines the data The information product = custom made Data quality is mostly dependable on the developer/expert Leadtimes unpredictable Still not much reusability R.D.Damhof
  • 6. A central department who knows what information you need That assembles information products, ready to be used for you ‘I now what you want’ – black Efficiency is the name of the game At least I got something, but it does not comply - even remotely - to my needs Even worse; the guild-days are still there – the expert is now submerged, but needed to get the data you actually need. Introduction of management – you want something? Please apply in 3-fold… R.D.Damhof
  • 7. Creating information products, the moment they are asked for Against quality criteria which are in line with the expectation of the customer Empower the customer with skills and facilities to be more self sufficient Minimize ‘data’-stock as much as possible Embrace new wishes and changes required by the customer The customer is the most important part of the production process Stephen Denning (2011) – Radical Management R.D.Damhof
  • 8. A modern data management environment: The ‘Supermarket’ The ‘Restaurant’ The ‘Do it yourself buffet’ R.D.Damhof
  • 10. Push characteristics §  Mass production §  Known specifications, operational definitions, standards §  Repeatable, predictable, even better; uniform process §  Part of the system that needs statistical control §  Inventory allowed/necessary §  Supply driven §  Reliability over flexibility Pull characteristics §  Just in time §  Demand driven §  Build to order §  Preferably no inventory §  Flexibility over Reliability R.D.Damhof
  • 11. Back to the issue at hand…… §  What: the ‘production process of data’ §  Where: Coordination - Local versus central §  How: System Engineering - Systematic vs. Opportunistic §  What principles guide us - leading principles R.D.Damhof
  • 12. Local  vs  Central  deployment   Informa.on  Delivery  Proces   Recipient   Informa.on  Delivery  process   4.  Generate  Informa.on  products   End-­‐user  (Local)   Data    func.on  service   4   4   4   4   4   3.  Enrich  and  cleanse  data   3   3   3   3   3   2.  Register    Standardize   2   2   2   2   2   1   1   1   1   1   1.  Get  the  raw  uncut  data   Generic  proces  (Central)   Data  sources   (internal    external)   R.D.Damhof
  • 13. System Engineering - Systematic vs. Opportunistic Manoeuvrability (opportunistic approach) Ad-hoc development proces Selfservice Developer=user Development Self-sufficient/ great degree of freedom Very broad tasks Lightweight development process Delegated Minimum of specialisation/ distinction of roles Development Self-sufficient/ limited freedom Development line discipline (OTAP) Developers at a distance from users IT Development Mutually dependent/ within frameworks Heavy separation of function Sustainability (Systematic approach) R.D.Damhof
  • 14. Leading principles Compliant Adaptible Sustainable Decoupled Centralized Standardized Effective Industrialized R.D.Damhof
  • 15. 1   2   3   4   Company  xxx  data  management  Domain   Source  store   BI  apps Reports     Business  View   Sources   BI  Apps   Analysis   Enterprise     Data  Warehouse   BI  Apps Ad-­‐hoc     Data,  ‘What’   Func.on,  ‘How’   ‘Where’,  ‘Whom’   15   R.D.Damhof
  • 16. Source  to   Sourcestore  to   Sourcestore  to   EDW  (DV)   product   product   BV   Adaptable   Sustainable   Compliant   Decoupled   Effec.ve   Standardized   Centralized   16   R.D.Damhof
  • 17. 1   2   3   4   Company  xxx  data  warehouse    Business  Intelligence    Domain   Source  store   BI  apps Reports     Business  View,     Data  feeds   Sources   BI  Apps   Analysis   Enterprise     Data  Warehouse   BI  Apps Ad-­‐hoc     Data,  ‘What’   Func.on,  ‘How’   ‘Where’,  ‘Whom’   17   R.D.Damhof
  • 18. Administra.ve  process   Informa.on  Delivery  Process   Decision-­‐    control   Generate   Data    Informa.on  recipients   Distribute   Enrich   Register     Standardize   Proces   AXain   Why PDCA   DV? Compliance  repor.ng   Informa.on   products   Risk  Management   Push   Systems   DV  based   Pull   (internal     Data     Performance   external)   Warehouse   Management   Business   Supply  chain   Staging   rules   op.miza.on   Push   Data  products   Fraud  detec.on   Market  basket   analysis   Control  /  Metadata   18   R.D.Damhof
  • 19. Metamodel  driven  automa.on   -­‐ Models  (process,  rules  and  data)  determine  the  metadata,  the  metadata  determines  the  automa.on  ar.facts   -­‐ Aim  is  to  be  100%  declara.ve   -­‐ It  can  not  be  generated  all,  specific  tailored  metadata  will  remain  necessary   Metadata  driven  automa.on   -­‐  Inputs:  Source  model(s),  target  model,  Template  Design,  Naming  conven.ons   -­‐  Advanced  inputs:  Normaliza.on  preferences,  Ontologies   Taken  from  Dan  Linstedt’s  blog  post:  hXp://danlinstedt.com/datavaultcat/code-­‐genera.on-­‐for-­‐data-­‐vault-­‐not-­‐as-­‐easy-­‐as-­‐you-­‐think/   Data  Vault   implementa.ons   Template  driven  automa.on   -­‐ In  the  most  basic  forms;  documenta.on    -­‐  describing  a  paXern   -­‐ More  advanced;  genera.ng  XML  code  for  2nd  gen.  ETL  tooling   -­‐ Vb  -­‐  hXp://www.grundsatzlich-­‐it.nl/bi-­‐tools-­‐templator.html   19   R.D.Damhof
  • 20. My PoV about (Data Vault) automation Tooling §  Generation is an aid, not a goal in itself Do not accommodate the principles to fit the tool.... Look for decoupling §  Truly understand the mechanics - handcraft it first! Invest in proper education and learning Invest in getting ready time Involve your customers from the start §  PoC, PoC, PoC §  Deliver, Deliver, Deliver 20   R.D.Damhof
  • 21. Agility Data Vault (1) Why is it that you can build and deploy extremely small particles in Data Vault and not in other approaches, without having an increase in the overhead and coordination of these particles? In other words; 'Divide and Conquer to beat the Size / Complexity Dynamic’ R.D.Damhof
  • 22. Agility Data Vault (2) Why is it that you can re-engineer your existing model and guarantee that the changes remain local? Something that is hugely beneficial in data warehouses that - by definition - grow over time. R.D.Damhof
  • 23. Agility Data Vault (3) Why is it that - as your (Data Vault based) data warehouse grows - your costs grow ‘merely’ in linear fashion initially, and as you approach the end state marginal growth in cost decreases exponentially. R.D.Damhof
  • 24. Data Vault as-such is not Agile, it is the development process that needs to be agile, DV merely supports the agile development process. “Our highest priority is to satisfy the customer through early and continuous delivery of valuable software” Agile Manifesto, 2001 Kent Beck, Mike Beedle, Arie van Bennekum, Alistair Cockburn, Ward Cunningham, Martin Fowler, James Grenning, Jim Highsmith, Andrew Hunt, Ron Jeffries, Jon Kern, Brian Marick, Robert C. Martin, Steve Mellor, Ken Schwaber, Jeff Sutherland, Dave Thomas R.D.Damhof
  • 25. Data Model Time Line Historic Overview © (Linstedt, Graziano, Hultgren, The New Business Supermodel, The Business of Data Vault Modeling, 2008, p. 36) §  Created By Dan Linstedt §  Released in 2000 §  Formally Introduced in the Netherlands in 2007 §  First DV Book: The Business of Data Vault Modeling 2008 §  First (Dutch) User group in 2010 §  Technical book from Dan Linstedt in 2011 R.D.Damhof
  • 26. Application Architecture R.D.Damhof
  • 27. Top Down Approach R.D.Damhof
  • 28. Bottom Up Approach R.D.Damhof
  • 29. Bottom Up Approach R.D.Damhof
  • 30. Bottom Up Approach R.D.Damhof
  • 31. Bottom Up Approach R.D.Damhof
  • 32. Irony R.D.Damhof
  • 33. Hybrid Approach (Data Vault) R.D.Damhof
  • 35. Kimball or Inmon ETL -  Complex ETL -  Truth oriented -  Business Rules before EDW ETL/Load Architecture -  100% of the data (within scope) 100% of the time -  Source driven /Auditable: -  “Fact Oriented” -  Template/metadata driven -  No Business Rules R.D.Damhof Pictures: Genesee Academy ©
  • 36. Classic Data Vault Application Architecture Business   Transac.on   System     Staging   Data  Vault   Datasets   Out   Business   Transac.on   Generic  Business  Rules   System     Rule  Vault   Structure  transforma.on   Business  rule  execu.on   Hub  =  business  keys   Structure  and  value  transforma.on   Adaptable   Sustainable   Compliant   Decoupled   Effec.veness   Standardized   Centralized   ?   ?   36   R.D.Damhof
  • 37. Data Vault Application Architecture §  Central EDW §  Business rules downstream §  Incremental/Non destructive Loading §  100% of the data (within scope) 100% of the time §  Auditable/Partly source driven R.D.Damhof
  • 38. Modeling R.D.Damhof
  • 42. R.D.Damhof Pictures: Genesee Academy ©
  • 43. R.D.Damhof Pictures: Genesee Academy ©
  • 44. R.D.Damhof Pictures: Genesee Academy ©
  • 45. R.D.Damhof Pictures: Genesee Academy ©
  • 46. Data Vault Constructs R.D.Damhof Pictures: Genesee Academy ©
  • 47. Data Vault Constructs R.D.Damhof Pictures: Genesee Academy ©
  • 48. Data Vault Constructs R.D.Damhof Pictures: Genesee Academy ©
  • 49. Core Components R.D.Damhof
  • 50. Data Vault Core Components R.D.Damhof Pictures: Genesee Academy ©
  • 51. Data Vault Core Components R.D.Damhof Pictures: Genesee Academy ©
  • 52. Hubs R.D.Damhof Pictures: Genesee Academy ©
  • 53. Hubs R.D.Damhof Pictures: Genesee Academy ©
  • 54. Hubs R.D.Damhof Pictures: Genesee Academy ©
  • 55. Satellites R.D.Damhof Pictures: Genesee Academy ©
  • 56. Satellites R.D.Damhof Pictures: Genesee Academy ©
  • 57. Links R.D.Damhof Pictures: Genesee Academy ©
  • 58. Links R.D.Damhof Pictures: Genesee Academy ©
  • 59. Loading R.D.Damhof
  • 60. HUB load R.D.Damhof Pictures: Genesee Academy ©
  • 61. HUB load INSERT INTO customer_hub (cust#,load_dts,record_src) SELECT source.customer#, @load_dts, @record_src FROM source_customer AS source WHERE NOT EXISTS (SELECT * FROM customer_hub AS hub WHERE hub.customer#=source.customer#) R.D.Damhof Pictures: Genesee Academy ©
  • 62. Link Load Loading a Link R.D.Damhof Pictures: Genesee Academy ©
  • 63. Link Load INSERT INTO custcontact_link(cust_id,contact_id,load_dts, record_src) SELECT source.customer#, @load_dts, @record_src FROM source_table AS source INNER JOIN contact_hub AS contact ON contact. contact#= source.contact# INNER JOIN customer_hub AS cust ON cust. customer#= source.customer# WHERE NOT EXISTS (SELECT * FROM custcontact_link AS link WHERE link. contact_id= contact.id and link.cust_id= cust.id) R.D.Damhof Pictures: Genesee Academy ©
  • 64. Satellite Load Loading a Satellite R.D.Damhof Pictures: Genesee Academy ©
  • 65. Satellite Load INSERT INTO customer_sat (hub_id,load_dts, name,record_src) SELECT hub.id, @load_dts, source.cust_name, ,@record_src FROM source_customer AS source INNER JOIN customer_hub AS hub ON cust.customer#= source.customer# # INNER JOIN customer_sat AS sat ON sat.id= hub.id# AND sat “Is most recent” AND sat.name source.name R.D.Damhof Pictures: Genesee Academy ©
  • 66. Data Vault Loading Paradigm R.D.Damhof Pictures: Genesee Academy ©
  • 67. Top 10 Rules for Data Vault Modeling R.D.Damhof Pictures: Genesee Academy ©
  • 68. Agility Data Vault - recap (1) Why is it that you can build and deploy extremely small particles in Data Vault and not in other approaches, without having an increase in the overhead and coordination of these particles? In other words; 'Divide and Conquer to beat the Size / Complexity Dynamic’ Why is it that you can re-engineer your existing model and guarantee that the changes remain local? Something that is hugely beneficial in data warehouses that - by definition - grow over time. Why is it that - as your (Data Vault based) data warehouse grows - your costs grow ‘merely’ in linear fashion initially, and as you approach the end state marginal growth in cost decreases exponentially. R.D.Damhof
  • 69. Agility Data Vault - recap (2) Remember the Push characteristics ➡  Mass production Data Vault ➡  Known specifications, operational definitions, standards Data Vault ➡  Repeatable, predictable, even better; uniform process Data Vault ➡  Part of the system that needs statistical control Data Vault ➡  Inventory allowed/necessary Data Vault ➡  Mainly supply driven Data Vault ➡  Reliability over flexibility Data Vault Automation of a Data Vault ‘production process’ is just common sense R.D.Damhof
  • 70. Bonus Slides Forks and mutations in DV ‘evolution’ R.D.Damhof
  • 71. Type 1 - Classic Data Vault Business   Transac.on   System     Staging   Data  Vault   Datasets   Out   Business   Transac.on   Generic  Business  Rules   System     Rule  Vault   Structure  transforma.on   Business  rule  execu.on   Hub  =  business  keys   Structure  and  value  transforma.on   Adaptable   Sustainable   Compliant   Decoupled   Effec.veness   Standardized   Centralized   ?   ?   71   R.D.Damhof
  • 72. Type 2 - Source Data Vault Business   Transac.on   Staging  Vault   System     Business     Data  Marts   Data  Vault   Business   Transac.on   Staging  Vault   System     Structure  transforma.on   Business  rule  execu.on   Structure  transforma.on   No  integra.on,  Hub=surrogate  keys   Integra.on   Persis.ng  staging  in  DV  format   DV  modelled     Adaptable   Sustainable   Compliant   Decoupled   Effec.veness   Standardized   Centralized   ?   ?   ?   72   R.D.Damhof
  • 73. Source   Source    100%  Seman.c  gap   Source   Staging  DV   Business  DV   Source   Staging  DV   100%  Seman.c  gap   S.ll  the  source   Integra.on,  cleansing,  consolida.on   Business  rule  execu.on  upstream  ??   DV  modelled     73   R.D.Damhof
  • 74. Source   Source    100%  Seman.c  gap   Source   Source  Staging  DV   Business  DV   Data  Warehouse   Source   Source  Staging  DV   100%  Seman.c  gap   S.ll  the  source   Integra.on,  cleansing,  consolida.on   Business  rule  execu.on  upstream  ??   DV  modelled     74   R.D.Damhof
  • 75. Wanna know more? §  Training certification: www.geneseeacademy.com §  Books: ‘Super Charge Your Data Warehouse: Invaluable Data Modeling Rules to Implement Your Data Vault’ – D.Linstedt / K.Graziano §  Linkedin: Data Vault Discussions (approx. 800 members) §  Niche non-commercial conferences; www.dwhautomation.com §  Many blogs, articles, presentations on the World Wide Web §  The best way to learn; try it, make some code, experience, engage R.D.Damhof
  • 76. Thank You Drs.  Ronald  D.  Damhof   Blog   hXp://prudenza.typepad.com/   hXp://www.b-­‐eye-­‐network.com/blogs/damhof/     Linkedin   hXp://nl.linkedin.com/in/ronalddamhof   Email   ronald.damhof@prudenza.nl   TwiXer   RonaldDamhof   Skype   Ronald.Damhof   Mobile   +31(0)6  269  67  184   Others   Informa.on  Quality  Cer.fied  Professional  (IQCP)   Data  Vault  Cer.fied  Grand  Master   Cer.fied  Scrum  Master   Member  of  the  Boulder  BI  Brain  Trust  (#BBBT)   Ronald  Damhof  is  an  independent  prac..oner  in  the  field  of  data  management  and  decision  support.  Graduated  in  1995  in  the   study  of  Economics.  Since  1995  he  worked  as  a  prac..oner  into  the  field  of  Informa.on  Management  with  a  focus  on  decision   support  and  data  management,  trying  hard  to  enhance  the  rigor  and  relevance  in  these  fields  by  combining  scien.fic  research   with  the  everyday  challenges  of  the  prac..oner.  Ronald  is  mainly  hired  by  customers  in  the  role  of  business/IT  architect,   auditor,  coach    trainer.  He  blogs  on  B-­‐Eye-­‐Network.com  as  well  as  his  own  blog,  is  a  member  of  the  pres.gious  BBBT,  wrote   several  ar.cles  regarding  decision  support  architectures  and  is  a  researcher  in  the  field  of  Informa.on  Management.       Although  Ronald  likes  to  work  with  theore.cal  grounded  research  and  proven  prac.ces,  Ronald  is  not  a  'white  paper'  architect;   put  your  money  where  your  mouth  is,  is  his  moXo.  He  likes  to  see  architectures  'live'  in  enterprises,  not  just  write  about  it.  In   most  organiza.ons  his  role  extends  architecture  onen.  In  truely  agile  spirit  the  roles  he  plays  depend  on  the  context  of  the   client;  he  can  be  a  missionary  (selling  the  value),  a  project  manager  (geong  it  done),  a  scrum  master  (removing  impediments),   specialist  (educa.ng  hardware  peeps,  data  architects,  data  logis.cs  etc.)  or  a  leader.   76   R.D.Damhof