SlideShare una empresa de Scribd logo
1 de 48
KENT GRAZIANO
@KentGraziano | kentgraziano.com
AGILE METHODS AND DATA
WAREHOUSING:
HOW TO DELIVER FASTER
Agenda
 My Bio
 Why Agile & DW
 Agile Manifesto
 12 Agile Principles
 Agile Concepts
 Two week iterations?
 Becoming Agile
 Agile DevOps
 Agile Data Modeling
 Conclusion
© Data Warrior LLC
My Bio
› Senior Technical Evangelist, Snowflake Computing
› Blogger: The Data Warrior
› Certified Data Vault Master and DV 2.0 Practitioner
› Oracle ACE Director (BI/DW)
› Data Modeling, Data Architecture and Data Warehouse
Specialist
› 30+ years in IT
› 25+ years of Oracle-related work
› 20+ years of data warehousing experience
› Former-Member: Boulder BI Brain Trust
(http://www.boulderbibraintrust.org/)
› Author & Co-Author of a bunch of books
› Past-President of Oracle Development Tools User
Group and Rocky Mountain Oracle User Group
Why Agile & DW?
 Perceptions
● DW too slow to produce results
● DW projects “fail”
● DW projects fail to adapt to business changes
 Goal
● Change perceptions
● Deliver value
● Be more adaptable and flexible
© Data Warrior LLC
Objectives
 Understand what is meant by “agile”
 Try to apply some agile ideas to data warehouse and business
intelligence efforts
 Answer the question – can we deliver results faster?
Manifesto for Agile Software Development
http://agilemanifesto.org
We are uncovering better ways of developing
software by doing it and helping others do it.
Through this work we have come to value:
Individuals and interactions over processes and
tools
Working software over comprehensive
documentation
Customer collaboration over contract negotiation
Responding to change over following a plan
That is, while there is value in the items on
the right, we value the items on the left more
Kent Beck
Mike Beedle
Arie van Bennekum
Alistair Cockburn
Ward Cunningham
Martin Fowler
James Grenning
Jim Highsmith
Andrew Hunt
Ron Jeffries
Jon Kern
Brian Marick
Robert C. Martin
Steve Mellor
Ken Schwaber
Jeff Sutherland
Dave Thomas
© 2001, the above authors
this declaration may be freely copied in any form,
but only in its entirety through this notice.
THE PRINCIPLES OF AGILE
Principle #1
 Our highest priority is to satisfy the
customer through early and continuous
delivery of valuable software.
● Who is the customer?
● What is “valuable software” in data
warehousing?
● BI reports
● Dashboard interface
● Working ETL code?
● In the context of the customer!
● Once we start – keep going!
© Data Warrior LLC
Principle #2
 Welcome changing requirements, even
late in development. Agile processes
harness change for the customer's
competitive advantage.
● Must be flexible and adaptable in thinking and
design
● User Stories
● Use code generators
● Agile data engineering techniques
● Flexible and performant data platform
© Data Warrior LLC
Principle #3
 Deliver working software frequently, from
a couple of weeks to a couple of months,
with a preference to the shorter
timescale.
●Need good scope control!
● Prioritized Backlog
●One subject area at a time
● What is a subject area?
●Think Data Vault
© Data Warrior LLC
Principle #4
 Business people and developers
must work together daily throughout
the project.
● DW MUST have the business involved
● One of the Top 10 reasons for failure
● This applies for BI reports
● Daily interaction would be great!
● But – politics and priorities may interfere!
● At HP GBI/EDW – we used “war” room
● At MSH – daily standups with Biz
© Data Warrior LLC
Principle #5
 Build projects around motivated
individuals. Give them the environment
and support they need, and trust them to
get the job done.
● Need people who WANT to be on the project
● Lose the dead weight
● Get training if needed
● Keep units of work small to create an
atmosphere of success
● Don’t try a Big Bang EDW
© Data Warrior LLC
Principle #6
 The most efficient and effective method of
conveying information to and within a
development team is face-to-face
conversation.
● Daily team huddles
● Co-located work space
● While face-to-face is efficient, still need some
documentation (or meta-data) for later
● Use a tool like JIRA, VersionOne, Rally, LeanKit
© Data Warrior LLC
Principle #7
 Working software is the primary measure
of progress.
●Applied to DW:
● What is “working software?”
● BI reports?
● Tables definitions and working ETL
code?
●Think more broadly – it is not just a data
entry screen
© Data Warrior LLC
Principle #8
 Agile processes promote sustainable development. The
sponsors, developers, and users should be able to
maintain a constant pace indefinitely.
● DW Programs last a long time – don’t burn the team
out with unreasonable deadlines
● See P#5 – Motivated individuals
● Good planning and scope control
● No all nighters!
● Smallest valuable unit of work possible (MVP)
● Keep it moving like a production line
● Pick (or develop) a standard, repeatable
methodology
● Study the Agile methods and adopt what works for your team
● Data Vault Modeling Methodology
© Data Warrior LLC
Principle #9
 Continuous attention to technical
excellence and good design enhances
agility.
● Bad design + bad architecture = trouble
● Symptom: can’t build a requested data mart
● Frequent design reviews a must
● Improves team skills – provides cross training
● Over time – better designs, shorter review cycles
● Faster delivery
© Data Warrior LLC
Principle #10
 Simplicity--the art of maximizing the
amount of work not done--is essential.
● KISS – Keep it Simple Stupid
● Write less code by hand
● Use code generators! (No syntax errors – ever)
● Oracle SDDM, ERWin, Vertabelo
● Oracle Data Integrator, SSIS
● AnalytxDS, WhereScape RED
● Modifications are easier – just regenerate the code
● Virtualize initial reporting layers
● Agile DevOps!
© Data Warrior LLC
Principle #11
 The best architectures, requirements, and
designs emerge from self-organizing teams.
● Team of smart, motivated people = success
● We succeed (or fail) as a TEAM
● Don’t micro manage or pigeon-hole staff
● Encourage team work and team thinking
● Staff will gravitate to roles based on skills, interest,
and personality
● Then they have more buy-in to the process
● Eliminates delays and bottlenecks by having
shared responsibilities (no single point of failure)
© Data Warrior LLC
Principle #12
 At regular intervals, the team reflects on how
to become more effective, then tunes and
adjusts its behavior accordingly.
● The Decision Model
● Debate Mode
● Check Points
● Related to self-organizing teams
● Make finding the solution to a problem the team’s
problem
● More buy-in to the solution
● Retrospectives are a MUST!
© Data Warrior LLC
Decision Model in Action
Plan
Debate Decision

Check
Point
Questions
?
?
?
?
Answers
Mini-Debate
(Cause a
Slight Change
in Direction)
Iterate
Courtesy of Dr. Ed Freeman, CIO/CTO, Denver Public Schools
© Data Warrior LLC
USING AGILE CONCEPTS
Agile Concepts for DW
 Team Huddles (Morning Scrum)
 Extreme Programming
 Pair Programming
© Data Warrior LLC
Team Huddles
 Daily Standup Meeting
● AKA Scum
● Morning Roll Call (FDD)
 Short meeting (< 15 minutes)
● Every morning, mandatory attendance
● Review assignments, accomplishments, backlogs,
blockers
● 3 Questions
 Immediate feedback and assistance
● Keeps team motivated and on track (P #5)
● Identifies constraints and bottlenecks early in the process
● Eliminates backlogs more quickly via re-assignments
 Improves team work
 Supports self-organizing teams (P #11)
© Data Warrior LLC
Extreme Programming (XP)
 Programmer works directly with the end user
● At MSH used conference room or devloper’s cube
● At HP used Virtual Classroom or NetMeeting
● At DFA used WebEx
 In DW:
● Best with developing BI reports
● DW or data mart must already be populated
© Data Warrior LLC
Extreme Programming (XP)
 Reports developed using BI tool
● With the user in the room (or virtual)
● With constant user reviews and input using a web
reporting tool (via email even!)
 Also applies to developing a dashboard or portal
interface
 Works for ETL as well!
● Used war room with business to get near instant
validation of ETL changes
© Data Warrior LLC
Pair/Mob Programming
 Part of XP
 Programmers work side-by-side
● One terminal
● One codes, the other reviews
● Two terminals, one cube
● One programming, one documenting
● Could also be done virtually!
 In DW:
● Writing ETL Code
● Pair data modeling
● Report Development
© Data Warrior LLC
Two week iterations?
 Goal is really a few weeks to a few months
(see P#3)
 What is the deliverable?
● A fact table for a star schema
● A dimension table
● A complete star (fact and all dimensions)
● One piece of ETL code that populates a fact
table
● A function needed by the ETL code
● A new report or query
 Who is the customer?
● BI programmer?
● Knowledge worker?
● ETL programmer?© Data Warrior LLC
REAL WORLD METRICS
HP EDW Examples
 Business found missing report elements
 Solution: modify 3 tables to add 5 new columns in
reporting model (star schema)
 Tasks:
● Document requirements and ETL specs
● Modify Logical & Physical model (w/peer review)
● Rebuild tables in development
● Develop and test ETL
● MTI (Move To Integration) tables and code
● Execute and test ETL
● Modify report in UAT environment & test
 Result: Revised report ready in 18 hours, 44 minutes
● Less than 1 business day
 2nd case: 6 tables, 16 new columns
● Ready for UAT in 72 hours
 How? War room with Biz & IT Team
© Data Warrior LLC
BECOMING AGILE
Getting to Agile/RAD
 “better than average expertise”
● Expert consulting and mentoring
● Do the work (OTJ)
 At Denver Public Schools
● Took two years before we could try being
more “agile”
● Needed experience in Data Warehousing,
Oracle Designer, OWB, and the “process” of
building, deploying, and maintaining an
Oracle DW
© Data Warrior LLC
Getting to Agile/RAD
 At HP GBI/EDW it took about a year
● Needed the right team and the right
management support
● Also the right project with willing business
users
 At McKesson over a year setting
standards and training staff
● Built templates in JIRA for repeatable tasks
● Then 3 week sprints
 At DFA – Day 1!
● Experienced team with agile BI/DW coach
● Used modified KanBan
© Data Warrior LLC
Getting to Agile
 Cannot instantly become “agile”
 Must develop and agile mindset & culture
 Implement agile tools, processes, and methods
 Get an Agile Coach if necessary
● Specfically an agile BI/DW or Data Coach
© Data Warrior LLC
AGILE DEVOPS
“the art of maximizing the amount of work not done”
 Remember Principle #10!
 Need a dev setup that let’s you be agile
● Painless & fast setup and configuration
● Easy to add more space and compute on demand
● Minimal resource contention
● Automated query optimization
● Easy cloning of data sets
 Automated Regression Testing!
● DBFit, iCEDQ, AnalytixDS, WhereScape
● Roll your own – basic SQL scripts
© Data Warrior LLC
AGILE DATA MODELING
Data Vault – How I did it
 Data modeling technique for enterprise data
warehouse design
● See Data Vault white papers at kentgraziano.com
● Data Vault related books on Amazon (author Linstedt)
 Allows modeling EDW in small chunks
● Develop model, build tables, build ETL, populate,
repeat (often)
● Key: prioritize the data requirements
● Think User Stories and Backlog ala SCRUM
© Data Warrior LLC
Data Vault Components
Copyright 2011 Dan Linstedt, used by permission
Data Vault Model Flexibility (Agility)
 Goes beyond standard 3NF
• Highly normalized
● Hubs and Links only hold keys and meta data
● Satellites split by rate of change and/or source
• Enables Agile data modeling
● Easy to add to model without having to change existing
structures and load routines
• Relationships (links) can be dropped and created on-demand.
● No more reloading history because of a missed requirement
 Based on natural business keys
• Not system surrogate keys
• Allows for integrating data across functions and source
systems more easily
● All data relationships are key driven.
© Data Warrior LLC
Data Vault Extensibility
Adding new components to
the EDW has NEAR ZERO
impact to:
• Existing Loading
Processes
• Existing Data Model
• Existing Reporting & BI
Functions
• Existing Source Systems
• Existing Star Schemas
and Data Marts
(C) LearnDataVault.com
Sprint 1
Sprint 2
Sprint 3
How to be Agile using DV
 Model iteratively
● Use Data Vault data modeling technique
● Create basic components, then add over time
 Virtualize the Access Layer
● Don’t waste time building facts and dimensions
up front (Principle #10)
● ETL and testing takes too long
● “Project” objects using pattern-based DV model
with database views (or BI meta layer)
● Users see real reports with real data
 Can always build out for performance in
another iteration
© Data Warrior LLC
SHAMELESS PLUG: My Intro Book
Available on
Amazon.com
http://www.amazon.com/Better-
Data-Modeling-Introduction-
Engineering-
ebook/dp/B018BREV1C/
New DV 2.0 Book
› Available on Amazon:
http://www.amazon.com/Building-
Scalable-Data-Warehouse-
Vault/dp/0128025107/
Register at wwdvc.com
Other References
 Agile Management for Software Engineering:
Applying the Theory of Constraints for Business
Results by David J. Anderson
 CASE Method Fast-track: A RAD Approach by
Richard Barker & Dai Clegg
 The Goal by Eliyahu M. Goldratt
 The Business of Data Vault Modeling by Dan
Linstedt, Kent Graziano, & Hans Hultgren
 Super Charge Your Data Warehouse by Dan
Linstedt
 Test Automation by Lynn Winterboer
http://www.slideshare.net/AgileDenver/lynn-
winterboer-test-automation
© Data Warrior LLC
Conclusion
 Agile concepts can be applied to data
warehouse and BI projects
● Not a purist definition!
● Try to apply the principles – be creative
 Suggested approaches
● Data Vault 2.0!
● Use Snowflake Elastic DW!
● Use team huddles
● Use pair programming to increase quality and cross
training
● Use code generators
● Read about Agile Methods
● Read Oracle CASE Method Fast-Track
 Be flexible and give it a try
© Data Warrior LLC
QUESTIONS?
DISCUSSION?
CONTACT
INFORMATION
KENT GRAZIANO
Snowflake Computing
Snowflake.net
kent.graziano@snowflake.net
@KentGraziano
http://kentgraziano.com

Más contenido relacionado

La actualidad más candente

Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault ModelingKent Graziano
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignKent Graziano
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...Kent Graziano
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on ReadKent Graziano
 
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...Patrick Van Renterghem
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachKent Graziano
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsKent Graziano
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseDatabricks
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingKent Graziano
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsQuontra Solutions
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfRob Winters
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWKent Graziano
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
Kb 40 kevin_klineukug_reading20070717[1]
Kb 40 kevin_klineukug_reading20070717[1]Kb 40 kevin_klineukug_reading20070717[1]
Kb 40 kevin_klineukug_reading20070717[1]shuwutong
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerKent Graziano
 
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksSelf-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksGrega Kespret
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceSnowflake Computing
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Cloudera, Inc.
 

La actualidad más candente (20)

Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 
Worst Practices in Data Warehouse Design
Worst Practices in Data Warehouse DesignWorst Practices in Data Warehouse Design
Worst Practices in Data Warehouse Design
 
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...HOW TO SAVE  PILEs of $$$BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
HOW TO SAVE PILEs of $$$ BY CREATING THE BEST DATA MODEL THE FIRST TIME (Ksc...
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
 
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...
Cloud Data Warehousing presentation by Rogier Werschkull, including tips, bes...
 
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile ApproachUsing OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
Using OBIEE and Data Vault to Virtualize Your BI Environment: An Agile Approach
 
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 DimensionsExtreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
Extreme BI: Creating Virtualized Hybrid Type 1+2 Dimensions
 
Free Training: How to Build a Lakehouse
Free Training: How to Build a LakehouseFree Training: How to Build a Lakehouse
Free Training: How to Build a Lakehouse
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
 
Dataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra SolutionsDataware house Introduction By Quontra Solutions
Dataware house Introduction By Quontra Solutions
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the Bijenkorf
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Demystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFWDemystifying Data Warehousing as a Service - DFW
Demystifying Data Warehousing as a Service - DFW
 
Data vault what's Next: Part 2
Data vault what's Next: Part 2Data vault what's Next: Part 2
Data vault what's Next: Part 2
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Kb 40 kevin_klineukug_reading20070717[1]
Kb 40 kevin_klineukug_reading20070717[1]Kb 40 kevin_klineukug_reading20070717[1]
Kb 40 kevin_klineukug_reading20070717[1]
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data ModelerTop Five Cool Features in Oracle SQL Developer Data Modeler
Top Five Cool Features in Oracle SQL Developer Data Modeler
 
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and DatabricksSelf-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
Self-serve analytics journey at Celtra: Snowflake, Spark, and Databricks
 
Demystifying Data Warehouse as a Service
Demystifying Data Warehouse as a ServiceDemystifying Data Warehouse as a Service
Demystifying Data Warehouse as a Service
 
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
Hadoop in the Enterprise - Dr. Amr Awadallah @ Microstrategy World 2011
 

Destacado

Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Kent Graziano
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureKent Graziano
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesCGI
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016Kent Graziano
 
Effective Test Driven Database Development
Effective Test Driven Database DevelopmentEffective Test Driven Database Development
Effective Test Driven Database Developmentelliando dias
 
Testing regression
Testing regressionTesting regression
Testing regressionRichie Lee
 
Hands on training on DbFit Part-II
Hands on training on DbFit Part-IIHands on training on DbFit Part-II
Hands on training on DbFit Part-IIBabul Mirdha
 
Deliberate Practice (Agile Slovenia 2015)
Deliberate Practice (Agile Slovenia 2015)Deliberate Practice (Agile Slovenia 2015)
Deliberate Practice (Agile Slovenia 2015)Peter Kofler
 
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...Yavor Nikolov
 
見守りサービスMiimaミーマ 資料
見守りサービスMiimaミーマ 資料見守りサービスMiimaミーマ 資料
見守りサービスMiimaミーマ 資料kaisya_account
 
Open Source BI Overview
Open Source BI Overview Open Source BI Overview
Open Source BI Overview Alex Meadows
 
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...Trivadis
 
Hands on training on DbFit Part-I
Hands on training on DbFit Part-IHands on training on DbFit Part-I
Hands on training on DbFit Part-IBabul Mirdha
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Michael Olschimke
 

Destacado (17)

Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
Guru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best PracticesGuru4Pro Data Vault Best Practices
Guru4Pro Data Vault Best Practices
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Effective Test Driven Database Development
Effective Test Driven Database DevelopmentEffective Test Driven Database Development
Effective Test Driven Database Development
 
Testing regression
Testing regressionTesting regression
Testing regression
 
Hands on training on DbFit Part-II
Hands on training on DbFit Part-IIHands on training on DbFit Part-II
Hands on training on DbFit Part-II
 
Deliberate Practice (Agile Slovenia 2015)
Deliberate Practice (Agile Slovenia 2015)Deliberate Practice (Agile Slovenia 2015)
Deliberate Practice (Agile Slovenia 2015)
 
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...
Test-Driven Development with DbFit and Oracle database, BGOUG Conference, 201...
 
見守りサービスMiimaミーマ 資料
見守りサービスMiimaミーマ 資料見守りサービスMiimaミーマ 資料
見守りサービスMiimaミーマ 資料
 
Open Source BI Overview
Open Source BI Overview Open Source BI Overview
Open Source BI Overview
 
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...
Trivadis TechEvent 2016 A few thoughts on the subject Continuous integration ...
 
Sigist Presentation 091208 V2.0
Sigist Presentation 091208 V2.0Sigist Presentation 091208 V2.0
Sigist Presentation 091208 V2.0
 
Agile WTF
Agile WTFAgile WTF
Agile WTF
 
Hands on training on DbFit Part-I
Hands on training on DbFit Part-IHands on training on DbFit Part-I
Hands on training on DbFit Part-I
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)
 

Similar a Agile Methods and Data Warehousing (2016 update)

Agile methods and dw mha
Agile methods and dw mhaAgile methods and dw mha
Agile methods and dw mhaAgileDenver
 
Transition to a modern data platform
Transition to a modern data platform Transition to a modern data platform
Transition to a modern data platform Michael Ghen
 
Managing software projects & teams effectively
Managing software projects & teams effectivelyManaging software projects & teams effectively
Managing software projects & teams effectivelyAshutosh Agarwal
 
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...Citrix
 
Mohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_DatastageMohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_DatastageMohammed Shaukath
 
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Marin Dimitrov
 
Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analyticsRob Winters
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesRob Winters
 
Embedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your EnterpriseEmbedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your EnterpriseGerald Bachlmayr
 
Why Agile? Back to Basics.
Why Agile? Back to Basics.Why Agile? Back to Basics.
Why Agile? Back to Basics.Lucas Hendrich
 
Webinar: Demonstrating Business Value for DevOps & Continuous Delivery
Webinar: Demonstrating Business Value for DevOps & Continuous DeliveryWebinar: Demonstrating Business Value for DevOps & Continuous Delivery
Webinar: Demonstrating Business Value for DevOps & Continuous DeliveryXebiaLabs
 
Designing salesforce solutions for reuse - Josh Dennis
Designing salesforce solutions for reuse - Josh DennisDesigning salesforce solutions for reuse - Josh Dennis
Designing salesforce solutions for reuse - Josh DennisSakthivel Madesh
 
Agile software development compfest 13
Agile software development compfest 13Agile software development compfest 13
Agile software development compfest 13Panji Gautama
 
Vishwanath_M_CV_NL
Vishwanath_M_CV_NLVishwanath_M_CV_NL
Vishwanath_M_CV_NLVishwanath M
 
Moving from BI to AI : For decision makers
Moving from BI to AI : For decision makersMoving from BI to AI : For decision makers
Moving from BI to AI : For decision makerszekeLabs Technologies
 
Hemanth Kumar - Drupal Architect
Hemanth Kumar - Drupal ArchitectHemanth Kumar - Drupal Architect
Hemanth Kumar - Drupal ArchitectHemanth Kumar
 

Similar a Agile Methods and Data Warehousing (2016 update) (20)

Agile methods and dw mha
Agile methods and dw mhaAgile methods and dw mha
Agile methods and dw mha
 
Transition to a modern data platform
Transition to a modern data platform Transition to a modern data platform
Transition to a modern data platform
 
Managing software projects & teams effectively
Managing software projects & teams effectivelyManaging software projects & teams effectively
Managing software projects & teams effectively
 
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
Excalibur: best practices for virtual desktop operations leveraging Citrix Di...
 
Mohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_DatastageMohd_Shaukath_5_Exp_Datastage
Mohd_Shaukath_5_Exp_Datastage
 
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...Measuring the Productivity of Your Engineering Organisation - the Good, the B...
Measuring the Productivity of Your Engineering Organisation - the Good, the B...
 
Architecting for analytics
Architecting for analyticsArchitecting for analytics
Architecting for analytics
 
Big Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil GamesBig Data at a Gaming Company: Spil Games
Big Data at a Gaming Company: Spil Games
 
Embedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your EnterpriseEmbedding a Shift Left Culture in your Enterprise
Embedding a Shift Left Culture in your Enterprise
 
Why Agile? Back to Basics.
Why Agile? Back to Basics.Why Agile? Back to Basics.
Why Agile? Back to Basics.
 
Webinar: Demonstrating Business Value for DevOps & Continuous Delivery
Webinar: Demonstrating Business Value for DevOps & Continuous DeliveryWebinar: Demonstrating Business Value for DevOps & Continuous Delivery
Webinar: Demonstrating Business Value for DevOps & Continuous Delivery
 
Designing salesforce solutions for reuse - Josh Dennis
Designing salesforce solutions for reuse - Josh DennisDesigning salesforce solutions for reuse - Josh Dennis
Designing salesforce solutions for reuse - Josh Dennis
 
Agile software development compfest 13
Agile software development compfest 13Agile software development compfest 13
Agile software development compfest 13
 
Manoranjan OBIEE CV-2017
Manoranjan OBIEE CV-2017Manoranjan OBIEE CV-2017
Manoranjan OBIEE CV-2017
 
Amit_Resume
Amit_ResumeAmit_Resume
Amit_Resume
 
KarthikSNOW_CV
KarthikSNOW_CVKarthikSNOW_CV
KarthikSNOW_CV
 
Vishwanath_M_CV_NL
Vishwanath_M_CV_NLVishwanath_M_CV_NL
Vishwanath_M_CV_NL
 
Moving from BI to AI : For decision makers
Moving from BI to AI : For decision makersMoving from BI to AI : For decision makers
Moving from BI to AI : For decision makers
 
Hemanth Kumar - Drupal Architect
Hemanth Kumar - Drupal ArchitectHemanth Kumar - Drupal Architect
Hemanth Kumar - Drupal Architect
 
Resume
ResumeResume
Resume
 

Último

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Hiroshi SHIBATA
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 

Último (20)

Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024Long journey of Ruby standard library at RubyConf AU 2024
Long journey of Ruby standard library at RubyConf AU 2024
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 

Agile Methods and Data Warehousing (2016 update)

  • 1. KENT GRAZIANO @KentGraziano | kentgraziano.com AGILE METHODS AND DATA WAREHOUSING: HOW TO DELIVER FASTER
  • 2. Agenda  My Bio  Why Agile & DW  Agile Manifesto  12 Agile Principles  Agile Concepts  Two week iterations?  Becoming Agile  Agile DevOps  Agile Data Modeling  Conclusion © Data Warrior LLC
  • 3. My Bio › Senior Technical Evangelist, Snowflake Computing › Blogger: The Data Warrior › Certified Data Vault Master and DV 2.0 Practitioner › Oracle ACE Director (BI/DW) › Data Modeling, Data Architecture and Data Warehouse Specialist › 30+ years in IT › 25+ years of Oracle-related work › 20+ years of data warehousing experience › Former-Member: Boulder BI Brain Trust (http://www.boulderbibraintrust.org/) › Author & Co-Author of a bunch of books › Past-President of Oracle Development Tools User Group and Rocky Mountain Oracle User Group
  • 4. Why Agile & DW?  Perceptions ● DW too slow to produce results ● DW projects “fail” ● DW projects fail to adapt to business changes  Goal ● Change perceptions ● Deliver value ● Be more adaptable and flexible © Data Warrior LLC
  • 5. Objectives  Understand what is meant by “agile”  Try to apply some agile ideas to data warehouse and business intelligence efforts  Answer the question – can we deliver results faster?
  • 6. Manifesto for Agile Software Development http://agilemanifesto.org We are uncovering better ways of developing software by doing it and helping others do it. Through this work we have come to value: Individuals and interactions over processes and tools Working software over comprehensive documentation Customer collaboration over contract negotiation Responding to change over following a plan That is, while there is value in the items on the right, we value the items on the left more Kent Beck Mike Beedle Arie van Bennekum Alistair Cockburn Ward Cunningham Martin Fowler James Grenning Jim Highsmith Andrew Hunt Ron Jeffries Jon Kern Brian Marick Robert C. Martin Steve Mellor Ken Schwaber Jeff Sutherland Dave Thomas © 2001, the above authors this declaration may be freely copied in any form, but only in its entirety through this notice.
  • 8. Principle #1  Our highest priority is to satisfy the customer through early and continuous delivery of valuable software. ● Who is the customer? ● What is “valuable software” in data warehousing? ● BI reports ● Dashboard interface ● Working ETL code? ● In the context of the customer! ● Once we start – keep going! © Data Warrior LLC
  • 9. Principle #2  Welcome changing requirements, even late in development. Agile processes harness change for the customer's competitive advantage. ● Must be flexible and adaptable in thinking and design ● User Stories ● Use code generators ● Agile data engineering techniques ● Flexible and performant data platform © Data Warrior LLC
  • 10. Principle #3  Deliver working software frequently, from a couple of weeks to a couple of months, with a preference to the shorter timescale. ●Need good scope control! ● Prioritized Backlog ●One subject area at a time ● What is a subject area? ●Think Data Vault © Data Warrior LLC
  • 11. Principle #4  Business people and developers must work together daily throughout the project. ● DW MUST have the business involved ● One of the Top 10 reasons for failure ● This applies for BI reports ● Daily interaction would be great! ● But – politics and priorities may interfere! ● At HP GBI/EDW – we used “war” room ● At MSH – daily standups with Biz © Data Warrior LLC
  • 12. Principle #5  Build projects around motivated individuals. Give them the environment and support they need, and trust them to get the job done. ● Need people who WANT to be on the project ● Lose the dead weight ● Get training if needed ● Keep units of work small to create an atmosphere of success ● Don’t try a Big Bang EDW © Data Warrior LLC
  • 13. Principle #6  The most efficient and effective method of conveying information to and within a development team is face-to-face conversation. ● Daily team huddles ● Co-located work space ● While face-to-face is efficient, still need some documentation (or meta-data) for later ● Use a tool like JIRA, VersionOne, Rally, LeanKit © Data Warrior LLC
  • 14. Principle #7  Working software is the primary measure of progress. ●Applied to DW: ● What is “working software?” ● BI reports? ● Tables definitions and working ETL code? ●Think more broadly – it is not just a data entry screen © Data Warrior LLC
  • 15. Principle #8  Agile processes promote sustainable development. The sponsors, developers, and users should be able to maintain a constant pace indefinitely. ● DW Programs last a long time – don’t burn the team out with unreasonable deadlines ● See P#5 – Motivated individuals ● Good planning and scope control ● No all nighters! ● Smallest valuable unit of work possible (MVP) ● Keep it moving like a production line ● Pick (or develop) a standard, repeatable methodology ● Study the Agile methods and adopt what works for your team ● Data Vault Modeling Methodology © Data Warrior LLC
  • 16. Principle #9  Continuous attention to technical excellence and good design enhances agility. ● Bad design + bad architecture = trouble ● Symptom: can’t build a requested data mart ● Frequent design reviews a must ● Improves team skills – provides cross training ● Over time – better designs, shorter review cycles ● Faster delivery © Data Warrior LLC
  • 17. Principle #10  Simplicity--the art of maximizing the amount of work not done--is essential. ● KISS – Keep it Simple Stupid ● Write less code by hand ● Use code generators! (No syntax errors – ever) ● Oracle SDDM, ERWin, Vertabelo ● Oracle Data Integrator, SSIS ● AnalytxDS, WhereScape RED ● Modifications are easier – just regenerate the code ● Virtualize initial reporting layers ● Agile DevOps! © Data Warrior LLC
  • 18. Principle #11  The best architectures, requirements, and designs emerge from self-organizing teams. ● Team of smart, motivated people = success ● We succeed (or fail) as a TEAM ● Don’t micro manage or pigeon-hole staff ● Encourage team work and team thinking ● Staff will gravitate to roles based on skills, interest, and personality ● Then they have more buy-in to the process ● Eliminates delays and bottlenecks by having shared responsibilities (no single point of failure) © Data Warrior LLC
  • 19. Principle #12  At regular intervals, the team reflects on how to become more effective, then tunes and adjusts its behavior accordingly. ● The Decision Model ● Debate Mode ● Check Points ● Related to self-organizing teams ● Make finding the solution to a problem the team’s problem ● More buy-in to the solution ● Retrospectives are a MUST! © Data Warrior LLC
  • 20. Decision Model in Action Plan Debate Decision  Check Point Questions ? ? ? ? Answers Mini-Debate (Cause a Slight Change in Direction) Iterate Courtesy of Dr. Ed Freeman, CIO/CTO, Denver Public Schools © Data Warrior LLC
  • 22. Agile Concepts for DW  Team Huddles (Morning Scrum)  Extreme Programming  Pair Programming © Data Warrior LLC
  • 23. Team Huddles  Daily Standup Meeting ● AKA Scum ● Morning Roll Call (FDD)  Short meeting (< 15 minutes) ● Every morning, mandatory attendance ● Review assignments, accomplishments, backlogs, blockers ● 3 Questions  Immediate feedback and assistance ● Keeps team motivated and on track (P #5) ● Identifies constraints and bottlenecks early in the process ● Eliminates backlogs more quickly via re-assignments  Improves team work  Supports self-organizing teams (P #11) © Data Warrior LLC
  • 24. Extreme Programming (XP)  Programmer works directly with the end user ● At MSH used conference room or devloper’s cube ● At HP used Virtual Classroom or NetMeeting ● At DFA used WebEx  In DW: ● Best with developing BI reports ● DW or data mart must already be populated © Data Warrior LLC
  • 25. Extreme Programming (XP)  Reports developed using BI tool ● With the user in the room (or virtual) ● With constant user reviews and input using a web reporting tool (via email even!)  Also applies to developing a dashboard or portal interface  Works for ETL as well! ● Used war room with business to get near instant validation of ETL changes © Data Warrior LLC
  • 26. Pair/Mob Programming  Part of XP  Programmers work side-by-side ● One terminal ● One codes, the other reviews ● Two terminals, one cube ● One programming, one documenting ● Could also be done virtually!  In DW: ● Writing ETL Code ● Pair data modeling ● Report Development © Data Warrior LLC
  • 27. Two week iterations?  Goal is really a few weeks to a few months (see P#3)  What is the deliverable? ● A fact table for a star schema ● A dimension table ● A complete star (fact and all dimensions) ● One piece of ETL code that populates a fact table ● A function needed by the ETL code ● A new report or query  Who is the customer? ● BI programmer? ● Knowledge worker? ● ETL programmer?© Data Warrior LLC
  • 29. HP EDW Examples  Business found missing report elements  Solution: modify 3 tables to add 5 new columns in reporting model (star schema)  Tasks: ● Document requirements and ETL specs ● Modify Logical & Physical model (w/peer review) ● Rebuild tables in development ● Develop and test ETL ● MTI (Move To Integration) tables and code ● Execute and test ETL ● Modify report in UAT environment & test  Result: Revised report ready in 18 hours, 44 minutes ● Less than 1 business day  2nd case: 6 tables, 16 new columns ● Ready for UAT in 72 hours  How? War room with Biz & IT Team © Data Warrior LLC
  • 31. Getting to Agile/RAD  “better than average expertise” ● Expert consulting and mentoring ● Do the work (OTJ)  At Denver Public Schools ● Took two years before we could try being more “agile” ● Needed experience in Data Warehousing, Oracle Designer, OWB, and the “process” of building, deploying, and maintaining an Oracle DW © Data Warrior LLC
  • 32. Getting to Agile/RAD  At HP GBI/EDW it took about a year ● Needed the right team and the right management support ● Also the right project with willing business users  At McKesson over a year setting standards and training staff ● Built templates in JIRA for repeatable tasks ● Then 3 week sprints  At DFA – Day 1! ● Experienced team with agile BI/DW coach ● Used modified KanBan © Data Warrior LLC
  • 33. Getting to Agile  Cannot instantly become “agile”  Must develop and agile mindset & culture  Implement agile tools, processes, and methods  Get an Agile Coach if necessary ● Specfically an agile BI/DW or Data Coach © Data Warrior LLC
  • 35. “the art of maximizing the amount of work not done”  Remember Principle #10!  Need a dev setup that let’s you be agile ● Painless & fast setup and configuration ● Easy to add more space and compute on demand ● Minimal resource contention ● Automated query optimization ● Easy cloning of data sets  Automated Regression Testing! ● DBFit, iCEDQ, AnalytixDS, WhereScape ● Roll your own – basic SQL scripts © Data Warrior LLC
  • 37. Data Vault – How I did it  Data modeling technique for enterprise data warehouse design ● See Data Vault white papers at kentgraziano.com ● Data Vault related books on Amazon (author Linstedt)  Allows modeling EDW in small chunks ● Develop model, build tables, build ETL, populate, repeat (often) ● Key: prioritize the data requirements ● Think User Stories and Backlog ala SCRUM © Data Warrior LLC
  • 38. Data Vault Components Copyright 2011 Dan Linstedt, used by permission
  • 39. Data Vault Model Flexibility (Agility)  Goes beyond standard 3NF • Highly normalized ● Hubs and Links only hold keys and meta data ● Satellites split by rate of change and/or source • Enables Agile data modeling ● Easy to add to model without having to change existing structures and load routines • Relationships (links) can be dropped and created on-demand. ● No more reloading history because of a missed requirement  Based on natural business keys • Not system surrogate keys • Allows for integrating data across functions and source systems more easily ● All data relationships are key driven. © Data Warrior LLC
  • 40. Data Vault Extensibility Adding new components to the EDW has NEAR ZERO impact to: • Existing Loading Processes • Existing Data Model • Existing Reporting & BI Functions • Existing Source Systems • Existing Star Schemas and Data Marts (C) LearnDataVault.com Sprint 1 Sprint 2 Sprint 3
  • 41. How to be Agile using DV  Model iteratively ● Use Data Vault data modeling technique ● Create basic components, then add over time  Virtualize the Access Layer ● Don’t waste time building facts and dimensions up front (Principle #10) ● ETL and testing takes too long ● “Project” objects using pattern-based DV model with database views (or BI meta layer) ● Users see real reports with real data  Can always build out for performance in another iteration © Data Warrior LLC
  • 42. SHAMELESS PLUG: My Intro Book Available on Amazon.com http://www.amazon.com/Better- Data-Modeling-Introduction- Engineering- ebook/dp/B018BREV1C/
  • 43. New DV 2.0 Book › Available on Amazon: http://www.amazon.com/Building- Scalable-Data-Warehouse- Vault/dp/0128025107/
  • 45. Other References  Agile Management for Software Engineering: Applying the Theory of Constraints for Business Results by David J. Anderson  CASE Method Fast-track: A RAD Approach by Richard Barker & Dai Clegg  The Goal by Eliyahu M. Goldratt  The Business of Data Vault Modeling by Dan Linstedt, Kent Graziano, & Hans Hultgren  Super Charge Your Data Warehouse by Dan Linstedt  Test Automation by Lynn Winterboer http://www.slideshare.net/AgileDenver/lynn- winterboer-test-automation © Data Warrior LLC
  • 46. Conclusion  Agile concepts can be applied to data warehouse and BI projects ● Not a purist definition! ● Try to apply the principles – be creative  Suggested approaches ● Data Vault 2.0! ● Use Snowflake Elastic DW! ● Use team huddles ● Use pair programming to increase quality and cross training ● Use code generators ● Read about Agile Methods ● Read Oracle CASE Method Fast-Track  Be flexible and give it a try © Data Warrior LLC