One of the first Dutch Data Vault implementation presented at the seminar 'ETL uitgediept' in 2004. It covers the Data Vault modeling techniques and the automation efforts to speed up the development.
2. 2
Agenda
Introduction SNS Bank
ICT strategy
DWH framework & tooling
Data VaultTM modeling
ETL-procedure
Modeling deliverables & procedure
Advantages
Further developments
3. 3
Introduction SNS Bank
Create financial resources for individuals and companies
Focus on innovative products and services
Mortgages
Asset accrual and protection
Key figures 2003
Employee 3400
Profit E 129 Million
Balance sheet E 38,5 Billion
4. 4
Introduction SNS Bank
Create financial resources for individuals and companies
Focus on innovative products and services
Mortgages
Asset accrual and protection
Key figures 2003
Employee 3400
Profit E 129 Million
Balance sheet E 38,5 Billion
5. 5
ICT strategy
Standardisation
Development environments
Infrastructure components
Application-integration
Solutions
Re-use -- Buy -- Build
Open Systemarchitecture / Open standards
No vendor lock-in
Business-ICT alignment (developments in ICT & Business translate
to changes in standards & directions)
6. 6
DWH framework
Data exploitationData storageData extraction
Data Staging
Interface
Data Mart 1
Operational Data Store
Analysis
Standard
Report
Data Warehouse
Ad-hoc
questions
Interface
Source
System
Data Mart 2
Data Mart 3
Data Mart 4
Analysis
Report
Ad-hoc
questions
Operational
TacticalStrategical
Metadata management
Maintenance
Infrastructure
Organisation
Source
System
Source
System
External
Source
Operational
7. 7
Tooling
Data exploitationData storageData extraction
Data Staging
Interface
Data Mart 1
Operational Data Store
Analysis
Standard
Report
Data Warehouse
Ad-hoc
questions
Interface
Source
System
Data Mart 2
Data Mart 3
Data Mart 4
Analysis
Report
Ad-hoc
questions
Operational
TacticalStrategical
Metadata management
Maintenance
Infrastructure
Organisation
Source
System
Source
System
External
Source
Operational
8. 8
Tooling
Data exploitationData storageData extraction
Data Staging
Interface
Data Mart 1
Operational Data Store
Analysis
Standard
Report
Data Warehouse
Ad-hoc
questions
Interface
Source
System
Data Mart 2
Data Mart 3
Data Mart 4
Analysis
Report
Ad-hoc
questions
Operational
TacticalStrategical
Metadata management
Maintenance
Infrastructure
Organisation
Source
System
Source
System
External
Source
Operational
9. 9
Tooling
Data exploitationData storageData extraction
Data Staging
Interface
Data Mart 1
Operational Data Store
Analysis
Standard
Report
Data Warehouse
Ad-hoc
questions
Interface
Source
System
Data Mart 2
Data Mart 3
Data Mart 4
Analysis
Report
Ad-hoc
questions
Operational
TacticalStrategical
Metadata management
Maintenance
Infrastructure
Organisation
Source
System
Source
System
External
Source
Operational
10. 10
Data Vault™ DWH model (1)
• 3 NF+Timestamps (80’s)
• Dimensional modelling (90’s)
• Data Vault TM (since 2000)
detail oriented
historical tracking
uniquely linked
set of normalized tables
Hybrid approach encompassing the best of breed between 3NF
and dimensional modelling.
Dan Linstedt : www.danlinstedt.com
11. 11
Data Vault™ DWH model (2)
Hub
Table carrying a unique list of business keys, representing a
business component
Link
Represents the relation or transaction between two or more
business components
Satellite
Context information about Hubs or Links.
Information is subject to change over time
12. 12
Data Vault™ DWH model (3)
Complementary choices SNS
Satellites
State satellite. Validation of hubs & links
Information satellite. Specific properties of hubs and links, stored
in attributes
13. 13
Data Vault™ DWH model (4)
Rekening kantoor van opening status
Rekening status
Rekening kantoor van inschrijving status
Rekening rekeningsoort status
Rekeningsoort
ID Rekeningsoort
Rekeningsoortnummer
Banknummer Bank
INTEGER
INTEGER
INTEGER
Link Rekeningsoort Rekening
ID Rekeningsoort
ID Rekening
INTEGER
INTEGER
Rekening
ID Rekening
Rekeningnummer
INTEGER
DECIMAL(10)
Link rekening kantoor van inschrijving
ID Rekening
ID Kantoor
INTEGER
INTEGER
Rekeningsoort Status
ID Rekeningsoort
Rekeningsoort indicator
Begindatum rekeningsoort
Einddatum rekeningsoort
INTEGER
SMALLINT
INTEGER
INTEGER
Rekeningsoort Informatie
ID Rekeningsoort
Begindatum rekeningsoort
Einddatum rekeningsoort
Indicator zakelijk product
Indicator lening
Methode maximum debetstand
WTK product codering
Naam rekeningsoort
Korte naam rekeningsoort
INTEGER
INTEGER
INTEGER
SMALLINT
SMALLINT
SMALLINT
SMALLINT
VARCHAR(40)
VARCHAR(40)
Rekening compensatielimiet
ID Rekening
Agenderingsnummer
Begindatum rekening
Einddatum rekening
Compensatielimietbedrag
INTEGER
INTEGER
INTEGER
INTEGER
DECIMAL(11,2)
Rekening betaalruime
ID Rekening
Agenderingsnummer
Begindatum rekening
Einddatum rekening
Betaalruimte
INTEGER
INTEGER
INTEGER
INTEGER
DECIMAL(11,2)
Rekening saldo
ID Rekening
Begindatum rekening
Einddatum rekening
Saldo rekening
INTEGER
INTEGER
INTEGER
DECIMAL(11,2)
Rekening informatie
ID Rekening
Begindatum rekening
Einddatum rekening
Bedrag Kredietlimiet
Saldocompensatie
Blokkeringsindicator
WTK codering
Datum opening rekening
Datum afbetaald
Datum herleefd
INTEGER
INTEGER
INTEGER
DECIMAL(11,2)
SMALLINT
SMALLINT
SMALLINT
INTEGER
INTEGER
INTEGER
Kantoren
ID Kantoor
Banknummer Bank
Kantoornummer
INTEGER
INTEGER
INTEGER
Link rekening kantoor van opening
ID Rekening
ID Kantoor
INTEGER
INTEGER
29. 29
Further developments
1. Generating PowerCenter workflows/mappings
2. Adding Data Mart functionality
1. Type 1 functionality
2. Type 3 functionality
3. Generating DV DWH model direct from source system specification