SlideShare una empresa de Scribd logo
1 de 35
VISUAL DATA VAULT
[MODELING LANGUAGE]
MichaelOlschimke
World-Wide DataVault Consortium, St.Albans,Vermont
Introduction
Goals
Basic Entities
Query AssistantTables
ReferenceTables
BusinessVault
Remarks
AGENDA
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 2
INTRODUCTION
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 3
• Mid-size consulting firm in Germany
• Consulting, training, implementation
• Focus on BI
• Also: relational
databases, mainframe, software
development
• Industries:
• Automotive
• Banking
• Consumer
• Pharmaceutical
• Telecommunications
• Insurance
• Partners:
INTRODUCTION (1/2)
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 4
• BIConsultant Dörffler + Partner GmbH
• Specialized on DataVault, data mining, CRM, ETL, project
management
• DataVault 2.0 Certified Individual
• Sectors: automotive, commerce, public, non-profits
• Academic research on neural networks, text
classification, information retrieval
• Located in Germany
INTRODUCTION (2/2)
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 5
GOALS
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 6
Visually
express
DataVault
models
Generate
DDL from
DataVault
models
Microsoft
Office
support
GOALS
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 7
3. BASIC ENTITIES
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 8
• A list of business keys
• Business keys are attached to hub
• Composite key is modeled by
adding multiple business keys to
hub
• Business keys might have data
types
3.1 HUBS
CustomerCustomer Country
Customer No.
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 9
Customer Country
Customer No.
Customer Country: varchar(2)
Customer No.:
integer
• Smart Keys are keys with some
logical structure
• Not a composite key
• Do not model check sums
• Do not model smart key if format
is unclear or multiple format
definitions are possible
• Possible to integrate in composite
key
• Composite key might consist of
multiple smart keys
3.1.2 SMART KEYS
Vehicle
Vehicle
Identification
Number
Vehicle Descriptor
Section
World Manufacturer
Identifier
Vehicle Identifier
Section
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 10
Vehicle
Vehicle
Identification
Number
Vehicle Descriptor
Section
World Manufacturer
Identifier
Vehicle Identifier
Section
Brand
Vehicle
Vehicle
Identification
Number
Vehicle Descriptor
Section
World Manufacturer
Identifier
Vehicle Identifier
Section
Vehicle Bar Code Stock Number
Parking Lot Number
• Links connect hubs
• Relationships or transactions
• Read: „Stock used by StockTrade“
• Check comments inVisio stencil
• Link reference might be
overwritten (add name to
connector)
• Important for multiple references
of the same hub in one link
• Possible to add attributes to links
(e.g., degenerated fields)
3.2 LINKS
Stock TradeStock Account
Customer
Account
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 11
Stock TradeStock Account
Customer
Account
Diverted Flight
Airport
Source
Airport
Destination
Airport
Diverted Flight
Airport
Diversion Number
Source
Airport
Destination
Airport
• Special form of link
• Data cannot legally change
• Notice the annotation in the icon
• Transactional satellites are
discussed later
3.2.1TRANSACTIONAL LINKS
T
Sales
T
SalesProduct Customer
T
Sales Information
Sales Status
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 12
• Link-to-Link structures can be
modeled as well
• However: not recommended
because of load dependencies
• Load dependencies complicate
the automated loading
3.2.2 LINK-TO-LINK
Supplier
Sales Person
Territory
Product Product
Sales Person
Territory
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 13
• Satellites store descriptive data
• Usually historized
• Data is stored in attributes
• Attached to hubs or links
3.3 SATELLITES (1/2)
Shipping AddressShipping Address City
Address Line 2
State
Address Line 1
Zip Code
Shipping Address City
Address Line 2
State
Address Line 1
Zip Code
Customer
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 14
• Links and hubs might have
multiple satellites
• Small bug in MSVisio stencil
3.3 SATELLITES (2/2)
Audit Information
Quantities
Stock Trade
Turbulence
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 15
• Usually data comes from multiple
sources
• Record tracking satellites track
the availability of keys and
associations in source systems
• Special satellite variant
• Normalized or de-normalized
version is not indicated (physical
features are not covered by the
modeling language)
3.3.1 RECORDTRACKING SATELLITES (1/2)
Customer
Customers from
CRM
Customers from
Invoicing
Customers from
Web Shop
R Customer Tracking
Satellite
Customer
Customers from
CRM
Customers from
Invoicing
Customers from
Web Shop
R Customer Tracking
Satellite
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 16
• Link version of record tracking
satellite
• Follows the hub version (record
tracking satellite can be added to
hub or link)
3.3.1 RECORDTRACKING SATELLITES (2/2)
Sale
Sale Information
from CRM
Sale Information
from Analytics
Sale Information
from Web Shop
Sale
Sale Information
from CRM
Sale Information
from Analytics
Sale Information
from Web Shop
R Sale Tracking
Satellite
Turbulence
Fasten Your
Seatbelt
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 17
• Attached to hub or link
• Follows general satellite structure
• There is always a Status attribute
3.3.2 STATUSTRACKING SATELLITES
Customer Customer Status StatusCustomer Customer Status StatusCustomer Customer Status Status
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 18
• Alternative to transactional links
• Transactional satellites are
attached to transactional links
• They store no history
• Attributes are added to the
satellite structure
• Introduced to allow automated
generation of DDL from such
models
3.3.3TRANSACTIONAL SATELLITES
Product Customer
T
Sales TransactionProduct Customer
T
Sales Transaction
T Sales Transaction
Data
Quantity
Item PriceTotal Price
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 19
Product Customer
T
Sales Transaction
T Sales Transaction
Data
Quantity
Item PriceTotal Price
4. QUERY ASSISTANT TABLES
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 20
• PIT table spans the satellites of
one hub or link
• Implemented as a ribbon that is
attached to the hub or link
symbol
• All satellites are affected by the
PIT
4.1 POINT-IN-TIME (PIT)TABLES
Contact
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 21
Contact
CRM Leads
Newsletter ContactsArticle Reviewers
• Bridges improve join performance
between hubs and links
• Hub or link is “used by” bridge
4.2 BRIDGES (1/2)
Bridge
Product
Parts
Customer
Bill of Material
T
Sale
Bridge
Marketplace
Shop
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 22
• Also possible to overwrite the
reference name
4.2 BRIDGES (2/2)
Product Customer
Bridge
Lead
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 23
5. REFERENCE TABLES
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 24
• Reference tables are lookup
tables that store descriptive data
• Have at least one business key
• Have multiple attributes
• Business key might be a smart
key
• Business key might be composite
key
• No history (flat structure)
5.1 NO-HISTORY REFERENCETABLES
ColorColor Color Code
Short Description
Long Description
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 25
Color Color Code
Short Description
Long Description
Color
Detailed Color
Identifier
Short Description
Long Description
Color Code
Main Color
Identifier
Color
Detailed Color
Identifier
Short Description
Long Description
Color Code
Main Color
Identifier
Product
• Similar to no-history reference
table
• Has business key in table
• Satellite stores attributes with
history-tracking
• Satellite follows standard rules for
satellites
5.2 HISTORY-BASED REFERENCETABLES
Category Code
Short Description
Long Description
Category
Descriptions
Category Code
Short Description
Long Description
Category
Descriptions
Category Code
Short Description
Long Description
Category
Descriptions
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 26
• Master code table for commonly
used codes and their descriptions
• Reference table contains two
business keys (Code & Group)
• History-based Satellite for the
descriptive attributes
5.3 CODE AND DESCRIPTIONS
Master Code Table Code
Short Description
Long Description
Master Code
Attributes
Group
Master Code Table Code
Short Description
Long Description
Master Code
Attributes
Group
Master Code Table Code
Short Description
Long Description
Master Code
Attributes
Group
Master Code Table Code
Short Description
Long Description
Master Code
Attributes
Group
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 27
6. BUSINESS VAULT
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 28
• Computed satellites describe a
hub or link with computed
descriptive attributes
• Added to the hub or link in the
same way as standard satellites
• Computed attributes are added to
the satellite
• Might contain non-computed
attributes (e.g. attributes that are
duplicated from another satellite
for convenience)
6.1 COMPUTED SATELLITES
Invoice Totals
Sales
Invoice Total
Grant Total
Tax Rate
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 29
Invoice Totals
Sales
Invoice Total
Grant Total
Tax Rate
Invoice Totals
Sales
Invoice Total
Grant Total
Tax Rate
Invoice Totals
Sales
Invoice Total
Grant Total
Invoice Totals
Sales
Invoice Total
Grant Total
Tax Rate
• Concept is similar to a bridge
• Changes the grain of a link
• Aggregates values and adds them
as computed attributes to the link
6.2 COMPUTED AGGREGATE LINKS
Sales per Shop and
Customer
SaleCustomer
Product
Shop
Total Sales
Sales per Shop and
Customer
SaleCustomer
Product
Shop
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 30
Sales per Shop and
Customer
SaleCustomer
Product
Shop
Total Sales
• These links are not available in
source systems
• Added artificially to the Data
Vault for exploration purposes
• Connects hubs that are not
directly connected in source
systems
• Basket Analysis
6.3 EXPLORATION LINKS
Customer Store
Product
T
Sale
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 31
Offerings
Customer Store
Product
T
Sale
• BusinessVault tables have no
prescribed format
• Have business keys and attributes
• Might have computed attributes
• Might have computed satellites
attached
• Can be added to the Raw Data
Vault by ordinary links that
reference the primary key of the
BusinessVault table
6.4 BUSINESSVAULTTABLES
Customer
First Name
Last Name
Customer Number
Customer
First Name
Last Name
Customer Number
Customer
First Name
Last Name
Customer Number
City
Address 1
Zip Code
Computed Customer
Attributes
Life-Time Value of
Customer
Birth Date
Customer
First Name
Last Name
Customer Number
City
Address 1
Zip Code
Computed Customer
Attributes
Life-Time Value of
Customer
Birth Date
Customer
Last Name
First Name
Customer Number
City
Address 1
Zip Code
SalesProduct
Computed Customer
Attributes
Life-Time Value of
Customer
Birth Date
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 32
SOME REMARKS
Visual DataVault [Modeling Language]
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 33
 Logical modeling, no physical features
 VisioThemes are not supported (yet)
 More features to come:
 Inline attributes
 Validation rules (prevent hub on hub, etc.)
 What else?
 Don’t copy fromVisio and paste intoWord or PowerPoint
 Instead: export toWMF for better quality
 Vendor support package available
 Check out www.datavault.de for German assets on DataVault
REMARKS
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 34
March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 35
Give us Feedback
http://tinyurl.com/doerffler-wwdvc
Source: vasilijonline.com

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Power BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data SolutionsPower BI for Big Data and the New Look of Big Data Solutions
Power BI for Big Data and the New Look of Big Data Solutions
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 
Difference between fact tables and dimension tables
Difference between fact tables and dimension tablesDifference between fact tables and dimension tables
Difference between fact tables and dimension tables
 
The Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the SameThe Future of Data Warehousing: ETL Will Never be the Same
The Future of Data Warehousing: ETL Will Never be the Same
 
Modern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform SystemModern Data Warehousing with the Microsoft Analytics Platform System
Modern Data Warehousing with the Microsoft Analytics Platform System
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Five Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data GovernanceFive Things to Consider About Data Mesh and Data Governance
Five Things to Consider About Data Mesh and Data Governance
 
Snowflake Overview
Snowflake OverviewSnowflake Overview
Snowflake Overview
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 
Rethinking Trust in Data
Rethinking Trust in Data Rethinking Trust in Data
Rethinking Trust in Data
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake5 Steps for Architecting a Data Lake
5 Steps for Architecting a Data Lake
 
Introduction to Data Vault Modeling
Introduction to Data Vault ModelingIntroduction to Data Vault Modeling
Introduction to Data Vault Modeling
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Traditional data warehouse vs data lake
Traditional data warehouse vs data lakeTraditional data warehouse vs data lake
Traditional data warehouse vs data lake
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 

Destacado

Heli data modeler wildcard2013
Heli data modeler wildcard2013Heli data modeler wildcard2013
Heli data modeler wildcard2013
Andrejs Vorobjovs
 
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
FrederikN
 

Destacado (20)

DWH-Modellierung mit Data Vault
DWH-Modellierung mit Data VaultDWH-Modellierung mit Data Vault
DWH-Modellierung mit Data Vault
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
 
Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)Agile Data Engineering - Intro to Data Vault Modeling (2016)
Agile Data Engineering - Intro to Data Vault Modeling (2016)
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
PDI data vault framework #pcmams 2012
PDI data vault framework #pcmams 2012PDI data vault framework #pcmams 2012
PDI data vault framework #pcmams 2012
 
Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
 
Data Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes AgileData Vault: Data Warehouse Design Goes Agile
Data Vault: Data Warehouse Design Goes Agile
 
Visualization 101 BA4All
Visualization 101 BA4AllVisualization 101 BA4All
Visualization 101 BA4All
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
Data Vault and DW2.0
Data Vault and DW2.0Data Vault and DW2.0
Data Vault and DW2.0
 
Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0
 
Ethische Entscheidungskompetenz
Ethische EntscheidungskompetenzEthische Entscheidungskompetenz
Ethische Entscheidungskompetenz
 
catfx Datasheet_v1
catfx Datasheet_v1catfx Datasheet_v1
catfx Datasheet_v1
 
Is it sensible to use Data Vault at all? Conclusions from a project.
Is it sensible to use Data Vault at all? Conclusions from a project.Is it sensible to use Data Vault at all? Conclusions from a project.
Is it sensible to use Data Vault at all? Conclusions from a project.
 
IRM UK - 2009: DV Modeling And Methodology
IRM UK - 2009: DV Modeling And MethodologyIRM UK - 2009: DV Modeling And Methodology
IRM UK - 2009: DV Modeling And Methodology
 
Data vault: What's Next
Data vault: What's NextData vault: What's Next
Data vault: What's Next
 
Data vault what's Next: Part 2
Data vault what's Next: Part 2Data vault what's Next: Part 2
Data vault what's Next: Part 2
 
Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)Agile Data Mining with Data Vault 2.0 (english)
Agile Data Mining with Data Vault 2.0 (english)
 
Heli data modeler wildcard2013
Heli data modeler wildcard2013Heli data modeler wildcard2013
Heli data modeler wildcard2013
 
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
Your favorite data modeling tool, your partner in crime for Data Warehouse Au...
 

Similar a Visual Data Vault

Designing a Future-proof API Program
Designing a Future-proof API ProgramDesigning a Future-proof API Program
Designing a Future-proof API Program
Pronovix
 
Experiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of ThingsExperiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of Things
GuyVanderSande
 
Master Data Management using WSO2 Platform
Master Data Management using WSO2 PlatformMaster Data Management using WSO2 Platform
Master Data Management using WSO2 Platform
WSO2
 

Similar a Visual Data Vault (20)

The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.The Warranty Data Lake – After, Inc.
The Warranty Data Lake – After, Inc.
 
Salesforce Multitenant Architecture: How We Do the Magic We Do
Salesforce Multitenant Architecture: How We Do the Magic We DoSalesforce Multitenant Architecture: How We Do the Magic We Do
Salesforce Multitenant Architecture: How We Do the Magic We Do
 
Designing a Future-proof API Program
Designing a Future-proof API ProgramDesigning a Future-proof API Program
Designing a Future-proof API Program
 
Partner Webinar: Why Is Open Source the Smartest Choice for Hybrid Integration?
Partner Webinar: Why Is Open Source the Smartest Choice for Hybrid Integration?Partner Webinar: Why Is Open Source the Smartest Choice for Hybrid Integration?
Partner Webinar: Why Is Open Source the Smartest Choice for Hybrid Integration?
 
Hadoop @ LifeWay
Hadoop @ LifeWayHadoop @ LifeWay
Hadoop @ LifeWay
 
Sample - Data Warehouse Requirements
Sample -  Data Warehouse RequirementsSample -  Data Warehouse Requirements
Sample - Data Warehouse Requirements
 
SNS practice: Generating ETL
SNS practice: Generating ETLSNS practice: Generating ETL
SNS practice: Generating ETL
 
DWH_Session_1.pptx
DWH_Session_1.pptxDWH_Session_1.pptx
DWH_Session_1.pptx
 
Info sphere overview
Info sphere overviewInfo sphere overview
Info sphere overview
 
Introduction to BizTalk for Beginners
Introduction to BizTalk for BeginnersIntroduction to BizTalk for Beginners
Introduction to BizTalk for Beginners
 
Overview of Information Framework
Overview of Information FrameworkOverview of Information Framework
Overview of Information Framework
 
Experiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of ThingsExperiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of Things
 
Experiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of ThingsExperiences from a Data Vault Pilot Exploiting the Internet of Things
Experiences from a Data Vault Pilot Exploiting the Internet of Things
 
Where to look at and where to start?: Richard Spithoven b.lay ITAM Review UK ...
Where to look at and where to start?: Richard Spithoven b.lay ITAM Review UK ...Where to look at and where to start?: Richard Spithoven b.lay ITAM Review UK ...
Where to look at and where to start?: Richard Spithoven b.lay ITAM Review UK ...
 
Data Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data WarehousingData Vault 2.0: Big Data Meets Data Warehousing
Data Vault 2.0: Big Data Meets Data Warehousing
 
Land O' Lakes: Harnessing Big Data Variety
Land O' Lakes: Harnessing Big Data VarietyLand O' Lakes: Harnessing Big Data Variety
Land O' Lakes: Harnessing Big Data Variety
 
How to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contendersHow to build streaming data applications - evaluating the top contenders
How to build streaming data applications - evaluating the top contenders
 
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakeseccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
eccenca CorporateMemory - Semantically integrated Enterprise Data Lakes
 
Vertica Analytics Database general overview
Vertica Analytics Database general overviewVertica Analytics Database general overview
Vertica Analytics Database general overview
 
Master Data Management using WSO2 Platform
Master Data Management using WSO2 PlatformMaster Data Management using WSO2 Platform
Master Data Management using WSO2 Platform
 

Más de Michael Olschimke (6)

Introduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingIntroduction to Salesforce CRM Reporting
Introduction to Salesforce CRM Reporting
 
Introduction to Google Analytics
Introduction to Google AnalyticsIntroduction to Google Analytics
Introduction to Google Analytics
 
Introduction to Piwik
Introduction to PiwikIntroduction to Piwik
Introduction to Piwik
 
Business Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsBusiness Concepts for Mobile Applications
Business Concepts for Mobile Applications
 
Technology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsTechnology Concepts for Mobile Applications
Technology Concepts for Mobile Applications
 
Data Modeling Zone 2013
Data Modeling Zone 2013Data Modeling Zone 2013
Data Modeling Zone 2013
 

Último

+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
mohitmore19
 

Último (20)

Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdfThe Top App Development Trends Shaping the Industry in 2024-25 .pdf
The Top App Development Trends Shaping the Industry in 2024-25 .pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfontein
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Generic or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 

Visual Data Vault

  • 1. VISUAL DATA VAULT [MODELING LANGUAGE] MichaelOlschimke World-Wide DataVault Consortium, St.Albans,Vermont
  • 3. INTRODUCTION Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 3
  • 4. • Mid-size consulting firm in Germany • Consulting, training, implementation • Focus on BI • Also: relational databases, mainframe, software development • Industries: • Automotive • Banking • Consumer • Pharmaceutical • Telecommunications • Insurance • Partners: INTRODUCTION (1/2) March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 4
  • 5. • BIConsultant Dörffler + Partner GmbH • Specialized on DataVault, data mining, CRM, ETL, project management • DataVault 2.0 Certified Individual • Sectors: automotive, commerce, public, non-profits • Academic research on neural networks, text classification, information retrieval • Located in Germany INTRODUCTION (2/2) March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 5
  • 6. GOALS Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 6
  • 8. 3. BASIC ENTITIES Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 8
  • 9. • A list of business keys • Business keys are attached to hub • Composite key is modeled by adding multiple business keys to hub • Business keys might have data types 3.1 HUBS CustomerCustomer Country Customer No. March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 9 Customer Country Customer No. Customer Country: varchar(2) Customer No.: integer
  • 10. • Smart Keys are keys with some logical structure • Not a composite key • Do not model check sums • Do not model smart key if format is unclear or multiple format definitions are possible • Possible to integrate in composite key • Composite key might consist of multiple smart keys 3.1.2 SMART KEYS Vehicle Vehicle Identification Number Vehicle Descriptor Section World Manufacturer Identifier Vehicle Identifier Section March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 10 Vehicle Vehicle Identification Number Vehicle Descriptor Section World Manufacturer Identifier Vehicle Identifier Section Brand Vehicle Vehicle Identification Number Vehicle Descriptor Section World Manufacturer Identifier Vehicle Identifier Section Vehicle Bar Code Stock Number Parking Lot Number
  • 11. • Links connect hubs • Relationships or transactions • Read: „Stock used by StockTrade“ • Check comments inVisio stencil • Link reference might be overwritten (add name to connector) • Important for multiple references of the same hub in one link • Possible to add attributes to links (e.g., degenerated fields) 3.2 LINKS Stock TradeStock Account Customer Account March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 11 Stock TradeStock Account Customer Account Diverted Flight Airport Source Airport Destination Airport Diverted Flight Airport Diversion Number Source Airport Destination Airport
  • 12. • Special form of link • Data cannot legally change • Notice the annotation in the icon • Transactional satellites are discussed later 3.2.1TRANSACTIONAL LINKS T Sales T SalesProduct Customer T Sales Information Sales Status March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 12
  • 13. • Link-to-Link structures can be modeled as well • However: not recommended because of load dependencies • Load dependencies complicate the automated loading 3.2.2 LINK-TO-LINK Supplier Sales Person Territory Product Product Sales Person Territory March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 13
  • 14. • Satellites store descriptive data • Usually historized • Data is stored in attributes • Attached to hubs or links 3.3 SATELLITES (1/2) Shipping AddressShipping Address City Address Line 2 State Address Line 1 Zip Code Shipping Address City Address Line 2 State Address Line 1 Zip Code Customer March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 14
  • 15. • Links and hubs might have multiple satellites • Small bug in MSVisio stencil 3.3 SATELLITES (2/2) Audit Information Quantities Stock Trade Turbulence March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 15
  • 16. • Usually data comes from multiple sources • Record tracking satellites track the availability of keys and associations in source systems • Special satellite variant • Normalized or de-normalized version is not indicated (physical features are not covered by the modeling language) 3.3.1 RECORDTRACKING SATELLITES (1/2) Customer Customers from CRM Customers from Invoicing Customers from Web Shop R Customer Tracking Satellite Customer Customers from CRM Customers from Invoicing Customers from Web Shop R Customer Tracking Satellite March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 16
  • 17. • Link version of record tracking satellite • Follows the hub version (record tracking satellite can be added to hub or link) 3.3.1 RECORDTRACKING SATELLITES (2/2) Sale Sale Information from CRM Sale Information from Analytics Sale Information from Web Shop Sale Sale Information from CRM Sale Information from Analytics Sale Information from Web Shop R Sale Tracking Satellite Turbulence Fasten Your Seatbelt March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 17
  • 18. • Attached to hub or link • Follows general satellite structure • There is always a Status attribute 3.3.2 STATUSTRACKING SATELLITES Customer Customer Status StatusCustomer Customer Status StatusCustomer Customer Status Status March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 18
  • 19. • Alternative to transactional links • Transactional satellites are attached to transactional links • They store no history • Attributes are added to the satellite structure • Introduced to allow automated generation of DDL from such models 3.3.3TRANSACTIONAL SATELLITES Product Customer T Sales TransactionProduct Customer T Sales Transaction T Sales Transaction Data Quantity Item PriceTotal Price March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 19 Product Customer T Sales Transaction T Sales Transaction Data Quantity Item PriceTotal Price
  • 20. 4. QUERY ASSISTANT TABLES Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 20
  • 21. • PIT table spans the satellites of one hub or link • Implemented as a ribbon that is attached to the hub or link symbol • All satellites are affected by the PIT 4.1 POINT-IN-TIME (PIT)TABLES Contact March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 21 Contact CRM Leads Newsletter ContactsArticle Reviewers
  • 22. • Bridges improve join performance between hubs and links • Hub or link is “used by” bridge 4.2 BRIDGES (1/2) Bridge Product Parts Customer Bill of Material T Sale Bridge Marketplace Shop March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 22
  • 23. • Also possible to overwrite the reference name 4.2 BRIDGES (2/2) Product Customer Bridge Lead March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 23
  • 24. 5. REFERENCE TABLES Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 24
  • 25. • Reference tables are lookup tables that store descriptive data • Have at least one business key • Have multiple attributes • Business key might be a smart key • Business key might be composite key • No history (flat structure) 5.1 NO-HISTORY REFERENCETABLES ColorColor Color Code Short Description Long Description March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 25 Color Color Code Short Description Long Description Color Detailed Color Identifier Short Description Long Description Color Code Main Color Identifier Color Detailed Color Identifier Short Description Long Description Color Code Main Color Identifier Product
  • 26. • Similar to no-history reference table • Has business key in table • Satellite stores attributes with history-tracking • Satellite follows standard rules for satellites 5.2 HISTORY-BASED REFERENCETABLES Category Code Short Description Long Description Category Descriptions Category Code Short Description Long Description Category Descriptions Category Code Short Description Long Description Category Descriptions March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 26
  • 27. • Master code table for commonly used codes and their descriptions • Reference table contains two business keys (Code & Group) • History-based Satellite for the descriptive attributes 5.3 CODE AND DESCRIPTIONS Master Code Table Code Short Description Long Description Master Code Attributes Group Master Code Table Code Short Description Long Description Master Code Attributes Group Master Code Table Code Short Description Long Description Master Code Attributes Group Master Code Table Code Short Description Long Description Master Code Attributes Group March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 27
  • 28. 6. BUSINESS VAULT Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 28
  • 29. • Computed satellites describe a hub or link with computed descriptive attributes • Added to the hub or link in the same way as standard satellites • Computed attributes are added to the satellite • Might contain non-computed attributes (e.g. attributes that are duplicated from another satellite for convenience) 6.1 COMPUTED SATELLITES Invoice Totals Sales Invoice Total Grant Total Tax Rate March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 29 Invoice Totals Sales Invoice Total Grant Total Tax Rate Invoice Totals Sales Invoice Total Grant Total Tax Rate Invoice Totals Sales Invoice Total Grant Total Invoice Totals Sales Invoice Total Grant Total Tax Rate
  • 30. • Concept is similar to a bridge • Changes the grain of a link • Aggregates values and adds them as computed attributes to the link 6.2 COMPUTED AGGREGATE LINKS Sales per Shop and Customer SaleCustomer Product Shop Total Sales Sales per Shop and Customer SaleCustomer Product Shop March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 30 Sales per Shop and Customer SaleCustomer Product Shop Total Sales
  • 31. • These links are not available in source systems • Added artificially to the Data Vault for exploration purposes • Connects hubs that are not directly connected in source systems • Basket Analysis 6.3 EXPLORATION LINKS Customer Store Product T Sale March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 31 Offerings Customer Store Product T Sale
  • 32. • BusinessVault tables have no prescribed format • Have business keys and attributes • Might have computed attributes • Might have computed satellites attached • Can be added to the Raw Data Vault by ordinary links that reference the primary key of the BusinessVault table 6.4 BUSINESSVAULTTABLES Customer First Name Last Name Customer Number Customer First Name Last Name Customer Number Customer First Name Last Name Customer Number City Address 1 Zip Code Computed Customer Attributes Life-Time Value of Customer Birth Date Customer First Name Last Name Customer Number City Address 1 Zip Code Computed Customer Attributes Life-Time Value of Customer Birth Date Customer Last Name First Name Customer Number City Address 1 Zip Code SalesProduct Computed Customer Attributes Life-Time Value of Customer Birth Date March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 32
  • 33. SOME REMARKS Visual DataVault [Modeling Language] March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 33
  • 34.  Logical modeling, no physical features  VisioThemes are not supported (yet)  More features to come:  Inline attributes  Validation rules (prevent hub on hub, etc.)  What else?  Don’t copy fromVisio and paste intoWord or PowerPoint  Instead: export toWMF for better quality  Vendor support package available  Check out www.datavault.de for German assets on DataVault REMARKS March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 34
  • 35. March 20, 2014 World-Wide Data Vault Consortium, St. Albans, Vermont 35 Give us Feedback http://tinyurl.com/doerffler-wwdvc Source: vasilijonline.com