SlideShare una empresa de Scribd logo
1 de 32
AGILE DATA MINING 
WITH DATA VAULT 2.0 
Timo Cirkel, Michael Olschimke 
Dörffler & Partner GmbH
Introduction 
Background 
Example 
Conclusion 
AGENDA 
Agile 12.02.2014 Data Mining with Data Vault 2.0 2
INTRODUCTION 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 3
TIMO CIRKEL 
BI-Consultant 
Certified Data Vault 2.0 Practitioner 
Analysis Of Policyholders 
Specialized inCRM, Software Development, 
DWHAutomation 
Industries: Insurance, Energy 
B. Sc. Business Informatics 
12.02.2014 Agile Data Mining with Data Vault 2.0 4
MICHAEL OLSCHIMKE 
Senior BI-Consultant 
Certified Data Vault 2.0 Practitioner 
Official Data Vault 2.0 Trainer in Europe 
AssociateTeacher University of Hannover 
Specializing in Data Vault 2.0, Data Mining, 
CRM, project management 
Industries: Insurance, Automotive, Retail, 
Public Sector, Non-Profits 
12.02.2014 Agile Data Mining with Data Vault 2.0 5
• Medium-sized consulting firm 
• Official Partner of Dan Linstedt In 
Europe 
• Consulting, Training, 
Implementation 
• Industries: 
• Insurance 
• Automotive 
• Banks 
• Trade 
• Pharmaceuticals 
• Telecommunications 
DÖRFFLER & PARTNER GMBH 
12.02.2014 Agile Data Mining With Data Vault 2.0 6
BACKGROUND 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 7
DATA MINING PROJECT IN THE VGH 
Motor insurance 
Customer segmentation 
A first datamining pilot, therefore: 
No specific requirements 
Vision is developed during project 
Agile Project Methodology 
Close co-operation with business 
12.02.2014 Agile Data Mining with Data Vault 2.0 8
• Extracting 
information from 
existing data and 
Patterns 
• Four (large) 
categories: 
• Segmentation 
• Classification 
• Prediction 
• Association 
• Wide range of 
available algorithms 
and methods 
DATA MINING PROJECTS 
"The term Data Mining ... describes 
the extraction implicitly existing, 
non-trivial and useful knowledge 
from large, dynamic, relatively 
complex structured data." 
Datenbank 
Anwendung 
Anwender 
Data-Mining- 
Techniken 
Aussagen, Regeln & 
Informationen 
Data Dictionary 
Fachwissen 
12.02.2014 Agile Data Mining with Data Vault 2.0 9
DATA VAULT 2.0 MODELING 
Surrogate 
Key 
Business 
Keys 
Foreign Keys 
Descriptors 
In accordance with its own representation Linstedt, 2014 
12.02.2014 Agile Data Mining with Data Vault 2.0 10
DATA VAULT 2.0 METHODOLOGY 
Data Vault 
2.0 
Methodology 
Six 
Sigma 
TQM 
Scrum CMMI 
PMP 
SDLC 
12.02.2014 Agile Data Mining with Data Vault 2.0 11
DATA VAULT 2.0 METHODOLOGY FOR DATA MINING 
Advantages 
• Agile project management for DWH projects 
• Automation and generation 
• Rapid adoption to changes in the model 
• Incremental build-out = incremental cost control 
• Targeted delivery = two week sprints 
• Predictable and measurable results 
Disadvantages 
• Focus on loading of raw data and the production 
of information 
• Not many data mining references 
• Many concepts in the methodology are not 
applicable for data mining projects 
• Difficult scaling of team sizes in data mining 
projects 
12.02.2014 Agile Data Mining with Data Vault 2.0 12
CRISP-DM 
Own Representation in accordance with Chapman, et al. , 2000 
12.02.2014 Agile Data Mining with Data Vault 2.0 13
PROCESS MODEL 
Prozessmodell – VGH Kundensegmentierung 
ivv KTC D & P 
Daten in Data Vault 
Modell speichern 
Daten abziehen 
Algorithmus 
auswählen 
Segmentierung 
ausführen 
Ergebnis erzielt? 
Ja 
Ergebnis 
präsentieren 
Ergebnis ok? 
Ende 
Ja 
Start 
Gütefunktion 
erarbeiten 
SQL-Query erstellen 
Relevante VN-Attribute 
ermitteln 
Nein Formel ok? 
Ja 
Nein 
Algorithmen 
erforschen 
Nein 
Geeigneter 
Algorithmus 
gefunden? 
Ja 
Nein 
12.02.2014 Agile Data Mining with Data Vault 2.0 14
RAPIDMINER 
 Java-based 
data 
mining 
software 
 One of 
the most 
widely used 
data mining 
tools 
 Offers 
 Environment fo 
r control flow 
 Large number 
of algorithms 
 Large choice 
of data sources 
Overall CorporaTE Consultants Academics NGO / GOV'T 
© 2012 Rexer AnalYTICS 
12.02.2014 Agile Data Mining with Data Vault 2.0 15
EXAMPLE 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 16
EXAMPLE 
 AdventureWorks-Database 
 Scenario: 
 Advertising campaign for a new bike 
 Identification of the target group 
 Solution: 
 Decision Tree 
 Identify relevant attributes in several iterations 
Lachev, 2005, p. 238ff 
Simple 
Example 
12.02.2014 Agile Data Mining with Data Vault 2.0 17
Agile Data Mining with Data Vault 2.0 18 
10066 Records 
Attribute 
Marital 
Status 
Gender 
Yearly 
Income 
Total 
Children 
Education 
Number Cars 
Owned 
Commute 
Distance 
Occupation 
House Owner 
Flag 
Age
ITERATION 1: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Category 
Product 
Category 
12.02.2014 Agile Data Mining with Data Vault 2.0 19
ITERATION 1: RAPIDMINER PROCESS 
Data Gathering 
Data preparation 
Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 20
ITERATION 1: DECISIONTREE MODEL 
12.02.2014 Agile Data Mining with Data Vault 2.0 21
ITERATION 1: RESULTS 
12.02.2014 Agile Data Mining with Data Vault 2.0 22
ITERATION 2: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Sat Customer 
Income 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Customer 
Children 
Sat Category 
Total 
Children 
Yearly 
Income 
Product 
Category 
12.02.2014 Agile Data Mining with Data Vault 2.0 23
ITERATION 2: RAPIDMINER PROCESS 
Data Gathering 
Preparation Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 24
ITERATION 2: RESULTS 
+4.01% 
12.02.2014 Agile Data Mining with Data Vault 2.0 25
ITERATION 3: DATA VAULT 2.0 MODEL 
English 
Education 
Numbers Cars 
Owned 
Gender 
Marital Status 
Sat 
Customer 
Hub 
Customer 
Sat Customer 
Income 
Customer Key 
Commute 
Distance 
Age 
House Owner 
Flag 
English 
Occupation 
Sat Customer 
Children 
Sat Category 
Total 
Children 
Yearly 
Income 
Product 
Category 
Commute 
Distance Miles 
CSat Customer 
Distance 
12.02.2014 Agile Data Mining with Data Vault 2.0 26
ITERATION 3: RAPIDMINER PROCESS 
Data Gathering 
Preparation Modeling 
12.02.2014 Agile Data Mining with Data Vault 2.0 27
ITERATION 3: RESULTS 
+0.12% 
12.02.2014 Agile Data Mining with Data Vault 2.0 28
CONCLUSIONS 
Agile Data Mining with DataVault 2.0 
Agile 12.02.2014 Data Mining with Data Vault 2.0 29
CONCLUSIONS 
 Data Vault is a flexible data 
model, with good support for agile project 
methodology 
 DataVault is not an additional hurdle in data mining 
projects 
 Additional attributes can be added at any time during 
the project, in an incremental fashion 
Business Vault: transparent data processing 
12.02.2014 Agile Data Mining with Data Vault 2.0 30
FURTHER INFORMATION 
Appears 
2015 
Available 
Www.doerffler.com WWW.datavault.de Www.learndatavault.com 
Appears 
2015 
12.02.2014 Agile Data Mining with Data Vault 2.0 31
Give us feedback 
Agile Data Mining with Data Vault 2.0 32 
Http://goo.gl/LGO4ze 
Source:Vasilijonline.com 
12.02.2014

Más contenido relacionado

La actualidad más candente

Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookJames Serra
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingKent Graziano
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)James Serra
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Empowered Holdings, LLC
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?James Serra
 
Data Staging Strategy
Data Staging StrategyData Staging Strategy
Data Staging StrategyMilind Zodge
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeKent Graziano
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshJeffrey T. Pollock
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothAdaryl "Bob" Wakefield, MBA
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesIvo Andreev
 
Emerging Trends in Data Engineering
Emerging Trends in Data EngineeringEmerging Trends in Data Engineering
Emerging Trends in Data EngineeringAnanth PackkilDurai
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault ModelingKent Graziano
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureKent Graziano
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureJames Serra
 

La actualidad más candente (20)

Why Data Vault?
Why Data Vault? Why Data Vault?
Why Data Vault?
 
Data Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future OutlookData Warehousing Trends, Best Practices, and Future Outlook
Data Warehousing Trends, Best Practices, and Future Outlook
 
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data ModelingAgile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
Agile Data Warehouse Modeling: Introduction to Data Vault Data Modeling
 
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r2)
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012Introduction To Data Vault - DAMA Oregon 2012
Introduction To Data Vault - DAMA Oregon 2012
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
Data Staging Strategy
Data Staging StrategyData Staging Strategy
Data Staging Strategy
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
Data Warehouse Design and Best Practices
Data Warehouse Design and Best PracticesData Warehouse Design and Best Practices
Data Warehouse Design and Best Practices
 
Emerging Trends in Data Engineering
Emerging Trends in Data EngineeringEmerging Trends in Data Engineering
Emerging Trends in Data Engineering
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
(OTW13) Agile Data Warehousing: Introduction to Data Vault Modeling
 
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data CaptureData Vault 2.0: Using MD5 Hashes for Change Data Capture
Data Vault 2.0: Using MD5 Hashes for Change Data Capture
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 
Building an Effective Data Warehouse Architecture
Building an Effective Data Warehouse ArchitectureBuilding an Effective Data Warehouse Architecture
Building an Effective Data Warehouse Architecture
 

Similar a Agile Data Mining with Data Vault 2.0 (english)

Building Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalBuilding Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalDenodo
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Denodo
 
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...IngridBuenaventura
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)Denodo
 
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Denodo
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudDATAVERSITY
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB
 
Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Denodo
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Denodo
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Arcadia Data
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItDenodo
 
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL DatabaseNuoDB
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Denodo
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH ModernizationTrivadis
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An IntroductionDenodo
 

Similar a Agile Data Mining with Data Vault 2.0 (english) (20)

Building Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New NormalBuilding Resiliency and Agility with Data Virtualization for the New Normal
Building Resiliency and Agility with Data Virtualization for the New Normal
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
Innovative Data Strategies for Advanced Analytics Solutions and the Role of D...
 
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
By Thoughtworks | Building data as a product: The key to unlocking Data Mesh'...
 
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
Denodo DataFest 2016: Data Science: Operationalizing Analytical Models in Rea...
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)A Key to Real-time Insights in a Post-COVID World (ASEAN)
A Key to Real-time Insights in a Post-COVID World (ASEAN)
 
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
Big Data with Data Virtualization (session 3 from Packed Lunch Webinar Series)
 
Slides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-CloudSlides: Success Stories for Data-to-Cloud
Slides: Success Stories for Data-to-Cloud
 
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
 
Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)Multi-Cloud Data Integration with Data Virtualization (APAC)
Multi-Cloud Data Integration with Data Virtualization (APAC)
 
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
 
Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019Trends for Modernizing Analytics and Data Warehousing in 2019
Trends for Modernizing Analytics and Data Warehousing in 2019
 
Bridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need ItBridging the Last Mile: Getting Data to the People Who Need It
Bridging the Last Mile: Getting Data to the People Who Need It
 
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database451 Research + NuoDB: What It Means to be a Container-Native SQL Database
451 Research + NuoDB: What It Means to be a Container-Native SQL Database
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
Your Data is Waiting. What are the Top 5 Trends for Data in 2022? (ASEAN)
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
TechEvent DWH Modernization
TechEvent DWH ModernizationTechEvent DWH Modernization
TechEvent DWH Modernization
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 

Más de Michael Olschimke

Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Michael Olschimke
 
Introduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingIntroduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingMichael Olschimke
 
Introduction to Google Analytics
Introduction to Google AnalyticsIntroduction to Google Analytics
Introduction to Google AnalyticsMichael Olschimke
 
Business Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsBusiness Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsMichael Olschimke
 
Technology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsTechnology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsMichael Olschimke
 
Ethische Entscheidungskompetenz
Ethische EntscheidungskompetenzEthische Entscheidungskompetenz
Ethische EntscheidungskompetenzMichael Olschimke
 

Más de Michael Olschimke (9)

Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0Agiles Data Mining mit Data Vault 2.0
Agiles Data Mining mit Data Vault 2.0
 
Introduction to Salesforce CRM Reporting
Introduction to Salesforce CRM ReportingIntroduction to Salesforce CRM Reporting
Introduction to Salesforce CRM Reporting
 
Introduction to Google Analytics
Introduction to Google AnalyticsIntroduction to Google Analytics
Introduction to Google Analytics
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Introduction to Piwik
Introduction to PiwikIntroduction to Piwik
Introduction to Piwik
 
Business Concepts for Mobile Applications
Business Concepts for Mobile ApplicationsBusiness Concepts for Mobile Applications
Business Concepts for Mobile Applications
 
Technology Concepts for Mobile Applications
Technology Concepts for Mobile ApplicationsTechnology Concepts for Mobile Applications
Technology Concepts for Mobile Applications
 
Ethische Entscheidungskompetenz
Ethische EntscheidungskompetenzEthische Entscheidungskompetenz
Ethische Entscheidungskompetenz
 
Data Modeling Zone 2013
Data Modeling Zone 2013Data Modeling Zone 2013
Data Modeling Zone 2013
 

Último

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 

Último (20)

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 

Agile Data Mining with Data Vault 2.0 (english)

  • 1. AGILE DATA MINING WITH DATA VAULT 2.0 Timo Cirkel, Michael Olschimke Dörffler & Partner GmbH
  • 2. Introduction Background Example Conclusion AGENDA Agile 12.02.2014 Data Mining with Data Vault 2.0 2
  • 3. INTRODUCTION Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 3
  • 4. TIMO CIRKEL BI-Consultant Certified Data Vault 2.0 Practitioner Analysis Of Policyholders Specialized inCRM, Software Development, DWHAutomation Industries: Insurance, Energy B. Sc. Business Informatics 12.02.2014 Agile Data Mining with Data Vault 2.0 4
  • 5. MICHAEL OLSCHIMKE Senior BI-Consultant Certified Data Vault 2.0 Practitioner Official Data Vault 2.0 Trainer in Europe AssociateTeacher University of Hannover Specializing in Data Vault 2.0, Data Mining, CRM, project management Industries: Insurance, Automotive, Retail, Public Sector, Non-Profits 12.02.2014 Agile Data Mining with Data Vault 2.0 5
  • 6. • Medium-sized consulting firm • Official Partner of Dan Linstedt In Europe • Consulting, Training, Implementation • Industries: • Insurance • Automotive • Banks • Trade • Pharmaceuticals • Telecommunications DÖRFFLER & PARTNER GMBH 12.02.2014 Agile Data Mining With Data Vault 2.0 6
  • 7. BACKGROUND Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 7
  • 8. DATA MINING PROJECT IN THE VGH Motor insurance Customer segmentation A first datamining pilot, therefore: No specific requirements Vision is developed during project Agile Project Methodology Close co-operation with business 12.02.2014 Agile Data Mining with Data Vault 2.0 8
  • 9. • Extracting information from existing data and Patterns • Four (large) categories: • Segmentation • Classification • Prediction • Association • Wide range of available algorithms and methods DATA MINING PROJECTS "The term Data Mining ... describes the extraction implicitly existing, non-trivial and useful knowledge from large, dynamic, relatively complex structured data." Datenbank Anwendung Anwender Data-Mining- Techniken Aussagen, Regeln & Informationen Data Dictionary Fachwissen 12.02.2014 Agile Data Mining with Data Vault 2.0 9
  • 10. DATA VAULT 2.0 MODELING Surrogate Key Business Keys Foreign Keys Descriptors In accordance with its own representation Linstedt, 2014 12.02.2014 Agile Data Mining with Data Vault 2.0 10
  • 11. DATA VAULT 2.0 METHODOLOGY Data Vault 2.0 Methodology Six Sigma TQM Scrum CMMI PMP SDLC 12.02.2014 Agile Data Mining with Data Vault 2.0 11
  • 12. DATA VAULT 2.0 METHODOLOGY FOR DATA MINING Advantages • Agile project management for DWH projects • Automation and generation • Rapid adoption to changes in the model • Incremental build-out = incremental cost control • Targeted delivery = two week sprints • Predictable and measurable results Disadvantages • Focus on loading of raw data and the production of information • Not many data mining references • Many concepts in the methodology are not applicable for data mining projects • Difficult scaling of team sizes in data mining projects 12.02.2014 Agile Data Mining with Data Vault 2.0 12
  • 13. CRISP-DM Own Representation in accordance with Chapman, et al. , 2000 12.02.2014 Agile Data Mining with Data Vault 2.0 13
  • 14. PROCESS MODEL Prozessmodell – VGH Kundensegmentierung ivv KTC D & P Daten in Data Vault Modell speichern Daten abziehen Algorithmus auswählen Segmentierung ausführen Ergebnis erzielt? Ja Ergebnis präsentieren Ergebnis ok? Ende Ja Start Gütefunktion erarbeiten SQL-Query erstellen Relevante VN-Attribute ermitteln Nein Formel ok? Ja Nein Algorithmen erforschen Nein Geeigneter Algorithmus gefunden? Ja Nein 12.02.2014 Agile Data Mining with Data Vault 2.0 14
  • 15. RAPIDMINER  Java-based data mining software  One of the most widely used data mining tools  Offers  Environment fo r control flow  Large number of algorithms  Large choice of data sources Overall CorporaTE Consultants Academics NGO / GOV'T © 2012 Rexer AnalYTICS 12.02.2014 Agile Data Mining with Data Vault 2.0 15
  • 16. EXAMPLE Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 16
  • 17. EXAMPLE  AdventureWorks-Database  Scenario:  Advertising campaign for a new bike  Identification of the target group  Solution:  Decision Tree  Identify relevant attributes in several iterations Lachev, 2005, p. 238ff Simple Example 12.02.2014 Agile Data Mining with Data Vault 2.0 17
  • 18. Agile Data Mining with Data Vault 2.0 18 10066 Records Attribute Marital Status Gender Yearly Income Total Children Education Number Cars Owned Commute Distance Occupation House Owner Flag Age
  • 19. ITERATION 1: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Customer Key Commute Distance Age House Owner Flag English Occupation Sat Category Product Category 12.02.2014 Agile Data Mining with Data Vault 2.0 19
  • 20. ITERATION 1: RAPIDMINER PROCESS Data Gathering Data preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 20
  • 21. ITERATION 1: DECISIONTREE MODEL 12.02.2014 Agile Data Mining with Data Vault 2.0 21
  • 22. ITERATION 1: RESULTS 12.02.2014 Agile Data Mining with Data Vault 2.0 22
  • 23. ITERATION 2: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Sat Customer Income Customer Key Commute Distance Age House Owner Flag English Occupation Sat Customer Children Sat Category Total Children Yearly Income Product Category 12.02.2014 Agile Data Mining with Data Vault 2.0 23
  • 24. ITERATION 2: RAPIDMINER PROCESS Data Gathering Preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 24
  • 25. ITERATION 2: RESULTS +4.01% 12.02.2014 Agile Data Mining with Data Vault 2.0 25
  • 26. ITERATION 3: DATA VAULT 2.0 MODEL English Education Numbers Cars Owned Gender Marital Status Sat Customer Hub Customer Sat Customer Income Customer Key Commute Distance Age House Owner Flag English Occupation Sat Customer Children Sat Category Total Children Yearly Income Product Category Commute Distance Miles CSat Customer Distance 12.02.2014 Agile Data Mining with Data Vault 2.0 26
  • 27. ITERATION 3: RAPIDMINER PROCESS Data Gathering Preparation Modeling 12.02.2014 Agile Data Mining with Data Vault 2.0 27
  • 28. ITERATION 3: RESULTS +0.12% 12.02.2014 Agile Data Mining with Data Vault 2.0 28
  • 29. CONCLUSIONS Agile Data Mining with DataVault 2.0 Agile 12.02.2014 Data Mining with Data Vault 2.0 29
  • 30. CONCLUSIONS  Data Vault is a flexible data model, with good support for agile project methodology  DataVault is not an additional hurdle in data mining projects  Additional attributes can be added at any time during the project, in an incremental fashion Business Vault: transparent data processing 12.02.2014 Agile Data Mining with Data Vault 2.0 30
  • 31. FURTHER INFORMATION Appears 2015 Available Www.doerffler.com WWW.datavault.de Www.learndatavault.com Appears 2015 12.02.2014 Agile Data Mining with Data Vault 2.0 31
  • 32. Give us feedback Agile Data Mining with Data Vault 2.0 32 Http://goo.gl/LGO4ze Source:Vasilijonline.com 12.02.2014

Notas del editor

  1. In This Slides Only The logos Replace. To Try it out New Design /Discuss Have We No Time
  2. Short On the DM Project In The VGH Comment. On the BI Spectrum Article Point out Objectives The Project Used Tools. Crisp-DM Used. Etc. GGF. For more Slides Open Name The insurance? No specific requirements Attributes evolve over time "Customer" does not exactly define first Only private clients or companies? Policyholders or vehicle owners? What kinds of contracts? How are "good" customers?
  3. Hubs, Left, Satellite Short Explains With VDV. Take a look at In the Folder Sources, There Can You You Use.
  4. We can no data and Findings of the VGH present Therefore to avoid AdventureWorks Setup took over from book
  5. Short On Adenture Works DW Comment Background Information Model of the Relevant Tables 25 Attributes, 500k Records
  6. On the First DV model Comment.
  7. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  8. Scatter Matrix Confusion matrix (performance matrix).
  9. On the Changes The DV Model Comment. Show As The Then Looks like.  Changes Comprehensible Make (On Animations)
  10. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  11. On the Changes The DV Model Comment. Show As The Then Looks like.  Changes Comprehensible Make (On Animations)
  12. Demo in Rapidminer Also On Measures Comment (Accuracy, Or Precision/recall).  On Best Graphically In Rm Represent.
  13. What Are The Benefits From Approach? Reference The VGH Project Take, But Also On the demo
  14. TBC: Link Revise (Make I)