SlideShare una empresa de Scribd logo
1 de 10
What is Big Data? Big Data Stack
Companies Using Big Data
• Churn Reduction and Customer Retention
• Natural Language Processing and Sentiment Analysis
• Targeted Advertising and Marketing Optimisation
• Personal Recommendation
• Fraud Detection and Prevention
• Social Media and Game Analytics
• Risk and Exposure Analysis
• Real time Insights and Reactive Processing
Industry Use Cases
Enterprise Data Lake
Big Data Vision
Centralised High Speed Analytics Hub
Periodic AnalyticsReal-time Insight
Stakeholder Dashboard
N2N4
N1
N3
Multiple Data Sources
DIVIDE CONQUER INSIGHT
DATA DROPBOX
Split Data in Block
Replicate and Store
Petabytes of Resilience
DATA EXPLORE
1000s of Parallel Threads
Explore Every Path
Machine Learning
DATA INSIGHT
Real Time Action
Periodic Dashboards
Iterative Evolution
ENTERPRISE BIG DATA LAKE
REFINE EXPLORE ENRICH
BATCH INTERACTIVE ONLINE
OPERATIONAL DATA SOURCES
Transactions, Interactions, Observations
time between load to access of data
INSIGHT
Enterprise Big Data Usage Patterns
DATASOURCES
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(weblogs, email, social media, forum)
DATASYSTEMS
RDBMS EDW MPP
TRADITIONAL REPOS
ENTERPRISE
BIG DATA
PLATFORM
APPLICATIONS
Business
Analytics
Custom
Applications
Enterprise
Applications
Incumbent Enterprise Data Warehouse
1
2
3
Traditional enterprise data warehousing
“Schema first, data last” approach to
loading data
1 Extract, Transform & Load
2 Schema and Join
3 Deliver
REFINE EXPLORE ENRICH
DATASOURCES
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(weblogs, email, social media, forum)
DATASYSTEMS
RDBMS EDW MPP
TRADITIONAL REPOS
ENTERPRISE
BIG DATA
PLATFORM
APPLICATIONS
Business
Analytics
Custom
Applications
Enterprise
Applications
Operational Data Reservoir
REFINE EXPLORE ENRICH
1
2
3
Transform & refine ALL sources of data
“Data first, schema last” approach to
loading data.
Schema created on demand based on case
1 Capture
2 Process
3 Distribute & Retain
DATASOURCES
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(weblogs, email, social media, forum)
DATASYSTEMS
RDBMS EDW MPP
TRADITIONAL REPOS
ENTERPRISE
BIG DATA
PLATFORM
APPLICATIONS
Business
Analytics
Custom
Applications
Enterprise
Applications
Transformational Data Refactory
REFINE EXPLORE ENRICH
1
2
3
Leverage “data lake” to perform iterative
investigation for value
“Direct to data” approach to access the data
from applications
1 Capture
2 Process
3 Explore & Visualse
DATASOURCES
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(weblogs, email, social media, forum)
DATASYSTEMS
RDBMS EDW MPP
TRADITIONAL REPOS
ENTERPRISE
BIG DATA
PLATFORM
APPLICATIONS
Business
Analytics
Custom
Applications
Enterprise
Applications
Low Latency Reactive Data
REFINE EXPLORE ENRICH
1
2
3
Create intelligent applications
Collect data, create analytical models and
deliver to online applications
“Reactive Data” or “Active Data approach
1 Capture
2 Process & Compute
3 Deliver in Real Time
NOSQL
DATASOURCES
Traditional Sources
(RDBMS, OLTP, OLAP)
New Sources
(weblogs, email, social media, forum)
DATASYSTEMS
ENTERPRISE
BIG DATA
PLATFORM
APPLICATIONS
Tool Integration
OPERATIONAL TOOLS
DEV & DATA TOOLS
understand customer preferences
embrace diversity and complexity react in real-time
1
3
2
Harness your Data
drive strategic business directioncreate data value
improve customer experience
STAY AHEAD
& INNOVATE

Más contenido relacionado

La actualidad más candente

Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
Shankar R
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
DataWorks Summit
 

La actualidad más candente (20)

Future of Data - Big Data
Future of Data - Big DataFuture of Data - Big Data
Future of Data - Big Data
 
Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013Big Data Analysis Patterns - TriHUG 6/27/2013
Big Data Analysis Patterns - TriHUG 6/27/2013
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Big Data Analytics for Real Time Systems
Big Data Analytics for Real Time SystemsBig Data Analytics for Real Time Systems
Big Data Analytics for Real Time Systems
 
Big Data Landscape 2016
Big Data Landscape 2016Big Data Landscape 2016
Big Data Landscape 2016
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
 
Hadoop - An Introduction
Hadoop - An IntroductionHadoop - An Introduction
Hadoop - An Introduction
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
Data lake ppt
Data lake pptData lake ppt
Data lake ppt
 
Big data 101
Big data 101Big data 101
Big data 101
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
DW Appliance
DW ApplianceDW Appliance
DW Appliance
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
Big Data Real Time Applications
Big Data Real Time ApplicationsBig Data Real Time Applications
Big Data Real Time Applications
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An Overview
 
Introduction to Big Data and Hadoop
Introduction to Big Data and HadoopIntroduction to Big Data and Hadoop
Introduction to Big Data and Hadoop
 
Bigdata
BigdataBigdata
Bigdata
 
BIG DATA
BIG DATABIG DATA
BIG DATA
 
Big data analytics - hadoop
Big data analytics - hadoopBig data analytics - hadoop
Big data analytics - hadoop
 
Présentation on radoop
Présentation on radoop   Présentation on radoop
Présentation on radoop
 

Destacado

How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
Perficient
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
smj
 

Destacado (10)

Big Data Spells Big Problems ...
Big Data Spells Big Problems ...Big Data Spells Big Problems ...
Big Data Spells Big Problems ...
 
CS Guest Lecture 2015 10-05 advanced databases
CS Guest Lecture 2015 10-05 advanced databasesCS Guest Lecture 2015 10-05 advanced databases
CS Guest Lecture 2015 10-05 advanced databases
 
Intel big data analytics in health and life sciences personalized medicine
Intel big data analytics in health and life sciences personalized medicineIntel big data analytics in health and life sciences personalized medicine
Intel big data analytics in health and life sciences personalized medicine
 
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
How to Rapidly Configure Oracle Life Sciences Data Hub (LSH) to Support the M...
 
2013 OHSUG - Clinical Data Warehouse Implementation
2013 OHSUG - Clinical Data Warehouse Implementation2013 OHSUG - Clinical Data Warehouse Implementation
2013 OHSUG - Clinical Data Warehouse Implementation
 
Data Warehousing 2016
Data Warehousing 2016Data Warehousing 2016
Data Warehousing 2016
 
Database vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative ReviewDatabase vs Data Warehouse: A Comparative Review
Database vs Data Warehouse: A Comparative Review
 
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
Clinical Data Repository vs. A Data Warehouse - Which Do You Need?
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
 
Data mining slides
Data mining slidesData mining slides
Data mining slides
 

Similar a Data Warehouse to Data Science

Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
StampedeCon
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
Jean-Marc Desvaux
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 

Similar a Data Warehouse to Data Science (20)

Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Hadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data ProcessingHadoop 2.0: YARN to Further Optimize Data Processing
Hadoop 2.0: YARN to Further Optimize Data Processing
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
[Webinar] Getting to Insights Faster: A Framework for Agile Big Data
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Getting Started with Data Virtualization – What problems DV solves
Getting Started with Data Virtualization – What problems DV solvesGetting Started with Data Virtualization – What problems DV solves
Getting Started with Data Virtualization – What problems DV solves
 
Big Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of LightBig Data & Analytics - Innovating at the Speed of Light
Big Data & Analytics - Innovating at the Speed of Light
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and moreBig Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
Big Data & Analytics - Use Cases in Mobile, E-commerce, Media and more
 
Decision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDecision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great Data
 
Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
 Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
 
Lecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.pptLecture 5 - Big Data and Hadoop Intro.ppt
Lecture 5 - Big Data and Hadoop Intro.ppt
 
IARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptxIARE_BDBA_ PPT_0.pptx
IARE_BDBA_ PPT_0.pptx
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 

Más de Chandan Rajah

Más de Chandan Rajah (19)

Business Change through Predictive Analytics
Business Change through Predictive AnalyticsBusiness Change through Predictive Analytics
Business Change through Predictive Analytics
 
Business Change through Predictive Analytics
Business Change through Predictive AnalyticsBusiness Change through Predictive Analytics
Business Change through Predictive Analytics
 
Data Disruption by Vertical Innovation
Data Disruption by Vertical InnovationData Disruption by Vertical Innovation
Data Disruption by Vertical Innovation
 
Data Innovation in the UK
Data Innovation in the UKData Innovation in the UK
Data Innovation in the UK
 
Data Disruption by Vertical Innovation in Media
Data Disruption by Vertical Innovation in MediaData Disruption by Vertical Innovation in Media
Data Disruption by Vertical Innovation in Media
 
Catalysing Sector Advantage
Catalysing Sector AdvantageCatalysing Sector Advantage
Catalysing Sector Advantage
 
Rise of the Machines
Rise of the MachinesRise of the Machines
Rise of the Machines
 
Health Innovation and the Digital Catapult
Health Innovation and the Digital CatapultHealth Innovation and the Digital Catapult
Health Innovation and the Digital Catapult
 
Connected Farms ...and the Digital Catapult
Connected Farms ...and the Digital CatapultConnected Farms ...and the Digital Catapult
Connected Farms ...and the Digital Catapult
 
Steps to the Big Data Science Epiphany
Steps to the Big Data Science EpiphanySteps to the Big Data Science Epiphany
Steps to the Big Data Science Epiphany
 
Data Innovation in the Digital Economy
Data Innovation in the Digital EconomyData Innovation in the Digital Economy
Data Innovation in the Digital Economy
 
Disruptive Data in Future Care
Disruptive Data in Future CareDisruptive Data in Future Care
Disruptive Data in Future Care
 
Big Data Science at the Digital Catapult
Big Data Science at the Digital CatapultBig Data Science at the Digital Catapult
Big Data Science at the Digital Catapult
 
Business Impact of Predictive Analytics
Business Impact of Predictive AnalyticsBusiness Impact of Predictive Analytics
Business Impact of Predictive Analytics
 
Social Triangulation with Big Data
Social Triangulation with Big DataSocial Triangulation with Big Data
Social Triangulation with Big Data
 
Big Data Science Challenges in Media
Big Data Science Challenges in MediaBig Data Science Challenges in Media
Big Data Science Challenges in Media
 
Hadoop and friends
Hadoop and friendsHadoop and friends
Hadoop and friends
 
Big Data Science: Intro and Benefits
Big Data Science: Intro and BenefitsBig Data Science: Intro and Benefits
Big Data Science: Intro and Benefits
 
IPTV Case Study
IPTV Case StudyIPTV Case Study
IPTV Case Study
 

Último

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

Data Warehouse to Data Science

  • 1. What is Big Data? Big Data Stack Companies Using Big Data • Churn Reduction and Customer Retention • Natural Language Processing and Sentiment Analysis • Targeted Advertising and Marketing Optimisation • Personal Recommendation • Fraud Detection and Prevention • Social Media and Game Analytics • Risk and Exposure Analysis • Real time Insights and Reactive Processing Industry Use Cases
  • 2. Enterprise Data Lake Big Data Vision Centralised High Speed Analytics Hub Periodic AnalyticsReal-time Insight Stakeholder Dashboard N2N4 N1 N3 Multiple Data Sources
  • 3. DIVIDE CONQUER INSIGHT DATA DROPBOX Split Data in Block Replicate and Store Petabytes of Resilience DATA EXPLORE 1000s of Parallel Threads Explore Every Path Machine Learning DATA INSIGHT Real Time Action Periodic Dashboards Iterative Evolution
  • 4. ENTERPRISE BIG DATA LAKE REFINE EXPLORE ENRICH BATCH INTERACTIVE ONLINE OPERATIONAL DATA SOURCES Transactions, Interactions, Observations time between load to access of data INSIGHT Enterprise Big Data Usage Patterns
  • 5. DATASOURCES Traditional Sources (RDBMS, OLTP, OLAP) New Sources (weblogs, email, social media, forum) DATASYSTEMS RDBMS EDW MPP TRADITIONAL REPOS ENTERPRISE BIG DATA PLATFORM APPLICATIONS Business Analytics Custom Applications Enterprise Applications Incumbent Enterprise Data Warehouse 1 2 3 Traditional enterprise data warehousing “Schema first, data last” approach to loading data 1 Extract, Transform & Load 2 Schema and Join 3 Deliver REFINE EXPLORE ENRICH
  • 6. DATASOURCES Traditional Sources (RDBMS, OLTP, OLAP) New Sources (weblogs, email, social media, forum) DATASYSTEMS RDBMS EDW MPP TRADITIONAL REPOS ENTERPRISE BIG DATA PLATFORM APPLICATIONS Business Analytics Custom Applications Enterprise Applications Operational Data Reservoir REFINE EXPLORE ENRICH 1 2 3 Transform & refine ALL sources of data “Data first, schema last” approach to loading data. Schema created on demand based on case 1 Capture 2 Process 3 Distribute & Retain
  • 7. DATASOURCES Traditional Sources (RDBMS, OLTP, OLAP) New Sources (weblogs, email, social media, forum) DATASYSTEMS RDBMS EDW MPP TRADITIONAL REPOS ENTERPRISE BIG DATA PLATFORM APPLICATIONS Business Analytics Custom Applications Enterprise Applications Transformational Data Refactory REFINE EXPLORE ENRICH 1 2 3 Leverage “data lake” to perform iterative investigation for value “Direct to data” approach to access the data from applications 1 Capture 2 Process 3 Explore & Visualse
  • 8. DATASOURCES Traditional Sources (RDBMS, OLTP, OLAP) New Sources (weblogs, email, social media, forum) DATASYSTEMS RDBMS EDW MPP TRADITIONAL REPOS ENTERPRISE BIG DATA PLATFORM APPLICATIONS Business Analytics Custom Applications Enterprise Applications Low Latency Reactive Data REFINE EXPLORE ENRICH 1 2 3 Create intelligent applications Collect data, create analytical models and deliver to online applications “Reactive Data” or “Active Data approach 1 Capture 2 Process & Compute 3 Deliver in Real Time NOSQL
  • 9. DATASOURCES Traditional Sources (RDBMS, OLTP, OLAP) New Sources (weblogs, email, social media, forum) DATASYSTEMS ENTERPRISE BIG DATA PLATFORM APPLICATIONS Tool Integration OPERATIONAL TOOLS DEV & DATA TOOLS
  • 10. understand customer preferences embrace diversity and complexity react in real-time 1 3 2 Harness your Data drive strategic business directioncreate data value improve customer experience STAY AHEAD & INNOVATE

Notas del editor

  1. Real-time insights, real-time platform Chandan to explain the process of the data hub