SlideShare una empresa de Scribd logo
1 de 25
Descargar para leer sin conexión
1© Copyright 2012 EMC Corporation. All rights reserved.
Create Your Big Data
Vision And Hadoop-ify
Your Data Warehouse
Jeff Kelly, Principal Research Contributor
The Wikibon Project
Bill Schmarzo, CTO
EIMA Practice, EMC Professional
Services
2© Copyright 2012 EMC Corporation. All rights reserved.
Agenda
Ÿ  Current Market Observations
Ÿ  The Big Data Business Maturity Index
and How to Identify Your Best Use Case
Ÿ  Get Started With Hadoop and Other New
Technologies
Ÿ  What Should You Look For in a Vendor?
Ÿ  Q&A
3© Copyright 2012 EMC Corporation. All rights reserved.
Current Market
Observations
Jeff Kelly
4© Copyright 2012 EMC Corporation. All rights reserved.
Big Data Market Size
2012
$11.4b
2013
$18.2b
2017
$48b
ü  59% Growth Y-o-Y 2011 to
2012
ü  Forecast 60%+ Growth in
2013
ü  31% CAGR Forecast 2012
through 2017
2014
$28b
2015
$37.9b
2016
$43.7b
Source: Wikibon Big Data Vendor Revenue and Market Forecast, 2012-2017
5© Copyright 2012 EMC Corporation. All rights reserved.
Big Data Market Segmentation, 2012
Services Leading the Way
Professional
Services
$3,784m
34%
Cloud and SaaS
$608m
5%
Pro. Services
Compute
Storage
Networking
Database
Applications
Data mgt.
Cloud
n = $11,400m
6© Copyright 2012 EMC Corporation. All rights reserved.
Big Data Growth Drivers
ü  Increased Awareness and Investments
By Large Enterprises Beyond the Web
ü  Retailers like Sears leveraging Big Data for
price optimization.
ü  Financial services firms, including JPMC, Morgan
Stanley and BoA, conduct fraud analysis, risk
profiling and more.
ü  Pharmaceutical including Bristol Myers Squibb
makers use Big Data to support drug
development.
ü  Continued Investment by Web Pioneers and Three Letter Agencies
ü  Google alone spent $1b+ on infrastructure in Q4 2012.
ü  “Everything we do is a Big Data problem.” – Jay Parikh, VP of Engineering, Facebook
ü  CIA CTO Ira Hunt: Our mission is to “collect everything and hang on to it forever.”
7© Copyright 2012 EMC Corporation. All rights reserved.
Big Data Growth Drivers, Cont.
ü  Increasingly Sophisticated Professional Services
ü  Professional services building on experience of assisting early adopters.
ü  Some (but not all) are vendor and product agnostic.
ü  Focusing on identifying use cases, improving communication, and leveraging
existing assets.
ü  Technology Maturation
ü  Open source community and vendors making
Hadoop enterprise-ready, easier to use.
ü  Better integration between Big Data and
existing IT infrastructure.
ü  Extending Big Data accessibility to business
users via BI and data visualization tools.
Consulting
Training &
Educations
Integration
8© Copyright 2012 EMC Corporation. All rights reserved.
Big Data Growth Inhibitors
ü  Lack of Data Scientists and Big Data
Practitioners
ü  Big Data Technology Still Complex, Difficult to
Manage/Use
ü  Organizational Resistance to Data-Driven
Decision Making
ü  Confusion Due to Vendor Marketing and “Big
Data Washing”
Big Data [Your Product Name Here]
9© Copyright 2012 EMC Corporation. All rights reserved.
The Big Data Business
Maturity Index and How to
Identify Your Best Use Case
Bill Schmarzo
10© Copyright 2012 EMC Corporation. All rights reserved.
Business
Metamorphosis
Data
Monetization
Business
Optimization
Business
Insights
Business
Monitoring
Monitoring business
performance to flag
areas of interest
Big Data Business Model Maturation Index
Integrate insights &
recommendations
into existing
business processes
Embed analytics to
optimize select
business processes
Leverage insights to
identify new revenue
opportunities
Transform customer and
product insights to
move into new markets
Measures the degree to which the organization has
integrated big data and advanced analytics into
their business model
11© Copyright 2012 EMC Corporation. All rights reserved.
How to Identify Your Best Use Case
The Big Data strategy document ensures a tight linkage between your
organization’s business initiatives and your big data strategy
•  Big	
  data	
  business	
  cases,	
  ROI	
  and	
  
analy4c	
  requirements	
  
•  Key	
  Performance	
  Indicators	
  and	
  
leading	
  metrics	
  	
  
•  Business	
  ques4ons	
  with	
  metrics,	
  
dimensions,	
  hierarchies	
  
•  Business	
  decisions,	
  decision	
  flow/
process	
  and	
  UEX	
  requirements	
  
•  Analy4c	
  algorithms	
  and	
  modeling	
  
requirements	
  
•  Required	
  data	
  sources	
  
Business Strategy: Provide Unique Starbucks Customer Experience
Business Initiatives:
•  Increase number of “Gold Card” customers
•  Increase “Gold Card” customer revenue & engagement (store visits, spend per visit, advocacy)
Mobile App
• 
• 
Social Media
• 
• 
Store Sales
• 
• 
Customer Loyalty
• 
• 
Collect customer engagement information through multiple channels (store, web, mobile)
Profile and micro-segment customers to improve marketing and offers effectiveness
Analyze social media data to identify and monitor brand advocates
Monitor and adjust customer engagement effectiveness (visits, revenue, margin, advocacy)
Tasks
Develop intimate knowledge of “Gold Card”
customers life stage, behaviors and interests
Act upon intimate knowledge of “Gold Card”
customers to increase store revenues
•  Expand customer data collection points
•  Leverage “gold card” member transactions, feedback (surveys) and social data
•  Integrate customer-specific insights back into operational, management and loyalty systems
Outcomes & CSF’s
12© Copyright 2012 EMC Corporation. All rights reserved.
Get Started With Hadoop and
Other New Technologies
Bill Schmarzo
13© Copyright 2012 EMC Corporation. All rights reserved.
A Playbook For Modernizing Your Data Warehouse With
New Big Data Technologies And Capabilities
#1) Enhance data warehouse with new unstructured data metrics
#2) Data virtualization to extend existing data warehouse environment
#3) MPP RDBMS to increase data platform scalability and agility
#4) In-database analytics to accelerate analytic development
#5) Hadoop to create the next generation Operational Data Store
14© Copyright 2012 EMC Corporation. All rights reserved.
#1) Enhance Data Warehouse With New Unstructured Data Metrics
Leverage HDFS to provide a single platform that supports your traditional SQL-
based BI environment plus your growing unstructured data needs at scale
HDFS
HBase
Pig, Hive,
Mahout
Map Reduce
Sqoop Flume
Resource
Management
& Workflow
Yarn
Zookeeper
Apache
Pivotal HD
Configure,
Deploy,
Monitor,
Manage
Command
Center
Hadoop Virtualization (HVE)
DataLoader
Xtension
Framework
Catalog
Services
Query
Optimizer
Dynamic Pipelining
ANSI SQL + Analytics
HAWQ – Advanced
Database Services
15© Copyright 2012 EMC Corporation. All rights reserved.
ETL
Cached
Streaming Data
Unified Data Platform
Data Source
Real-Time Visualization
Advanced Analytics and Modeling
Data Source
CEP/
Workflow
Data Federation Tool
Semantic Master
Data
Discovery /
Data Mapping
Data
Source
Data
Source
#2) Extend Existing Data Warehouse Via Data Virtualization
Leverage data federation tools to speed data discovery and analysis via virtual, on-
demand access to data sources outside your EDW
16© Copyright 2012 EMC Corporation. All rights reserved.
•  Massively Parallel
Processing (MPP), scale-
out architectures provide
cost effective options for
managing and analyzing
massive data volumes
•  MPP data warehouses
provide linear scalability on
general purpose,
commodity systems (e.g.,
fault-tolerant scale out
environment; automatic
parallelization; I/O
optimized)
#3) Massively Parallel Processing (MPP) Relational Databases
17© Copyright 2012 EMC Corporation. All rights reserved.
#4) In-Database Processing And Analytics
Conventional: A Data Scientist needs to move 1 TB of data from a 5-
processor database server to the analytical server at 1 gigabytes per
second (Gbs)
In-Database: A Data Scientist leaves the 1 TB data in the 5-processor
database server and runs the same algorithm directly in the database
0 20 40 60 80 100 120 140 180160 200
Data Movement Time = (1TB x 8) / 1Gbs / 60 s = 133.3 minutes Processing Time = 60 minutes
12
minutes
Total Time = 193.3 minutes
Time
(minutes)
Conventional
In-Database
18© Copyright 2012 EMC Corporation. All rights reserved.
Hadoop
Data
Store Analytics Environment
Data Preparation
and Enrichment
ALL data fed
into Hadoop
Data Store
EDWETL
Analytic
Sandbox
BI Environment
•  Production
•  Predictable load
•  SLA-drive
•  Standard tools
•  Exploratory, Ad Hoc
•  Unpredictable load
•  Experimentation
•  Best tool for the job
#5) Next Gen Operational Data Store/Data Prep With Hadoop
Feeds production BI and Enterprise
Data Warehouse environment and high-
velocity Analytics Sandbox
19© Copyright 2012 EMC Corporation. All rights reserved.
How To Get Started
20© Copyright 2012 EMC Corporation. All rights reserved.
EMC Big Data Analytics Strategy And Implementation Services
Analytics
Operationalization
Identify current state, determine required
state and conduct gap analysis to develop
analytics implementation roadmap
Analytics
Lab
Deploy analytics sandbox
to quantify the business
case
Vision
Workshop
Identify big data
analytics business
use cases
Repeat the process for
identified business cases
21© Copyright 2012 EMC Corporation. All rights reserved.
What Should You Look
For in a Vendor?
Jeff Kelly
22© Copyright 2012 EMC Corporation. All rights reserved.
Advice for Selecting Big Data Vendors
ü  Balance short-term goals with long-term vision.
ü  Objectives are:
ü  Quick, demonstrable ROI.
ü  Sustainable Big Data practice.
ü  Don’t get hung up on “speeds and feeds” or feature-by-feature comparisons.
ü  Focus on substance, flexibility, commitment and experience.
23© Copyright 2012 EMC Corporation. All rights reserved.
Selecting Big Data Vendors, Cont.
ü  Evaluate products portfolios based on:
ü  Ability to monetize existing and future data assets.
ü  Ability to integrate with and compliment existing data management technology.
ü  Accessibility to power users and business users alike (depending on use case).
ü  Ability to apply information governance and security best practices.
ü  Select service providers with track records of assisting enterprises adopt data-
driven culture as well as technology.
24© Copyright 2012 EMC Corporation. All rights reserved.
To type a question via WebEx, click on the Q&A tab
Please select “Ask: All Panelists”
to ensure your questions reach us. Thank you!
Questions and Answers
25© Copyright 2012 EMC Corporation. All rights reserved.
Learn More…
Ÿ  See us at…
–  EMC World, May 5-9 www.emc.world.com
Ÿ  Contact Jeff Kelly
–  Email: jeff.kelly@wikibon.org
–  LinkedIn: http://www.linkedin.com/in/jeffreyfkelly/
–  Twitter: @jeffreyfkelly
–  Research: http://www.wikibon.org/bigdata
Ÿ  Contact Bill Schmarzo
–  Email: william.schmarzo@emc.com
–  LinkedIn: http://www.linkedin.com/in/schmarzo
–  Twitter: @schmarzo
–  Blog: http://infocus.emc.com/author/william_schmarzo/

Más contenido relacionado

La actualidad más candente

How Cloud BI Powers Today's Agile Enterprise
How Cloud BI Powers Today's Agile EnterpriseHow Cloud BI Powers Today's Agile Enterprise
How Cloud BI Powers Today's Agile EnterpriseGoodData
 
Reveal the Intelligence in your Data with Talend Data Fabric
Reveal the Intelligence in your Data with Talend Data FabricReveal the Intelligence in your Data with Talend Data Fabric
Reveal the Intelligence in your Data with Talend Data FabricJean-Michel Franco
 
Making big data work
Making big data work Making big data work
Making big data work Ed Thewlis
 
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)Denodo
 
Big Data LDN 2017: Data Integration & Big Data Management
Big Data LDN 2017: Data Integration & Big Data ManagementBig Data LDN 2017: Data Integration & Big Data Management
Big Data LDN 2017: Data Integration & Big Data ManagementMatt Stubbs
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoDenodo
 
Liberating data with Talend Data Catalog
Liberating data with Talend Data CatalogLiberating data with Talend Data Catalog
Liberating data with Talend Data CatalogJean-Michel Franco
 
Pieter den Hamer Alliander
Pieter den Hamer Alliander Pieter den Hamer Alliander
Pieter den Hamer Alliander BigDataExpo
 
Big Data – From Strategy to Production
Big Data – From Strategy to ProductionBig Data – From Strategy to Production
Big Data – From Strategy to ProductionSemantic Web Company
 
Big Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataBig Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataMatt Stubbs
 
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...Neo4j
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsMatt Stubbs
 
David Waxman Keynote
David Waxman KeynoteDavid Waxman Keynote
David Waxman KeynoteData Con LA
 
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...Databricks
 
Neo4j Aura Enterprise
Neo4j Aura EnterpriseNeo4j Aura Enterprise
Neo4j Aura EnterpriseNeo4j
 
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...Matt Stubbs
 
Eneco Ronald Root
Eneco Ronald RootEneco Ronald Root
Eneco Ronald RootBigDataExpo
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
Delivering data you can trust for data privacy
Delivering data you can trust for data privacy Delivering data you can trust for data privacy
Delivering data you can trust for data privacy Jean-Michel Franco
 

La actualidad más candente (20)

How Cloud BI Powers Today's Agile Enterprise
How Cloud BI Powers Today's Agile EnterpriseHow Cloud BI Powers Today's Agile Enterprise
How Cloud BI Powers Today's Agile Enterprise
 
Reveal the Intelligence in your Data with Talend Data Fabric
Reveal the Intelligence in your Data with Talend Data FabricReveal the Intelligence in your Data with Talend Data Fabric
Reveal the Intelligence in your Data with Talend Data Fabric
 
Making big data work
Making big data work Making big data work
Making big data work
 
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
Accelerating Self-Service Analytics with Denodo and Tableau (Singapore)
 
Big Data LDN 2017: Data Integration & Big Data Management
Big Data LDN 2017: Data Integration & Big Data ManagementBig Data LDN 2017: Data Integration & Big Data Management
Big Data LDN 2017: Data Integration & Big Data Management
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de Denodo
 
Liberating data with Talend Data Catalog
Liberating data with Talend Data CatalogLiberating data with Talend Data Catalog
Liberating data with Talend Data Catalog
 
Pieter den Hamer Alliander
Pieter den Hamer Alliander Pieter den Hamer Alliander
Pieter den Hamer Alliander
 
Big Data – From Strategy to Production
Big Data – From Strategy to ProductionBig Data – From Strategy to Production
Big Data – From Strategy to Production
 
Big Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big DataBig Data LDN 2017: Big Impact with Big Data
Big Data LDN 2017: Big Impact with Big Data
 
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...
An Overview of the Neo4j Cloud Strategy and the Future of Graph Databases in ...
 
Big Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business SolutionsBig Data LDN 2017: How to leverage the cloud for Business Solutions
Big Data LDN 2017: How to leverage the cloud for Business Solutions
 
David Waxman Keynote
David Waxman KeynoteDavid Waxman Keynote
David Waxman Keynote
 
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...
Reinventing Payments at HSBC with a Unified Platform for Data and AI in the C...
 
Neo4j Aura Enterprise
Neo4j Aura EnterpriseNeo4j Aura Enterprise
Neo4j Aura Enterprise
 
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...
Big Data LDN 2017: Pervasive Intelligence: the Future of Big Data, Machine Le...
 
Eneco Ronald Root
Eneco Ronald RootEneco Ronald Root
Eneco Ronald Root
 
Big data&DaaS
Big data&DaaSBig data&DaaS
Big data&DaaS
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
Delivering data you can trust for data privacy
Delivering data you can trust for data privacy Delivering data you can trust for data privacy
Delivering data you can trust for data privacy
 

Destacado

Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Johan Himberg
 
How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven cultureMark Beekman
 
Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Jeff Kelly
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Harbinger Systems - HRTech Builder of Choice
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value ChainPRELIDA Project
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataMohammed Guller
 
Becoming a Data Driven Organisation
Becoming a Data Driven OrganisationBecoming a Data Driven Organisation
Becoming a Data Driven OrganisationWizdee
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overviewwalshe1
 
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...Data Con LA
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13Rei Lynn Hayashi
 
"Using Data Science to Design Effective Precision Preventative Behavioral Med...
"Using Data Science to Design Effective Precision Preventative Behavioral Med..."Using Data Science to Design Effective Precision Preventative Behavioral Med...
"Using Data Science to Design Effective Precision Preventative Behavioral Med...Hyper Wellbeing
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015 Den Reymer
 

Destacado (13)

Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture Lecture on Data Science in a Data-Driven Culture
Lecture on Data Science in a Data-Driven Culture
 
How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven culture
 
Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014
 
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
Application of Data Science in Government Services – IPMA Forum 2016 Speaker ...
 
Steps towards a Data Value Chain
Steps towards a Data Value ChainSteps towards a Data Value Chain
Steps towards a Data Value Chain
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Becoming a Data Driven Organisation
Becoming a Data Driven OrganisationBecoming a Data Driven Organisation
Becoming a Data Driven Organisation
 
#BigDataCanarias: "Big Data & Career Paths"
#BigDataCanarias: "Big Data & Career Paths"#BigDataCanarias: "Big Data & Career Paths"
#BigDataCanarias: "Big Data & Career Paths"
 
EMC Big Data Solutions Overview
EMC Big Data Solutions OverviewEMC Big Data Solutions Overview
EMC Big Data Solutions Overview
 
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...
Big Data Day LA 2016/ Data Science Track - Backstage to a Data Driven Culture...
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
 
"Using Data Science to Design Effective Precision Preventative Behavioral Med...
"Using Data Science to Design Effective Precision Preventative Behavioral Med..."Using Data Science to Design Effective Precision Preventative Behavioral Med...
"Using Data Science to Design Effective Precision Preventative Behavioral Med...
 
Big Data Industry Insights 2015
Big Data Industry Insights 2015 Big Data Industry Insights 2015
Big Data Industry Insights 2015
 

Similar a Create your Big Data vision and Hadoop-ify your data warehouse

2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersDatameer
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfullyAdir Sharabi
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfulyAdir Sharabi
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalHortonworks
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalHortonworks
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopHortonworks
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBigDataExpo
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopGhassan Al-Yafie
 
A Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to FinanceA Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to FinanceSlim Baltagi
 
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...EMC
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesDataWorks Summit
 

Similar a Create your Big Data vision and Hadoop-ify your data warehouse (20)

2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Getting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
How to implement Hadoop successfully
How to implement Hadoop successfullyHow to implement Hadoop successfully
How to implement Hadoop successfully
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
How to implement hadoop successfuly
How to implement hadoop successfulyHow to implement hadoop successfuly
How to implement hadoop successfuly
 
Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?Big data for Telco: opportunity or threat?
Big data for Telco: opportunity or threat?
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
Webinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_finalWebinar turbo charging_data_science_hawq_on_hdp_final
Webinar turbo charging_data_science_hawq_on_hdp_final
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Eliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside HadoopEliminating the Challenges of Big Data Management Inside Hadoop
Eliminating the Challenges of Big Data Management Inside Hadoop
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Rob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoopRob peglar introduction_analytics _big data_hadoop
Rob peglar introduction_analytics _big data_hadoop
 
A Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to FinanceA Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to Finance
 
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
Pivotal the new_pivotal_big_data_suite_-_revolutionary_foundation_to_leverage...
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
Big Data Proof of Concept
Big Data Proof of ConceptBig Data Proof of Concept
Big Data Proof of Concept
 

Más de Jeff Kelly

CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...Jeff Kelly
 
Wikibon Barclays Disruptive Tech Call - November 2014
Wikibon Barclays Disruptive Tech Call - November 2014Wikibon Barclays Disruptive Tech Call - November 2014
Wikibon Barclays Disruptive Tech Call - November 2014Jeff Kelly
 
Democratizing Big Data (Updated)
Democratizing Big Data (Updated)Democratizing Big Data (Updated)
Democratizing Big Data (Updated)Jeff Kelly
 
The business value of Big Data
The business value of Big DataThe business value of Big Data
The business value of Big DataJeff Kelly
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesJeff Kelly
 
Democratizing Big Data
Democratizing Big DataDemocratizing Big Data
Democratizing Big DataJeff Kelly
 

Más de Jeff Kelly (6)

CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
 
Wikibon Barclays Disruptive Tech Call - November 2014
Wikibon Barclays Disruptive Tech Call - November 2014Wikibon Barclays Disruptive Tech Call - November 2014
Wikibon Barclays Disruptive Tech Call - November 2014
 
Democratizing Big Data (Updated)
Democratizing Big Data (Updated)Democratizing Big Data (Updated)
Democratizing Big Data (Updated)
 
The business value of Big Data
The business value of Big DataThe business value of Big Data
The business value of Big Data
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use cases
 
Democratizing Big Data
Democratizing Big DataDemocratizing Big Data
Democratizing Big Data
 

Último

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 

Último (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Create your Big Data vision and Hadoop-ify your data warehouse

  • 1. 1© Copyright 2012 EMC Corporation. All rights reserved. Create Your Big Data Vision And Hadoop-ify Your Data Warehouse Jeff Kelly, Principal Research Contributor The Wikibon Project Bill Schmarzo, CTO EIMA Practice, EMC Professional Services
  • 2. 2© Copyright 2012 EMC Corporation. All rights reserved. Agenda Ÿ  Current Market Observations Ÿ  The Big Data Business Maturity Index and How to Identify Your Best Use Case Ÿ  Get Started With Hadoop and Other New Technologies Ÿ  What Should You Look For in a Vendor? Ÿ  Q&A
  • 3. 3© Copyright 2012 EMC Corporation. All rights reserved. Current Market Observations Jeff Kelly
  • 4. 4© Copyright 2012 EMC Corporation. All rights reserved. Big Data Market Size 2012 $11.4b 2013 $18.2b 2017 $48b ü  59% Growth Y-o-Y 2011 to 2012 ü  Forecast 60%+ Growth in 2013 ü  31% CAGR Forecast 2012 through 2017 2014 $28b 2015 $37.9b 2016 $43.7b Source: Wikibon Big Data Vendor Revenue and Market Forecast, 2012-2017
  • 5. 5© Copyright 2012 EMC Corporation. All rights reserved. Big Data Market Segmentation, 2012 Services Leading the Way Professional Services $3,784m 34% Cloud and SaaS $608m 5% Pro. Services Compute Storage Networking Database Applications Data mgt. Cloud n = $11,400m
  • 6. 6© Copyright 2012 EMC Corporation. All rights reserved. Big Data Growth Drivers ü  Increased Awareness and Investments By Large Enterprises Beyond the Web ü  Retailers like Sears leveraging Big Data for price optimization. ü  Financial services firms, including JPMC, Morgan Stanley and BoA, conduct fraud analysis, risk profiling and more. ü  Pharmaceutical including Bristol Myers Squibb makers use Big Data to support drug development. ü  Continued Investment by Web Pioneers and Three Letter Agencies ü  Google alone spent $1b+ on infrastructure in Q4 2012. ü  “Everything we do is a Big Data problem.” – Jay Parikh, VP of Engineering, Facebook ü  CIA CTO Ira Hunt: Our mission is to “collect everything and hang on to it forever.”
  • 7. 7© Copyright 2012 EMC Corporation. All rights reserved. Big Data Growth Drivers, Cont. ü  Increasingly Sophisticated Professional Services ü  Professional services building on experience of assisting early adopters. ü  Some (but not all) are vendor and product agnostic. ü  Focusing on identifying use cases, improving communication, and leveraging existing assets. ü  Technology Maturation ü  Open source community and vendors making Hadoop enterprise-ready, easier to use. ü  Better integration between Big Data and existing IT infrastructure. ü  Extending Big Data accessibility to business users via BI and data visualization tools. Consulting Training & Educations Integration
  • 8. 8© Copyright 2012 EMC Corporation. All rights reserved. Big Data Growth Inhibitors ü  Lack of Data Scientists and Big Data Practitioners ü  Big Data Technology Still Complex, Difficult to Manage/Use ü  Organizational Resistance to Data-Driven Decision Making ü  Confusion Due to Vendor Marketing and “Big Data Washing” Big Data [Your Product Name Here]
  • 9. 9© Copyright 2012 EMC Corporation. All rights reserved. The Big Data Business Maturity Index and How to Identify Your Best Use Case Bill Schmarzo
  • 10. 10© Copyright 2012 EMC Corporation. All rights reserved. Business Metamorphosis Data Monetization Business Optimization Business Insights Business Monitoring Monitoring business performance to flag areas of interest Big Data Business Model Maturation Index Integrate insights & recommendations into existing business processes Embed analytics to optimize select business processes Leverage insights to identify new revenue opportunities Transform customer and product insights to move into new markets Measures the degree to which the organization has integrated big data and advanced analytics into their business model
  • 11. 11© Copyright 2012 EMC Corporation. All rights reserved. How to Identify Your Best Use Case The Big Data strategy document ensures a tight linkage between your organization’s business initiatives and your big data strategy •  Big  data  business  cases,  ROI  and   analy4c  requirements   •  Key  Performance  Indicators  and   leading  metrics     •  Business  ques4ons  with  metrics,   dimensions,  hierarchies   •  Business  decisions,  decision  flow/ process  and  UEX  requirements   •  Analy4c  algorithms  and  modeling   requirements   •  Required  data  sources   Business Strategy: Provide Unique Starbucks Customer Experience Business Initiatives: •  Increase number of “Gold Card” customers •  Increase “Gold Card” customer revenue & engagement (store visits, spend per visit, advocacy) Mobile App •  •  Social Media •  •  Store Sales •  •  Customer Loyalty •  •  Collect customer engagement information through multiple channels (store, web, mobile) Profile and micro-segment customers to improve marketing and offers effectiveness Analyze social media data to identify and monitor brand advocates Monitor and adjust customer engagement effectiveness (visits, revenue, margin, advocacy) Tasks Develop intimate knowledge of “Gold Card” customers life stage, behaviors and interests Act upon intimate knowledge of “Gold Card” customers to increase store revenues •  Expand customer data collection points •  Leverage “gold card” member transactions, feedback (surveys) and social data •  Integrate customer-specific insights back into operational, management and loyalty systems Outcomes & CSF’s
  • 12. 12© Copyright 2012 EMC Corporation. All rights reserved. Get Started With Hadoop and Other New Technologies Bill Schmarzo
  • 13. 13© Copyright 2012 EMC Corporation. All rights reserved. A Playbook For Modernizing Your Data Warehouse With New Big Data Technologies And Capabilities #1) Enhance data warehouse with new unstructured data metrics #2) Data virtualization to extend existing data warehouse environment #3) MPP RDBMS to increase data platform scalability and agility #4) In-database analytics to accelerate analytic development #5) Hadoop to create the next generation Operational Data Store
  • 14. 14© Copyright 2012 EMC Corporation. All rights reserved. #1) Enhance Data Warehouse With New Unstructured Data Metrics Leverage HDFS to provide a single platform that supports your traditional SQL- based BI environment plus your growing unstructured data needs at scale HDFS HBase Pig, Hive, Mahout Map Reduce Sqoop Flume Resource Management & Workflow Yarn Zookeeper Apache Pivotal HD Configure, Deploy, Monitor, Manage Command Center Hadoop Virtualization (HVE) DataLoader Xtension Framework Catalog Services Query Optimizer Dynamic Pipelining ANSI SQL + Analytics HAWQ – Advanced Database Services
  • 15. 15© Copyright 2012 EMC Corporation. All rights reserved. ETL Cached Streaming Data Unified Data Platform Data Source Real-Time Visualization Advanced Analytics and Modeling Data Source CEP/ Workflow Data Federation Tool Semantic Master Data Discovery / Data Mapping Data Source Data Source #2) Extend Existing Data Warehouse Via Data Virtualization Leverage data federation tools to speed data discovery and analysis via virtual, on- demand access to data sources outside your EDW
  • 16. 16© Copyright 2012 EMC Corporation. All rights reserved. •  Massively Parallel Processing (MPP), scale- out architectures provide cost effective options for managing and analyzing massive data volumes •  MPP data warehouses provide linear scalability on general purpose, commodity systems (e.g., fault-tolerant scale out environment; automatic parallelization; I/O optimized) #3) Massively Parallel Processing (MPP) Relational Databases
  • 17. 17© Copyright 2012 EMC Corporation. All rights reserved. #4) In-Database Processing And Analytics Conventional: A Data Scientist needs to move 1 TB of data from a 5- processor database server to the analytical server at 1 gigabytes per second (Gbs) In-Database: A Data Scientist leaves the 1 TB data in the 5-processor database server and runs the same algorithm directly in the database 0 20 40 60 80 100 120 140 180160 200 Data Movement Time = (1TB x 8) / 1Gbs / 60 s = 133.3 minutes Processing Time = 60 minutes 12 minutes Total Time = 193.3 minutes Time (minutes) Conventional In-Database
  • 18. 18© Copyright 2012 EMC Corporation. All rights reserved. Hadoop Data Store Analytics Environment Data Preparation and Enrichment ALL data fed into Hadoop Data Store EDWETL Analytic Sandbox BI Environment •  Production •  Predictable load •  SLA-drive •  Standard tools •  Exploratory, Ad Hoc •  Unpredictable load •  Experimentation •  Best tool for the job #5) Next Gen Operational Data Store/Data Prep With Hadoop Feeds production BI and Enterprise Data Warehouse environment and high- velocity Analytics Sandbox
  • 19. 19© Copyright 2012 EMC Corporation. All rights reserved. How To Get Started
  • 20. 20© Copyright 2012 EMC Corporation. All rights reserved. EMC Big Data Analytics Strategy And Implementation Services Analytics Operationalization Identify current state, determine required state and conduct gap analysis to develop analytics implementation roadmap Analytics Lab Deploy analytics sandbox to quantify the business case Vision Workshop Identify big data analytics business use cases Repeat the process for identified business cases
  • 21. 21© Copyright 2012 EMC Corporation. All rights reserved. What Should You Look For in a Vendor? Jeff Kelly
  • 22. 22© Copyright 2012 EMC Corporation. All rights reserved. Advice for Selecting Big Data Vendors ü  Balance short-term goals with long-term vision. ü  Objectives are: ü  Quick, demonstrable ROI. ü  Sustainable Big Data practice. ü  Don’t get hung up on “speeds and feeds” or feature-by-feature comparisons. ü  Focus on substance, flexibility, commitment and experience.
  • 23. 23© Copyright 2012 EMC Corporation. All rights reserved. Selecting Big Data Vendors, Cont. ü  Evaluate products portfolios based on: ü  Ability to monetize existing and future data assets. ü  Ability to integrate with and compliment existing data management technology. ü  Accessibility to power users and business users alike (depending on use case). ü  Ability to apply information governance and security best practices. ü  Select service providers with track records of assisting enterprises adopt data- driven culture as well as technology.
  • 24. 24© Copyright 2012 EMC Corporation. All rights reserved. To type a question via WebEx, click on the Q&A tab Please select “Ask: All Panelists” to ensure your questions reach us. Thank you! Questions and Answers
  • 25. 25© Copyright 2012 EMC Corporation. All rights reserved. Learn More… Ÿ  See us at… –  EMC World, May 5-9 www.emc.world.com Ÿ  Contact Jeff Kelly –  Email: jeff.kelly@wikibon.org –  LinkedIn: http://www.linkedin.com/in/jeffreyfkelly/ –  Twitter: @jeffreyfkelly –  Research: http://www.wikibon.org/bigdata Ÿ  Contact Bill Schmarzo –  Email: william.schmarzo@emc.com –  LinkedIn: http://www.linkedin.com/in/schmarzo –  Twitter: @schmarzo –  Blog: http://infocus.emc.com/author/william_schmarzo/