SlideShare una empresa de Scribd logo
1 de 18
By Thanuja Seneviratne
 Part I Recap
 Big Data Market
› Data Growth
› Market Growth
› Market Drivers
› Adoption Cycle
› Forrester Market Report Findings
 Big Data Products
› Enterprise Data Warehouses (EDW) – non-canonical, traditional
› Big Data Products Offering
› Hadoop and its Distros
› MapR and Others
› Big Data Products Stack
 Future of Big Data
 Data Science vs Traditional Analytics
 Traditional Analytics - Decide what data is relevant, create a static data model, data visualize
 Data Science – Assemble all possible data, create a predictive model, operationalize the
model (visualize, feed to another system)
 Three types of data stores/data management systems
› Relational vs non-relational [MSSQL, Oracle, MySql vs NoSql products]
› Relational “big data” offering called EDW (mostly packaged as MPP appliances)
› Each three types has merits in certain use cases and will be continued to be used in
the industry
› Why EDW is not enough for new “big data” scenarios
 Three V’s becoming too heavy
 Time to Market is delayed
 High Cost
 Write-first schema unnecessary
 Importance of Individualized experience
› Another Sample case: Money found $ 1000 in front of a bank, Will a person return it to the bank or
runaway with it?
› Multiple business cases and multiple use cases
 Hadoop as the premier open source “big data” offering and its distros
 Other Hadoop-like “big data” offerings
 Data Growth
 Market Growth
› will be the largest market overtaking ERP by 2020
 Adaption Cycle
 Market Drivers
› Business Drivers
 Reactive Analytics instead Proactive Analytics
 Insights generated for competitive advantage
 Rise of Data-First enterprise
› Technical Drivers
 Data growing exponentially to petabyte scale
 Data is everywhere with variety of formats
› Financial Drivers
 Cost of IT continues to grow
 Commodity hardware instead Enterprise hardware
 Forrester Market Report Findings
› Unstoppable Hadoop momentum in the market
› More and more enterprises wants to do POC’s
› Open source is the key
› Many Big Data products – a fair amount products to chose
from. But no market dominating leader yet.
 Hadoop distributions
 Other products including MapR
› Enterprise Hadoop and partnerships with large vendors
 IBM, TeraData, Pivotal, Microsoft
› Hadoop in the cloud
› Hadoop Ecosystem
 Enterprise Data Warehouses (EDWs)
› Traditional big data offering
› Non-canonical or original way of storing large data sets
› Refer to Part I slides
 Big Data Products Offering
 Hadoop and its distros
› History of Hadoop
› Hadoop as a Platform
 HortonWorks Data Platform (HDP)
 Cloudera Distribution on Hadoop (CDH)
 Big Vendors
› IBM’s BigInsights – This is a Hadoop distro through Cloudera’s CDH
› Microsoft’s HDInsight on Azure – this is a Hadoop distro through
HortonWorks’ HDP
› SAP’s HANA – this is a Hadoop distro through HortonWorks’ HDP
 MapR and Others
› Instead HDFS MapR uses Network File System (NFS)
› MapR Distros
 Open source M3 in Amazon Cloud
 Premium M5 in Amazon Cloud
 MapR distro on Google
› Others
 Amazon EMR
› A Hadoop distro on Amazon EC2 clusters in the Amazon cloud
› Exposed a Web service to manage the clusters
› Most popular and cost-effective distro apart from Cloudera and
HortonWorks
 Hybrids
› Converging SQL Enterprise Data Warehouses (specially MPP
products) with Big Data
› The investments made for long running contracts with EDW vendors
are safeguarded
› Existing SQL/DW knowledge and skill set can be utilized
› Following are popular products:
 Big Data Products Stack
 Market leader by 2020
 Many products and alternatives are coming our way
 5Vs-driven ecosystem instead 3Vs
 Demanding skill-set around the Big Data technologies
› Enterprise Hadoop,
› Hadoop Distros,
› MapR and its Distros,
› Hadoop stack,
› Application Frameworks and languages
 “R” language and frameworks
 Scala language and frameworks
 Subjective evolution instead objective evolution
› Improvements to Big Data Infrastructure (BDI)
› Improvements to Big Data Life Cycle (BDLC)
› Evolve to All-Data processing
Big Data - Part II

Más contenido relacionado

La actualidad más candente

Big Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with PentahoBig Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with Pentaho
Mark Kromer
 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big Data
Lewis Crawford
 

La actualidad más candente (20)

Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster Services
 
Bigdata and Hadoop Bootcamp
Bigdata and Hadoop BootcampBigdata and Hadoop Bootcamp
Bigdata and Hadoop Bootcamp
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 
Intro to bigdata on gcp (1)
Intro to bigdata on gcp (1)Intro to bigdata on gcp (1)
Intro to bigdata on gcp (1)
 
Hadoop
HadoopHadoop
Hadoop
 
Big Data Analytics for Non-Programmers
Big Data Analytics for Non-ProgrammersBig Data Analytics for Non-Programmers
Big Data Analytics for Non-Programmers
 
Big Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with PentahoBig Data Analytics Projects - Real World with Pentaho
Big Data Analytics Projects - Real World with Pentaho
 
Introduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.bizIntroduction to Big Data Hadoop Training Online by www.itjobzone.biz
Introduction to Big Data Hadoop Training Online by www.itjobzone.biz
 
Big Data
Big DataBig Data
Big Data
 
Great Expectations Presentation
Great Expectations PresentationGreat Expectations Presentation
Great Expectations Presentation
 
Big Data Visualisation with Hadoop and PowerPivot
Big Data Visualisation with Hadoop and PowerPivotBig Data Visualisation with Hadoop and PowerPivot
Big Data Visualisation with Hadoop and PowerPivot
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
big data and hadoop
 big data and hadoop big data and hadoop
big data and hadoop
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Next Generation Data Platforms - Deon Thomas
Next Generation Data Platforms - Deon ThomasNext Generation Data Platforms - Deon Thomas
Next Generation Data Platforms - Deon Thomas
 
BigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRTBigData Analytics with Hadoop and BIRT
BigData Analytics with Hadoop and BIRT
 
Big Tools for Big Data
Big Tools for Big DataBig Tools for Big Data
Big Tools for Big Data
 
How to boost your datamanagement with Dremio ?
How to boost your datamanagement with Dremio ?How to boost your datamanagement with Dremio ?
How to boost your datamanagement with Dremio ?
 
Big data in Azure
Big data in AzureBig data in Azure
Big data in Azure
 
Big data overview
Big data overviewBig data overview
Big data overview
 

Similar a Big Data - Part II

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
Jean-Marc Desvaux
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
Raul Chong
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
BSP Media Group
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 

Similar a Big Data - Part II (20)

Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenariosThe Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
The Practice of Big Data - The Hadoop ecosystem explained with usage scenarios
 
Big data introduction, Hadoop in details
Big data introduction, Hadoop in detailsBig data introduction, Hadoop in details
Big data introduction, Hadoop in details
 
Big Data in Azure
Big Data in AzureBig Data in Azure
Big Data in Azure
 
Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 MillionHow One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
How One Company Offloaded Data Warehouse ETL To Hadoop and Saved $30 Million
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
Big Data
Big DataBig Data
Big Data
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
The Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data HadoopThe Forrester Wave - Big Data Hadoop
The Forrester Wave - Big Data Hadoop
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Big Data
Big DataBig Data
Big Data
 
Big Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential ToolsBig Data Tools: A Deep Dive into Essential Tools
Big Data Tools: A Deep Dive into Essential Tools
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
 

Último

The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
shinachiaurasa2
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 

Último (20)

10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban%in Durban+277-882-255-28 abortion pills for sale in Durban
%in Durban+277-882-255-28 abortion pills for sale in Durban
 
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
The title is not connected to what is inside
The title is not connected to what is insideThe title is not connected to what is inside
The title is not connected to what is inside
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students8257 interfacing 2 in microprocessor for btech students
8257 interfacing 2 in microprocessor for btech students
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
Generic or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisionsGeneric or specific? Making sensible software design decisions
Generic or specific? Making sensible software design decisions
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Define the academic and professional writing..pdf
Define the academic and professional writing..pdfDefine the academic and professional writing..pdf
Define the academic and professional writing..pdf
 
SHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions PresentationSHRMPro HRMS Software Solutions Presentation
SHRMPro HRMS Software Solutions Presentation
 
Architecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the pastArchitecture decision records - How not to get lost in the past
Architecture decision records - How not to get lost in the past
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) SolutionIntroducing Microsoft’s new Enterprise Work Management (EWM) Solution
Introducing Microsoft’s new Enterprise Work Management (EWM) Solution
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 

Big Data - Part II

  • 2.  Part I Recap  Big Data Market › Data Growth › Market Growth › Market Drivers › Adoption Cycle › Forrester Market Report Findings  Big Data Products › Enterprise Data Warehouses (EDW) – non-canonical, traditional › Big Data Products Offering › Hadoop and its Distros › MapR and Others › Big Data Products Stack  Future of Big Data
  • 3.  Data Science vs Traditional Analytics  Traditional Analytics - Decide what data is relevant, create a static data model, data visualize  Data Science – Assemble all possible data, create a predictive model, operationalize the model (visualize, feed to another system)  Three types of data stores/data management systems › Relational vs non-relational [MSSQL, Oracle, MySql vs NoSql products] › Relational “big data” offering called EDW (mostly packaged as MPP appliances) › Each three types has merits in certain use cases and will be continued to be used in the industry › Why EDW is not enough for new “big data” scenarios  Three V’s becoming too heavy  Time to Market is delayed  High Cost  Write-first schema unnecessary  Importance of Individualized experience › Another Sample case: Money found $ 1000 in front of a bank, Will a person return it to the bank or runaway with it? › Multiple business cases and multiple use cases  Hadoop as the premier open source “big data” offering and its distros  Other Hadoop-like “big data” offerings
  • 5.  Market Growth › will be the largest market overtaking ERP by 2020
  • 7.  Market Drivers › Business Drivers  Reactive Analytics instead Proactive Analytics  Insights generated for competitive advantage  Rise of Data-First enterprise › Technical Drivers  Data growing exponentially to petabyte scale  Data is everywhere with variety of formats › Financial Drivers  Cost of IT continues to grow  Commodity hardware instead Enterprise hardware
  • 8.  Forrester Market Report Findings › Unstoppable Hadoop momentum in the market › More and more enterprises wants to do POC’s › Open source is the key › Many Big Data products – a fair amount products to chose from. But no market dominating leader yet.  Hadoop distributions  Other products including MapR › Enterprise Hadoop and partnerships with large vendors  IBM, TeraData, Pivotal, Microsoft › Hadoop in the cloud › Hadoop Ecosystem
  • 9.  Enterprise Data Warehouses (EDWs) › Traditional big data offering › Non-canonical or original way of storing large data sets › Refer to Part I slides
  • 10.  Big Data Products Offering
  • 11.  Hadoop and its distros › History of Hadoop › Hadoop as a Platform  HortonWorks Data Platform (HDP)  Cloudera Distribution on Hadoop (CDH)
  • 12.  Big Vendors › IBM’s BigInsights – This is a Hadoop distro through Cloudera’s CDH › Microsoft’s HDInsight on Azure – this is a Hadoop distro through HortonWorks’ HDP › SAP’s HANA – this is a Hadoop distro through HortonWorks’ HDP
  • 13.  MapR and Others › Instead HDFS MapR uses Network File System (NFS) › MapR Distros  Open source M3 in Amazon Cloud  Premium M5 in Amazon Cloud  MapR distro on Google › Others
  • 14.  Amazon EMR › A Hadoop distro on Amazon EC2 clusters in the Amazon cloud › Exposed a Web service to manage the clusters › Most popular and cost-effective distro apart from Cloudera and HortonWorks
  • 15.  Hybrids › Converging SQL Enterprise Data Warehouses (specially MPP products) with Big Data › The investments made for long running contracts with EDW vendors are safeguarded › Existing SQL/DW knowledge and skill set can be utilized › Following are popular products:
  • 16.  Big Data Products Stack
  • 17.  Market leader by 2020  Many products and alternatives are coming our way  5Vs-driven ecosystem instead 3Vs  Demanding skill-set around the Big Data technologies › Enterprise Hadoop, › Hadoop Distros, › MapR and its Distros, › Hadoop stack, › Application Frameworks and languages  “R” language and frameworks  Scala language and frameworks  Subjective evolution instead objective evolution › Improvements to Big Data Infrastructure (BDI) › Improvements to Big Data Life Cycle (BDLC) › Evolve to All-Data processing

Notas del editor

  1. Sample - Non “Big Data” scenario - http://www.gloria.de/Pages/Home.aspx. Small information web site, small data set, no growth expected, enough with relational model data.