SlideShare una empresa de Scribd logo
1 de 22
Building Data Start-ups: Fast, Big, and Focused Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011
The Big Data  Opportunity
The Attack of the Exponentials
The Attack of the Exponentials
The Intersection of Three Forces Yields Higher Volume & Velocity of Data exponential economics sensor networks cloud computing
Data Value Must Exceed Data Cost
Data Value Must Exceed Data Cost ... New Classes of Data are Now Valuable
Success on the Data Stack Services Analytics Data
Success on the Data Stack Fast Services Analytics Fast Data
Success on the Data Stack Fast, Big Services Big Analytics Fast Data
Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data
#1: Fast
Success on the Data Stack Fast Data real-time Kdb Netezza Esper Vertica MongoDB speed InfoBright Aster MySQL MapR Greenplum Postgres batch Hadoop Services megabytes petabytes scale Analytics free, open-source Data commercial
Fast Data With Cheap Memory 1964 – Univac 2k $51 million/MB 2011 – DDR 1GB 1 cent/MB data sources:  http://www.sharkyextreme.com & http://www.webservicessummit.com/Trends/TechTrends1/img11.html, plotted with ggplot2
#2: Big
Success on the Data Stack Big Analytics custom (hardware) real-time speed Revolution R R custom  distributed SAP SAS SciPy SPSS batch Services megabytes petabytes scale Analytics free, open-source Data commercial
The Promise ofAnalytics extract learn predict DATA FEATURES MODELS “More data usually beats better algorithms.”
#3: Focused
Success on the Data Stack Focused Services Focused Services Analytics Data
“Real-time, large-scale analytics in a focused vertical.” credit:  Joe Reisinger, Metamarkets
Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data
Thank You.  Questions? Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

Más contenido relacionado

Destacado

Destacado (6)

Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
 
Complex Analytics with NoSQL Data Store in Real Time
Complex Analytics with NoSQL Data Store in Real TimeComplex Analytics with NoSQL Data Store in Real Time
Complex Analytics with NoSQL Data Store in Real Time
 
Real-Time Queries in Hadoop w/ Cloudera Impala
Real-Time Queries in Hadoop w/ Cloudera ImpalaReal-Time Queries in Hadoop w/ Cloudera Impala
Real-Time Queries in Hadoop w/ Cloudera Impala
 
Open Stack Days israel Keynote 2017
Open Stack Days israel Keynote 2017Open Stack Days israel Keynote 2017
Open Stack Days israel Keynote 2017
 
The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...
The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...
The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 

Último

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 

Building Data Start-Ups: Fast, Big, and Focused

  • 1. Building Data Start-ups: Fast, Big, and Focused Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011
  • 2. The Big Data Opportunity
  • 3. The Attack of the Exponentials
  • 4. The Attack of the Exponentials
  • 5. The Intersection of Three Forces Yields Higher Volume & Velocity of Data exponential economics sensor networks cloud computing
  • 6. Data Value Must Exceed Data Cost
  • 7. Data Value Must Exceed Data Cost ... New Classes of Data are Now Valuable
  • 8. Success on the Data Stack Services Analytics Data
  • 9. Success on the Data Stack Fast Services Analytics Fast Data
  • 10. Success on the Data Stack Fast, Big Services Big Analytics Fast Data
  • 11. Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data
  • 13. Success on the Data Stack Fast Data real-time Kdb Netezza Esper Vertica MongoDB speed InfoBright Aster MySQL MapR Greenplum Postgres batch Hadoop Services megabytes petabytes scale Analytics free, open-source Data commercial
  • 14. Fast Data With Cheap Memory 1964 – Univac 2k $51 million/MB 2011 – DDR 1GB 1 cent/MB data sources: http://www.sharkyextreme.com & http://www.webservicessummit.com/Trends/TechTrends1/img11.html, plotted with ggplot2
  • 16. Success on the Data Stack Big Analytics custom (hardware) real-time speed Revolution R R custom distributed SAP SAS SciPy SPSS batch Services megabytes petabytes scale Analytics free, open-source Data commercial
  • 17. The Promise ofAnalytics extract learn predict DATA FEATURES MODELS “More data usually beats better algorithms.”
  • 19. Success on the Data Stack Focused Services Focused Services Analytics Data
  • 20. “Real-time, large-scale analytics in a focused vertical.” credit: Joe Reisinger, Metamarkets
  • 21. Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data
  • 22. Thank You. Questions? Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

Notas del editor

  1. I want to first thank O’Reilly for putting together this event, and all of you for tuning in from around the globe.The Data Opportunity in 2 parts:I. The Opportunity: Why now, what forces are driving the data explosionII. The Technology Stack: What does the Big Data technology stack look like – where are the opportunities and risks?Data is heavy.