Building Data Start-Ups: Fast, Big, and Focused

•

73 recomendaciones•29,035 vistas

====================================================== 1. Building Data Start-ups: Fast, Big, and Focused ====================================================== * 2 parts today: (i) forces behind big data opportunity (ii) big data stack and how to compete with in * building a data start-up is a bit like Sumo Wrestling * data is heavy, has weight - we need agile strategies to succeed * today: talk about opportunities for data, strategies for success * in a nutshell: data start-ups must be fast, big, and focused ================================================ 2. The Big Data Opportunity ================================================ * it's a cliche by now: there is a mountain of data in this world * understanding these forces is critical to data start-up's strategy <transition>: what are some of the tectonic forces at work? ================================================ 3-4. Attack of the Exponentials ================================================ * these are something that i call 'attack of exponentials' * VCs like curves like [transition] * in the past few decades, the cost of storage, CPU, and bandwidth has been exponentially dropping, while network access has shot up * in 1980, a terabyte of storage cost $14 MILLION - today it's $47 dollars <transition>: exponential economics, together with two other forces ================================================ 5. Intersection of Three Forces ================================================ * ... form the inputs to this massive increase in data, the data singularity * sensor networks the phones, GPS devices, laptops, and instrumented spimes * cloud computing has democratized and made computing power & storage a utility ( "even if it turns out that the cloud is actually just some place in Virginia.") ================================================ 6-7. Data Value Must Exceed Data Cost ================================================ * the laws of economics have not changed: value must exceed cost * the upper left side of this graph shows data whose value exceeded its cost of collecting, storing, and computing over a decade ago * the human genome data cost $3 billion (in 2000) [shift slide] * but as the tide shifts, new classes of data are revealed as being valuable * the dog genome cost only $30 million (in 2005) * web log data used to be tossed; now it's cheap enough to collect, store, and compute over * i encourage all of you, think of a data source that was previously not collected, or not kept around, and mull the possibilities <transition>: with that, i would like to now talk about the emerging stack, and the strategies for being successful within it ================================================ 8-9, 10-11. Success on the Data Stack ================================================ * here is my vision of the emerging big data stack * at bottom is data - persistence layer - databases - the brawn * in the middle is analytics - the intelligence layer * at the top - services, what you all the brains and brawn [ transitions in quite succession ] * I argue that data start-ups, to succeed, must have == FAST data, BIG analytics, and FOCUSED services == * let's take each of these in turn, exploring the competitive axes at each layer starting from the bottom of the stack, data ================================================ 12. FAST ================================================ * as I said before, data is heavy * being able to move big data quickly is key * let's pull the data layer out of the stack & examine it ================================================ 13. Fast Data ================================================ * so we have the two competitive axes on the data layer * the first axis is scale: for data, the scaling issue has been solved. * Hadoop

Tecnología

Building Data Start-ups: Fast, Big, and Focused Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

The Intersection of Three Forces Yields Higher Volume & Velocity of Data exponential economics sensor networks cloud computing

Data Value Must Exceed Data Cost ... New Classes of Data are Now Valuable

Success on the Data Stack Services Analytics Data

Success on the Data Stack Fast Services Analytics Fast Data

Success on the Data Stack Fast, Big Services Big Analytics Fast Data

Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data

Success on the Data Stack Fast Data real-time Kdb Netezza Esper Vertica MongoDB speed InfoBright Aster MySQL MapR Greenplum Postgres batch Hadoop Services megabytes petabytes scale Analytics free, open-source Data commercial

Fast Data With Cheap Memory 1964 – Univac 2k $51 million/MB 2011 – DDR 1GB 1 cent/MB data sources: http://www.sharkyextreme.com & http://www.webservicessummit.com/Trends/TechTrends1/img11.html, plotted with ggplot2

Success on the Data Stack Big Analytics custom (hardware) real-time speed Revolution R R custom distributed SAP SAS SciPy SPSS batch Services megabytes petabytes scale Analytics free, open-source Data commercial

The Promise ofAnalytics extract learn predict DATA FEATURES MODELS “More data usually beats better algorithms.”

Success on the Data Stack Focused Services Focused Services Analytics Data

“Real-time, large-scale analytics in a focused vertical.” credit: Joe Reisinger, Metamarkets

Thank You. Questions? Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

Más contenido relacionado

Destacado

Standardizing +113 million Merchant Names in Financial Services with Greenplu...Data Science London

Complex Analytics with NoSQL Data Store in Real TimeNati Shalom

Real-Time Queries in Hadoop w/ Cloudera ImpalaData Science London

Open Stack Days israel Keynote 2017Nati Shalom

The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...Carmine Gallo

How to Become a Data Scientistryanorban

Destacado (6)

Standardizing +113 million Merchant Names in Financial Services with Greenplu...

Complex Analytics with NoSQL Data Store in Real Time

Real-Time Queries in Hadoop w/ Cloudera Impala

Open Stack Days israel Keynote 2017

The Storyteller's Secret: 3 Keys to Mastering Storytelling to Win Hearts and ...

How to Become a Data Scientist

Último

Manulife - Insurer Transformation Award 2024The Digital Insurer

Why Teams call analytics are critical to your entire businesspanagenda

TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc

Real Time Object Detection Using Open CVKhem

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Architecting Cloud Native ApplicationsWSO2

MINDCTI Revenue Release Quarter One 2024MIND CTI

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

DBX First Quarter 2024 Investor PresentationDropbox

ICT role in 21st century education and its challengesrafiqahmad00786416

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays

2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong

Building Data Start-Ups: Fast, Big, and Focused

1. Building Data Start-ups: Fast, Big, and Focused Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

2. The Big Data Opportunity

3. The Attack of the Exponentials

4. The Attack of the Exponentials

5. The Intersection of Three Forces Yields Higher Volume & Velocity of Data exponential economics sensor networks cloud computing

6. Data Value Must Exceed Data Cost

7. Data Value Must Exceed Data Cost ... New Classes of Data are Now Valuable

8. Success on the Data Stack Services Analytics Data

9. Success on the Data Stack Fast Services Analytics Fast Data

10. Success on the Data Stack Fast, Big Services Big Analytics Fast Data

11. Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data

12. #1: Fast

13. Success on the Data Stack Fast Data real-time Kdb Netezza Esper Vertica MongoDB speed InfoBright Aster MySQL MapR Greenplum Postgres batch Hadoop Services megabytes petabytes scale Analytics free, open-source Data commercial

14. Fast Data With Cheap Memory 1964 – Univac 2k $51 million/MB 2011 – DDR 1GB 1 cent/MB data sources: http://www.sharkyextreme.com & http://www.webservicessummit.com/Trends/TechTrends1/img11.html, plotted with ggplot2

15. #2: Big

16. Success on the Data Stack Big Analytics custom (hardware) real-time speed Revolution R R custom distributed SAP SAS SciPy SPSS batch Services megabytes petabytes scale Analytics free, open-source Data commercial

17. The Promise ofAnalytics extract learn predict DATA FEATURES MODELS “More data usually beats better algorithms.”

18. #3: Focused

19. Success on the Data Stack Focused Services Focused Services Analytics Data

20. “Real-time, large-scale analytics in a focused vertical.” credit: Joe Reisinger, Metamarkets

21. Success on the Data Stack Fast, Big, and Focused Focused Services Big Analytics Fast Data

22. Thank You. Questions? Michael E. Driscoll, CTO, Metamarkets @medriscoll O’Reilly Strata Online | May 25, 2011

Notas del editor

I want to first thank O’Reilly for putting together this event, and all of you for tuning in from around the globe.The Data Opportunity in 2 parts:I. The Opportunity: Why now, what forces are driving the data explosionII. The Technology Stack: What does the Big Data technology stack look like – where are the opportunities and risks?Data is heavy.

Building Data Start-Ups: Fast, Big, and Focused

Recomendados

Recomendados

Más contenido relacionado

Destacado

Destacado (6)

Último

Último (20)

Building Data Start-Ups: Fast, Big, and Focused

Notas del editor