SlideShare a Scribd company logo
1 of 35
WELCOME
Conference Highlights

    • Four exciting keynotes
    • Lots networking opportunities
    • Sixty educational sessions




2                  ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                  Reproduction or redistribution without written permission is
                                          prohibited.
Thank You Sponsors
      PLATINUM SPONSORS                                                             GOLD SPONSORS




       SILVER SPONSORS                                                           BRONZE SPONSORS




3                         ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                         Reproduction or redistribution without written permission is
                                                 prohibited.
Housekeeping Items

    • Connecting to the internet
      – Wireless network = Sheraton Meeting
      – Code = Vertica
    • Hashtag = #hw2011
    • Take the surveys
      – Breakout sessions
      – Overall survey



4                   ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                   Reproduction or redistribution without written permission is
                                           prohibited.
Mike Olson
Chief Executive Officer
Cloudera
Three Years Ago…

    We said: Hadoop is going to be huge.


This year’s conference:
    • 1,400 people from 580 companies in
      27 countries and 40 of the United States
    • 75.7% attending Hadoop World for the
      first time
    • 71.9% using Hadoop
    • 66.5% engineers, developers and
      architects, 33.5% non-technical
      business roles
    • Just over 50 of you are “data scientists”




6                                  ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                                  Reproduction or redistribution without written permission is
                                                          prohibited.
Three Years Ago…

    We said: Hadoop is going to be huge.


Your Hadoop usage:
    • Less than one year: 36.8%
    • One to two years: 32.3%
    • Two to three years: 16.8%
    • More than three years: 12%
    • Average usage is 17.4 months this year,
      versus 8.76 months at last year’s
      Hadoop World




7                                ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                                Reproduction or redistribution without written permission is
                                                        prohibited.
Three Years Ago…

    We said: Hadoop is going to be huge.


Your clusters:
    • Average size is 120 nodes, up from
      66 last year
    • 44% between 10 and 100 nodes, 52%
      between 100 and 1,000 nodes
    • Total of 202 petabytes under management
      (60 last year)
    • Largest cluster bigger than 20PB
    • 13.1% bigger than 100TB
    • 12.8% bigger than 1PB
                                                                                               2010   2011


8                                ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                                Reproduction or redistribution without written permission is
                                                        prohibited.
Two Years Ago…
We said: Hadoop is at the center of a new platform for big data.                           • Hadoop
                                                                                           • HBase
                                                                                           • Pig
                                                                                           • Zookeeper
                                                                                           • Mahout
                                                                                           • Hive
                                                                                           • Avro
                                                                                           • Whirr
                                                                                           • Sqoop
                                                                                           • Hcatalog
                                                                                           • MRUnit
                                                                                           • Bigtop
                                                                                           • Oozie


9                            ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                            Reproduction or redistribution without written permission is
                                                    prohibited.
Two Years Ago…

  We said: Hadoop is at the center of a new platform for big data.
              100%             100%

Core
                                                                                         58%
Hadoop                                                     37%                                            37%                31%
as % of
New
Contribs
             2006             2007                       2008                          2009              2010               2011
            • Core Hadoop   • Core Hadoop            •   Core Hadoop              •   Core Hadoop   •   Core Hadoop   •   Core Hadoop
                                                     •   HBase                    •   HBase         •   HBase         •   HBase
                                                     •   Zookeeper                •   Pig           •   Pig           •   Pig
                                                     •   Mahout                   •   Zookeeper     •   Zookeeper     •   Zookeeper
Relevant                                                                          •   Mahout        •   Mahout        •   Mahout
Projects                                                                          •   Hive          •   Hive          •   Hive
                                                                                                    •   Avro          •   Avro
                                                                                                    •   Whirr         •   Whirr
                                                                                                    •   Sqoop         •   Sqoop
                                                                                                                      •   Bigtop
                                                                                                                      •   …



   10                               ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                                   Reproduction or redistribution without written permission is
                                                           prohibited.
Last Year…
                                                      We said : Hadoop must integrate with
                                                      data center infrastructure and tools.
                                                         •     Enterprises need software and
                                                               support that de-risk and simplify the
                                                               operation of Hadoop in production

                                                         •     Must build on the open source
                                                               platform to deliver all the innovation
        Hadoop                                                 and value created by the global Apache
       Operations                                              Hadoop ecosystem




11                   ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                    Reproduction or redistribution without written permission is
                                            prohibited.
12    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
13    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
14    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
15    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
16    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
17    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
18    ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
     Reproduction or redistribution without written permission is
                             prohibited.
Last Year…
                                                               We said : Hadoop must integrate with
                                                               data center infrastructure and tools.

     OPERATORS                                   ENGINEERS                  ANALYSTS             BUSINESS USERS




     Management                                                                                     Enterprise
                                                    IDE’s                 BI / Analytics
        Tools                                                                                       Reporting




                                                                                                                  CUSTOMERS
                                                                                            Enterprise Data
                                                                                             Warehouse


                                                                                                                    Web
                                                                                                                  Application



                                                 Relational
        Logs      Files   Web Data
                                                 Databases




19                            ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                             Reproduction or redistribution without written permission is
                                                     prohibited.
This Year…

We’re talking about the future.




20                       ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                        Reproduction or redistribution without written permission is
                                                prohibited.
Building Applications
                                        Develop personalized
                                        applications on Hadoop
                                        and HBase
                                        Get it at:
                                        http://fonedoktor.com

                                          Learn more about
                                          Today, 3:30PM,
                                          Architecture Track
  Battery Analysis   Mapping Features     Aaron Kimball and
 Available Today…     Coming Soon!        Garrett Wu

www.wibidata.com, @wibidata
Data Analysis and Visualization                                     INSTANT INTELLIGENCE




  Demand for Online
    App Analytics
• Real-time, interactive &
  visual analytics
• Auto-discover data trends
• User behavior analytics with
  data clustering
• Investigative and root cause
  analytics
• Simplify data modeling &
  custom functions for Hadoop
  data

Empower business users, data scientists without-of-the-box analytics


 www.cetas.net, @CetasAnalytics
Powerful Statistical Tools

• Why Hadoop and R?
  •   Need to do more than simple statistics
  •   Analyze all of the data

• Integration
  •   Make it easy to write MapReduce programs in R
  •   Keep the statisticians focused on the analysis

  Usage
  •   Fraud and Risk Analysis
  •   Portfolio Optimization
  •   Anything you can model in R!




 www.revolutionanalytics.com, @RevolutionR
Complex Data Exploration

                                      Automatic extraction of facts,
                        Who
                                     connections, associations, etc.
                      Relationship
                                      Who

  Association
                                      Connections

                                      Aliases                Entity
                                                               :
                          Alias       Where                   AIG

                                      When
   Location
                                      What

                       Time
                                        Synthesys Knowledge
                                               Base
                    What did..

                                                                       Connection discovered from AIG to
                                                                           Metlife Equity in Wikipedia:
   Unstructured Data                                                    AIG sells Allco to Metlife Equity
                                                                                    for $6.8B
                Synthesys automatically surfaces critical
                      facts in unstructured data


www.digitalreasoning.com, @dreasoning
Business Analytics
• Metrics Management and Reporting
• Strategic, Financial, and Operational Planning, Budgeting, and Forecasting
• Profitability Modeling



           USABLE


           UNIFIED


        ACTIONABLE

 Enterprise Performance Management
             for the Cloud


www.tidemark.net, @TidemarkEPM
An Exploding, Diverse Ecosystem




26           ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
            Reproduction or redistribution without written permission is
                                    prohibited.
| BE FIRST



Big Data Fund
Hadoop World — November 2011
Big Data Fund
• $100MM dedicated to fund entrepreneurs globally in building disruptive, Big
  Data companies
• Funding innovation across every layer of the “Big Data Stack”:
                                 Infrastructure                     •                Applications
                                                                        Business Intelligence
                 •   Automation                                     •   Collaboration
                 •   Data Management                                •   Data Analysis/Visualization
                 •   Identity & Access                              •   Mobile
                 •   Security                                       •   Vertical Applications
                 •   Storage                                        •   …
                 •   …


• Partnering with thought leaders to foster community and drive innovation:




  Doug Cutting       Gil Elbaz      Jeff Hammerbacher   Jeff Heer          Hilary Mason        Jay Parikh   Kenny Van Zant
      Hadoop          Factual            Cloudera       Stanford               Bit.ly          Facebook       Solarwinds


Accel Partners                                                                                                             28
Who We Are

 Three decades of technology investing with over $6B of capital in US, Europe,
 China and India
           • Partner with category-defining entrepreneurs
           • Invest at every stage of technology lifecycle – seed, venture and growth capital
           • Focus deeply on technology innovations in software, infrastructure and internet

     Big Data consistently drives innovation across our portfolio companies today


                   Data Generators                                 Data Solutions




Accel Partners                                                                                  29
Time is Now!

                          The Big Data Wave                       Data is exploding

                                                                  “New” data types are
                                                                   breaking legacy data
     Data Growth




                                                                   platforms

                                                                  Big Data platforms such
                                                                   as Hadoop are becoming
                                                                   mainstream

                   1980     1990                2000    2010      “Native” Big Data
                          Traditional Data   Big Data
                                                                   applications and services
                                                                   will quickly emerge



     Big Data continues to revolutionize data centers across all industries, opening
                  up a massive market for entrepreneurial activity.
Accel Partners                                                                            30
Funding the Big Data Ecosystem
Big Data will drive the next-generation of multi-billion dollar software companies

                               1980 - 2010                          2010 and beyond

                                                               Analytics              Security
                                                                                      Business
Applications




                                                             Collaboration           Intelligence

                                                                Mobile                  CRM

                                                              Vertical Apps: Fin Tech, Healthcare


                                                                    Big Data Platforms
                     Traditional Data Platforms
Data




                    Relational Database Management Systems
Infrastructure




                 Traditional Infrastructure Platforms             Private & Public Cloud
                         Mainframe, Client-Server, Web             Platform and Services


Accel Partners                                                                                      31
Big Data Fund Contact Info
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners
                                                                Contact Us
▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big
Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data
Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪
                                                              accel.com/bigdata
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners
▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel bigdatafund@accel.com
                                                              Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big
Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data
Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪
                                                              @bigdatafund
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners
▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big
Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data
Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪
                                           Big Data Conference - Spring 2012
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners
                                                              Want to attend or speak?
▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big
                                                            BigData2012@accel.com
Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data
Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪
                                    Stay on top of the latest big data news from Accel Partners by finding us on
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners
                                                               facebook.com/Accel
▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big
Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data
                                                                 @Accel_Partners
Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪
Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel
      Accel Partners                                                                                                                 32
The Next-Generation Data Center

                                                 Systems
               Web                                Logs                                   Real-time
              Servers                                                                     Feeds
 Trading
 Systems                                                                                             Sensors




Enterprise                                                                                            Sales
  Data                                                                                               Systems
Warehouse                                                                                             People




             Document
             Repository                      ERP System                                    CRM




33                         ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                          Reproduction or redistribution without written permission is
                                                  prohibited.
The Future

                   Tackling Critical Business Issues
                                                                                                   Better targeted
                          Better and deeper                                                        medicines with fewer
                          understanding of risk                                                    complications and
                          to avoid credit crisis.                                                  side effects.
     Financial Services                                                  Life Sciences


                                                                                                   A personal experience
                    More reliable                                                                  with products and offers
                    networks where we                                                              that are just what
                    can predict and                                                                you need.
 Telecommunications                                                             Retail
                    prevent failure.

                          More content that is                                                     Government services
                          lined up with your                                                       that are based on hard
                          personal preferences.                                                    data, not just gut.
          Media                                                           Government




34                                   ©2011 Cloudera, Inc. All Rights Reserved. Confidential.
                                    Reproduction or redistribution without written permission is
                                                            prohibited.
Thank You
    Thanks you

More Related Content

What's hot

Hadoop 101
Hadoop 101Hadoop 101
Hadoop 101EMC
 
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandApachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandRichard McDougall
 
Cloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera, Inc.
 
Deploying Grid Services Using Hadoop
Deploying Grid Services Using HadoopDeploying Grid Services Using Hadoop
Deploying Grid Services Using HadoopGeorge Ang
 
The power of hadoop in cloud computing
The power of hadoop in cloud computingThe power of hadoop in cloud computing
The power of hadoop in cloud computingJoey Echeverria
 
Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作James Chen
 
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Gavin Heavyside
 
App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)outstanding59
 
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Eric Baldeschwieler
 
Hadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment EvolutionHadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment EvolutionBenoit Perroud
 
Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hortonworks
 
Introduction to hadoop and hdfs
Introduction to hadoop and hdfsIntroduction to hadoop and hdfs
Introduction to hadoop and hdfsTrendProgContest13
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messagesyarapavan
 
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...Vinod Kumar Vavilapalli
 
ROMA User-Customizable NoSQL Database in Ruby
ROMA User-Customizable NoSQL Database in RubyROMA User-Customizable NoSQL Database in Ruby
ROMA User-Customizable NoSQL Database in RubyRakuten Group, Inc.
 
Hadoop Performance at LinkedIn
Hadoop Performance at LinkedInHadoop Performance at LinkedIn
Hadoop Performance at LinkedInAllen Wittenauer
 
Scalable vertical search engine with hadoop
Scalable vertical search engine with hadoopScalable vertical search engine with hadoop
Scalable vertical search engine with hadoopdatasalt
 

What's hot (20)

Introduction to h base
Introduction to h baseIntroduction to h base
Introduction to h base
 
Hadoop 101
Hadoop 101Hadoop 101
Hadoop 101
 
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on DemandApachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
 
Cloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for HadoopCloudera Impala: A modern SQL Query Engine for Hadoop
Cloudera Impala: A modern SQL Query Engine for Hadoop
 
Deploying Grid Services Using Hadoop
Deploying Grid Services Using HadoopDeploying Grid Services Using Hadoop
Deploying Grid Services Using Hadoop
 
The power of hadoop in cloud computing
The power of hadoop in cloud computingThe power of hadoop in cloud computing
The power of hadoop in cloud computing
 
Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作Etu L2 Training - Hadoop 企業應用實作
Etu L2 Training - Hadoop 企業應用實作
 
Hadoop on VMware
Hadoop on VMwareHadoop on VMware
Hadoop on VMware
 
Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010Introduction to Hadoop - ACCU2010
Introduction to Hadoop - ACCU2010
 
App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)App cap2956v2-121001194956-phpapp01 (1)
App cap2956v2-121001194956-phpapp01 (1)
 
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
Hadoop - Where did it come from and what's next? (Pasadena Sept 2014)
 
Hadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment EvolutionHadoop Successes and Failures to Drive Deployment Evolution
Hadoop Successes and Failures to Drive Deployment Evolution
 
Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)Hdp r-google charttools-webinar-3-5-2013 (2)
Hdp r-google charttools-webinar-3-5-2013 (2)
 
Introduction to hadoop and hdfs
Introduction to hadoop and hdfsIntroduction to hadoop and hdfs
Introduction to hadoop and hdfs
 
Storage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook MessagesStorage Infrastructure Behind Facebook Messages
Storage Infrastructure Behind Facebook Messages
 
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...
Innovations in Apache Hadoop MapReduce, Pig and Hive for improving query perf...
 
ROMA User-Customizable NoSQL Database in Ruby
ROMA User-Customizable NoSQL Database in RubyROMA User-Customizable NoSQL Database in Ruby
ROMA User-Customizable NoSQL Database in Ruby
 
Hadoop Performance at LinkedIn
Hadoop Performance at LinkedInHadoop Performance at LinkedIn
Hadoop Performance at LinkedIn
 
Hadoop at Rakuten, 2011/07/06
Hadoop at Rakuten, 2011/07/06Hadoop at Rakuten, 2011/07/06
Hadoop at Rakuten, 2011/07/06
 
Scalable vertical search engine with hadoop
Scalable vertical search engine with hadoopScalable vertical search engine with hadoop
Scalable vertical search engine with hadoop
 

Viewers also liked

Open Data Fueling Innovation - Kristen Honey
Open Data Fueling Innovation - Kristen HoneyOpen Data Fueling Innovation - Kristen Honey
Open Data Fueling Innovation - Kristen Honeyscoopnewsgroup
 
HHS: Opening Data, Influencing Innovation - Damon Davis
HHS: Opening Data, Influencing Innovation - Damon DavisHHS: Opening Data, Influencing Innovation - Damon Davis
HHS: Opening Data, Influencing Innovation - Damon Davisscoopnewsgroup
 
Intro to Apache Kudu (short) - Big Data Application Meetup
Intro to Apache Kudu (short) - Big Data Application MeetupIntro to Apache Kudu (short) - Big Data Application Meetup
Intro to Apache Kudu (short) - Big Data Application MeetupMike Percy
 
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...Yahoo Developer Network
 
Using a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessUsing a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessDataWorks Summit/Hadoop Summit
 
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...DataWorks Summit/Hadoop Summit
 
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례Gruter
 
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개Gruter
 
Kudu: New Hadoop Storage for Fast Analytics on Fast Data
Kudu: New Hadoop Storage for Fast Analytics on Fast DataKudu: New Hadoop Storage for Fast Analytics on Fast Data
Kudu: New Hadoop Storage for Fast Analytics on Fast DataCloudera, Inc.
 

Viewers also liked (17)

Open Data Fueling Innovation - Kristen Honey
Open Data Fueling Innovation - Kristen HoneyOpen Data Fueling Innovation - Kristen Honey
Open Data Fueling Innovation - Kristen Honey
 
HHS: Opening Data, Influencing Innovation - Damon Davis
HHS: Opening Data, Influencing Innovation - Damon DavisHHS: Opening Data, Influencing Innovation - Damon Davis
HHS: Opening Data, Influencing Innovation - Damon Davis
 
Intro to Apache Kudu (short) - Big Data Application Meetup
Intro to Apache Kudu (short) - Big Data Application MeetupIntro to Apache Kudu (short) - Big Data Application Meetup
Intro to Apache Kudu (short) - Big Data Application Meetup
 
LinkedIn
LinkedInLinkedIn
LinkedIn
 
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...
February 2016 HUG: Apache Kudu (incubating): New Apache Hadoop Storage for Fa...
 
NLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-TextNLP Structured Data Investigation on Non-Text
NLP Structured Data Investigation on Non-Text
 
Securing Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise ContextSecuring Hadoop in an Enterprise Context
Securing Hadoop in an Enterprise Context
 
Using a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessUsing a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance business
 
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
 
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: 인터넷 쇼핑몰의 실시간 분석 플랫폼 구축 사례
 
Smart data for a predictive bank
Smart data for a predictive bankSmart data for a predictive bank
Smart data for a predictive bank
 
Apache Hive on ACID
Apache Hive on ACIDApache Hive on ACID
Apache Hive on ACID
 
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개
GRUTER가 들려주는 Big Data Platform 구축 전략과 적용 사례: GRUTER의 빅데이터 플랫폼 및 전략 소개
 
Kudu: New Hadoop Storage for Fast Analytics on Fast Data
Kudu: New Hadoop Storage for Fast Analytics on Fast DataKudu: New Hadoop Storage for Fast Analytics on Fast Data
Kudu: New Hadoop Storage for Fast Analytics on Fast Data
 
Hive Does ACID
Hive Does ACIDHive Does ACID
Hive Does ACID
 
Apache kudu
Apache kuduApache kudu
Apache kudu
 
On Demand HDP Clusters using Cloudbreak and Ambari
On Demand HDP Clusters using Cloudbreak and AmbariOn Demand HDP Clusters using Cloudbreak and Ambari
On Demand HDP Clusters using Cloudbreak and Ambari
 

Similar to Hadoop World 2011: Mike Olson Keynote Presentation

Hortonworks Presentation at Big Data London
Hortonworks Presentation at Big Data LondonHortonworks Presentation at Big Data London
Hortonworks Presentation at Big Data LondonHortonworks
 
The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)Rensselaer Polytechnic Institute
 
2.0 Adoption in the Enterprise - The After
2.0 Adoption in the Enterprise - The After2.0 Adoption in the Enterprise - The After
2.0 Adoption in the Enterprise - The AfterSoCo Partners
 
Hadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr AwadallahHadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr AwadallahCloudera, Inc.
 
Social media class 2 v2
Social media class 2 v2Social media class 2 v2
Social media class 2 v2Novell
 
Social media class 2
Social media class 2Social media class 2
Social media class 2Novell
 
Metadata is a Love Note to the Future
Metadata is a Love Note to the FutureMetadata is a Love Note to the Future
Metadata is a Love Note to the FutureRachel Lovinger
 
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...Hadoop / Spark Conference Japan
 
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...StampedeCon
 
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Cloudera, Inc.
 
Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual ObservatoryJose Enrique Ruiz
 
Partner Day at DrupalCon Sydney
Partner Day at DrupalCon SydneyPartner Day at DrupalCon Sydney
Partner Day at DrupalCon SydneyAcquia
 
Social media class 3
Social media class 3Social media class 3
Social media class 3Novell
 
Business power point templates linear demonstration of marketing process usin...
Business power point templates linear demonstration of marketing process usin...Business power point templates linear demonstration of marketing process usin...
Business power point templates linear demonstration of marketing process usin...SlideTeam.net
 
Empowering Your Audience Ambassadors with Semantic Publishing
Empowering Your Audience Ambassadors with Semantic Publishing Empowering Your Audience Ambassadors with Semantic Publishing
Empowering Your Audience Ambassadors with Semantic Publishing Rachel Lovinger
 
20100806 cloudera 10 hadoopable problems webinar
20100806 cloudera 10 hadoopable problems webinar20100806 cloudera 10 hadoopable problems webinar
20100806 cloudera 10 hadoopable problems webinarCloudera, Inc.
 
10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems WebinarCloudera, Inc.
 

Similar to Hadoop World 2011: Mike Olson Keynote Presentation (20)

Hortonworks Presentation at Big Data London
Hortonworks Presentation at Big Data LondonHortonworks Presentation at Big Data London
Hortonworks Presentation at Big Data London
 
The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)The Future of DSpace: Making it Personal (Making it Social)
The Future of DSpace: Making it Personal (Making it Social)
 
2.0 Adoption in the Enterprise - The After
2.0 Adoption in the Enterprise - The After2.0 Adoption in the Enterprise - The After
2.0 Adoption in the Enterprise - The After
 
Hadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr AwadallahHadoop's Impact on the Future of Data Management | Amr Awadallah
Hadoop's Impact on the Future of Data Management | Amr Awadallah
 
Social media class 2 v2
Social media class 2 v2Social media class 2 v2
Social media class 2 v2
 
Social media class 2
Social media class 2Social media class 2
Social media class 2
 
Metadata is a Love Note to the Future
Metadata is a Love Note to the FutureMetadata is a Love Note to the Future
Metadata is a Love Note to the Future
 
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
The Evolution and Future of Hadoop Storage (Hadoop Conference Japan 2016キーノート...
 
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...
MapReduce Best Practices and Lessons Learned Applied to Enterprise Datasets -...
 
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208Webinar: Productionizing Hadoop: Lessons Learned - 20101208
Webinar: Productionizing Hadoop: Lessons Learned - 20101208
 
Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
 
Partner Day at DrupalCon Sydney
Partner Day at DrupalCon SydneyPartner Day at DrupalCon Sydney
Partner Day at DrupalCon Sydney
 
Social media class 3
Social media class 3Social media class 3
Social media class 3
 
Cloudbees -Open Source Versus Business - nicolas de loof - fossa2011
Cloudbees -Open Source Versus Business - nicolas de loof - fossa2011Cloudbees -Open Source Versus Business - nicolas de loof - fossa2011
Cloudbees -Open Source Versus Business - nicolas de loof - fossa2011
 
Business power point templates linear demonstration of marketing process usin...
Business power point templates linear demonstration of marketing process usin...Business power point templates linear demonstration of marketing process usin...
Business power point templates linear demonstration of marketing process usin...
 
Empowering Your Audience Ambassadors with Semantic Publishing
Empowering Your Audience Ambassadors with Semantic Publishing Empowering Your Audience Ambassadors with Semantic Publishing
Empowering Your Audience Ambassadors with Semantic Publishing
 
NYC.JS
NYC.JSNYC.JS
NYC.JS
 
20100806 cloudera 10 hadoopable problems webinar
20100806 cloudera 10 hadoopable problems webinar20100806 cloudera 10 hadoopable problems webinar
20100806 cloudera 10 hadoopable problems webinar
 
10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar10 Common Hadoop-able Problems Webinar
10 Common Hadoop-able Problems Webinar
 
hadoop事例紹介
hadoop事例紹介hadoop事例紹介
hadoop事例紹介
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 

Recently uploaded (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 

Hadoop World 2011: Mike Olson Keynote Presentation

  • 2. Conference Highlights • Four exciting keynotes • Lots networking opportunities • Sixty educational sessions 2 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 3. Thank You Sponsors PLATINUM SPONSORS GOLD SPONSORS SILVER SPONSORS BRONZE SPONSORS 3 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 4. Housekeeping Items • Connecting to the internet – Wireless network = Sheraton Meeting – Code = Vertica • Hashtag = #hw2011 • Take the surveys – Breakout sessions – Overall survey 4 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 5. Mike Olson Chief Executive Officer Cloudera
  • 6. Three Years Ago… We said: Hadoop is going to be huge. This year’s conference: • 1,400 people from 580 companies in 27 countries and 40 of the United States • 75.7% attending Hadoop World for the first time • 71.9% using Hadoop • 66.5% engineers, developers and architects, 33.5% non-technical business roles • Just over 50 of you are “data scientists” 6 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 7. Three Years Ago… We said: Hadoop is going to be huge. Your Hadoop usage: • Less than one year: 36.8% • One to two years: 32.3% • Two to three years: 16.8% • More than three years: 12% • Average usage is 17.4 months this year, versus 8.76 months at last year’s Hadoop World 7 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 8. Three Years Ago… We said: Hadoop is going to be huge. Your clusters: • Average size is 120 nodes, up from 66 last year • 44% between 10 and 100 nodes, 52% between 100 and 1,000 nodes • Total of 202 petabytes under management (60 last year) • Largest cluster bigger than 20PB • 13.1% bigger than 100TB • 12.8% bigger than 1PB 2010 2011 8 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 9. Two Years Ago… We said: Hadoop is at the center of a new platform for big data. • Hadoop • HBase • Pig • Zookeeper • Mahout • Hive • Avro • Whirr • Sqoop • Hcatalog • MRUnit • Bigtop • Oozie 9 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 10. Two Years Ago… We said: Hadoop is at the center of a new platform for big data. 100% 100% Core 58% Hadoop 37% 37% 31% as % of New Contribs 2006 2007 2008 2009 2010 2011 • Core Hadoop • Core Hadoop • Core Hadoop • Core Hadoop • Core Hadoop • Core Hadoop • HBase • HBase • HBase • HBase • Zookeeper • Pig • Pig • Pig • Mahout • Zookeeper • Zookeeper • Zookeeper Relevant • Mahout • Mahout • Mahout Projects • Hive • Hive • Hive • Avro • Avro • Whirr • Whirr • Sqoop • Sqoop • Bigtop • … 10 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 11. Last Year… We said : Hadoop must integrate with data center infrastructure and tools. • Enterprises need software and support that de-risk and simplify the operation of Hadoop in production • Must build on the open source platform to deliver all the innovation Hadoop and value created by the global Apache Operations Hadoop ecosystem 11 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 12. 12 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 13. 13 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 14. 14 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 15. 15 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 16. 16 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 17. 17 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 18. 18 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 19. Last Year… We said : Hadoop must integrate with data center infrastructure and tools. OPERATORS ENGINEERS ANALYSTS BUSINESS USERS Management Enterprise IDE’s BI / Analytics Tools Reporting CUSTOMERS Enterprise Data Warehouse Web Application Relational Logs Files Web Data Databases 19 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 20. This Year… We’re talking about the future. 20 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 21. Building Applications Develop personalized applications on Hadoop and HBase Get it at: http://fonedoktor.com Learn more about Today, 3:30PM, Architecture Track Battery Analysis Mapping Features Aaron Kimball and Available Today… Coming Soon! Garrett Wu www.wibidata.com, @wibidata
  • 22. Data Analysis and Visualization INSTANT INTELLIGENCE Demand for Online App Analytics • Real-time, interactive & visual analytics • Auto-discover data trends • User behavior analytics with data clustering • Investigative and root cause analytics • Simplify data modeling & custom functions for Hadoop data Empower business users, data scientists without-of-the-box analytics www.cetas.net, @CetasAnalytics
  • 23. Powerful Statistical Tools • Why Hadoop and R? • Need to do more than simple statistics • Analyze all of the data • Integration • Make it easy to write MapReduce programs in R • Keep the statisticians focused on the analysis Usage • Fraud and Risk Analysis • Portfolio Optimization • Anything you can model in R! www.revolutionanalytics.com, @RevolutionR
  • 24. Complex Data Exploration Automatic extraction of facts, Who connections, associations, etc. Relationship Who Association Connections Aliases Entity : Alias Where AIG When Location What Time Synthesys Knowledge Base What did.. Connection discovered from AIG to Metlife Equity in Wikipedia: Unstructured Data AIG sells Allco to Metlife Equity for $6.8B Synthesys automatically surfaces critical facts in unstructured data www.digitalreasoning.com, @dreasoning
  • 25. Business Analytics • Metrics Management and Reporting • Strategic, Financial, and Operational Planning, Budgeting, and Forecasting • Profitability Modeling USABLE UNIFIED ACTIONABLE Enterprise Performance Management for the Cloud www.tidemark.net, @TidemarkEPM
  • 26. An Exploding, Diverse Ecosystem 26 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 27. | BE FIRST Big Data Fund Hadoop World — November 2011
  • 28. Big Data Fund • $100MM dedicated to fund entrepreneurs globally in building disruptive, Big Data companies • Funding innovation across every layer of the “Big Data Stack”: Infrastructure • Applications Business Intelligence • Automation • Collaboration • Data Management • Data Analysis/Visualization • Identity & Access • Mobile • Security • Vertical Applications • Storage • … • … • Partnering with thought leaders to foster community and drive innovation: Doug Cutting Gil Elbaz Jeff Hammerbacher Jeff Heer Hilary Mason Jay Parikh Kenny Van Zant Hadoop Factual Cloudera Stanford Bit.ly Facebook Solarwinds Accel Partners 28
  • 29. Who We Are Three decades of technology investing with over $6B of capital in US, Europe, China and India • Partner with category-defining entrepreneurs • Invest at every stage of technology lifecycle – seed, venture and growth capital • Focus deeply on technology innovations in software, infrastructure and internet Big Data consistently drives innovation across our portfolio companies today Data Generators Data Solutions Accel Partners 29
  • 30. Time is Now! The Big Data Wave  Data is exploding  “New” data types are breaking legacy data Data Growth platforms  Big Data platforms such as Hadoop are becoming mainstream 1980 1990 2000 2010  “Native” Big Data Traditional Data Big Data applications and services will quickly emerge Big Data continues to revolutionize data centers across all industries, opening up a massive market for entrepreneurial activity. Accel Partners 30
  • 31. Funding the Big Data Ecosystem Big Data will drive the next-generation of multi-billion dollar software companies 1980 - 2010 2010 and beyond Analytics Security Business Applications Collaboration Intelligence Mobile CRM Vertical Apps: Fin Tech, Healthcare Big Data Platforms Traditional Data Platforms Data Relational Database Management Systems Infrastructure Traditional Infrastructure Platforms Private & Public Cloud Mainframe, Client-Server, Web Platform and Services Accel Partners 31
  • 32. Big Data Fund Contact Info Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners Contact Us ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ accel.com/bigdata Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel bigdatafund@accel.com Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ @bigdatafund Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Big Data Conference - Spring 2012 Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners Want to attend or speak? ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big BigData2012@accel.com Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Stay on top of the latest big data news from Accel Partners by finding us on Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners facebook.com/Accel ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data @Accel_Partners Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Partners ▪ Big Data Fund ▪ Accel Accel Partners 32
  • 33. The Next-Generation Data Center Systems Web Logs Real-time Servers Feeds Trading Systems Sensors Enterprise Sales Data Systems Warehouse People Document Repository ERP System CRM 33 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 34. The Future Tackling Critical Business Issues Better targeted Better and deeper medicines with fewer understanding of risk complications and to avoid credit crisis. side effects. Financial Services Life Sciences A personal experience More reliable with products and offers networks where we that are just what can predict and you need. Telecommunications Retail prevent failure. More content that is Government services lined up with your that are based on hard personal preferences. data, not just gut. Media Government 34 ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  • 35. Thank You Thanks you

Editor's Notes

  1. We ran a survey.1400 people, 580 countries27 countries and 40 of the United StatesMore than 3/4 are first-timers at Hadoop World – Welcome!Nearly 3/4 are using Hadoop today2/3 technical, 1/3 businessAnd the new profession of data science is here in force!
  2. One third each: Less than one year, 1-2 years, more than two years.The average user here is more experienced than the average user at Hadoop World 2010 – 9 months
  3. Average cluster size has doubled in a year.More than half of you have pretty big clusters – more than 100 nodes.202 PB represented on our survey. One company was 10% of that.More of you – 12% -- above a petabyte than I would have guessed.But important: About 3/4 of you have less than 100TB in Hadoop.
  4. Hadoop needed more:Load and share dataQuery tools and ways to schedule and manage obsFast record storage and retrievalAll of that is available from the Apache ecossytem
  5. In 2006 and 2007, all the work was on core Hadoop.2008, the ecosystem began to diversify.Today, nearly 70% of all new contribs are to surrounding projects – only 31% to Hadoop itselfWhat you would expect as platform has matured
  6. Hadoop in production is just one part of your data center.You need to monitor and manage like other critical platforms.
  7. What’s happening right now?Who’s doing what?
  8. How are the services I depend on doing?
  9. I need a high-level service view.Take storage.How is it performing?Latency? Throughput?What’s happening?
  10. Who’s consuming storage?Am I close to capacity?How to I make sure users get what they need?How do I track their use?
  11. Infrastructure is long-lived.I need to add, remove, retire hardware.I can’t shut down the system.
  12. Move between high-level view and detail.HDFS is a service, but it runs on lots of servers.I need to see both.
  13. That’s just storage.Lots of other services: query tools, analytics and more.Complex, multi-tenant, mission-critical infrastructure.Integrate with data center operations.
  14. Hadoop is not an island.It is part of your enterprise IT platform.We were right.
  15. Pick your graph: Big data is a big deal.The platform is here today.The next 12 months will be about use cases.About tooling and apps.Let me show you some cool ones. These companies are all here today.
  16. WibiData is Odiago’s core product – a platform for developing personalized applications with Hadoop and HbaseWibiData provides both programmatic APIs for Application Development and an ODBC interface for easy integration with existing BI / Reporting / Analysis technology + libraries that make personalization quick and easyFoneDoktor is one such application, powered by WibiDataFoneDoktor is free for Consumers:Learn from your dataShare with the community -> get more value from your dataAvailable at fonedoktor.comFoneDoktor is available to Partners (Carriers and OEMs):Lower Device Return RatesLower Support VolumeMeasure Device / Network performanceWibiData + FoneDoktor deep dive in Aaron and Garrett’s talk – check it out!
  17. Need self-service tools for behavioral analytics.Interactive, visual tools for business users to explore data themselves.Cetas provides real-time, interactive analytics.Automatic discover and highlight clusters and trends in data.Mask complexity, deliver big data analysis to business users.
  18. R is a statistical language for developing advanced analyticsWith Hadoop, R can explore all the data: No sampling, no subsetting.R language runs under MapReduceStatistician focuses on analysis, not HadoopFraud and Risk analysisPortfolio optimizationAnything you can model in R
  19. Validated by customers in the US Army and intelligence spaceOperates on key enterprise information (financial intelligence, risk, and patents)Combines enterprise data with public sourcesStructured, semi-structured and complexDiscovers and shows connections, relationships among entities
  20. Enterprise Performance ManagementKey metrics, trends, analysies: Plan, budget, forecastHadoop for trending, diverse data sources, external and internalWith drill-downAimed at busy execs who need clear insight and overviewiPad, iPhone applications
  21. It’s getting crowded in here!Companies contributing to Hadoop, integrating with it or building on top.Sign of a big, robust market.But these aren’t the only people who have spotted the opportunity in big data.I’d like to bring up Ping Li from Accel Partners with an exciting announcement.
  22. Hadoop as the hubCatch, process, summarize the firehoseIntegrate with new and existing platforms for special-purpose workloadsAlready happening
  23. Three years talking speeds and feedsThe story for the future is value:Business problems and solutions built on big data.