SlideShare una empresa de Scribd logo
1 de 34
Business value from
     Big Data
Agenda

• What’s Big Data?
• Why now?
• Why care?
• Key technologies
• How can I get started?
Agenda

• What’s Big Data?
• Why now?
• Why care?
• Key technologies
• How can I get started?
What’s Big Data?

• Volume
• Velocity
• Variety
• Value
Large datasets
Challenge


    40%                   5%
                       Growth of IT
                     spending per year
  Growth of data
generated per year
                           Source: McKinsey
Maybe Big Data is...

• When any of volume, velocity, variety, value
  (cost?) becomes a problem
• When new use cases emerge, new things
  become possible, because of new data
  sources
For example



 US cell     Items shared    Smart meter
 updates     Social media   readings 2015
600B/day        4B/day         29B/day
Agenda

• What’s Big Data?
• Why now?
• Why care?
• Key technologies
• How can I get started?
Why now?
Cost per gigabyte
                      1000 $569



                          100
               $ per GB




                          10



                           1
                                                                                        $0.13


                                1992   1994   1996   1998   2000   2002   2004   2006    2008

Source: Deloitte
Guess what?
Disruptive innovation
Agenda

• What’s Big Data?
• Why now?
• Why care?
• Key technologies
• How can I get started?
Why care?

“Companies that can harness big data will
     trample data incompetents”
           The Economist, May 26th 2011
Why care - take 2

• The competition will do it (and you’ll get
  fired)
• Competitive advantage to be gained by
  doing it well (you get promoted)
• It’s not hard to get started (no need for huge
  investment)
What are we looking
         for?

• Data / Information
• Insights
• Actionable intelligence
Agenda

• What’s Big Data?
• Why now?
• Why care?
• Key technologies
• How can I get started?
Databases




A Relational Model of Data for Large Shared Data Banks
                                     Tedd Codd, CACM, June 1970

 Image: IBM
Big = Slow?

                                                                                    Throughput
           Throughput: records/ms




                                                                                  falls as datasets
                                                                                      get larger




                                    0     25                    50               75       100
                                                               Records (in millions)



Source: Gerard Maas, http://www.gerardmaas.net/2011/06/bigdata-on-rdbms
Scale-out versus Scale-up




21
Hadoop
• Great for unstructured data or arbitrary
  queries
• MapReduce framework for distributed
  compute
• Tools now making it accessible
• Still essentially a batch processing system
What about real-time?
Use cases
• Tracking trending topics on social media
• Network and infrastructure monitoring
• Web and ad analytics dashboard and
  platforms
• Real-time A-B testing
• User profiling
NOSQL

        Voldemort
No “one size fits all”

• Column DBs and Key-Value stores   P
• Document databases
• Graph databases
                               C        A
Questions to ask

• Who uses it?
• Who can support it? Where are they?
• How does it scale? Perform?
• Maturity, both DB and tool ecosystem
Changing economics
     XDR                         XDR
    metadata                    metadata



      Oracle

     NetApp              30 x $3k Dell servers



 30 days of SMS       1/5th TCO of alternatives
At capacity ceiling    Cost grows predictably
Agenda

• What’s Big Data?
• Why now?
• Why care?
• What’s the new technology good for?
• How can I get started?
Start small

• Identify data sources
• Look at capabilities
• Run experiments, PoCs
Data sources
     Web, SCM, Retail   Location Services    Infra Monitoring




     Smart Metering     Oil/Gas Sensors      Ad Marketplaces



            Fraud Detection        Social Media


31
Capabilities

     •   Open source, supported, or “packaged”
         solution?
     •   How do “commodity” servers fit your
         infrastructure?
     •   Don’t rule out Cloud deployments to get
         quick answers


32
Acunu
         Discover the Potential of Real Time Big Data with Acunu Activate                                                                                                                                                                        Acunu Reflex
                                                                                                                                                                                                                   Makes Big Data results easy, economic and fast


         Every CIO, Architect and Analyst knows of existing data with huge untapped potential within their organisation.
                                                                                                                                                                       Zero to Big Data Hero
         Evolving Big Data technologies provide new paths to revenue with both customers and prospects.                                                                    Build a Big Data database cluster on commodity hardware in hours, not days.



                                                                                                                                                             $
         Acunu partners with you to deliver competitive advantage by
         capturing data and exploring its benefits. You’ll validate the
         value of Big Data by building real applications and dashboards
         to drive new value for your business.
                                                                                  “ Key business andmanagement and processing landscape.
                                                                                    traditional data
                                                                                                     technology trends are disrupting the
                                                                                                                                                                       Save Money versus Open Source Alternatives
         At the outset, we work with you to identify and develop use
                                                                                     Data analysis is increasingly being viewed as a                                       Save up to 60% on hardware and operation costs.
                                                                                     competitive advantage. An increasingly sensor-enabled
         cases and areas where Big Data tools could be utilised to add
                                                                                     and instrumented business environment is generating huge
         significant business value. We work with you to recommend

                                                                                                                                                            z z z Database lag getting you down?
         solutions architectures for your specific use cases.                        volumes of data… Traditional IT infrastructure is simply

                                                                                                                                           ”
                                                                                                                                                             zz
                                                                                     not able to meet the demands of this new situation.
         We then deploy Acunu Reflex in your own data center or in the
         cloud and can include Apache Hadoop for investigative work and                                                       -Gartner                                     Milliseconds turning into minutes?
         Acunu Analytics for real-time decision support.

         Once the software is installed, we work with you to integrate, capture and store sources of data from inside
         your organisation. We provide hands-on assistance to help you showcase the business value of your data
         through live proof-of-concept applications. You’ll get results quickly, with successive iterations delivering                                  What is Acunu Reflex?
         ongoing value.



     As a result, you gain an understanding of Big Data’s transformative capability through working                                                         Easy
                                                                                                                                                            Acunu provides an integrated suite of technologies to support rapid development and deployment of
     demonstrations and have a clear route to deliver that competitive advantage to your business.                                                          your Big Data applications. Getting started is easy with a single, fast installation, handling all the details
                                                                                                                                                            usually associated with OS tuning, storage optimization, database integration and management.
                                                                                                                                                            This alleviates the complexity of NoSQL development, deployment and support. The platform is flexible
                                                                                                                                                            and scalable, providing simple, one click deployment. Scale linearly with ease and deploy across
                                                                                                                                                            numerous machines within a data center or across a globally distributed public or private cloud.
         Workshops                             Structure & Planning                    Ecosystem of Expertise
         Acunu Specialist delivers             A dedicated Project Lead will           Acunu’s Big Data expertise is
         workshops and provides                keep the project on track               complemented through our
         on-demand consulting to
         enable your development team
                                               through kickoff, reviews and
                                               regular calls. Progressively,
                                                                                       partners. Together we will build
                                                                                       your own Big Data ecosystem.                                         Economic
         to build Big Data applications.       we’ll help you plan next steps.
                                                                                                                                                            Acunu’s subscription base pricing model insures continuous value, skipping charges for non-production
                                                                                                                                                            deployment, so you can defer technology expenses until your application goes into production. Acunu
                                                                                                                                                            provides the NoSQL domain expertise you need, reducing your technology deployment costs without
                                                                                                                                                            compromising your data security. The platform is architected to store significantly more data per node

                        A Comprehensive Big Data Discovery Package                                                                                          than competing technologies with a focus on reducing both your initial hardware and operational costs
                                                                                                                                                            over time. Acunu’s support for commodity hardware and large capacity disks further reduces your costs.



         Deployment                            Data Source Integration                 Support & Training                                                   Fast
         We deploy Acunu’s database            We work with you to integrate           We deliver hands on training
         and storage software, complete        sources of log, clickstream,            on the Acunu Reflex                                                  Acunu provides a suite of products focused on bringing you the performance your Big Data applications
         with management tools, to             sensor, monitoring or similar           infrastructure to your                                               demand. Whether it’s a globally distributed database, millions to billions of records, tremendous amounts
         your own hardware or to               data into Acunu Reflex.                 operations staff, and provide                                        of machine generated data or managing millions of active users, Acunu provides you with real time
         Amazon’s public cloud.                                                        support throughout the project.
                                                                                                                                                            results. Acunu has the professional services and support to get your applications up and running in the
                                                                                                                                                            shortest possible time. Acunu leverages best in class open source solutions, adding additional
                                                                                                                                                            management and performance technology to accelerate your Big Data results.




33
www.acunu.com @acunu




Apache, Apache Cassandra, Cassandra, Hadoop, and the eye and
elephant logos are trademarks of the Apache Software Foundation.

Más contenido relacionado

La actualidad más candente

Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsHappiest Minds Technologies
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformDATAVERSITY
 
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentationBig data ibm keynote d advani presentation
Big data ibm keynote d advani presentationMassTLC
 
IBM Big Data References
IBM Big Data ReferencesIBM Big Data References
IBM Big Data ReferencesRob Thomas
 
Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsAlan Quayle
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big DataMatthew Dennis
 
NextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataNextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataEd Dodds
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataMicrosoft
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesmark madsen
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Vasu S
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMEGigaom
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesTony Pearson
 
Why Data is Drowning the (IT) World?
Why Data is Drowning the (IT) World?Why Data is Drowning the (IT) World?
Why Data is Drowning the (IT) World?Sanjeev Kumar
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveKun Le
 
Big Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideBig Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideSlideTeam
 

La actualidad más candente (18)

Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Big data ibm keynote d advani presentation
Big data ibm keynote d advani presentationBig data ibm keynote d advani presentation
Big data ibm keynote d advani presentation
 
IBM Big Data References
IBM Big Data ReferencesIBM Big Data References
IBM Big Data References
 
Telco Big Data 2012 Highlights
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 Highlights
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
NextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataNextGen Infrastructure for Big Data
NextGen Infrastructure for Big Data
 
Analytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big dataAnalytics 3.0 Measurable business impact from analytics & big data
Analytics 3.0 Measurable business impact from analytics & big data
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slides
 
Big Data on AWS
Big Data on AWSBig Data on AWS
Big Data on AWS
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
Case Study - Spotad: Rebuilding And Optimizing Real-Time Mobile Adverting Bid...
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Why Data is Drowning the (IT) World?
Why Data is Drowning the (IT) World?Why Data is Drowning the (IT) World?
Why Data is Drowning the (IT) World?
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep dive
 
Big Data Overview
Big Data OverviewBig Data Overview
Big Data Overview
 
Big Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation SlideBig Data Information Architecture PowerPoint Presentation Slide
Big Data Information Architecture PowerPoint Presentation Slide
 

Destacado

Value proposition of open government data
Value proposition of open government dataValue proposition of open government data
Value proposition of open government dataAlexander Howard
 
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co..."Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...Edge AI and Vision Alliance
 
Value Creation for SMBs with Big Data
Value Creation for SMBs with Big DataValue Creation for SMBs with Big Data
Value Creation for SMBs with Big DataAndrey Sadovykh
 
Turning Data Into Value
Turning Data Into ValueTurning Data Into Value
Turning Data Into ValueMatt Hall
 
Food waste collection in the Netherlands
Food waste collection in the NetherlandsFood waste collection in the Netherlands
Food waste collection in the NetherlandsMilano Recycle City
 
Business Aspects of the IoT: Making Products Smart
Business Aspects of the IoT: Making Products SmartBusiness Aspects of the IoT: Making Products Smart
Business Aspects of the IoT: Making Products SmartDominique Guinard
 
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS WorldSuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS WorldSimo Ahava
 
Emerging Business Models for the Open Data Industry and Open Data Value Capab...
Emerging Business Models for the Open Data Industry and Open Data Value Capab...Emerging Business Models for the Open Data Industry and Open Data Value Capab...
Emerging Business Models for the Open Data Industry and Open Data Value Capab...Fatemeh Ahmadi
 
Industrial Data Space Key Facts
Industrial Data Space Key FactsIndustrial Data Space Key Facts
Industrial Data Space Key FactsBoris Otto
 
Turning data from insights into value
Turning data from insights into valueTurning data from insights into value
Turning data from insights into valueKoray Kocabas
 
[243] turning data into value
[243] turning data into value[243] turning data into value
[243] turning data into valueNAVER D2
 
Big data and value creation
Big data and value creationBig data and value creation
Big data and value creationRichard Vidgen
 
Turning Industrial Data into Value
Turning Industrial Data into ValueTurning Industrial Data into Value
Turning Industrial Data into ValueBoris Otto
 

Destacado (13)

Value proposition of open government data
Value proposition of open government dataValue proposition of open government data
Value proposition of open government data
 
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co..."Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
"Using Vision to Improve Waste Collection Efficiency," a Presentation from Co...
 
Value Creation for SMBs with Big Data
Value Creation for SMBs with Big DataValue Creation for SMBs with Big Data
Value Creation for SMBs with Big Data
 
Turning Data Into Value
Turning Data Into ValueTurning Data Into Value
Turning Data Into Value
 
Food waste collection in the Netherlands
Food waste collection in the NetherlandsFood waste collection in the Netherlands
Food waste collection in the Netherlands
 
Business Aspects of the IoT: Making Products Smart
Business Aspects of the IoT: Making Products SmartBusiness Aspects of the IoT: Making Products Smart
Business Aspects of the IoT: Making Products Smart
 
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS WorldSuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
SuperWeek 2016 - Garbage In Garbage Out: Data Quality in a TMS World
 
Emerging Business Models for the Open Data Industry and Open Data Value Capab...
Emerging Business Models for the Open Data Industry and Open Data Value Capab...Emerging Business Models for the Open Data Industry and Open Data Value Capab...
Emerging Business Models for the Open Data Industry and Open Data Value Capab...
 
Industrial Data Space Key Facts
Industrial Data Space Key FactsIndustrial Data Space Key Facts
Industrial Data Space Key Facts
 
Turning data from insights into value
Turning data from insights into valueTurning data from insights into value
Turning data from insights into value
 
[243] turning data into value
[243] turning data into value[243] turning data into value
[243] turning data into value
 
Big data and value creation
Big data and value creationBig data and value creation
Big data and value creation
 
Turning Industrial Data into Value
Turning Industrial Data into ValueTurning Industrial Data into Value
Turning Industrial Data into Value
 

Similar a Exploring Big Data value for your business

Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateCCG
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Introduction of big data and analytics
Introduction of big data and analyticsIntroduction of big data and analytics
Introduction of big data and analyticsSanjeev Solanki
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Denodo
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibmAccenture
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalAccenture the Netherlands
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxVaishnavGhadge1
 
Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 IBM Sverige
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnectaDigital
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigDataValarmathi V
 
Key note big data analytics ecosystem strategy
Key note   big data analytics ecosystem strategyKey note   big data analytics ecosystem strategy
Key note big data analytics ecosystem strategyIBM Sverige
 

Similar a Exploring Big Data value for your business (20)

Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data EstateEnable Better Decision Making with Power BI Visualizations & Modern Data Estate
Enable Better Decision Making with Power BI Visualizations & Modern Data Estate
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Introduction of big data and analytics
Introduction of big data and analyticsIntroduction of big data and analytics
Introduction of big data and analytics
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
Quicker Insights and Sustainable Business Agility Powered By Data Virtualizat...
 
Analytics big data ibm
Analytics big data ibmAnalytics big data ibm
Analytics big data ibm
 
OSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - TechnicalOSC2012: Big Data Using Open Source: Netapp Project - Technical
OSC2012: Big Data Using Open Source: Netapp Project - Technical
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
big-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptxbig-data-8722-m8RQ3h1.pptx
big-data-8722-m8RQ3h1.pptx
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013 Building Confidence in Big Data - IBM Smarter Business 2013
Building Confidence in Big Data - IBM Smarter Business 2013
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Key note big data analytics ecosystem strategy
Key note   big data analytics ecosystem strategyKey note   big data analytics ecosystem strategy
Key note big data analytics ecosystem strategy
 

Más de Acunu

Acunu and Hailo: a realtime analytics case study on Cassandra
Acunu and Hailo: a realtime analytics case study on CassandraAcunu and Hailo: a realtime analytics case study on Cassandra
Acunu and Hailo: a realtime analytics case study on CassandraAcunu
 
Virtual nodes: Operational Aspirin
Virtual nodes: Operational AspirinVirtual nodes: Operational Aspirin
Virtual nodes: Operational AspirinAcunu
 
Acunu Analytics and Cassandra at Hailo All Your Base 2013
Acunu Analytics and Cassandra at Hailo All Your Base 2013 Acunu Analytics and Cassandra at Hailo All Your Base 2013
Acunu Analytics and Cassandra at Hailo All Your Base 2013 Acunu
 
Understanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsUnderstanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsAcunu
 
Acunu Analytics: Simpler Real-Time Cassandra Apps
Acunu Analytics: Simpler Real-Time Cassandra AppsAcunu Analytics: Simpler Real-Time Cassandra Apps
Acunu Analytics: Simpler Real-Time Cassandra AppsAcunu
 
All Your Base
All Your BaseAll Your Base
All Your BaseAcunu
 
Realtime Analytics with Apache Cassandra
Realtime Analytics with Apache CassandraRealtime Analytics with Apache Cassandra
Realtime Analytics with Apache CassandraAcunu
 
Realtime Analytics with Apache Cassandra - JAX London
Realtime Analytics with Apache Cassandra - JAX LondonRealtime Analytics with Apache Cassandra - JAX London
Realtime Analytics with Apache Cassandra - JAX LondonAcunu
 
Real-time Cassandra
Real-time CassandraReal-time Cassandra
Real-time CassandraAcunu
 
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...Acunu
 
Realtime Analytics with Cassandra
Realtime Analytics with CassandraRealtime Analytics with Cassandra
Realtime Analytics with CassandraAcunu
 
Acunu Analytics @ Cassandra London
Acunu Analytics @ Cassandra LondonAcunu Analytics @ Cassandra London
Acunu Analytics @ Cassandra LondonAcunu
 
Realtime Analytics on the Twitter Firehose with Cassandra
Realtime Analytics on the Twitter Firehose with CassandraRealtime Analytics on the Twitter Firehose with Cassandra
Realtime Analytics on the Twitter Firehose with CassandraAcunu
 
Progressive NOSQL: Cassandra
Progressive NOSQL: CassandraProgressive NOSQL: Cassandra
Progressive NOSQL: CassandraAcunu
 
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...Acunu
 
Cassandra EU 2012 - Putting the X Factor into Cassandra
Cassandra EU 2012 - Putting the X Factor into CassandraCassandra EU 2012 - Putting the X Factor into Cassandra
Cassandra EU 2012 - Putting the X Factor into CassandraAcunu
 
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source EffortsCassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source EffortsAcunu
 
Next Generation Cassandra
Next Generation CassandraNext Generation Cassandra
Next Generation CassandraAcunu
 
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans Acunu
 
Cassandra EU 2012 - Storage Internals by Nicolas Favre-Felix
Cassandra EU 2012 - Storage Internals by Nicolas Favre-FelixCassandra EU 2012 - Storage Internals by Nicolas Favre-Felix
Cassandra EU 2012 - Storage Internals by Nicolas Favre-FelixAcunu
 

Más de Acunu (20)

Acunu and Hailo: a realtime analytics case study on Cassandra
Acunu and Hailo: a realtime analytics case study on CassandraAcunu and Hailo: a realtime analytics case study on Cassandra
Acunu and Hailo: a realtime analytics case study on Cassandra
 
Virtual nodes: Operational Aspirin
Virtual nodes: Operational AspirinVirtual nodes: Operational Aspirin
Virtual nodes: Operational Aspirin
 
Acunu Analytics and Cassandra at Hailo All Your Base 2013
Acunu Analytics and Cassandra at Hailo All Your Base 2013 Acunu Analytics and Cassandra at Hailo All Your Base 2013
Acunu Analytics and Cassandra at Hailo All Your Base 2013
 
Understanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problemsUnderstanding Cassandra internals to solve real-world problems
Understanding Cassandra internals to solve real-world problems
 
Acunu Analytics: Simpler Real-Time Cassandra Apps
Acunu Analytics: Simpler Real-Time Cassandra AppsAcunu Analytics: Simpler Real-Time Cassandra Apps
Acunu Analytics: Simpler Real-Time Cassandra Apps
 
All Your Base
All Your BaseAll Your Base
All Your Base
 
Realtime Analytics with Apache Cassandra
Realtime Analytics with Apache CassandraRealtime Analytics with Apache Cassandra
Realtime Analytics with Apache Cassandra
 
Realtime Analytics with Apache Cassandra - JAX London
Realtime Analytics with Apache Cassandra - JAX LondonRealtime Analytics with Apache Cassandra - JAX London
Realtime Analytics with Apache Cassandra - JAX London
 
Real-time Cassandra
Real-time CassandraReal-time Cassandra
Real-time Cassandra
 
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...
Realtime Analytics on the Twitter Firehose with Apache Cassandra - Denormaliz...
 
Realtime Analytics with Cassandra
Realtime Analytics with CassandraRealtime Analytics with Cassandra
Realtime Analytics with Cassandra
 
Acunu Analytics @ Cassandra London
Acunu Analytics @ Cassandra LondonAcunu Analytics @ Cassandra London
Acunu Analytics @ Cassandra London
 
Realtime Analytics on the Twitter Firehose with Cassandra
Realtime Analytics on the Twitter Firehose with CassandraRealtime Analytics on the Twitter Firehose with Cassandra
Realtime Analytics on the Twitter Firehose with Cassandra
 
Progressive NOSQL: Cassandra
Progressive NOSQL: CassandraProgressive NOSQL: Cassandra
Progressive NOSQL: Cassandra
 
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...
Cassandra EU 2012 - Overview of Case Studies and State of the Market by 451 R...
 
Cassandra EU 2012 - Putting the X Factor into Cassandra
Cassandra EU 2012 - Putting the X Factor into CassandraCassandra EU 2012 - Putting the X Factor into Cassandra
Cassandra EU 2012 - Putting the X Factor into Cassandra
 
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source EffortsCassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts
Cassandra EU 2012 - Netflix's Cassandra Architecture and Open Source Efforts
 
Next Generation Cassandra
Next Generation CassandraNext Generation Cassandra
Next Generation Cassandra
 
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans
Cassandra EU 2012 - CQL: Then, Now and When by Eric Evans
 
Cassandra EU 2012 - Storage Internals by Nicolas Favre-Felix
Cassandra EU 2012 - Storage Internals by Nicolas Favre-FelixCassandra EU 2012 - Storage Internals by Nicolas Favre-Felix
Cassandra EU 2012 - Storage Internals by Nicolas Favre-Felix
 

Último

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 

Exploring Big Data value for your business

  • 2. Agenda • What’s Big Data? • Why now? • Why care? • Key technologies • How can I get started?
  • 3. Agenda • What’s Big Data? • Why now? • Why care? • Key technologies • How can I get started?
  • 4. What’s Big Data? • Volume • Velocity • Variety • Value
  • 6. Challenge 40% 5% Growth of IT spending per year Growth of data generated per year Source: McKinsey
  • 7. Maybe Big Data is... • When any of volume, velocity, variety, value (cost?) becomes a problem • When new use cases emerge, new things become possible, because of new data sources
  • 8. For example US cell Items shared Smart meter updates Social media readings 2015 600B/day 4B/day 29B/day
  • 9. Agenda • What’s Big Data? • Why now? • Why care? • Key technologies • How can I get started?
  • 11. Cost per gigabyte 1000 $569 100 $ per GB 10 1 $0.13 1992 1994 1996 1998 2000 2002 2004 2006 2008 Source: Deloitte
  • 14. Agenda • What’s Big Data? • Why now? • Why care? • Key technologies • How can I get started?
  • 15. Why care? “Companies that can harness big data will trample data incompetents” The Economist, May 26th 2011
  • 16. Why care - take 2 • The competition will do it (and you’ll get fired) • Competitive advantage to be gained by doing it well (you get promoted) • It’s not hard to get started (no need for huge investment)
  • 17. What are we looking for? • Data / Information • Insights • Actionable intelligence
  • 18. Agenda • What’s Big Data? • Why now? • Why care? • Key technologies • How can I get started?
  • 19. Databases A Relational Model of Data for Large Shared Data Banks Tedd Codd, CACM, June 1970 Image: IBM
  • 20. Big = Slow? Throughput Throughput: records/ms falls as datasets get larger 0 25 50 75 100 Records (in millions) Source: Gerard Maas, http://www.gerardmaas.net/2011/06/bigdata-on-rdbms
  • 22. Hadoop • Great for unstructured data or arbitrary queries • MapReduce framework for distributed compute • Tools now making it accessible • Still essentially a batch processing system
  • 24. Use cases • Tracking trending topics on social media • Network and infrastructure monitoring • Web and ad analytics dashboard and platforms • Real-time A-B testing • User profiling
  • 25. NOSQL Voldemort
  • 26. No “one size fits all” • Column DBs and Key-Value stores P • Document databases • Graph databases C A
  • 27. Questions to ask • Who uses it? • Who can support it? Where are they? • How does it scale? Perform? • Maturity, both DB and tool ecosystem
  • 28. Changing economics XDR XDR metadata metadata Oracle NetApp 30 x $3k Dell servers 30 days of SMS 1/5th TCO of alternatives At capacity ceiling Cost grows predictably
  • 29. Agenda • What’s Big Data? • Why now? • Why care? • What’s the new technology good for? • How can I get started?
  • 30. Start small • Identify data sources • Look at capabilities • Run experiments, PoCs
  • 31. Data sources Web, SCM, Retail Location Services Infra Monitoring Smart Metering Oil/Gas Sensors Ad Marketplaces Fraud Detection Social Media 31
  • 32. Capabilities • Open source, supported, or “packaged” solution? • How do “commodity” servers fit your infrastructure? • Don’t rule out Cloud deployments to get quick answers 32
  • 33. Acunu Discover the Potential of Real Time Big Data with Acunu Activate Acunu Reflex Makes Big Data results easy, economic and fast Every CIO, Architect and Analyst knows of existing data with huge untapped potential within their organisation. Zero to Big Data Hero Evolving Big Data technologies provide new paths to revenue with both customers and prospects. Build a Big Data database cluster on commodity hardware in hours, not days. $ Acunu partners with you to deliver competitive advantage by capturing data and exploring its benefits. You’ll validate the value of Big Data by building real applications and dashboards to drive new value for your business. “ Key business andmanagement and processing landscape. traditional data technology trends are disrupting the Save Money versus Open Source Alternatives At the outset, we work with you to identify and develop use Data analysis is increasingly being viewed as a Save up to 60% on hardware and operation costs. competitive advantage. An increasingly sensor-enabled cases and areas where Big Data tools could be utilised to add and instrumented business environment is generating huge significant business value. We work with you to recommend z z z Database lag getting you down? solutions architectures for your specific use cases. volumes of data… Traditional IT infrastructure is simply ” zz not able to meet the demands of this new situation. We then deploy Acunu Reflex in your own data center or in the cloud and can include Apache Hadoop for investigative work and -Gartner Milliseconds turning into minutes? Acunu Analytics for real-time decision support. Once the software is installed, we work with you to integrate, capture and store sources of data from inside your organisation. We provide hands-on assistance to help you showcase the business value of your data through live proof-of-concept applications. You’ll get results quickly, with successive iterations delivering What is Acunu Reflex? ongoing value. As a result, you gain an understanding of Big Data’s transformative capability through working Easy Acunu provides an integrated suite of technologies to support rapid development and deployment of demonstrations and have a clear route to deliver that competitive advantage to your business. your Big Data applications. Getting started is easy with a single, fast installation, handling all the details usually associated with OS tuning, storage optimization, database integration and management. This alleviates the complexity of NoSQL development, deployment and support. The platform is flexible and scalable, providing simple, one click deployment. Scale linearly with ease and deploy across numerous machines within a data center or across a globally distributed public or private cloud. Workshops Structure & Planning Ecosystem of Expertise Acunu Specialist delivers A dedicated Project Lead will Acunu’s Big Data expertise is workshops and provides keep the project on track complemented through our on-demand consulting to enable your development team through kickoff, reviews and regular calls. Progressively, partners. Together we will build your own Big Data ecosystem. Economic to build Big Data applications. we’ll help you plan next steps. Acunu’s subscription base pricing model insures continuous value, skipping charges for non-production deployment, so you can defer technology expenses until your application goes into production. Acunu provides the NoSQL domain expertise you need, reducing your technology deployment costs without compromising your data security. The platform is architected to store significantly more data per node A Comprehensive Big Data Discovery Package than competing technologies with a focus on reducing both your initial hardware and operational costs over time. Acunu’s support for commodity hardware and large capacity disks further reduces your costs. Deployment Data Source Integration Support & Training Fast We deploy Acunu’s database We work with you to integrate We deliver hands on training and storage software, complete sources of log, clickstream, on the Acunu Reflex Acunu provides a suite of products focused on bringing you the performance your Big Data applications with management tools, to sensor, monitoring or similar infrastructure to your demand. Whether it’s a globally distributed database, millions to billions of records, tremendous amounts your own hardware or to data into Acunu Reflex. operations staff, and provide of machine generated data or managing millions of active users, Acunu provides you with real time Amazon’s public cloud. support throughout the project. results. Acunu has the professional services and support to get your applications up and running in the shortest possible time. Acunu leverages best in class open source solutions, adding additional management and performance technology to accelerate your Big Data results. 33
  • 34. www.acunu.com @acunu Apache, Apache Cassandra, Cassandra, Hadoop, and the eye and elephant logos are trademarks of the Apache Software Foundation.

Notas del editor

  1. \n
  2. \n
  3. \n
  4. Analyst firms appear to be fighting with each other to come up with the most Vs to define Big Data. \n“Big” seems to imply size of the dataset but there’s more to it than this.\n
  5. There’s nothing new about huge datasets: plenty of people playing with big data sets for years:\nSeismic survey datasets in the PB range, v. high end supercomputing hardware\nWeather: ECMWF has supercomputers with > PB disk storage\nHEP: 15PB/year for CERN HB project\nThe rest of us can’t afford supercomputers\n
  6. But we all have a challenge with datasets that grow faster than IT budgets (these numbers from McKinsey and are probably optimistic w.r.t. IT budgets)\n
  7. So maybe Big Data is really when we have one of these two things...and we’ve not already solved the problem. Perhaps there’s a silent [New] in “Big Data” :)\n
  8. Here are some new datasets that typify both the challange and the opportunity of big data. \n\nSo we’ve explored what Big Data might be. Let’s move on to look at why the Big Data hype is happening now\n
  9. \n
  10. Disks got cheaper!\n\nhttp://everyjoe.com/technology/hard-drive-cost-per-gigabyte-from-1980-to-2009/\n
  11. Exponential drop in price. (1GB cost around $200K in 1980). Today, I can buy SATA disks at 4p/GB,\n
  12. Basic economics. Reduce the price and the demand goes up.\n
  13. With huge reductions in cost and waves of commoditisation, the scene is set for repeated disruptive innovation, not just in storage technology itself but in the products and services that rely on it.\n
  14. \n
  15. What’s big data about? We’re looking to get insight from data. Data trumps intuition and commonsense every time (funny anecdotal examples). More data means better decisions, based on fact not folklore.\n
  16. \n
  17. Odd coloured cars more reliable.\nVegetarians less likely to miss flights.\nComputing hardware doesn’t fail at high temperatures as thought - but changing temperatures kill it.\nA person who’s just viewed a particular web page is more likely to buy product X\n
  18. \n
  19. RDBMS. Ted Codd, 1970, IBM. System/R, DB2, Oracle...\nBy late 1980s, it was the standard. Usurpers (e.g. Object-oriented Databases) failed to gain significant market share; hierarchical (IMS - developed for Apollo) and network databases (CODASYL) pretty much disappeared.\nHowever, while RDBMS have become the default choice, they aren’t necessarily the best for some Big Data UCs. Some problems: \n
  20. Problem 1: Performance. Dealing with time-series data is a common BD use case. We’re not looking to do complex transactions but we need to store the data so we can access it for analytics &c. RDBs do not handle this well. \n\nI’ve been investigating the performance of our “big vendor” RDBMS to hold months of sensor data. So far, the results are not really encouraging. I observe a exponential performance drop on the single-index (PK) table holding the data as more records are added. Here’s a plot of the performance of 5K records as records are continuously added to the table. Record addition is done with 5 parallel client threads, each inserting 1K records in batch mode. The client is an optimized Java app, using raw JDBC for the batch inserts. I haven’t found a faster way than that to add records to the relational DB.\n
  21. Problem 2: Increasingly we’d like to scale out rather than scale up. Why? (i) incremental capacity (and cost) management; (ii) availability; (iii) distribution; (iv) potential cloud deployment (onto relatively small machines)\n\nRelational DBs tend to push towards a single big machine, or a tightly coupled cluster with expensive h/w like Infiniband or SAN storage.\n
  22. From Google via Yahoo. Not really a database, but provides a distributed filesystem intended to store large files - with no schema.\nNot trivial to set up, but tools getting better - no longer need to write Java code to do queries. See Hive - HQL - for SQL-like access.\nBut it’s still batch, and plenty of time, you want real-time.\n
  23. We’re looking to act on the insights that the data bring. If we don’t act, we’re just observing. But action is often time critical; the world is changing. (e.g. we’re monitoring an oil well, trading financial instruments, trying to understand the behaviour of lots of people) and insights from yesterday’s data are historical documents: interesting, perhaps, but not great as a guide to action.\n
  24. Some concrete examples of things that people we’re working with are interested in capturing. \n
  25. Lots of databases.\n
  26. Lots of different kinds of databases with different goals in mind. \nOne way to view them is to see what sit in CAP terms (Brewer’s CAP theorem).\nMany different data models.\n
  27. \n
  28. Picking the right solution (or combination) can deliver significant cost savings, increased capability and allow granular growth over time.\n
  29. \n
  30. Paradoxically for “big data” consider starting with something small\nLet's look at the other items...\n
  31. \n
  32. \n
  33. How can we help?\nAcunu Activate: A focused package of work to help you discover big data opportunities and understand how to exploit them;\nAcunu Reflex: A fully supported distributed database to support real-time big data use cases\nAcunu Analytics: Currently in preview, launching soon, provides real-time results for queries that would normally be costly to compute\n
  34. \n