SlideShare una empresa de Scribd logo
1 de 22
Dealing with data
Enterprise systems in the cloud


Aaron Kimball
Founding Engineer, Cloudera Inc.
2-5-09
Cloud computing: scalable applications
Cloud computing: broader than any one app



   Cloud computing is a method to address
     scalability and availability concerns
         for enterprise applications.
The take-away


Cloud computing represents a new approach to scalability
problems.


Reusable infrastructure components are available to your
organization to build rapidly and scale gracefully.
Outline
Introduction
More data than you’ve ever seen before
Processing large data volumes
Hosting large-scale applications
An evolving ecosystem of components
Data volumes are growing
Amount of data one computer can store: 10,000 GB
Amount of data one computer can process at a time: 32 GB
Amount of data processed by Google per month:
    400,000,000 GB
    … in 2007
Where does data come from?
Watching your users
(clicks on web site, pages viewed, items purchased…)
Simulations, scientific/experimental data
(genome sequences, medical imaging, wireless sensor grids…)
User-provided content
(Billions of flickr images, youtube videos, blog posts…)
Your infrastructure itself
(10,000 computers reporting their status every second…)
Existing databases
(product catalogs, historical sales data, surveys…)
Large-scale data processing lessons
You can generate vastly more data than you can process with
conventional tools
 No relational database handles petabytes gracefully


Data processing must involve many machines working in parallel
Hadoop: an active storage platform



A community-driven, commercially-supported, extensible
system.
Based on techniques developed by Google.
 Separates the problem of extracting information from large
 data from performing reliable computation.
Combines a scalable, reliable compute framework with self-
healing high-bandwidth storage.
Putting it together: active storage




     Data automatically distributed to nodes at load time
       Load balancing implicitly managed by Hadoop
Automatic parallel processing




       Data elements processed locally, in parallel
    Reliable computation implicitly managed by Hadoop
Distributed data, single volume




 Output data is written to local disks, and forms a single user-
                       accessible volume
      A high-level abstraction for engineers and analysts
A self-healing system




     Loss of nodes causes automatic data rebalance
        Automatic recovery managed by Hadoop
Existing large-scale systems…
Are augmented by Hadoop
Hosting infrastructure




 Managed cloud platforms provide hardware resources for rent.
   Think cycles and bytes, not months and machines.
 Provides on-demand low-level infrastructure for hosting
 applications.
An evolving ecosystem
An evolving ecosystem

                         !




               ! #   !




                "            !




                         !
Conclusions
Cloud computing makes resources available in an on-demand
fashion.
 From raw hardware up to fully-configured applications
The range of resources available is increasing, with new tools
being aimed at different levels of the hardware/software stack.
These tools allow you to rapidly integrate disparate components
of your infrastructure and handle vastly more data than before.
(c) 2008 Cloudera, Inc. or its licensors. "Cloudera" is a registered trademark of Cloudera, Inc.. All rights reserved. 1.0




Iceberg by wikipedia user Calyponte
(c) 2008 Cloudera, Inc. or its licensors. "Cloudera" is a registered trademark of Cloudera, Inc.. All rights reserved. 1.0

Más contenido relacionado

La actualidad más candente

Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...
Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...
Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...SoftwareONEPresents
 
Storsimple overview.Microsoft.Walter.Pitrof
Storsimple overview.Microsoft.Walter.PitrofStorsimple overview.Microsoft.Walter.Pitrof
Storsimple overview.Microsoft.Walter.Pitrofinnobit
 
Stor simple presentation customers
 Stor simple presentation customers Stor simple presentation customers
Stor simple presentation customersJarek Sokolnicki
 
Outgrowing your-datacenter
Outgrowing your-datacenterOutgrowing your-datacenter
Outgrowing your-datacenterMike Kavis
 
Architecture Challenges In Cloud Computing
Architecture Challenges In Cloud ComputingArchitecture Challenges In Cloud Computing
Architecture Challenges In Cloud ComputingIndicThreads
 
Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure TechExeter
 
Hybrid Cloud: The Cloud on Your Terms
Hybrid Cloud: The Cloud on Your TermsHybrid Cloud: The Cloud on Your Terms
Hybrid Cloud: The Cloud on Your TermsNetApp
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...Alex Liu
 
Big data on Azure for Architects
Big data on Azure for ArchitectsBig data on Azure for Architects
Big data on Azure for ArchitectsTomasz Kopacz
 
Data Orchestration for AI, Big Data, and Cloud
Data Orchestration for AI, Big Data, and CloudData Orchestration for AI, Big Data, and Cloud
Data Orchestration for AI, Big Data, and CloudAlluxio, Inc.
 
Storage As A Service (StAAS)
Storage As A Service (StAAS)Storage As A Service (StAAS)
Storage As A Service (StAAS)Shreyans Jain
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Alluxio, Inc.
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData, Inc.
 
Hd insight essentials quick view
Hd insight essentials quick viewHd insight essentials quick view
Hd insight essentials quick viewRajesh Nadipalli
 
The world with Cloud, Big Data, ML, IoT and AI
The world with Cloud, Big Data, ML, IoT and AIThe world with Cloud, Big Data, ML, IoT and AI
The world with Cloud, Big Data, ML, IoT and AIMeenakshiGupta127
 
How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”Hardway Hou
 

La actualidad más candente (20)

Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...
Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...
Azure and StorSimple for Disaster Recovery and Storage Management - SoftwareO...
 
Storsimple overview.Microsoft.Walter.Pitrof
Storsimple overview.Microsoft.Walter.PitrofStorsimple overview.Microsoft.Walter.Pitrof
Storsimple overview.Microsoft.Walter.Pitrof
 
Stor simple presentation customers
 Stor simple presentation customers Stor simple presentation customers
Stor simple presentation customers
 
Case study on big data
Case study on big dataCase study on big data
Case study on big data
 
Outgrowing your-datacenter
Outgrowing your-datacenterOutgrowing your-datacenter
Outgrowing your-datacenter
 
Architecture Challenges In Cloud Computing
Architecture Challenges In Cloud ComputingArchitecture Challenges In Cloud Computing
Architecture Challenges In Cloud Computing
 
Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure Data Science in the cloud with Microsoft Azure
Data Science in the cloud with Microsoft Azure
 
Hybrid Cloud: The Cloud on Your Terms
Hybrid Cloud: The Cloud on Your TermsHybrid Cloud: The Cloud on Your Terms
Hybrid Cloud: The Cloud on Your Terms
 
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
BUILDING BETTER PREDICTIVE MODELS WITH COGNITIVE ASSISTANCE IN A DATA SCIENCE...
 
Big Data with Azure
Big Data with AzureBig Data with Azure
Big Data with Azure
 
Cloud and Big Data trends
Cloud and Big Data trendsCloud and Big Data trends
Cloud and Big Data trends
 
Green Cloud Computing
Green Cloud ComputingGreen Cloud Computing
Green Cloud Computing
 
Big data on Azure for Architects
Big data on Azure for ArchitectsBig data on Azure for Architects
Big data on Azure for Architects
 
Data Orchestration for AI, Big Data, and Cloud
Data Orchestration for AI, Big Data, and CloudData Orchestration for AI, Big Data, and Cloud
Data Orchestration for AI, Big Data, and Cloud
 
Storage As A Service (StAAS)
Storage As A Service (StAAS)Storage As A Service (StAAS)
Storage As A Service (StAAS)
 
Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS Enabling big data & AI workloads on the object store at DBS
Enabling big data & AI workloads on the object store at DBS
 
BlueData EPIC 2.0 Overview
BlueData EPIC 2.0 OverviewBlueData EPIC 2.0 Overview
BlueData EPIC 2.0 Overview
 
Hd insight essentials quick view
Hd insight essentials quick viewHd insight essentials quick view
Hd insight essentials quick view
 
The world with Cloud, Big Data, ML, IoT and AI
The world with Cloud, Big Data, ML, IoT and AIThe world with Cloud, Big Data, ML, IoT and AI
The world with Cloud, Big Data, ML, IoT and AI
 
How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”How To Build A Stable And Robust Base For a “Cloud”
How To Build A Stable And Robust Base For a “Cloud”
 

Similar a WTIA Cloud Computing Series - Part I: The Fundamentals

Cloud computing
Cloud computingCloud computing
Cloud computingsandita12
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big datasolarisyourep
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big dataxKinAnx
 
Architecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentationArchitecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentationVlad Ponomarev
 
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalHPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalDataWorks Summit
 
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...actualtechmedia
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache SoftwareBob Marcus
 
Big data application using hadoop in cloud [Smart Refrigerator]
Big data application using hadoop in cloud [Smart Refrigerator] Big data application using hadoop in cloud [Smart Refrigerator]
Big data application using hadoop in cloud [Smart Refrigerator] Pushkar Bhandari
 
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStaxWebinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStaxDataStax
 
Introduction to Microsoft Azure
Introduction to Microsoft AzureIntroduction to Microsoft Azure
Introduction to Microsoft AzureSayed Erfan Arefin
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud ComputingAnimesh Chaturvedi
 
To the Cloud and beyond (Nantes, Rebuild 2018)
To the Cloud and beyond (Nantes, Rebuild 2018)To the Cloud and beyond (Nantes, Rebuild 2018)
To the Cloud and beyond (Nantes, Rebuild 2018)Alex Danvy
 
Get your enterprise backup strategy ready for the digital world
Get your enterprise backup strategy ready for the digital worldGet your enterprise backup strategy ready for the digital world
Get your enterprise backup strategy ready for the digital worldVeritas Technologies LLC
 
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick HamonOpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick HamoneNovance
 
Building Web Applications on AWS - AWS Summit 2012 - NYC
Building Web Applications on AWS - AWS Summit 2012 - NYCBuilding Web Applications on AWS - AWS Summit 2012 - NYC
Building Web Applications on AWS - AWS Summit 2012 - NYCAmazon Web Services
 

Similar a WTIA Cloud Computing Series - Part I: The Fundamentals (20)

Cloud computing
Cloud computingCloud computing
Cloud computing
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
 
Presentation architecting virtualized infrastructure for big data
Presentation   architecting virtualized infrastructure for big dataPresentation   architecting virtualized infrastructure for big data
Presentation architecting virtualized infrastructure for big data
 
Computer project
Computer projectComputer project
Computer project
 
Architecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentationArchitecting virtualized infrastructure for big data presentation
Architecting virtualized infrastructure for big data presentation
 
HPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposalHPE Hadoop Solutions - From use cases to proposal
HPE Hadoop Solutions - From use cases to proposal
 
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
Conquering Disaster Recovery Challenges and Out-of-Control Data with the Hybr...
 
Big Data Companies and Apache Software
Big Data Companies and Apache SoftwareBig Data Companies and Apache Software
Big Data Companies and Apache Software
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Big data application using hadoop in cloud [Smart Refrigerator]
Big data application using hadoop in cloud [Smart Refrigerator] Big data application using hadoop in cloud [Smart Refrigerator]
Big data application using hadoop in cloud [Smart Refrigerator]
 
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStaxWebinar | From Zero to 1 Million with Google Cloud Platform and DataStax
Webinar | From Zero to 1 Million with Google Cloud Platform and DataStax
 
Cloud computing
Cloud computingCloud computing
Cloud computing
 
Introduction to Microsoft Azure
Introduction to Microsoft AzureIntroduction to Microsoft Azure
Introduction to Microsoft Azure
 
Introduction to Cloud Computing
Introduction to Cloud ComputingIntroduction to Cloud Computing
Introduction to Cloud Computing
 
Cloud computing whitepaper(2)
Cloud computing whitepaper(2)Cloud computing whitepaper(2)
Cloud computing whitepaper(2)
 
To the Cloud and beyond (Nantes, Rebuild 2018)
To the Cloud and beyond (Nantes, Rebuild 2018)To the Cloud and beyond (Nantes, Rebuild 2018)
To the Cloud and beyond (Nantes, Rebuild 2018)
 
Get your enterprise backup strategy ready for the digital world
Get your enterprise backup strategy ready for the digital worldGet your enterprise backup strategy ready for the digital world
Get your enterprise backup strategy ready for the digital world
 
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick HamonOpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
OpenStack in Action! 5 - Dell - OpenStack powered solutions - Patrick Hamon
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Building Web Applications on AWS - AWS Summit 2012 - NYC
Building Web Applications on AWS - AWS Summit 2012 - NYCBuilding Web Applications on AWS - AWS Summit 2012 - NYC
Building Web Applications on AWS - AWS Summit 2012 - NYC
 

Más de Washington Technology Industry Association

WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...Washington Technology Industry Association
 
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud Computing
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud ComputingWTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud Computing
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud ComputingWashington Technology Industry Association
 

Más de Washington Technology Industry Association (20)

G.E.T. Smart Fuels: General Bbiodiesel
G.E.T. Smart Fuels: General Bbiodiesel G.E.T. Smart Fuels: General Bbiodiesel
G.E.T. Smart Fuels: General Bbiodiesel
 
G.E.T. Smart - Smart Fuels: Farm Power Northwest Presentation
G.E.T. Smart - Smart Fuels: Farm Power Northwest PresentationG.E.T. Smart - Smart Fuels: Farm Power Northwest Presentation
G.E.T. Smart - Smart Fuels: Farm Power Northwest Presentation
 
G.E.T Smart - Smart Renewables: Areva Lawrence Jones
G.E.T Smart - Smart Renewables: Areva Lawrence JonesG.E.T Smart - Smart Renewables: Areva Lawrence Jones
G.E.T Smart - Smart Renewables: Areva Lawrence Jones
 
G.E.T. Smart - Smart Renewables: Dept. of Commerce Policy Presentation
G.E.T. Smart - Smart Renewables: Dept. of Commerce Policy PresentationG.E.T. Smart - Smart Renewables: Dept. of Commerce Policy Presentation
G.E.T. Smart - Smart Renewables: Dept. of Commerce Policy Presentation
 
G.E.T. Smart - Smart Renewables: Principal Power Presentation
G.E.T. Smart - Smart Renewables: Principal Power PresentationG.E.T. Smart - Smart Renewables: Principal Power Presentation
G.E.T. Smart - Smart Renewables: Principal Power Presentation
 
G.E.T. Smart - Smart Renewables: Demand Energy Presentation
G.E.T. Smart - Smart Renewables: Demand Energy PresentationG.E.T. Smart - Smart Renewables: Demand Energy Presentation
G.E.T. Smart - Smart Renewables: Demand Energy Presentation
 
G.E.T. Smart - Smart Renewables: Clean Energy Leadership Council Presentation
G.E.T. Smart - Smart Renewables: Clean Energy Leadership Council PresentationG.E.T. Smart - Smart Renewables: Clean Energy Leadership Council Presentation
G.E.T. Smart - Smart Renewables: Clean Energy Leadership Council Presentation
 
G.E.T. Smart - Smart Grid: Itron Presentation
G.E.T. Smart - Smart Grid: Itron PresentationG.E.T. Smart - Smart Grid: Itron Presentation
G.E.T. Smart - Smart Grid: Itron Presentation
 
G.E.T. Smart - Smart Grid: IBM Presentation
G.E.T. Smart - Smart Grid: IBM PresentationG.E.T. Smart - Smart Grid: IBM Presentation
G.E.T. Smart - Smart Grid: IBM Presentation
 
G.E.T. Smart - Smart Grid: Gridpoint Presentation
G.E.T. Smart - Smart Grid: Gridpoint PresentationG.E.T. Smart - Smart Grid: Gridpoint Presentation
G.E.T. Smart - Smart Grid: Gridpoint Presentation
 
Fast Pitch Forum (AdReady)
Fast Pitch Forum (AdReady)Fast Pitch Forum (AdReady)
Fast Pitch Forum (AdReady)
 
Fast Pitch Forum-WhitePages
Fast Pitch Forum-WhitePagesFast Pitch Forum-WhitePages
Fast Pitch Forum-WhitePages
 
Cloud Computing Series - Part II: SmartSheet Case Study
Cloud Computing Series - Part II: SmartSheet Case StudyCloud Computing Series - Part II: SmartSheet Case Study
Cloud Computing Series - Part II: SmartSheet Case Study
 
WTIA Cloud Computing Series - Part IV: Skytap Case Study.
WTIA Cloud Computing Series - Part IV: Skytap Case Study.WTIA Cloud Computing Series - Part IV: Skytap Case Study.
WTIA Cloud Computing Series - Part IV: Skytap Case Study.
 
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
WTIA Cloud Computing Series - Part II: Scaling into the Cloud with Amazon Web...
 
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud Computing
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud ComputingWTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud Computing
WTIA Cloud Computing Series - Part IV: Microsofts World View of Cloud Computing
 
Marketing Yourself Successfully
Marketing Yourself SuccessfullyMarketing Yourself Successfully
Marketing Yourself Successfully
 
Change is Inevitable – Embrace the Opportunity
Change is Inevitable – Embrace the OpportunityChange is Inevitable – Embrace the Opportunity
Change is Inevitable – Embrace the Opportunity
 
Fast Pitch Forum (Gist)
Fast Pitch Forum (Gist)Fast Pitch Forum (Gist)
Fast Pitch Forum (Gist)
 
Fast Pitch Forum (Socrata)
Fast Pitch Forum (Socrata)Fast Pitch Forum (Socrata)
Fast Pitch Forum (Socrata)
 

Último

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Último (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

WTIA Cloud Computing Series - Part I: The Fundamentals

  • 1.
  • 2. Dealing with data Enterprise systems in the cloud Aaron Kimball Founding Engineer, Cloudera Inc. 2-5-09
  • 4. Cloud computing: broader than any one app Cloud computing is a method to address scalability and availability concerns for enterprise applications.
  • 5. The take-away Cloud computing represents a new approach to scalability problems. Reusable infrastructure components are available to your organization to build rapidly and scale gracefully.
  • 6. Outline Introduction More data than you’ve ever seen before Processing large data volumes Hosting large-scale applications An evolving ecosystem of components
  • 7. Data volumes are growing Amount of data one computer can store: 10,000 GB Amount of data one computer can process at a time: 32 GB Amount of data processed by Google per month: 400,000,000 GB … in 2007
  • 8. Where does data come from? Watching your users (clicks on web site, pages viewed, items purchased…) Simulations, scientific/experimental data (genome sequences, medical imaging, wireless sensor grids…) User-provided content (Billions of flickr images, youtube videos, blog posts…) Your infrastructure itself (10,000 computers reporting their status every second…) Existing databases (product catalogs, historical sales data, surveys…)
  • 9. Large-scale data processing lessons You can generate vastly more data than you can process with conventional tools No relational database handles petabytes gracefully Data processing must involve many machines working in parallel
  • 10. Hadoop: an active storage platform A community-driven, commercially-supported, extensible system. Based on techniques developed by Google. Separates the problem of extracting information from large data from performing reliable computation. Combines a scalable, reliable compute framework with self- healing high-bandwidth storage.
  • 11. Putting it together: active storage Data automatically distributed to nodes at load time Load balancing implicitly managed by Hadoop
  • 12. Automatic parallel processing Data elements processed locally, in parallel Reliable computation implicitly managed by Hadoop
  • 13. Distributed data, single volume Output data is written to local disks, and forms a single user- accessible volume A high-level abstraction for engineers and analysts
  • 14. A self-healing system Loss of nodes causes automatic data rebalance Automatic recovery managed by Hadoop
  • 17. Hosting infrastructure Managed cloud platforms provide hardware resources for rent. Think cycles and bytes, not months and machines. Provides on-demand low-level infrastructure for hosting applications.
  • 19. An evolving ecosystem ! ! # ! " ! !
  • 20. Conclusions Cloud computing makes resources available in an on-demand fashion. From raw hardware up to fully-configured applications The range of resources available is increasing, with new tools being aimed at different levels of the hardware/software stack. These tools allow you to rapidly integrate disparate components of your infrastructure and handle vastly more data than before.
  • 21. (c) 2008 Cloudera, Inc. or its licensors. "Cloudera" is a registered trademark of Cloudera, Inc.. All rights reserved. 1.0 Iceberg by wikipedia user Calyponte
  • 22. (c) 2008 Cloudera, Inc. or its licensors. "Cloudera" is a registered trademark of Cloudera, Inc.. All rights reserved. 1.0