SlideShare una empresa de Scribd logo
1 de 30
Warehouse-Scale
Computers
CS4342 Advanced Computer Architecture
Dilum Bandara
Dilum.Bandara@uom.lk
Slides adapted from “Computer Architecture, A Quantitative Approach” by John L.
Hennessy and David A. Patterson, 5th Edition, 2012, MK Publishers and
The Datacenter as a Computer:An Introduction to the Design of Warehouse-Scale
Machines by Luiz André Barroso & Urs Hölzle
Outline
 Programming model & workloads
 Architectures
 Cloud computing
2
Warehouse-Scale Computers (WSC)
3
www.laserfocusworld.com/articles/print/volume-48/issue-
12/features/optical-technologies-scale-the-datacenter.html http://www.slashgear.com/google-data-center-hd-photos-
hit-where-the-internet-lives-gallery-17252451/
WSC (Cont.)
4
WSC Layout
5
Source: http://bnrg.cs.berkeley.edu/~randy/Courses/CS294.F07/
Main Components of a WSC
6
Warehouse-Scale Computer (WSC)
 Provides Internet services
 Search, social networking, online maps, video sharing,
online shopping, email, cloud computing, etc.
 Differences with HPC clusters
 Clusters use higher performance processors & network
 Clusters emphasize thread-level parallelism, WSCs
emphasize request/task-level parallelism
 Differences with datacenters
 Datacenters consolidate different machines & software
into a single location
 Datacenters emphasize virtual machines & hardware
heterogeneity to serve varied customers 7
Design Factors for WSC
 Cost-performance
 Small savings add up
 Energy efficiency
 Affects power distribution & cooling
 Work per joule
 Operational costs count
 Power consumption is a primary constraint when
designing a system
 Dependability via redundancy
 Many low-cost components
8
Design Factors (Cont.)
 Network I/O
 Interactive & batch processing workloads
 Web search – interactive
 Web indexing – batch
 Ample computational parallelism isn’t important
 Most jobs are totally independent, “Request-level
parallelism”
 Scale – Its opportunities & problems
 Can afford to build customized systems as WSC
require volume purchase
 Frequent failures
9
Failure Example
 Consider a WSC with 50,000 nodes. MTTF of a node is 5
years. How many failures be there for a day?
MTTF in days = 5 x 365 = 1,825
Failure rate = 1/1,825 per day
No of failures per day = 50,000/1,825 = 27.4
 Consider a WSC with 50,000 nodes & each node with 4
hard disks. Suppose a annual failure rate of a disk is 4%.
What is the time for a disk failure?
No of disks = 50,000 x 4 = 200,000
No of failures per year = 200,000 x 0.04 = 8,000
Time for failure = 365 x 24 / 8,000 = 1.095 hours/failure 10
Programming Models & Workloads
 Batch processing framework
– MapReduce
 Map
 Applies a programmer-
supplied function to each
logical input record
 Runs on thousands of
computers
 Provides new set of (key,
value) pairs as intermediate
values
 Reduce
 Collapses values using
another function 11
http://www.cbsolution.net/techniques/ontarget/mapredu
ce_vs_data_warehouse
MapReduce Execution
12
Source: Dean et. al.,
“MapReduce, OSDI, 2004
Programming Models & Workloads
(Cont.)
13
www.datanami.com/datanami/2012-07-
16/top_5_challenges_for_hadoop_mapreduce
_in_the_enterprise.html
Programming Models & Workloads
(Cont.)
 MapReduce runtime environment schedules
map & reduce task to WSC nodes
 Availability
 Use replicas of data across different servers
 Use relaxed consistency
 No need for all replicas to always agree
 Workload demands
 Often vary considerably
14
Computer Architecture of WSC
 Often uses a hierarchy of networks for
interconnection
 Each 19” rack holds 48 1U servers connected to
a rack switch
 Rack switches are uplinked to a switch(es)
higher in hierarchy
 Uplink has 48/n times lower bandwidth –
Oversubscription
 n – No of uplink ports
 Goal is to maximize locality of communication relative
to the rack
15
Hierarchy of Switches
16
Network Hierarchy
17
Source: www.laserfocusworld.com/articles/print/volume-48/issue-12/features/optical-
technologies-scale-the-datacenter.html
Storage Hierarchy
18
Infrastructure & Costs
 Location
 Proximity to Internet backbones, electricity cost, property tax rates,
low risk from earthquakes, floods, & hurricanes
 Power distribution
19
Power Usage
20
U.S. EPA Report 2007 – 1.5% of total U.S.
power consumption used by data centers
which has more than doubled since 2000 &
costs $4.5 billion
How Many Nodes can a WSC Support?
 Each node
 “Nameplate power rating” gives maximum power
consumption
 To get actual, measure power under actual workloads
 Oversubscribe cumulative nodes power by 40%,
but monitor power closely
21
Cooling
22
Typically operate around 18 – 22 0C
Cooling (Cont.)
23
Cooling system also uses water (evaporation & spills)
e.g. 70,000 to 200,000 gallons per day for an 8 MW facility
Efficiency
 Power Utilization Effectiveness (PUE)
= Total facility power / IT equipment power
 ≥ 1
 Median PUE on 2006 study was 1.69
24
Source: http://hightech.lbl.gov/benchmarking-guides/data-a1.html
Performance
 Latency is important metric because it is seen by
users
 Bing study
 Users will use search less as response time
increases
 Service Level Objectives (SLOs) & Service Level
Agreements (SLAs)
 Typically given at application level
 e.g., 99% of requests be below 100 ms
 In clouds typically given only for static resources
 CPU speed, no of cores, & memory
25
Cost
 Capital expenditures (CAPEX)
 Cost to build a WSC
 Hardware cost dominates
 Operational expenditures (OPEX)
 Cost to operate a WSC
 Power for nodes & cooling dominates
26
Cloud Computing
27
Clients
Other
Cloud Services
Govt.
Cloud Services
Private
Cloud
Cloud
Manager
Public Cloud
Green Cloud Computing by Dr. Rajkumar Buyya
Cloud Computing (Cont.)
 WSCs offer economies of scale that can’t be
achieved with a datacenter
 5.7 times reduction in storage costs
 7.1 times reduction in administrative costs
 7.3 times reduction in networking costs
 This has given rise to cloud services such as Amazon
Web Services
 “Utility Computing”
 Based on using open source virtual machine & operating
system software
28
Amazon Web Services
 Virtual machines
 XEN
 Very low cost
 $ 0.10 per hour per instance
 Primary rely on open source software
 No (initial) service guarantees
 No contract required
 Amazon S3
 Simple Storage Service
 Amazon EC2
 Elastic Computer Cloud 29
Amazon Web Services – Example
30
http://www.ryhug.com/free-art-available-on-amazon-amazon-web-services-that-is/

Más contenido relacionado

Similar a Introduction to Warehouse-Scale Computers

Desktop to Cloud Transformation Planning
Desktop to Cloud Transformation PlanningDesktop to Cloud Transformation Planning
Desktop to Cloud Transformation Planning
Phearin Sok
 
E source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-finalE source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-final
josh whitney
 
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
IJAEMSJORNAL
 

Similar a Introduction to Warehouse-Scale Computers (20)

The Cloud & Its Impact on IT
The Cloud & Its Impact on ITThe Cloud & Its Impact on IT
The Cloud & Its Impact on IT
 
Scheduling in CCE
Scheduling in CCEScheduling in CCE
Scheduling in CCE
 
Desktop to Cloud Transformation Planning
Desktop to Cloud Transformation PlanningDesktop to Cloud Transformation Planning
Desktop to Cloud Transformation Planning
 
Paolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzionePaolo Merialdo, Cloud Computing and Virtualization: una introduzione
Paolo Merialdo, Cloud Computing and Virtualization: una introduzione
 
Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland Cloud Roundtable at Microsoft Switzerland
Cloud Roundtable at Microsoft Switzerland
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
Exploring Emerging Technologies in the Extreme Scale HPC Co-Design Space with...
 
E source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-finalE source energy managers conf 4 24-13-final
E source energy managers conf 4 24-13-final
 
Optimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource ConfigurationOptimising Service Deployment and Infrastructure Resource Configuration
Optimising Service Deployment and Infrastructure Resource Configuration
 
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
Disaster Recovery Experience at CACIB: Hardening Hadoop for Critical Financia...
 
Green cloud computing
Green cloud computingGreen cloud computing
Green cloud computing
 
Summer Intern Report
Summer Intern ReportSummer Intern Report
Summer Intern Report
 
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
Performance Improvement of Cloud Computing Data Centers Using Energy Efficien...
 
AI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptxAI Sustainability Mascots 23-f.pptx
AI Sustainability Mascots 23-f.pptx
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Cloudsim & Green Cloud
Cloudsim & Green CloudCloudsim & Green Cloud
Cloudsim & Green Cloud
 
Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and Taking High Performance Computing to the Cloud: Windows HPC and
Taking High Performance Computing to the Cloud: Windows HPC and
 
cloud computing
cloud computing cloud computing
cloud computing
 
Benefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a ServiceBenefits of Hadoop as Platform as a Service
Benefits of Hadoop as Platform as a Service
 

Más de Dilum Bandara

Más de Dilum Bandara (20)

Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Time Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in PracticeTime Series Analysis and Forecasting in Practice
Time Series Analysis and Forecasting in Practice
 
Introduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCAIntroduction to Dimension Reduction with PCA
Introduction to Dimension Reduction with PCA
 
Introduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive AnalyticsIntroduction to Descriptive & Predictive Analytics
Introduction to Descriptive & Predictive Analytics
 
Introduction to Concurrent Data Structures
Introduction to Concurrent Data StructuresIntroduction to Concurrent Data Structures
Introduction to Concurrent Data Structures
 
Hard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-MatrixHard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
Hard to Paralelize Problems: Matrix-Vector and Matrix-Matrix
 
Introduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with HadoopIntroduction to Map-Reduce Programming with Hadoop
Introduction to Map-Reduce Programming with Hadoop
 
Embarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel ProblemsEmbarrassingly/Delightfully Parallel Problems
Embarrassingly/Delightfully Parallel Problems
 
Introduction to Thread Level Parallelism
Introduction to Thread Level ParallelismIntroduction to Thread Level Parallelism
Introduction to Thread Level Parallelism
 
CPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching TechniquesCPU Memory Hierarchy and Caching Techniques
CPU Memory Hierarchy and Caching Techniques
 
Data-Level Parallelism in Microprocessors
Data-Level Parallelism in MicroprocessorsData-Level Parallelism in Microprocessors
Data-Level Parallelism in Microprocessors
 
Instruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware TechniquesInstruction Level Parallelism – Hardware Techniques
Instruction Level Parallelism – Hardware Techniques
 
Instruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler TechniquesInstruction Level Parallelism – Compiler Techniques
Instruction Level Parallelism – Compiler Techniques
 
CPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An IntroductionCPU Pipelining and Hazards - An Introduction
CPU Pipelining and Hazards - An Introduction
 
High Performance Networking with Advanced TCP
High Performance Networking with Advanced TCPHigh Performance Networking with Advanced TCP
High Performance Networking with Advanced TCP
 
Introduction to Content Delivery Networks
Introduction to Content Delivery NetworksIntroduction to Content Delivery Networks
Introduction to Content Delivery Networks
 
Peer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and StreamingPeer-to-Peer Networking Systems and Streaming
Peer-to-Peer Networking Systems and Streaming
 
Mobile Services
Mobile ServicesMobile Services
Mobile Services
 
Wired Broadband Communication
Wired Broadband CommunicationWired Broadband Communication
Wired Broadband Communication
 
Mobile IP
Mobile IPMobile IP
Mobile IP
 

Último

Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
FIDO Alliance
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
Muhammad Subhan
 

Último (20)

Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024Long journey of Ruby Standard library at RubyKaigi 2024
Long journey of Ruby Standard library at RubyKaigi 2024
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
Using IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & IrelandUsing IESVE for Room Loads Analysis - UK & Ireland
Using IESVE for Room Loads Analysis - UK & Ireland
 
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...Hyatt driving innovation and exceptional customer experiences with FIDO passw...
Hyatt driving innovation and exceptional customer experiences with FIDO passw...
 
State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!State of the Smart Building Startup Landscape 2024!
State of the Smart Building Startup Landscape 2024!
 
ADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptxADP Passwordless Journey Case Study.pptx
ADP Passwordless Journey Case Study.pptx
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
TrustArc Webinar - Unified Trust Center for Privacy, Security, Compliance, an...
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 
Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
“Iamnobody89757” Understanding the Mysterious of Digital Identity.pdf
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Overview of Hyperledger Foundation
Overview of Hyperledger FoundationOverview of Hyperledger Foundation
Overview of Hyperledger Foundation
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Design Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptxDesign Guidelines for Passkeys 2024.pptx
Design Guidelines for Passkeys 2024.pptx
 

Introduction to Warehouse-Scale Computers

  • 1. Warehouse-Scale Computers CS4342 Advanced Computer Architecture Dilum Bandara Dilum.Bandara@uom.lk Slides adapted from “Computer Architecture, A Quantitative Approach” by John L. Hennessy and David A. Patterson, 5th Edition, 2012, MK Publishers and The Datacenter as a Computer:An Introduction to the Design of Warehouse-Scale Machines by Luiz André Barroso & Urs Hölzle
  • 2. Outline  Programming model & workloads  Architectures  Cloud computing 2
  • 3. Warehouse-Scale Computers (WSC) 3 www.laserfocusworld.com/articles/print/volume-48/issue- 12/features/optical-technologies-scale-the-datacenter.html http://www.slashgear.com/google-data-center-hd-photos- hit-where-the-internet-lives-gallery-17252451/
  • 7. Warehouse-Scale Computer (WSC)  Provides Internet services  Search, social networking, online maps, video sharing, online shopping, email, cloud computing, etc.  Differences with HPC clusters  Clusters use higher performance processors & network  Clusters emphasize thread-level parallelism, WSCs emphasize request/task-level parallelism  Differences with datacenters  Datacenters consolidate different machines & software into a single location  Datacenters emphasize virtual machines & hardware heterogeneity to serve varied customers 7
  • 8. Design Factors for WSC  Cost-performance  Small savings add up  Energy efficiency  Affects power distribution & cooling  Work per joule  Operational costs count  Power consumption is a primary constraint when designing a system  Dependability via redundancy  Many low-cost components 8
  • 9. Design Factors (Cont.)  Network I/O  Interactive & batch processing workloads  Web search – interactive  Web indexing – batch  Ample computational parallelism isn’t important  Most jobs are totally independent, “Request-level parallelism”  Scale – Its opportunities & problems  Can afford to build customized systems as WSC require volume purchase  Frequent failures 9
  • 10. Failure Example  Consider a WSC with 50,000 nodes. MTTF of a node is 5 years. How many failures be there for a day? MTTF in days = 5 x 365 = 1,825 Failure rate = 1/1,825 per day No of failures per day = 50,000/1,825 = 27.4  Consider a WSC with 50,000 nodes & each node with 4 hard disks. Suppose a annual failure rate of a disk is 4%. What is the time for a disk failure? No of disks = 50,000 x 4 = 200,000 No of failures per year = 200,000 x 0.04 = 8,000 Time for failure = 365 x 24 / 8,000 = 1.095 hours/failure 10
  • 11. Programming Models & Workloads  Batch processing framework – MapReduce  Map  Applies a programmer- supplied function to each logical input record  Runs on thousands of computers  Provides new set of (key, value) pairs as intermediate values  Reduce  Collapses values using another function 11 http://www.cbsolution.net/techniques/ontarget/mapredu ce_vs_data_warehouse
  • 12. MapReduce Execution 12 Source: Dean et. al., “MapReduce, OSDI, 2004
  • 13. Programming Models & Workloads (Cont.) 13 www.datanami.com/datanami/2012-07- 16/top_5_challenges_for_hadoop_mapreduce _in_the_enterprise.html
  • 14. Programming Models & Workloads (Cont.)  MapReduce runtime environment schedules map & reduce task to WSC nodes  Availability  Use replicas of data across different servers  Use relaxed consistency  No need for all replicas to always agree  Workload demands  Often vary considerably 14
  • 15. Computer Architecture of WSC  Often uses a hierarchy of networks for interconnection  Each 19” rack holds 48 1U servers connected to a rack switch  Rack switches are uplinked to a switch(es) higher in hierarchy  Uplink has 48/n times lower bandwidth – Oversubscription  n – No of uplink ports  Goal is to maximize locality of communication relative to the rack 15
  • 19. Infrastructure & Costs  Location  Proximity to Internet backbones, electricity cost, property tax rates, low risk from earthquakes, floods, & hurricanes  Power distribution 19
  • 20. Power Usage 20 U.S. EPA Report 2007 – 1.5% of total U.S. power consumption used by data centers which has more than doubled since 2000 & costs $4.5 billion
  • 21. How Many Nodes can a WSC Support?  Each node  “Nameplate power rating” gives maximum power consumption  To get actual, measure power under actual workloads  Oversubscribe cumulative nodes power by 40%, but monitor power closely 21
  • 23. Cooling (Cont.) 23 Cooling system also uses water (evaporation & spills) e.g. 70,000 to 200,000 gallons per day for an 8 MW facility
  • 24. Efficiency  Power Utilization Effectiveness (PUE) = Total facility power / IT equipment power  ≥ 1  Median PUE on 2006 study was 1.69 24 Source: http://hightech.lbl.gov/benchmarking-guides/data-a1.html
  • 25. Performance  Latency is important metric because it is seen by users  Bing study  Users will use search less as response time increases  Service Level Objectives (SLOs) & Service Level Agreements (SLAs)  Typically given at application level  e.g., 99% of requests be below 100 ms  In clouds typically given only for static resources  CPU speed, no of cores, & memory 25
  • 26. Cost  Capital expenditures (CAPEX)  Cost to build a WSC  Hardware cost dominates  Operational expenditures (OPEX)  Cost to operate a WSC  Power for nodes & cooling dominates 26
  • 27. Cloud Computing 27 Clients Other Cloud Services Govt. Cloud Services Private Cloud Cloud Manager Public Cloud Green Cloud Computing by Dr. Rajkumar Buyya
  • 28. Cloud Computing (Cont.)  WSCs offer economies of scale that can’t be achieved with a datacenter  5.7 times reduction in storage costs  7.1 times reduction in administrative costs  7.3 times reduction in networking costs  This has given rise to cloud services such as Amazon Web Services  “Utility Computing”  Based on using open source virtual machine & operating system software 28
  • 29. Amazon Web Services  Virtual machines  XEN  Very low cost  $ 0.10 per hour per instance  Primary rely on open source software  No (initial) service guarantees  No contract required  Amazon S3  Simple Storage Service  Amazon EC2  Elastic Computer Cloud 29
  • 30. Amazon Web Services – Example 30 http://www.ryhug.com/free-art-available-on-amazon-amazon-web-services-that-is/

Notas del editor

  1. 1U - A rack unit (abbreviated U or RU) is a unit of measure defined as 44.50 mm (1.75 in)
  2. computer room air conditioning (CRAC)
  3. DCiE = 1/PUE
  4. S3 - Simple Storage Service EC2 - Elastic Compute Cloud