SlideShare una empresa de Scribd logo
1 de 8
Descargar para leer sin conexión
MapReduce: Simplified Data
Processing on Large Clusters
          Rob Keisler
           CSCI 638
         Summer 2011
Outline

● Background

● Model

● Examples

● Execution

● Conclusions
Background

● Transformation operations are conceptually straightforward
   ○ Until data is large and the computation must be
     distributed over hundred or thousands of machines

● So, Google created MapReduce

● MapReduce is a programming abstraction
   ○ Expresses simple computations
   ○ Hides complexity details
Model

● Utilizes higher-order shaping functions Map and Reduce to
  take a set of input key/value pairs and produce a set of
  output key/value pairs

● Map
   ○ Takes an input key/value pair and produces a set of
     intermediate key/value pairs

● Reduce
   ○ Accepts an intermediate key I and a set of values for
     that key, and merges those values to form possibly
     smaller sets of values
Examples

● Distributed Grep

● Count of URL Access Frequency

● Reverse Web-Link Graph

● Term-Vector per Host

● Inverted Index

● Distributed Sort
Execution Overview
Conclusions

● The MapReduce programming model proved to be a useful
  abstraction for many different purposes
   ○ Easy to use
       ■ even for programmers without experience with
         parallel and distributed systems
   ○ A large variety of problems are easily expressible as
     MapReduce computations
   ○ The implementation scales to large clusters of machines

● Greatly simplifies large-scale computations at Google
Questions?

http://labs.google.com/papers/mapreduce.html

Más contenido relacionado

La actualidad más candente

FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesFME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesSafe Software
 
Using FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CityUsing FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CitySafe Software
 
Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Safe Software
 
FME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasFME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasSafe Software
 
Supporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSupporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSafe Software
 
Prepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsPrepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsSafe Software
 
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesUsing FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesSafe Software
 
Using GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsUsing GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsniket_narang
 
Map Reduce
Map ReduceMap Reduce
Map Reducemsgroner
 
KDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMEKDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMESafe Software
 
Gain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataGain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataSafe Software
 
Creating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieCreating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieSafe Software
 
Some of my favourite QGIS plugins
Some of my favourite QGIS pluginsSome of my favourite QGIS plugins
Some of my favourite QGIS pluginsMark Owen
 
Essential NumPy By ZekeLabs
Essential NumPy By ZekeLabsEssential NumPy By ZekeLabs
Essential NumPy By ZekeLabsAwantik Das
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISShaun Lewis
 
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...I3E Technologies
 
Tilemill gwu-wboykinm
Tilemill gwu-wboykinmTilemill gwu-wboykinm
Tilemill gwu-wboykinmBill Morris
 
From Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingFrom Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingSafe Software
 

La actualidad más candente (20)

FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the treesFME in Tesera’s HRIS: Slicing through the forest of data to see the trees
FME in Tesera’s HRIS: Slicing through the forest of data to see the trees
 
Using FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a CityUsing FME to Automate Data Integration in a City
Using FME to Automate Data Integration in a City
 
Map Reduce Presentation
Map Reduce PresentationMap Reduce Presentation
Map Reduce Presentation
 
Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017Extending 3D Model Visualization with FME 2017
Extending 3D Model Visualization with FME 2017
 
FME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility IdeasFME Cloud as Engine for New Mobility Ideas
FME Cloud as Engine for New Mobility Ideas
 
Supporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME ServerSupporting Situational Awareness at LAX using FME Server
Supporting Situational Awareness at LAX using FME Server
 
Prepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your RequirementsPrepare LiDAR Data To Meet Your Requirements
Prepare LiDAR Data To Meet Your Requirements
 
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas CompaniesUsing FME to Deliver Map-Based Geological Data for Oil & Gas Companies
Using FME to Deliver Map-Based Geological Data for Oil & Gas Companies
 
Om
OmOm
Om
 
Using GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissionsUsing GIS to reassess urban plans based on changing industrial emissions
Using GIS to reassess urban plans based on changing industrial emissions
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
KDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FMEKDOT Aviation Portal Update: Cesium, FME
KDOT Aviation Portal Update: Cesium, FME
 
Gain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud DataGain Total Control of Your LiDAR and Point Cloud Data
Gain Total Control of Your LiDAR and Point Cloud Data
 
Creating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of BarrieCreating Geometric Networks at the City of Barrie
Creating Geometric Networks at the City of Barrie
 
Some of my favourite QGIS plugins
Some of my favourite QGIS pluginsSome of my favourite QGIS plugins
Some of my favourite QGIS plugins
 
Essential NumPy By ZekeLabs
Essential NumPy By ZekeLabsEssential NumPy By ZekeLabs
Essential NumPy By ZekeLabs
 
Dr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GISDr Richard Fry - Using R as a GIS
Dr Richard Fry - Using R as a GIS
 
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
ON TRAFFIC-AWARE PARTITION AND AGGREGATION IN MAPREDUCE FOR BIG DATA APPLICAT...
 
Tilemill gwu-wboykinm
Tilemill gwu-wboykinmTilemill gwu-wboykinm
Tilemill gwu-wboykinm
 
From Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue MappingFrom Outdoor to Indoor: 3D and Venue Mapping
From Outdoor to Indoor: 3D and Venue Mapping
 

Similar a MapReduce

Big data processing systems research
Big data processing systems researchBig data processing systems research
Big data processing systems researchVasia Kalavri
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueDatabricks
 
Hadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantHadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantAkshay Rai
 
MapReduce Programming Model
MapReduce Programming ModelMapReduce Programming Model
MapReduce Programming ModelAdarshaDhakal
 
Introduction to Machine Learning with Spark
Introduction to Machine Learning with SparkIntroduction to Machine Learning with Spark
Introduction to Machine Learning with Sparkdatamantra
 
My mapreduce1 presentation
My mapreduce1 presentationMy mapreduce1 presentation
My mapreduce1 presentationNoha Elprince
 
Hadoop Map Reduce OS
Hadoop Map Reduce OSHadoop Map Reduce OS
Hadoop Map Reduce OSVedant Mane
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce Sina Ebrahimi
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...areej qasrawi
 
Mapreduce2008 cacm
Mapreduce2008 cacmMapreduce2008 cacm
Mapreduce2008 cacmlmphuong06
 
Map reduce presentation
Map reduce presentationMap reduce presentation
Map reduce presentationAhmad El Tawil
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache SparkLucian Neghina
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster ServicesAdam Doyle
 
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsFugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsDatabricks
 
Netflix machine learning
Netflix machine learningNetflix machine learning
Netflix machine learningAmer Ather
 
Software Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationSoftware Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationHao Xu
 
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfmodule3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfTSANKARARAO
 

Similar a MapReduce (20)

Big data processing systems research
Big data processing systems researchBig data processing systems research
Big data processing systems research
 
Superworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and FugueSuperworkflow of Graph Neural Networks with K8S and Fugue
Superworkflow of Graph Neural Networks with K8S and Fugue
 
Hadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. ElephantHadoop & Spark Performance tuning using Dr. Elephant
Hadoop & Spark Performance tuning using Dr. Elephant
 
MapReduce Programming Model
MapReduce Programming ModelMapReduce Programming Model
MapReduce Programming Model
 
Introduction to Machine Learning with Spark
Introduction to Machine Learning with SparkIntroduction to Machine Learning with Spark
Introduction to Machine Learning with Spark
 
Main map reduce
Main map reduceMain map reduce
Main map reduce
 
My mapreduce1 presentation
My mapreduce1 presentationMy mapreduce1 presentation
My mapreduce1 presentation
 
Hadoop Map Reduce OS
Hadoop Map Reduce OSHadoop Map Reduce OS
Hadoop Map Reduce OS
 
An Introduction to MapReduce
An Introduction to MapReduce An Introduction to MapReduce
An Introduction to MapReduce
 
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...MapReduce:Simplified Data Processing on Large Cluster  Presented by Areej Qas...
MapReduce:Simplified Data Processing on Large Cluster Presented by Areej Qas...
 
Mapreduce2008 cacm
Mapreduce2008 cacmMapreduce2008 cacm
Mapreduce2008 cacm
 
Map reduce presentation
Map reduce presentationMap reduce presentation
Map reduce presentation
 
Big Data processing with Apache Spark
Big Data processing with Apache SparkBig Data processing with Apache Spark
Big Data processing with Apache Spark
 
try
trytry
try
 
Managed Cluster Services
Managed Cluster ServicesManaged Cluster Services
Managed Cluster Services
 
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data AnalyticsFugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
Fugue: Unifying Spark and Non-Spark Ecosystems for Big Data Analytics
 
Netflix machine learning
Netflix machine learningNetflix machine learning
Netflix machine learning
 
Software Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale AutomationSoftware Design Practices for Large-Scale Automation
Software Design Practices for Large-Scale Automation
 
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdfmodule3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
module3part-1-bigdata-230301002404-3db4f2a4 (1).pdf
 
Big Data.pptx
Big Data.pptxBig Data.pptx
Big Data.pptx
 

Último

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Shubhangi Sonawane
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxNikitaBankoti2
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.MaryamAhmad92
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdfQucHHunhnh
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIShubhangi Sonawane
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 

Último (20)

Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
Ecological Succession. ( ECOSYSTEM, B. Pharmacy, 1st Year, Sem-II, Environmen...
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Role Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptxRole Of Transgenic Animal In Target Validation-1.pptx
Role Of Transgenic Animal In Target Validation-1.pptx
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 

MapReduce

  • 1. MapReduce: Simplified Data Processing on Large Clusters Rob Keisler CSCI 638 Summer 2011
  • 2. Outline ● Background ● Model ● Examples ● Execution ● Conclusions
  • 3. Background ● Transformation operations are conceptually straightforward ○ Until data is large and the computation must be distributed over hundred or thousands of machines ● So, Google created MapReduce ● MapReduce is a programming abstraction ○ Expresses simple computations ○ Hides complexity details
  • 4. Model ● Utilizes higher-order shaping functions Map and Reduce to take a set of input key/value pairs and produce a set of output key/value pairs ● Map ○ Takes an input key/value pair and produces a set of intermediate key/value pairs ● Reduce ○ Accepts an intermediate key I and a set of values for that key, and merges those values to form possibly smaller sets of values
  • 5. Examples ● Distributed Grep ● Count of URL Access Frequency ● Reverse Web-Link Graph ● Term-Vector per Host ● Inverted Index ● Distributed Sort
  • 7. Conclusions ● The MapReduce programming model proved to be a useful abstraction for many different purposes ○ Easy to use ■ even for programmers without experience with parallel and distributed systems ○ A large variety of problems are easily expressible as MapReduce computations ○ The implementation scales to large clusters of machines ● Greatly simplifies large-scale computations at Google