Más contenido relacionado
La actualidad más candente (19)
Similar a Big Data for Managers: From hadoop to streaming and beyond (20)
Más de DataWorks Summit/Hadoop Summit (20)
Big Data for Managers: From hadoop to streaming and beyond
- 2. www.scispike.com Copyright © SciSpike 2016
Dr. Vladimir Bacvanski
§ Founder of SciSpike, a development,
consulting, and training firm
§ Passionate about software and data
§ PhD in computer science RWTH Aachen,
Germany
§ Architect, consultant, mentor
§ Custom development: Scalable Web
and IoT systems
§ Training and mentoring in
Big Data, Scala, node.js, software
architecture
@OnSoftware
https://www.linkedin.com/in/vladimirbacvanski
- 9. www.scispike.com Copyright © SciSpike 2016
MapReduce Example: Word Count
§ WordCount is the "Hello World" of Big Data
– You will see various technologies implemenEng it
– A good first step to compare the expressiveness of Big Data
tools
9
dog cat bird
dog cat bird
dog dog cat
dog, 1
cat, 1
bird, 1
dog, 1
cat, 1
bird, 1
dog, 1
dog, 1
cat, 1
Map
dog, 1
dog, 1
dog, 1
dog, 1
cat, 1
cat, 1
cat, 1
bird, 1
bird, 1
Shuffle
dog, 4
cat, 3
bird, 2
Reduce
dog cat bird
dog cat bird
dog dog cat
pets.txt
dog, 4
cat, 3
bird, 2
pet_freq.txt