SlideShare una empresa de Scribd logo
1 de 2
Descargar para leer sin conexión
www.VirtualNuggets.com
Training@VirtualNuggets.com
India +91-8885560202 ; +91-40-64540202
USA +1-707-666-8949
Course Overview:
The course presents the material as small building blocks with a thorough coverage of each component in
the Hadoop stack. We begin by looking at Hadoop’s architecture and its underlying parts with top-down
identification of component interactions within the Hadoop eco-system. The course then provides in-
depth coverage of Hadoop Distributed File System (HDFS), HBase, Map/Reduce, Oozie, Pig and Hive.
To re-enforce concepts, each section is followed by a set of hands-on exercises. The exercises come in
various complexities to accommodate developers with various levels of expertise.
Course Content:
Module 01 -
1.) What is Big Data & Why Hadoop?
Big Data Characteristics, Challenges with traditional system
2.) Hadoop Overview & it’s Ecosystem
Anatomy of Hadoop Cluster, Installing and Configuring Hadoop
Hands-On Exercise
3.) HDFS – Hadoop Distributed File System
Name Nodes and Data Nodes
Hands-On Exercise
4.) Map Reduce Anatomy
How Map Reduce Works?
The Mapper & Reducer, InputFormats & OutputFormats, Data Type & Customer Writables
Developing Map Reduce Programs
Setting up Eclipse Development Environment, Creating Map Reduce Projects, Debugging
and Unit Testing Map Reduce Code, Testing with MRUnit
Hands-On Exercise
Advanced Map Reduce Concepts
Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero
Reducer and Distributed Cache
Hands-On Exercise
Module 02 - HADOOP ECOSYSTEM
1.) Operating System (Linux)
2.) Data Storage Framework
HDFS (Self Healing Distributed File System)
3.) Data Processing Framework
MapReduce (Powerful, parallel data processing framework)
4.) Data Access Framework
Pig
Hive : Hive Basics & Pig Basics
Sqoop : Importing and Exporting data from using RDBMS
Avro
www.VirtualNuggets.com
Training@VirtualNuggets.com
India +91-8885560202 ; +91-40-64540202
USA +1-707-666-8949
5.) Orchestration Framework
Hbase
Chukwa
Flume
ZooKeeper
Module 03
1.) Advanced Map Reduce Concepts
Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero
Reducer and Distributed Cache
Hands-On Exercise
2.) Advanced Map Reduce Algorithms
Sorting, Searching and Indexing, Multiple Inputs, Chaining multiple jobs
Joins, Handling Binary & Unstructured data
Hands-On Exercise
3.) Advanced Tips & Techniques
Determining optimal number of reducers, skipping bad records
Partitioning into multiple output files & Passing parameters to tasks
Optimizing Hadoop Cluster & Performance Tuning
4.) Monitoring & Management of Hadoop
Managing HDFS with Tools like fsck and dfsadmin
Using HDFS & Job Tracker Web UI
Routine Administration Procedures
Commissioning and decommissioning of nodes
Hands-On Exercise
Hands-On Exercise
Module -4 Hadoop with Analytics
1.) Hadoop with Analytics using R
Predictive analytics with Mahout
Use of statistics over big data using R.
Introduction over R.
Using R, How to create API which will interact hadoop Ecosystem compoment.
Integration of Java,R,Hadoop,Hive etc.
2.) Graph Manipulation in Hadoop
Introduction to graph techniques
Representing Graphs in Hadoop
Implementing a sample algorithm: Single Source Shortest Path

Más contenido relacionado

Más de Virtual Nuggets

Más de Virtual Nuggets (13)

Best Online Training Institute on Oracle SOA/BPEL
Best Online Training Institute on Oracle SOA/BPELBest Online Training Institute on Oracle SOA/BPEL
Best Online Training Institute on Oracle SOA/BPEL
 
Sap hana
Sap hanaSap hana
Sap hana
 
Qc
QcQc
Qc
 
Qtp
QtpQtp
Qtp
 
Manual testing
Manual testingManual testing
Manual testing
 
Load runner
Load runnerLoad runner
Load runner
 
Etl testing
Etl testingEtl testing
Etl testing
 
Ibm web sphere
Ibm web sphereIbm web sphere
Ibm web sphere
 
Ibm web experience factory
Ibm web experience factoryIbm web experience factory
Ibm web experience factory
 
Ibm maximo
Ibm maximoIbm maximo
Ibm maximo
 
Ibm web sphere portal
Ibm web sphere portalIbm web sphere portal
Ibm web sphere portal
 
TIBCO AMX BPM Online Training Institute
TIBCO AMX BPM Online Training InstituteTIBCO AMX BPM Online Training Institute
TIBCO AMX BPM Online Training Institute
 
Tibco amx bpm Online Training Inatitute
Tibco amx bpm Online Training InatituteTibco amx bpm Online Training Inatitute
Tibco amx bpm Online Training Inatitute
 

Best Hadoop Developer Online Training @ VirtualNuggets

  • 1. www.VirtualNuggets.com Training@VirtualNuggets.com India +91-8885560202 ; +91-40-64540202 USA +1-707-666-8949 Course Overview: The course presents the material as small building blocks with a thorough coverage of each component in the Hadoop stack. We begin by looking at Hadoop’s architecture and its underlying parts with top-down identification of component interactions within the Hadoop eco-system. The course then provides in- depth coverage of Hadoop Distributed File System (HDFS), HBase, Map/Reduce, Oozie, Pig and Hive. To re-enforce concepts, each section is followed by a set of hands-on exercises. The exercises come in various complexities to accommodate developers with various levels of expertise. Course Content: Module 01 - 1.) What is Big Data & Why Hadoop? Big Data Characteristics, Challenges with traditional system 2.) Hadoop Overview & it’s Ecosystem Anatomy of Hadoop Cluster, Installing and Configuring Hadoop Hands-On Exercise 3.) HDFS – Hadoop Distributed File System Name Nodes and Data Nodes Hands-On Exercise 4.) Map Reduce Anatomy How Map Reduce Works? The Mapper & Reducer, InputFormats & OutputFormats, Data Type & Customer Writables Developing Map Reduce Programs Setting up Eclipse Development Environment, Creating Map Reduce Projects, Debugging and Unit Testing Map Reduce Code, Testing with MRUnit Hands-On Exercise Advanced Map Reduce Concepts Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero Reducer and Distributed Cache Hands-On Exercise Module 02 - HADOOP ECOSYSTEM 1.) Operating System (Linux) 2.) Data Storage Framework HDFS (Self Healing Distributed File System) 3.) Data Processing Framework MapReduce (Powerful, parallel data processing framework) 4.) Data Access Framework Pig Hive : Hive Basics & Pig Basics Sqoop : Importing and Exporting data from using RDBMS Avro
  • 2. www.VirtualNuggets.com Training@VirtualNuggets.com India +91-8885560202 ; +91-40-64540202 USA +1-707-666-8949 5.) Orchestration Framework Hbase Chukwa Flume ZooKeeper Module 03 1.) Advanced Map Reduce Concepts Combiner, Partitioner, Counter, Compression, Setup and teardown, Speculative Execution, Zero Reducer and Distributed Cache Hands-On Exercise 2.) Advanced Map Reduce Algorithms Sorting, Searching and Indexing, Multiple Inputs, Chaining multiple jobs Joins, Handling Binary & Unstructured data Hands-On Exercise 3.) Advanced Tips & Techniques Determining optimal number of reducers, skipping bad records Partitioning into multiple output files & Passing parameters to tasks Optimizing Hadoop Cluster & Performance Tuning 4.) Monitoring & Management of Hadoop Managing HDFS with Tools like fsck and dfsadmin Using HDFS & Job Tracker Web UI Routine Administration Procedures Commissioning and decommissioning of nodes Hands-On Exercise Hands-On Exercise Module -4 Hadoop with Analytics 1.) Hadoop with Analytics using R Predictive analytics with Mahout Use of statistics over big data using R. Introduction over R. Using R, How to create API which will interact hadoop Ecosystem compoment. Integration of Java,R,Hadoop,Hive etc. 2.) Graph Manipulation in Hadoop Introduction to graph techniques Representing Graphs in Hadoop Implementing a sample algorithm: Single Source Shortest Path