1. Big Data And Hadoop
By easydata
easydata - Online Training
2. Big Data And Hadoop
• Big Data is an asset, often a complex and ambiguous one.
• Hadoop is a program that accomplishes a set of goals and objectives
for dealing with that asset.
• Big data is large sets of data that businesses and other parties put
together for specific goals and operations.
• Businesses / Companies collect these data over a period of time.
easydata - Online Training
3. • These data may include customer identifiers like name , Social
Security number , age group , location or anything.
• On product information in the form of model numbers, sales numbers
, inventory numbers, complain numbers.
• Customer feedback, angry customers, happy customers etc.
• All of this can be called big data.
• But they all are raw and unsorted data.
• Hadoop is one of the tools designed to handle this raw and unsorted
big data.
easydata - Online Training
4. • Hadoop works to interpret or parse the results of big data searches.
• Hadoop uses some algorithms and methods to understand it.
• Hadoop is an open-source program under the Apache license that is
maintained by a global community of users.
• To understand Hadoop, you have to understand two fundamental
things.
• They are: How Hadoop stores files, and how it processes data.
easydata - Online Training
5. • Hadoop includes various main components like MapReduce , HDFS.
• HDFS : Stores raw and unsorted data.
• MapReduce : Its ability to process that data, or provide a framework
for processing that data.
• HDFS == Storage
• MapReduce == Processing
easydata - Online Training