SlideShare una empresa de Scribd logo
1 de 22
Submitted By
J. Subha, M.Tech II Year
M.S. University, Tirunelveli.
 Introduction Big Data
 Data Facts
 Characteristics of Big Data
 Type of Data
 Big Data Tools
 Hadoop
No single definition: here is from Wikipedia:
 Big data is the term for a collection of data
sets so large and complex that it becomes
difficult to process using on-hand database
management tools or traditional data
processing applications.
 Involves various tools, techniques and
frameworks.

Customer
Social
Media
Gamin
g
Entertai
n
Bankin
g
Financ
e
Our
Know
n
Histor
y
Purcha
se
 Over 90% of all the data in the world was
created in the past 2 years.
 Every 2 days we create as much information.
 The total amount of data being captured and
stored by industry doubles every years.
 Every minute we send 204 million emails,
Generate 1.8 million Facebook likes, send
278 thousand Tweets, and upload 200,000
photos to Facebook
 Around 100 hours of video are uploaded to
every minute.
 Big data (TB) cannot fit in a memory of single
computer
 RDBMS fail to handle Big Data
 Processing of Big data in a single computer
will take a lot of time.
 Big data cannot be analyzed with a traditional
tools.
 Characteristics of Big Data:5V’s
 Volume – Data Quantity
 Velocity – Data Speed
 Variety - Data Types
 Veracity – Data Quality and accuracy
 Value - Data Value
 Turning Big Data into Value: The latest
technology such as Distributed systems and
cloud computing together with the latest
software and analysis approaches allow us to
leverage all types of data to gain insights and
add value.
The Model of Generating/Consuming Data has Changed
Old Model: Few companies are generating data, all others are
consuming data
New Model: all of us are generating data, and all of us are
consuming data
Processing Big Data
 Unstructured - Video data, audio data,
( PDF)
 Semi-structured - Many sources of big data
( XML)
 Structured - Most traditional data sources
(Tables)
 Sensors
 Cc-cams
 Social Network- FB..
 Online Shopping
 Airlines
 Hospitality data etc.,
 Big Data is needed – Increase of storage
capacities – Increase of processing power –
Availability of data (different data types).
 Collecting
 Organizing
 Analyzing of Large
set of data to discover
pattern or other
useful information.
Organizing
Analyzing
Collecting
Representation
 Hadoop – Getting huge data, processed in
less time
 Storing and processing huge amount of data
 Hadoop is the Open source frame work
software, that is developed by ‘Apache’ to
support distributed processing of data.
 Initially, Java Language was used to develop
Hadoop script, but today many other
languages are used for scripting Hadoop.
 Hadoop is used to helps in data analytics
 Hadoop implements Google’s MapReduce,
using HDFS
 MapReduce divides applications into many
small blocks of work.
 HDFS creates multiple replicas of data
blocks for reliability, placing them on
compute nodes around the cluster.
 MapReduce can then process the data
where it is located.
 Hadoop ‘s target is to run on clusters of the
order of 10,000-nodes.
 Hardware Requirements
 Quad core processor- 64 bit
 RAM – 8GB
 Disk Free – 20 GB
 Software Requirements
 Windows 7+, MAC Osx10.10+,..
 Several Opensource Software tools including
Apache Hadoop.
Thank You,

Más contenido relacionado

La actualidad más candente

Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Simplilearn
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
magda3695
 

La actualidad más candente (20)

Big Data
Big DataBig Data
Big Data
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big Data ppt
Big Data pptBig Data ppt
Big Data ppt
 
Big Data
Big DataBig Data
Big Data
 
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
Big Data Tutorial | What Is Big Data | Big Data Hadoop Tutorial For Beginners...
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data
Big dataBig data
Big data
 
Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)Big Data & Analytics (Conceptual and Practical Introduction)
Big Data & Analytics (Conceptual and Practical Introduction)
 
Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides Big Data Ppt PowerPoint Presentation Slides
Big Data Ppt PowerPoint Presentation Slides
 
Social media with big data analytics
Social media with big data analyticsSocial media with big data analytics
Social media with big data analytics
 
Big Data & Data Science
Big Data & Data ScienceBig Data & Data Science
Big Data & Data Science
 
Lecture1 introduction to big data
Lecture1 introduction to big dataLecture1 introduction to big data
Lecture1 introduction to big data
 
Big Data
Big DataBig Data
Big Data
 
Big Data - Applications and Technologies Overview
Big Data - Applications and Technologies OverviewBig Data - Applications and Technologies Overview
Big Data - Applications and Technologies Overview
 
Big data by Mithlesh sadh
Big data by Mithlesh sadhBig data by Mithlesh sadh
Big data by Mithlesh sadh
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
What is big data?
What is big data?What is big data?
What is big data?
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
 

Destacado

Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
Mateusz Brzoska
 
Marketing segmentation
Marketing segmentationMarketing segmentation
Marketing segmentation
Maya Humbatova
 

Destacado (15)

Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail ChinaTop 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
Top 6 Tips to Market to Affluent Chinese Consumers - Dragon Trail China
 
Cluster Analysis - Keyword Clustering
Cluster Analysis -  Keyword ClusteringCluster Analysis -  Keyword Clustering
Cluster Analysis - Keyword Clustering
 
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
AXIS BANK (SEGMENTATION AXIS BANK PRODUCTS & SERVICES.)
 
Affluent Market
Affluent MarketAffluent Market
Affluent Market
 
Mass Affluent South Asian Business Proposal
Mass Affluent South Asian Business ProposalMass Affluent South Asian Business Proposal
Mass Affluent South Asian Business Proposal
 
Data Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data SetData Mining – analyse Bank Marketing Data Set
Data Mining – analyse Bank Marketing Data Set
 
Segmenting the SME & Commercial Customer Banking Market
Segmenting the SME & Commercial Customer Banking MarketSegmenting the SME & Commercial Customer Banking Market
Segmenting the SME & Commercial Customer Banking Market
 
Market segmentation & competitive analysis of banking products
Market segmentation & competitive analysis of banking productsMarket segmentation & competitive analysis of banking products
Market segmentation & competitive analysis of banking products
 
Introduction to Market Segmentation
Introduction to Market SegmentationIntroduction to Market Segmentation
Introduction to Market Segmentation
 
Learning & Development Strategy in Banking Industry
Learning & Development Strategy in Banking IndustryLearning & Development Strategy in Banking Industry
Learning & Development Strategy in Banking Industry
 
Towards Future Proof Customer Relations
Towards Future Proof Customer RelationsTowards Future Proof Customer Relations
Towards Future Proof Customer Relations
 
Marketing segmentation
Marketing segmentationMarketing segmentation
Marketing segmentation
 
Market Segmentation
Market SegmentationMarket Segmentation
Market Segmentation
 
Customer centric in a digital world
Customer centric in a digital worldCustomer centric in a digital world
Customer centric in a digital world
 
Market Segmentation, Targeting and Positioning
Market Segmentation, Targeting and PositioningMarket Segmentation, Targeting and Positioning
Market Segmentation, Targeting and Positioning
 

Similar a Big Data

Similar a Big Data (20)

GADLJRIET850691
GADLJRIET850691GADLJRIET850691
GADLJRIET850691
 
How Do I Learn Big Data
How Do I Learn Big DataHow Do I Learn Big Data
How Do I Learn Big Data
 
How Do I Learn Big Data
How Do I Learn Big DataHow Do I Learn Big Data
How Do I Learn Big Data
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class Easylearning Guru online Hadoop class
Easylearning Guru online Hadoop class
 
Big Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –ReviewBig Data and Big Data Management (BDM) with current Technologies –Review
Big Data and Big Data Management (BDM) with current Technologies –Review
 
Big Data
Big DataBig Data
Big Data
 
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop FrameworkIRJET-  	  Youtube Data Sensitivity and Analysis using Hadoop Framework
IRJET- Youtube Data Sensitivity and Analysis using Hadoop Framework
 
Big Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning GuruBig Data Hadoop Training by Easylearning Guru
Big Data Hadoop Training by Easylearning Guru
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Big data-analytics-cpe8035
Big data-analytics-cpe8035Big data-analytics-cpe8035
Big data-analytics-cpe8035
 
Big data and hadoop introduction
Big data and hadoop introductionBig data and hadoop introduction
Big data and hadoop introduction
 
Big Data
Big DataBig Data
Big Data
 
Big Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning GuruBig Data Hadoop Tutorial by Easylearning Guru
Big Data Hadoop Tutorial by Easylearning Guru
 
Big data abstract
Big data abstractBig data abstract
Big data abstract
 
BIG Data and Methodology-A review
BIG Data and Methodology-A reviewBIG Data and Methodology-A review
BIG Data and Methodology-A review
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big Data
Big DataBig Data
Big Data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 

Último

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 

Último (20)

Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-IIFood Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
Food Chain and Food Web (Ecosystem) EVS, B. Pharmacy 1st Year, Sem-II
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural ResourcesEnergy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
Energy Resources. ( B. Pharmacy, 1st Year, Sem-II) Natural Resources
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 

Big Data

  • 1.
  • 2.
  • 3. Submitted By J. Subha, M.Tech II Year M.S. University, Tirunelveli.
  • 4.  Introduction Big Data  Data Facts  Characteristics of Big Data  Type of Data  Big Data Tools  Hadoop
  • 5. No single definition: here is from Wikipedia:  Big data is the term for a collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.  Involves various tools, techniques and frameworks.
  • 7.
  • 8.  Over 90% of all the data in the world was created in the past 2 years.  Every 2 days we create as much information.  The total amount of data being captured and stored by industry doubles every years.  Every minute we send 204 million emails, Generate 1.8 million Facebook likes, send 278 thousand Tweets, and upload 200,000 photos to Facebook  Around 100 hours of video are uploaded to every minute.
  • 9.  Big data (TB) cannot fit in a memory of single computer  RDBMS fail to handle Big Data  Processing of Big data in a single computer will take a lot of time.  Big data cannot be analyzed with a traditional tools.
  • 10.  Characteristics of Big Data:5V’s  Volume – Data Quantity  Velocity – Data Speed  Variety - Data Types  Veracity – Data Quality and accuracy  Value - Data Value  Turning Big Data into Value: The latest technology such as Distributed systems and cloud computing together with the latest software and analysis approaches allow us to leverage all types of data to gain insights and add value.
  • 11.
  • 12.
  • 13. The Model of Generating/Consuming Data has Changed Old Model: Few companies are generating data, all others are consuming data New Model: all of us are generating data, and all of us are consuming data
  • 14. Processing Big Data  Unstructured - Video data, audio data, ( PDF)  Semi-structured - Many sources of big data ( XML)  Structured - Most traditional data sources (Tables)
  • 15.  Sensors  Cc-cams  Social Network- FB..  Online Shopping  Airlines  Hospitality data etc.,  Big Data is needed – Increase of storage capacities – Increase of processing power – Availability of data (different data types).
  • 16.  Collecting  Organizing  Analyzing of Large set of data to discover pattern or other useful information. Organizing Analyzing Collecting Representation
  • 17.
  • 18.  Hadoop – Getting huge data, processed in less time  Storing and processing huge amount of data  Hadoop is the Open source frame work software, that is developed by ‘Apache’ to support distributed processing of data.  Initially, Java Language was used to develop Hadoop script, but today many other languages are used for scripting Hadoop.  Hadoop is used to helps in data analytics
  • 19.
  • 20.  Hadoop implements Google’s MapReduce, using HDFS  MapReduce divides applications into many small blocks of work.  HDFS creates multiple replicas of data blocks for reliability, placing them on compute nodes around the cluster.  MapReduce can then process the data where it is located.  Hadoop ‘s target is to run on clusters of the order of 10,000-nodes.
  • 21.  Hardware Requirements  Quad core processor- 64 bit  RAM – 8GB  Disk Free – 20 GB  Software Requirements  Windows 7+, MAC Osx10.10+,..  Several Opensource Software tools including Apache Hadoop.

Notas del editor

  1. B