SlideShare una empresa de Scribd logo
1 de 65
Introduction: Cloud Computing and
Big Data - Hadoop
Presented By:
Nagarjuna D.N
SAP CTL
AT&T, Bengaluru
Date: 14-07-2015
Overview
• Cloud Computing Evolution
• Why Cloud Computing needed?
• Cloud Computing Models
• Cloud Solutions
• Cloud Jobs opportunities
• Criteria for Big Data
• Big Data challenges
• Technologies to process Big Data- Hadoop
• Hadoop History and Architecture
• Hadoop Eco-System
• Hadoop Real-time Use cases
• Hadoop Job opportunities
• Hadoop and SAP HANA integration
• Summary
2
Internet of Things (IoT)
Big Data “One of the Reason is Cloud Computing….!”
3
Cloud Computing
(Evolution of an internet and its hidden from the end user)
• Infrastructure is maintained somewhere with shared computing
resources -servers and storage, network, all delivered over the Internet.
• The Cloud delivers a hosting environment that is-
-immediate,
-flexible,
-scalable,
-secure,
-available,
-saves corporations money, time and resources.
Flexible
Scalable
Secure
Cloud Computing (Cont….)
• In addition, the platform provides on demand services, i.e
always on, anywhere, anytime and any place.
• “Pay-for-what-you-use”- metered basis.
• Its based on utility computing and Virtualization.
5
Cloud Computing History
Traditional Infrastructure Model
Forecasted
Infrastructure
Demand
Time
Capital
7
Acceptable Surplus
Forecasted
Infrastructure
Demand
Surplus
Time
Capital
8
Actual Infrastructure Model
Actual
Infrastructure
Demand
Time
Capital
9
Unacceptable Surplus
Surplus
Time
Capital
10
Unacceptable Deficit
Deficit
Time
Capital
11
Utility Infrastructure Model
(Concept of Cloud Computing)
Actual
Infrastructure
Demand
Time
Capital
12
Cloud Flavors (Service Models)
• IaaS – Infrastructure as a Service
• PaaS – Platform as a Service
• SaaS – Software as a Service
13
SaaS Examples
14
IaaS Examples
15
PaaS Examples
16
Cloud Deployment Models
• Public Cloud
• Private Cloud
• Hybrid Cloud
• Community Cloud
17
Cloud Distribution Examined
18
Enterprise Cloud Solutions
1. Test / Development / QA Platform
o Use cloud infrastructure servers as test and development
platform
2. Disaster Recovery
o Keep images of servers on cloud infrastructure ready to
go in case of a disaster
3. Cloud File Storage
o Backup or Archive company data to cloud file storage
4. Load Balancing
o Use cloud infrastructure for overflow management during
peak usage times
19
Enterprise Cloud Solutions (cont)
5. Overhead Control
o Lower overhead costs and make bids more competitive
6. Distributed Network Control and Cost Reporting
o Create an individual private networks (VPC) for each of
subsidiaries or contracts
7. Rapid Deployment
o Turn up servers immediately to fulfill project timelines
8. Functional IT Labor Shift
o Refocus IT labor expense on revenue producing activities
20
Preparing for the Future Cloud IT
Jobs
Sampling of IT skills likely to be in demand in the future
o Functional application development and support
 I.e. Oracle, SAP, SQL, linking hardware to software
o Leveraging data to make strategic business decisions
 I.e. Business Intelligence : Applying sales forecasts to inventory and
manufacturing decisions
o Mobile apps
 Android, iPhone, Windows Mobile
o Wi-Fi engineers
 USF to include broadband communications (LTE replaces GSM/CDMA)
o Optical engineers
 Optical offers the highest bandwidth today (PON, CWDM, DWDM)
o Virtualization Specialists
 Economies of scale require virtualization (server, storage, client…)
o IP Engineers
o Network Security Specialists
o Web developers
o Social Media developers
o Business Intelligence application development and support
21
IT Cloud infrastructure
23
“Big Data- Big Thing”
• Big Data is exactly like Rubik’s cube.
• Just like a Rubik’s cube Big Data has many different solutions.
• If you take five Rubik’s cube and mix up the same way and give it to five
different expert’s.
• They will solve the Rubik’s cube in fractions of the seconds.
• But if you pay attention to the same closely, you will notice that even though
the final outcome is the same, the route taken to solve the Rubik’s cube is
not the same.
• Every expert will start at a different place(colors) and will try to resolve it
with different methods.
• It is nearly impossible to have a exact same route taken by two experts.
Begining Big Data
24
25
Big Data Definition in general
• Big Data is a collection of data sets that are large and complex in
nature.
• They constitute both structured and unstructured data that grow
large so fast that they are not manageable by traditional relational
database systems(Eg., RDBMS).
26
Big Data Technically
i. Volume
petta bytes or Zetta bytes.
ii. Velocity
Batch or real(stream) time processing.
iii. Variety
Structured, semi-structured &
Unstructured.
It is estimated that 80% of world’s data
are unstructured and rest of them
semi-structured and structured.
iv. Veracity
The quality of the data being captured
can vary greatly.
Fig.Big Data Based on Doug Cutting 3Vs model
27
Variety of Data
1. Structured Data:- Data i.e. identifiable because its organized in a
structure(Standard defined format)
E.g.: Database, Data Warehouses & Electronic spreadsheets.
2. Semi-Structured Data:- Data i.e. neither raw data, nor typed data in
a conventional database system
E.g.: Wiki pages, Tweets, Facebook data & Instant Messages.
3. Unstructured Data:- its doesn’t have standard defined structure
E.g.: Data files, Audio files, Video, Graphics & Multimedia.
28
Traditional Data v/s Big Data
Attributes Traditional Data Big Data
Volume Gigabytes to terabytes Petabytes to zettabytes
Organizaton Centralized Distributed
Structure Structured Semi-structured & unstructured
Data model Strict schema based Flat schema
Data relationship Complex interrelationships Almost flat with few relationships
29
Criteria of Big Data
1. 272 hours of video are uploaded to YouTube every minute and
over 3 billion hours of video are watched every month.
2. Radio Frequency ID (RFID) systems generated up to 1,000 times
more data compared to the conventional bar code systems.
3. 340 million tweets are sent every day and that amounts of 7TB of
data.
4. Social networking site, Facebook, processes over 10TB of data
every day.
5. Over 5 billion people use cell phones to call, send SMS, email,
browse Internet, and interact via social networking sites.
6. The Square Kilometre Array project of NASA receives 700 TB of
data per second.
30
Challenges with Big Data
1. Scaling is costly.
2. Strategy must be in place before you hit the limit of a single
computer.
3. Most entreprises responded to scalability needs when they started
facing problems of poor response and low throughput.
4. Adding hardware to existing system is manpower extensive and
hence error prone.
5. Mixed data type - structured and unstructured - makes scaling even
harder.
31
Exploring Big Data for business insights
32
33
Big Data solutions with Hadoop
34
Organizations Adopted Big Data
35
How are Organizations using Big Data
Technology?
36
37
Feb 14th 2011 –Watson is IBM’s super
computer built using Big Data Technology.
Its not online & its process like a human brain.
38
39
Tools typically used in Big Data
Scenarios
40
Technology to process Big Data- Hadoop
(Open-source software framework written in Java)
• Open-source software: It's free to download, though more and
more commercial versions of Hadoop are becoming available.
• Framework: It means that everything you need to develop and run
software applications is provided –programs, connections, etc.
• Distributed storage: The Hadoop framework breaks big data into
blocks, which are stored on clusters of commodity hardware.
• Processing power: Hadoop concurrently processes large amounts
of data using multiple low-cost computers for fast results.
• Hadoop an DFS and not Database. Its designed for information from
many forms.
• Open source project started by Doug Cutting-
employee of Yahoo. Hadoop is the name of his sons toy elephant.
• Apache software foundation- Apache Hadoop.
41
Hadoop Creation
History
42
Hadoop Architecture
Hadoop core has two major components (daemons):
1. HDFS
a. NameNode
b. Secondary NameNode
c. DataNode
2. MapReduce Engine (distributed data processing framework)
a. JobTracker
b. TaskTracker
46
What components make up Hadoop?
• Hadoop Common – the libraries and utilities used by other Hadoop
modules.
• Hadoop Distributed File System (HDFS) – the Java-based
scalable system that stores data across multiple machines without
prior organization.
• MapReduce – a software programming model for processing large
sets of data in parallel.
• YARN – resource management framework for scheduling and
handling resource requests from distributed applications. (YARN is
an acronym for Yet Another Resource Negotiator.)
45
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Slaves
Master
Task
Tracker
Data
Node
Job
Tracker
Name
Node
MapReduce
HDFS
Hadoop Architecture
47
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Slaves
Master
Task
Tracker
Data
Node
Job
Tracker
Name
Node
48
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Task
Tracker
Data
Node
Slaves
Master
Task
Tracker
Data
Node
Job
Tracker
Name
Node
49
Node
RACK RACK RACK
RACK
Cluster
Data Center
50
51
MapReduce Example
52
Benefits of Hadoop
• Scalable– New nodes can be added without needing to change
data formats.
• Cost effective– Hadoop brings massively parallel computing to
commodity hardwares.
• Flexible– Hadoop is schema-less, and can absorb any type of data,
structured or not, from any number of sources.
• Fault tolerant– When you lose a node, the system redirects work to
another location of the data and continues processing without
missing a heartbeat.
• Programming languages- Java(default)/python.
• Last but not least – it’s free! ( Open source).
43
Hadoop is not Suitable for All Kinds of
Applications
Hadoop is not suitable to:
• perform real-time, stream-based processing where data is
processed immediately upon its arrival.
• perform online access where low latency is required.
44
Hadoop Eco-System
53
Real-Time Hadoop
Use Cases
1. Risk Modeling (How can banks
understand customers & markets ?)
2. Customer churn analysis (why do
companies really lose customers?)
3. Ad Targeting (How can companies
increase campaign efficiency?)
4. Point of sale transaction analysis (How do retailers
target promotion guaranteed to make you buy?)
5. Search quality
(What’s in your search?) Hyperlink54
55
56
Hadoop Job Opportunities
57
58
Apache Hadoop & SAP HANA Integration
(Future Generation Technologies)
59
In Real-Time Business
60
Resources
61
Summary
o Cloud Computing
o Big Data
o Apache Hadoop
o Hadoop and SAP HANA integration
62
More Details
Nagarjuna D N
nagarjunadn.arjun@gmail.com
nagarjuna_dn@live.com
More Cloud Solutions Architect Skills:
• Amazon Cloud (Amazon Web Services)
• MongoDB (NoSQL Database)
• Play Framework (Web Application Framework)
• Domain/ SSL Certificate setup
• Apache Hadoop, Apache Pig, Apache hive
Your Valuable Feedback Please
• Compulsory to where I must improve………..!

Más contenido relacionado

La actualidad más candente

The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingMinhazul Arefin
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computingViet-Trung TRAN
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big DataZaloni
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012Gigaom
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabatinabati
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreSoftweb Solutions
 
NextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataNextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataEd Dodds
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data WarehousingThomas Kejser
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with HadoopPhilippe Julio
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An OverviewArvind Kalyan
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big DataMatthew Dennis
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyNishant Gandhi
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsKaniska Mandal
 
Big data trends challenges opportunities
Big data trends challenges opportunitiesBig data trends challenges opportunities
Big data trends challenges opportunitiesMohammed Guller
 
Big data processing using hadoop poster presentation
Big data processing using hadoop poster presentationBig data processing using hadoop poster presentation
Big data processing using hadoop poster presentationAmrut Patil
 
ROI of Big Data Analytics Native on Hadoop
ROI of Big Data Analytics Native on HadoopROI of Big Data Analytics Native on Hadoop
ROI of Big Data Analytics Native on HadoopDataWorks Summit
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An OverviewC. Scyphers
 
Scaling Out With Hadoop And HBase
Scaling Out With Hadoop And HBaseScaling Out With Hadoop And HBase
Scaling Out With Hadoop And HBaseAge Mooij
 

La actualidad más candente (20)

The rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computingThe rise of “Big Data” on cloud computing
The rise of “Big Data” on cloud computing
 
Overview of big data in cloud computing
Overview of big data in cloud computingOverview of big data in cloud computing
Overview of big data in cloud computing
 
Cloud Computing and Big Data
Cloud Computing and Big DataCloud Computing and Big Data
Cloud Computing and Big Data
 
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
THE 3V's OF BIG DATA: VARIETY, VELOCITY, AND VOLUME from Structure:Data 2012
 
BIG DATA
BIG DATABIG DATA
BIG DATA
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
Big Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and moreBig Data in Action : Operations, Analytics and more
Big Data in Action : Operations, Analytics and more
 
NextGen Infrastructure for Big Data
NextGen Infrastructure for Big DataNextGen Infrastructure for Big Data
NextGen Infrastructure for Big Data
 
Big Data vs Data Warehousing
Big Data vs Data WarehousingBig Data vs Data Warehousing
Big Data vs Data Warehousing
 
Big Data Analytics with Hadoop
Big Data Analytics with HadoopBig Data Analytics with Hadoop
Big Data Analytics with Hadoop
 
Big Data - An Overview
Big Data -  An OverviewBig Data -  An Overview
Big Data - An Overview
 
The Future Of Big Data
The Future Of Big DataThe Future Of Big Data
The Future Of Big Data
 
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of TechnologyGuest Lecture: Introduction to Big Data at Indian Institute of Technology
Guest Lecture: Introduction to Big Data at Indian Institute of Technology
 
Core concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data AnalyticsCore concepts and Key technologies - Big Data Analytics
Core concepts and Key technologies - Big Data Analytics
 
Big data trends challenges opportunities
Big data trends challenges opportunitiesBig data trends challenges opportunities
Big data trends challenges opportunities
 
Big data processing using hadoop poster presentation
Big data processing using hadoop poster presentationBig data processing using hadoop poster presentation
Big data processing using hadoop poster presentation
 
ROI of Big Data Analytics Native on Hadoop
ROI of Big Data Analytics Native on HadoopROI of Big Data Analytics Native on Hadoop
ROI of Big Data Analytics Native on Hadoop
 
Big Data: An Overview
Big Data: An OverviewBig Data: An Overview
Big Data: An Overview
 
Scaling Out With Hadoop And HBase
Scaling Out With Hadoop And HBaseScaling Out With Hadoop And HBase
Scaling Out With Hadoop And HBase
 
Big data storage
Big data storageBig data storage
Big data storage
 

Destacado

Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big DataMrinal Kumar
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the CloudDATAVERSITY
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Seungyun Lee
 
Data Virtualization Primer - Introduction
Data Virtualization Primer - IntroductionData Virtualization Primer - Introduction
Data Virtualization Primer - IntroductionKenneth Peeples
 
Cloud Computing And Virtualization
Cloud Computing And VirtualizationCloud Computing And Virtualization
Cloud Computing And VirtualizationSonali Parab
 
Crash Course in Cloud Computing
Crash Course in Cloud ComputingCrash Course in Cloud Computing
Crash Course in Cloud ComputingAll Things Open
 
Big Data and Cloud Computing
Big Data and Cloud ComputingBig Data and Cloud Computing
Big Data and Cloud ComputingFarzad Nozarian
 
Big data on virtualized infrastucture
Big data on virtualized infrastuctureBig data on virtualized infrastucture
Big data on virtualized infrastuctureDataWorks Summit
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...Abzetdin Adamov
 
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...Chad Lawler
 
Cloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for PartnersCloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for PartnersAmazon Web Services
 
The Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationThe Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationHortonworks
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data VirtualizationKenneth Peeples
 
Forecast on Cloud Computing Trends 2015
Forecast on  Cloud Computing  Trends 2015Forecast on  Cloud Computing  Trends 2015
Forecast on Cloud Computing Trends 2015IMC Institute
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessAjay Ohri
 
2011 Introduction to Cloud Computing and Amazon Web Services
2011 Introduction to Cloud Computing and Amazon Web Services2011 Introduction to Cloud Computing and Amazon Web Services
2011 Introduction to Cloud Computing and Amazon Web ServicesSimone Brunozzi
 

Destacado (20)

Cloud Computing & Big Data
Cloud Computing & Big DataCloud Computing & Big Data
Cloud Computing & Big Data
 
Big Data & the Cloud
Big Data & the CloudBig Data & the Cloud
Big Data & the Cloud
 
Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing Issues on Big Data & Cloud Computing
Issues on Big Data & Cloud Computing
 
Data Virtualization Primer - Introduction
Data Virtualization Primer - IntroductionData Virtualization Primer - Introduction
Data Virtualization Primer - Introduction
 
Cloud Computing And Virtualization
Cloud Computing And VirtualizationCloud Computing And Virtualization
Cloud Computing And Virtualization
 
Crash Course in Cloud Computing
Crash Course in Cloud ComputingCrash Course in Cloud Computing
Crash Course in Cloud Computing
 
big data and cloud computing
big data and cloud computingbig data and cloud computing
big data and cloud computing
 
Big Data and Cloud Computing
Big Data and Cloud ComputingBig Data and Cloud Computing
Big Data and Cloud Computing
 
Big data on virtualized infrastucture
Big data on virtualized infrastuctureBig data on virtualized infrastucture
Big data on virtualized infrastucture
 
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...Latest Trends in Technology:BigData Analytics, Virtualization, Cloud Computi...
Latest Trends in Technology: BigData Analytics, Virtualization, Cloud Computi...
 
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
The Executive View on Big Data Platform Hosting - Evaluating Hosting Services...
 
Cloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for PartnersCloud Migration, Application Modernization and Security for Partners
Cloud Migration, Application Modernization and Security for Partners
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Big Data
Big DataBig Data
Big Data
 
The Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen ModernizationThe Power of your Data Achieved - Next Gen Modernization
The Power of your Data Achieved - Next Gen Modernization
 
Big Data & The Cloud
Big Data & The CloudBig Data & The Cloud
Big Data & The Cloud
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data Virtualization
 
Forecast on Cloud Computing Trends 2015
Forecast on  Cloud Computing  Trends 2015Forecast on  Cloud Computing  Trends 2015
Forecast on Cloud Computing Trends 2015
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
2011 Introduction to Cloud Computing and Amazon Web Services
2011 Introduction to Cloud Computing and Amazon Web Services2011 Introduction to Cloud Computing and Amazon Web Services
2011 Introduction to Cloud Computing and Amazon Web Services
 

Similar a Introduction to Cloud computing and Big Data-Hadoop

DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusersBob Hardaway
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewAbhishek Roy
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptxkalai75
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01nayanbhatia2
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxdickonsondorris
 
The Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersThe Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersZohar Elkayam
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big dataVedanand Singh
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapSrinath Perera
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoopMohit Tare
 

Similar a Introduction to Cloud computing and Big Data-Hadoop (20)

DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Big data4businessusers
Big data4businessusersBig data4businessusers
Big data4businessusers
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Hadoop Master Class : A concise overview
Hadoop Master Class : A concise overviewHadoop Master Class : A concise overview
Hadoop Master Class : A concise overview
 
Big_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptxBig_Data_ppt[1] (1).pptx
Big_Data_ppt[1] (1).pptx
 
ppt final.pptx
ppt final.pptxppt final.pptx
ppt final.pptx
 
Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01Bigdatappt 140225061440-phpapp01
Bigdatappt 140225061440-phpapp01
 
Hadoop HDFS.ppt
Hadoop HDFS.pptHadoop HDFS.ppt
Hadoop HDFS.ppt
 
Content1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docxContent1. Introduction2. What is Big Data3. Characte.docx
Content1. Introduction2. What is Big Data3. Characte.docx
 
bigdata.pdf
bigdata.pdfbigdata.pdf
bigdata.pdf
 
bigdata.pptx
bigdata.pptxbigdata.pptx
bigdata.pptx
 
Data analytics & its Trends
Data analytics & its TrendsData analytics & its Trends
Data analytics & its Trends
 
Big data
Big dataBig data
Big data
 
The Hadoop Ecosystem for Developers
The Hadoop Ecosystem for DevelopersThe Hadoop Ecosystem for Developers
The Hadoop Ecosystem for Developers
 
Special issues on big data
Special issues on big dataSpecial issues on big data
Special issues on big data
 
Big Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and RoadmapBig Data Analytics Strategy and Roadmap
Big Data Analytics Strategy and Roadmap
 
Big Data
Big DataBig Data
Big Data
 
Big data ppt
Big data pptBig data ppt
Big data ppt
 
Big data and hadoop
Big data and hadoopBig data and hadoop
Big data and hadoop
 

Último

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Último (20)

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Introduction to Cloud computing and Big Data-Hadoop

  • 1. Introduction: Cloud Computing and Big Data - Hadoop Presented By: Nagarjuna D.N SAP CTL AT&T, Bengaluru Date: 14-07-2015
  • 2. Overview • Cloud Computing Evolution • Why Cloud Computing needed? • Cloud Computing Models • Cloud Solutions • Cloud Jobs opportunities • Criteria for Big Data • Big Data challenges • Technologies to process Big Data- Hadoop • Hadoop History and Architecture • Hadoop Eco-System • Hadoop Real-time Use cases • Hadoop Job opportunities • Hadoop and SAP HANA integration • Summary 2
  • 3. Internet of Things (IoT) Big Data “One of the Reason is Cloud Computing….!” 3
  • 4. Cloud Computing (Evolution of an internet and its hidden from the end user) • Infrastructure is maintained somewhere with shared computing resources -servers and storage, network, all delivered over the Internet. • The Cloud delivers a hosting environment that is- -immediate, -flexible, -scalable, -secure, -available, -saves corporations money, time and resources. Flexible Scalable Secure
  • 5. Cloud Computing (Cont….) • In addition, the platform provides on demand services, i.e always on, anywhere, anytime and any place. • “Pay-for-what-you-use”- metered basis. • Its based on utility computing and Virtualization. 5
  • 12. Utility Infrastructure Model (Concept of Cloud Computing) Actual Infrastructure Demand Time Capital 12
  • 13. Cloud Flavors (Service Models) • IaaS – Infrastructure as a Service • PaaS – Platform as a Service • SaaS – Software as a Service 13
  • 17. Cloud Deployment Models • Public Cloud • Private Cloud • Hybrid Cloud • Community Cloud 17
  • 19. Enterprise Cloud Solutions 1. Test / Development / QA Platform o Use cloud infrastructure servers as test and development platform 2. Disaster Recovery o Keep images of servers on cloud infrastructure ready to go in case of a disaster 3. Cloud File Storage o Backup or Archive company data to cloud file storage 4. Load Balancing o Use cloud infrastructure for overflow management during peak usage times 19
  • 20. Enterprise Cloud Solutions (cont) 5. Overhead Control o Lower overhead costs and make bids more competitive 6. Distributed Network Control and Cost Reporting o Create an individual private networks (VPC) for each of subsidiaries or contracts 7. Rapid Deployment o Turn up servers immediately to fulfill project timelines 8. Functional IT Labor Shift o Refocus IT labor expense on revenue producing activities 20
  • 21. Preparing for the Future Cloud IT Jobs Sampling of IT skills likely to be in demand in the future o Functional application development and support  I.e. Oracle, SAP, SQL, linking hardware to software o Leveraging data to make strategic business decisions  I.e. Business Intelligence : Applying sales forecasts to inventory and manufacturing decisions o Mobile apps  Android, iPhone, Windows Mobile o Wi-Fi engineers  USF to include broadband communications (LTE replaces GSM/CDMA) o Optical engineers  Optical offers the highest bandwidth today (PON, CWDM, DWDM) o Virtualization Specialists  Economies of scale require virtualization (server, storage, client…) o IP Engineers o Network Security Specialists o Web developers o Social Media developers o Business Intelligence application development and support 21
  • 22.
  • 24. “Big Data- Big Thing” • Big Data is exactly like Rubik’s cube. • Just like a Rubik’s cube Big Data has many different solutions. • If you take five Rubik’s cube and mix up the same way and give it to five different expert’s. • They will solve the Rubik’s cube in fractions of the seconds. • But if you pay attention to the same closely, you will notice that even though the final outcome is the same, the route taken to solve the Rubik’s cube is not the same. • Every expert will start at a different place(colors) and will try to resolve it with different methods. • It is nearly impossible to have a exact same route taken by two experts. Begining Big Data 24
  • 25. 25
  • 26. Big Data Definition in general • Big Data is a collection of data sets that are large and complex in nature. • They constitute both structured and unstructured data that grow large so fast that they are not manageable by traditional relational database systems(Eg., RDBMS). 26
  • 27. Big Data Technically i. Volume petta bytes or Zetta bytes. ii. Velocity Batch or real(stream) time processing. iii. Variety Structured, semi-structured & Unstructured. It is estimated that 80% of world’s data are unstructured and rest of them semi-structured and structured. iv. Veracity The quality of the data being captured can vary greatly. Fig.Big Data Based on Doug Cutting 3Vs model 27
  • 28. Variety of Data 1. Structured Data:- Data i.e. identifiable because its organized in a structure(Standard defined format) E.g.: Database, Data Warehouses & Electronic spreadsheets. 2. Semi-Structured Data:- Data i.e. neither raw data, nor typed data in a conventional database system E.g.: Wiki pages, Tweets, Facebook data & Instant Messages. 3. Unstructured Data:- its doesn’t have standard defined structure E.g.: Data files, Audio files, Video, Graphics & Multimedia. 28
  • 29. Traditional Data v/s Big Data Attributes Traditional Data Big Data Volume Gigabytes to terabytes Petabytes to zettabytes Organizaton Centralized Distributed Structure Structured Semi-structured & unstructured Data model Strict schema based Flat schema Data relationship Complex interrelationships Almost flat with few relationships 29
  • 30. Criteria of Big Data 1. 272 hours of video are uploaded to YouTube every minute and over 3 billion hours of video are watched every month. 2. Radio Frequency ID (RFID) systems generated up to 1,000 times more data compared to the conventional bar code systems. 3. 340 million tweets are sent every day and that amounts of 7TB of data. 4. Social networking site, Facebook, processes over 10TB of data every day. 5. Over 5 billion people use cell phones to call, send SMS, email, browse Internet, and interact via social networking sites. 6. The Square Kilometre Array project of NASA receives 700 TB of data per second. 30
  • 31. Challenges with Big Data 1. Scaling is costly. 2. Strategy must be in place before you hit the limit of a single computer. 3. Most entreprises responded to scalability needs when they started facing problems of poor response and low throughput. 4. Adding hardware to existing system is manpower extensive and hence error prone. 5. Mixed data type - structured and unstructured - makes scaling even harder. 31
  • 32. Exploring Big Data for business insights 32
  • 33. 33
  • 34. Big Data solutions with Hadoop 34
  • 36. How are Organizations using Big Data Technology? 36
  • 37. 37
  • 38. Feb 14th 2011 –Watson is IBM’s super computer built using Big Data Technology. Its not online & its process like a human brain. 38
  • 39. 39
  • 40. Tools typically used in Big Data Scenarios 40
  • 41. Technology to process Big Data- Hadoop (Open-source software framework written in Java) • Open-source software: It's free to download, though more and more commercial versions of Hadoop are becoming available. • Framework: It means that everything you need to develop and run software applications is provided –programs, connections, etc. • Distributed storage: The Hadoop framework breaks big data into blocks, which are stored on clusters of commodity hardware. • Processing power: Hadoop concurrently processes large amounts of data using multiple low-cost computers for fast results. • Hadoop an DFS and not Database. Its designed for information from many forms. • Open source project started by Doug Cutting- employee of Yahoo. Hadoop is the name of his sons toy elephant. • Apache software foundation- Apache Hadoop. 41
  • 43. Hadoop Architecture Hadoop core has two major components (daemons): 1. HDFS a. NameNode b. Secondary NameNode c. DataNode 2. MapReduce Engine (distributed data processing framework) a. JobTracker b. TaskTracker 46
  • 44. What components make up Hadoop? • Hadoop Common – the libraries and utilities used by other Hadoop modules. • Hadoop Distributed File System (HDFS) – the Java-based scalable system that stores data across multiple machines without prior organization. • MapReduce – a software programming model for processing large sets of data in parallel. • YARN – resource management framework for scheduling and handling resource requests from distributed applications. (YARN is an acronym for Yet Another Resource Negotiator.) 45
  • 49. 51
  • 51. Benefits of Hadoop • Scalable– New nodes can be added without needing to change data formats. • Cost effective– Hadoop brings massively parallel computing to commodity hardwares. • Flexible– Hadoop is schema-less, and can absorb any type of data, structured or not, from any number of sources. • Fault tolerant– When you lose a node, the system redirects work to another location of the data and continues processing without missing a heartbeat. • Programming languages- Java(default)/python. • Last but not least – it’s free! ( Open source). 43
  • 52. Hadoop is not Suitable for All Kinds of Applications Hadoop is not suitable to: • perform real-time, stream-based processing where data is processed immediately upon its arrival. • perform online access where low latency is required. 44
  • 54. Real-Time Hadoop Use Cases 1. Risk Modeling (How can banks understand customers & markets ?) 2. Customer churn analysis (why do companies really lose customers?) 3. Ad Targeting (How can companies increase campaign efficiency?) 4. Point of sale transaction analysis (How do retailers target promotion guaranteed to make you buy?) 5. Search quality (What’s in your search?) Hyperlink54
  • 55. 55
  • 56. 56
  • 58. 58
  • 59. Apache Hadoop & SAP HANA Integration (Future Generation Technologies) 59
  • 62. Summary o Cloud Computing o Big Data o Apache Hadoop o Hadoop and SAP HANA integration 62
  • 63.
  • 64. More Details Nagarjuna D N nagarjunadn.arjun@gmail.com nagarjuna_dn@live.com More Cloud Solutions Architect Skills: • Amazon Cloud (Amazon Web Services) • MongoDB (NoSQL Database) • Play Framework (Web Application Framework) • Domain/ SSL Certificate setup • Apache Hadoop, Apache Pig, Apache hive
  • 65. Your Valuable Feedback Please • Compulsory to where I must improve………..!

Notas del editor

  1. Tentative
  2. Era of IOT, start up companies, resources- server, storage, networking
  3. Infrastructure is maintained somewhere with shared computing resources can be accessed over the internet
  4. Public cloud The cloud infrastructure is available to the public on a commercial basis by a cloud service provider. This enables a consumer to develop and deploy a service in the cloud with very little financial outlay compared to the capital expenditure requirements normally associated with other deployment options Private cloud The cloud infrastructure has been deployed, and is maintained and operated for a specific organization. The operation may be in-house or with a third party on the premises. • Community Cloud — The cloud infrastructure is shared among a num Hybrid cloud A hybrid cloud environment consists of some portion of computing resources on-site (on premise) and off-site (public cloud). By integrating public cloud services, users can leverage cloud solutions for specific functions that are too costly to maintain on-premise such as virtual server disaster recovery, backups and test/development environments.   Community cloud A community cloud is formed when several organizations with similar requirements share common infrastructure. Costs are spread over fewer users than a public cloud but more than a single tenant.
  5. On-demand, Reserved and Bid
  6. It infrastructure myriad-numerous to think of it
  7. Harnessing big data for business insights Harnessing-exploit, control, keep in check
  8. Issues with existing RDBMS
  9. Big Data solutions
  10. http://saphanatutorial.com/what-is-hadoop/
  11. JobT- 50030 TaskT-50060 Sec N Node-50090 Name Node-50070 datanode-50075
  12. http://www.edureka.co/blog/how-essential-is-hadoop-training/
  13. http://www.edureka.co/blog/how-essential-is-hadoop-training/
  14. AT&T has Synaptic Cloud http://connect.att.jobs/careers/big-data-jobs http://att.jobs/careers/technology/big-data http://www.corp.att.com/stateandlocal/big_data/ http://blogs.wsj.com/cio/2014/06/03/att-uses-big-data-to-improve-customer-experience/ http://www.research.att.com/evergreen/working_with_us/job_desc_data_scientist_network?fbid=woFy7dK7UK9
  15. http://saphanatutorial.com/sap-hana-and-hadoop/