SlideShare una empresa de Scribd logo
1 de 18
BIG DATA is not just HADOOP
Understand and navigate
federated big data sources

Federated Discovery and Navigation

Manage & store huge volume
of any data

Hadoop File System
MapReduce

Structure and control data

Data Warehousing

Manage streaming data

Stream Computing

Analyze unstructured data

Text Analytics Engine

Integrate and govern all
data sources

Integration, Data Quality, Security,
Lifecycle Management, MDM
Business-Centric Big Data Enables You to Start With a Critical Business Pain and Expand the
Foundation for Future Requirements

Corresponding Tools
/products

 “Big data” isn’t just a technology—it’s a
business strategy for capitalizing on
information resources
 Getting started is crucial
 Success at each entry point is
accelerated by products within the Big
Data platform
 Build the foundation for future
requirements by expanding further
into the big data platform
Velocity

Variety

Volume
Merging the Traditional and Big Data Approaches
Traditional Approach

Big Data Approach

Structured & Repeatable Analysis

Iterative & Exploratory Analysis

Business Users
Determine what
question to ask

IT
Delivers a platform to
enable creative
discovery

IT

Business

Structures the
data to answer
that question

Explores what questions
could be asked

Monthly sales reports
Profitability analysis
Customer surveys

Brand sentiment
Product strategy
Maximum asset utilization
Raw Data

Valuable Data Assets
A) Data Refinery Platform
B) Data Discovery Platform
C)Analytical Tools And Techniques
D)Integrated Data Warehouse
E)Distinct Execution Engine
F)Library Of pre-Built analytic functions
G)Interactive Development Tool
SQL for structured and MR for
large scale process analytics
Manage relational & non Relational
data in ins& out of Data Warehouse
Iterative analytics with greater
accuracy and effectiveness
Dig deeper for insights
Within budget
Data Task

Low-cost storage

Potential Workloads

•

Retains raw data in manner that can provide low TCO-per-terabyte storage costs

and retention
• Requires access in deep storage, but not at same speeds as in a front-line system

Loading

•

Brings data into the system from the source system

Pre-processing/

•

Prepares data for downstream processing by, for example, fetching dimension

prep/cleansing/

data, recording a new incoming batch, or archiving old window batch.

constraint
validation

Transformation

•

Converts one structure of data into another structure. This may require going

from third-normal form in a relational database to a star or snowflake schema,
or from text to a relational database, or from relational technology to a graph,
as with structural transformations.

Reporting

•

Queries historical data such as what happened, where it happened, how much
happened, who did it (e.g., sales of a given product by region)

Analytics (including

•

Performs relationship modeling via declarative SQL (e.g., scoring or basic stats)

•

Performs relationship modeling via procedural MapReduce (e.g., model building

user-driven, inter-

active, or ad-hoc)

or time series)
Stable
(structured)
Evolving
(Semi-Structured)

No Schema

(Has Format only)

• Relatively fixed, Infrequent change
• Leverage strength of relational model & SQL

• Fixed and variable of schema, but changes occur too
quickly
• Leverage backend RDBMS, “LATE BINDING” of
structure by queries

• Less relational, No Semantics – stored in native file
formats
• via MapReduce: Interpret the format & pull out
the required data
Stable

Evolving

• ERP Data
• Inventory
Recods

• Web logs,
Call record
• Twitter
feeds

No
Schema
• images
• Videos,
Web Pages
What Does Machine Data Look Like?
Sources

Order Processing

Middleware
Error

Care IVR

Twitter

6
Machine Data Contains Critical Insights
Sources

Customer ID

Order ID

Product ID

Order Processing

Order ID

Customer ID

Middleware
Error

Time Waiting On Hold

Care IVR

Customer ID

Twitter ID

Twitter

Company’s Twitter ID

Customer’s Tweet
Machine Data Contains Critical Insights
Sources

Customer ID

Order ID

Product ID

Order Processing

Order ID

Customer ID

Middleware
Error

Time Waiting On Hold

Care IVR

Customer ID

Twitter ID

Twitter

Company’s Twitter ID

Customer’s Tweet
Di

Hadoop captures, stores and
transforms images and call
records

Traditional Work flow

Capture, Retention
and
Transformation
Layer

Data Sources

ETL TOOLS

Analytic Results

Call Center
Voice Records

Analysis and Marketing
Automation (Customer
Retention Campaign)

Discovery
Platform

Dimensional Data

Hadoop

Check Images

path and sentiment
analysis with multistructured data

Social and web
data

Integrated DW
Federated Big Data Discovery

Más contenido relacionado

La actualidad más candente

Data Warehouse
Data WarehouseData Warehouse
Data Warehouseganblues
 
Business Analysis, Query Tools, Dm unit-3
Business Analysis, Query Tools, Dm unit-3Business Analysis, Query Tools, Dm unit-3
Business Analysis, Query Tools, Dm unit-3Dr. Sunil Kr. Pandey
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data WarehousingEdureka!
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data miningRohit Kumar
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)sadam33146
 
Data Warehouse
Data WarehouseData Warehouse
Data WarehouseSana Alvi
 
SAS/MIT/Sloan Data Analytics
SAS/MIT/Sloan Data AnalyticsSAS/MIT/Sloan Data Analytics
SAS/MIT/Sloan Data AnalyticsSteven Kimber
 
Designing the business process dimensional model
Designing the business process dimensional modelDesigning the business process dimensional model
Designing the business process dimensional modelGersiton Pila Challco
 
Zackman frame work
Zackman frame workZackman frame work
Zackman frame workganblues
 
WITSML to PPDM mapping project
WITSML to PPDM mapping projectWITSML to PPDM mapping project
WITSML to PPDM mapping projectETLSolutions
 
Data warehousing and machine learning primer
Data warehousing and machine learning primerData warehousing and machine learning primer
Data warehousing and machine learning primerTom Donoghue
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousinguncleRhyme
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data miningSnehali Chake
 
Consumer Data Management
Consumer Data ManagementConsumer Data Management
Consumer Data Managementijtsrd
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Caserta
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl conceptsjeshocarme
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemKiran kumar
 
A configuration independent score-based benchmark for distributed databases
A configuration independent score-based benchmark for distributed databasesA configuration independent score-based benchmark for distributed databases
A configuration independent score-based benchmark for distributed databasesieeepondy
 
Data warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designData warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designSarita Kataria
 

La actualidad más candente (20)

Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Business Analysis, Query Tools, Dm unit-3
Business Analysis, Query Tools, Dm unit-3Business Analysis, Query Tools, Dm unit-3
Business Analysis, Query Tools, Dm unit-3
 
Introduction to Data Warehousing
Introduction to Data WarehousingIntroduction to Data Warehousing
Introduction to Data Warehousing
 
Data warehouse and data mining
Data warehouse and data miningData warehouse and data mining
Data warehouse and data mining
 
Data mining (prefinals)
Data mining (prefinals)Data mining (prefinals)
Data mining (prefinals)
 
Data Warehouse
Data WarehouseData Warehouse
Data Warehouse
 
Preparing Your Data for ECM
Preparing Your Data for ECMPreparing Your Data for ECM
Preparing Your Data for ECM
 
SAS/MIT/Sloan Data Analytics
SAS/MIT/Sloan Data AnalyticsSAS/MIT/Sloan Data Analytics
SAS/MIT/Sloan Data Analytics
 
Designing the business process dimensional model
Designing the business process dimensional modelDesigning the business process dimensional model
Designing the business process dimensional model
 
Zackman frame work
Zackman frame workZackman frame work
Zackman frame work
 
WITSML to PPDM mapping project
WITSML to PPDM mapping projectWITSML to PPDM mapping project
WITSML to PPDM mapping project
 
Data warehousing and machine learning primer
Data warehousing and machine learning primerData warehousing and machine learning primer
Data warehousing and machine learning primer
 
Introduction to data warehousing
Introduction to data warehousingIntroduction to data warehousing
Introduction to data warehousing
 
Data warehousing and data mining
Data warehousing and data miningData warehousing and data mining
Data warehousing and data mining
 
Consumer Data Management
Consumer Data ManagementConsumer Data Management
Consumer Data Management
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
 
Dw & etl concepts
Dw & etl conceptsDw & etl concepts
Dw & etl concepts
 
Business Intelligence Data Warehouse System
Business Intelligence Data Warehouse SystemBusiness Intelligence Data Warehouse System
Business Intelligence Data Warehouse System
 
A configuration independent score-based benchmark for distributed databases
A configuration independent score-based benchmark for distributed databasesA configuration independent score-based benchmark for distributed databases
A configuration independent score-based benchmark for distributed databases
 
Data warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-designData warehouse-dimensional-modeling-and-design
Data warehouse-dimensional-modeling-and-design
 

Destacado

Virtual private network
Virtual private networkVirtual private network
Virtual private networkSowmia Sathyan
 
Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808OpenCity Community
 
General Quiz (Finals) | Elixir '12
General Quiz (Finals) | Elixir '12General Quiz (Finals) | Elixir '12
General Quiz (Finals) | Elixir '12Abinash Shaw
 
MSU DL Workshop Aug 13 2013
MSU DL Workshop Aug 13 2013MSU DL Workshop Aug 13 2013
MSU DL Workshop Aug 13 2013Josh Johnson
 
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...Kouluterveyskysely
 
English in the FLS, Bulgaria
English in the FLS, BulgariaEnglish in the FLS, Bulgaria
English in the FLS, BulgariaTanya Madjarova
 
real estate agent in patna 9304611353
real estate agent in patna 9304611353real estate agent in patna 9304611353
real estate agent in patna 9304611353Adore Global Pvt. Ltd
 
Upload.ppt
Upload.pptUpload.ppt
Upload.pptMay Mei
 
All you need know about testing
All you need know about testingAll you need know about testing
All you need know about testingJorge Barroso
 
Pastoral Innovation in Somali Region-Town Camels and Milk Villages The Case o...
Pastoral Innovation in Somali Region-Town Camels and Milk VillagesThe Case o...Pastoral Innovation in Somali Region-Town Camels and Milk VillagesThe Case o...
Pastoral Innovation in Somali Region-Town Camels and Milk Villages The Case o...futureagricultures
 
Sesion extraordinaria discusion pdd municipal
Sesion extraordinaria discusion pdd municipalSesion extraordinaria discusion pdd municipal
Sesion extraordinaria discusion pdd municipalAlexander Puertas
 

Destacado (20)

Virtual private network
Virtual private networkVirtual private network
Virtual private network
 
Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808Pm 04 华胜天成openstack实践汇报-20120808
Pm 04 华胜天成openstack实践汇报-20120808
 
Lesson 2
Lesson 2Lesson 2
Lesson 2
 
Processor CPU
Processor CPUProcessor CPU
Processor CPU
 
Tbn passion2
Tbn passion2Tbn passion2
Tbn passion2
 
Paperless - smartare pappershantering
Paperless - smartare pappershanteringPaperless - smartare pappershantering
Paperless - smartare pappershantering
 
Taysia
TaysiaTaysia
Taysia
 
Portfolio english
Portfolio englishPortfolio english
Portfolio english
 
Network Interface Layer
Network Interface LayerNetwork Interface Layer
Network Interface Layer
 
General Quiz (Finals) | Elixir '12
General Quiz (Finals) | Elixir '12General Quiz (Finals) | Elixir '12
General Quiz (Finals) | Elixir '12
 
MSU DL Workshop Aug 13 2013
MSU DL Workshop Aug 13 2013MSU DL Workshop Aug 13 2013
MSU DL Workshop Aug 13 2013
 
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...
Puusniekka: Tupakointi ammatillisissa oppilaitoksissa – tuloksia Kouluterveys...
 
English in the FLS, Bulgaria
English in the FLS, BulgariaEnglish in the FLS, Bulgaria
English in the FLS, Bulgaria
 
Light painting presentation
Light painting presentationLight painting presentation
Light painting presentation
 
Brookshear 06
Brookshear 06Brookshear 06
Brookshear 06
 
real estate agent in patna 9304611353
real estate agent in patna 9304611353real estate agent in patna 9304611353
real estate agent in patna 9304611353
 
Upload.ppt
Upload.pptUpload.ppt
Upload.ppt
 
All you need know about testing
All you need know about testingAll you need know about testing
All you need know about testing
 
Pastoral Innovation in Somali Region-Town Camels and Milk Villages The Case o...
Pastoral Innovation in Somali Region-Town Camels and Milk VillagesThe Case o...Pastoral Innovation in Somali Region-Town Camels and Milk VillagesThe Case o...
Pastoral Innovation in Somali Region-Town Camels and Milk Villages The Case o...
 
Sesion extraordinaria discusion pdd municipal
Sesion extraordinaria discusion pdd municipalSesion extraordinaria discusion pdd municipal
Sesion extraordinaria discusion pdd municipal
 

Similar a Federated Big Data Discovery

BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)Syaifuddin Ismail
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Andrey Akulov
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Zaloni
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptDougSchoemaker
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?James Serra
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time AnalyticsMohsin Hakim
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatiaSatish Bhatia
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data SolutionJames Serra
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysNEWYORKSYS-IT SOLUTIONS
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousingsumit621
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2RojaT4
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake OverviewJames Serra
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceCaserta
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonCapgemini
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousingwork
 

Similar a Federated Big Data Discovery (20)

BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)BI Masterclass slides (Reference Architecture v3)
BI Masterclass slides (Reference Architecture v3)
 
Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.Эволюция Big Data и Information Management. Reference Architecture.
Эволюция Big Data и Information Management. Reference Architecture.
 
Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...Understanding Metadata: Why it's essential to your big data solution and how ...
Understanding Metadata: Why it's essential to your big data solution and how ...
 
dw_concepts_2_day_course.ppt
dw_concepts_2_day_course.pptdw_concepts_2_day_course.ppt
dw_concepts_2_day_course.ppt
 
Is the traditional data warehouse dead?
Is the traditional data warehouse dead?Is the traditional data warehouse dead?
Is the traditional data warehouse dead?
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Real Time Analytics
Real Time AnalyticsReal Time Analytics
Real Time Analytics
 
Digital intelligence satish bhatia
Digital intelligence satish bhatiaDigital intelligence satish bhatia
Digital intelligence satish bhatia
 
Building a Big Data Solution
Building a Big Data SolutionBuilding a Big Data Solution
Building a Big Data Solution
 
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ NewyorksysWhat is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
What is OLAP -Data Warehouse Concepts - IT Online Training @ Newyorksys
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 
Big data and oracle
Big data and oracleBig data and oracle
Big data and oracle
 
Chapter 13 data warehousing
Chapter 13   data warehousingChapter 13   data warehousing
Chapter 13 data warehousing
 
Big data unit 2
Big data unit 2Big data unit 2
Big data unit 2
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Traditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A ComparisonTraditional BI vs. Business Data Lake – A Comparison
Traditional BI vs. Business Data Lake – A Comparison
 
Datawarehousing
DatawarehousingDatawarehousing
Datawarehousing
 

Último

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 

Último (20)

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 

Federated Big Data Discovery

  • 1.
  • 2.
  • 3. BIG DATA is not just HADOOP Understand and navigate federated big data sources Federated Discovery and Navigation Manage & store huge volume of any data Hadoop File System MapReduce Structure and control data Data Warehousing Manage streaming data Stream Computing Analyze unstructured data Text Analytics Engine Integrate and govern all data sources Integration, Data Quality, Security, Lifecycle Management, MDM
  • 4. Business-Centric Big Data Enables You to Start With a Critical Business Pain and Expand the Foundation for Future Requirements Corresponding Tools /products  “Big data” isn’t just a technology—it’s a business strategy for capitalizing on information resources  Getting started is crucial  Success at each entry point is accelerated by products within the Big Data platform  Build the foundation for future requirements by expanding further into the big data platform
  • 6. Merging the Traditional and Big Data Approaches Traditional Approach Big Data Approach Structured & Repeatable Analysis Iterative & Exploratory Analysis Business Users Determine what question to ask IT Delivers a platform to enable creative discovery IT Business Structures the data to answer that question Explores what questions could be asked Monthly sales reports Profitability analysis Customer surveys Brand sentiment Product strategy Maximum asset utilization
  • 8.
  • 9. A) Data Refinery Platform B) Data Discovery Platform C)Analytical Tools And Techniques D)Integrated Data Warehouse E)Distinct Execution Engine F)Library Of pre-Built analytic functions G)Interactive Development Tool
  • 10. SQL for structured and MR for large scale process analytics Manage relational & non Relational data in ins& out of Data Warehouse Iterative analytics with greater accuracy and effectiveness Dig deeper for insights Within budget
  • 11. Data Task Low-cost storage Potential Workloads • Retains raw data in manner that can provide low TCO-per-terabyte storage costs and retention • Requires access in deep storage, but not at same speeds as in a front-line system Loading • Brings data into the system from the source system Pre-processing/ • Prepares data for downstream processing by, for example, fetching dimension prep/cleansing/ data, recording a new incoming batch, or archiving old window batch. constraint validation Transformation • Converts one structure of data into another structure. This may require going from third-normal form in a relational database to a star or snowflake schema, or from text to a relational database, or from relational technology to a graph, as with structural transformations. Reporting • Queries historical data such as what happened, where it happened, how much happened, who did it (e.g., sales of a given product by region) Analytics (including • Performs relationship modeling via declarative SQL (e.g., scoring or basic stats) • Performs relationship modeling via procedural MapReduce (e.g., model building user-driven, inter- active, or ad-hoc) or time series)
  • 12. Stable (structured) Evolving (Semi-Structured) No Schema (Has Format only) • Relatively fixed, Infrequent change • Leverage strength of relational model & SQL • Fixed and variable of schema, but changes occur too quickly • Leverage backend RDBMS, “LATE BINDING” of structure by queries • Less relational, No Semantics – stored in native file formats • via MapReduce: Interpret the format & pull out the required data
  • 13. Stable Evolving • ERP Data • Inventory Recods • Web logs, Call record • Twitter feeds No Schema • images • Videos, Web Pages
  • 14. What Does Machine Data Look Like? Sources Order Processing Middleware Error Care IVR Twitter 6
  • 15. Machine Data Contains Critical Insights Sources Customer ID Order ID Product ID Order Processing Order ID Customer ID Middleware Error Time Waiting On Hold Care IVR Customer ID Twitter ID Twitter Company’s Twitter ID Customer’s Tweet
  • 16. Machine Data Contains Critical Insights Sources Customer ID Order ID Product ID Order Processing Order ID Customer ID Middleware Error Time Waiting On Hold Care IVR Customer ID Twitter ID Twitter Company’s Twitter ID Customer’s Tweet
  • 17. Di Hadoop captures, stores and transforms images and call records Traditional Work flow Capture, Retention and Transformation Layer Data Sources ETL TOOLS Analytic Results Call Center Voice Records Analysis and Marketing Automation (Customer Retention Campaign) Discovery Platform Dimensional Data Hadoop Check Images path and sentiment analysis with multistructured data Social and web data Integrated DW