SlideShare una empresa de Scribd logo
1 de 23
Data Mashups
for Analytics
Bringing Everything Together for
Actionable Insights
Ben Hopkins
Sr. Product Marketing Manager, Pentaho
We Enable the Modern Data-driven Business
Modern, Cohesive Business Analytics and Data Integration Platform
• Full spectrum of analytics for all key roles
• Embeddable, cloud-ready analytics
• Broadest and deepest big data integration
Innovation Through an Open Heritage
• Open, pluggable, purpose-built for the future
• Sustained leadership in big data ecosystem
Business Momentum
• Over 1,500 commercial customers
• Over 10,000 production deployments
Agenda
① Background
② Approaches to Data Blending
③ The Role of Data Integration
④ Real World Examples & Success
Background
Much of the value from big data
will come from
“mashing up” proprietary data
with external and open data.
McKinsey Global Institute
10 IT-enabled Business Trends
for the Decade Ahead, 2013
Poll Results from pentaho.com
Poll Results from pentaho.com
Poll Results from pentaho.com
Background
Proportion Utilizing Unstructured Data From:
Social Media: 66%
Internet of Things: 65%
Mobile Device Data: 58%
“When individual sources include automated
and/or manual inputs, originate from disparate
systems with different architectures, and are
subject to different levels of governance, an
effective integration process is essential.”
From “Delivering Governed Data For Analytics At Scale,” Forrester Consulting, 2015
The most powerful insights come
from blending data on demand
and at the source
On Demand and At the Source
Architected & Trusted Approach
• Designed with full knowledge of
underlying systems and constraints
• Utilize most efficient point of
processing
• Provide fast access, avoid
unnecessary staging
• Maintains governance rules
• Preserve semantics, auditability
Where Does Data Integration Add Value?
Business Intelligence
and Data Warehousing
“Effective decisions depend on aggregated,
calculated, and time-series data values in a DW
– data and data structure that wouldn’t exist
without data integration”
Builds New and
Valuable Data Sets
“Similar to a value-adding process in
manufacturing, DI collects raw material (data
from sources systems) and assembles it into a
product (new data sets)”
360-Degree Views of
Business Entities
“Success in sales and service often depends on
complete views of each customer, which are
typically assembled with data integration tools
and techniques”
From “Ten Ways Data Integration Provides Business Value,” Philip Russom, TDWI, 2011
Data Readiness Checklist
Do I Need Data Integration Capabilities?
1. Do I need to blend several different data sources?
2. Is my data cleansed and modeled?
3. Do I want to enrich my data with new data sources?
4. Have I already captured all the data I need?
5. Will my data sources change in 6, 12, or 18 months?
6. Do I need ad-hoc and drill-down analytic capabilities?
All Signs Point to
Data Integration
Data Blending
Examples &
Success Stories
Blending Web Analytics and Support Data
Business Question:
Am I supporting all of the right browsers for my web app?
Blended by
region and
browser
Software Product
Manager
Google Analytics
web visits via API
Flat file of historical
product support
requests
Android visits,
but we don’t
support yet
Blending Machine and Production Data
Business Question:
What facility temperature is optimal for manufacturing output?
HVAC sensor data in
Hadoop accessed
via Hive
Production quotas
and actuals from
data warehouse
Blended by
facility and
time
Operations
Manager
Cold temperature ranges associated with
higher production across almost all facilities
See detailed mashup videos:
pentaho.com/blend-of-the-week
Caterpillar
Delivering a 360-Degree View of Equipment
Business Challenge
• Identify opportunities for maintenance
and fuel savings in industrial equipment
operations
• Predict equipment breakdowns to avoid
downtime
• Extend fleet-level insights to equipment
operators
Caterpillar
Delivering a 360-Degree View of Equipment
Pentaho Benefits
• Blend sensor data with customer data
and more into unified analytics service
• Operationalize predictive ‘useful life’
models in the data workflow
• Provide a revenue-generating offering
to customers that drives substantial
fuel and maintenance savings
Entity 360 Marine Asset Intelligence
Business User (COO)
Reporting on
Operations and
Efficiency
End Users
Dashboards and
Reports on Machine
Performance
Business
Analytics
Server
Data
Marts
Data Scientist
Data Mining and
Predictive Data
Governance
Local Machine
and Server
Data
Fleet Data via
Satellite
Cross
Department
Operations Data
Data
Integration
Data
Integration
British Telecom
Protecting Against Cyber Threats
Business Challenge
• Launch new service to market: BT
Assure Cyber, an enterprise
solution for cyber security insights
across many data types
• Previously BT Assure Cyber could
only integrate relational data
sources and not big data sources
British Telecom
Protecting Against Cyber Threats
Pentaho Benefits
• Native support for Hadoop in an
enterprise environment
• Ability to integrate telemetry data
from sensors, security controls and
advanced detection tools
• Reduced detection time of cyber
threats from weeks to seconds
In Closing
Next Steps
 Explore more mashup examples:
www.pentaho.com/blend-of-the-week
 Take a look at Pentaho in the 2016 Gartner
Business Analytics Magic Quadrant
Key Takeaways
 Teams are taking “data mashups” to new heights
 Blend data on demand and at the source
 Data integration can maximize analytic value
Questions
and Discussion
Thank You

Más contenido relacionado

La actualidad más candente

Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataPentaho
 
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho
 
Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Pentaho
 
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Pentaho
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataPentaho
 
Pentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpenAnalytics Spain
 
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BICC Thomas More
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...BICC Thomas More
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBICC Thomas More
 
30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho EvaluationPentaho
 
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journeyDataWorks Summit
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Denodo
 
Unlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaUnlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaCloudera, Inc.
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Denodo
 
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleBardess Group
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho
 

La actualidad más candente (20)

Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR DataExclusive Verizon Employee Webinar: Getting More From Your CDR Data
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
 
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
 
Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy Why Your Product Needs an Analytic Strategy
Why Your Product Needs an Analytic Strategy
 
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]Competitive edgewithmongod bandpentaho_2014sep_v3[1]
Competitive edgewithmongod bandpentaho_2014sep_v3[1]
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big DataBig Data Integration Webinar: Getting Started With Hadoop Big Data
Big Data Integration Webinar: Getting Started With Hadoop Big Data
 
Pentaho Healthcare Solutions
Pentaho Healthcare SolutionsPentaho Healthcare Solutions
Pentaho Healthcare Solutions
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open SourceOpen Analytics 2014 - Pedro Alves - Innovation though Open Source
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
 
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
 
30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation30 for 30: Quick Start Your Pentaho Evaluation
30 for 30: Quick Start Your Pentaho Evaluation
 
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
 
Hilton's enterprise data journey
Hilton's enterprise data journeyHilton's enterprise data journey
Hilton's enterprise data journey
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
 
Unlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and ClouderaUnlocking data science in the enterprise - with Oracle and Cloudera
Unlocking data science in the enterprise - with Oracle and Cloudera
 
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
 
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
 
Revolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus ExampleRevolution in Business Analytics-Zika Virus Example
Revolution in Business Analytics-Zika Virus Example
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcarePentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
 

Destacado

Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationPentaho
 
Pentaho Data Integration Introduction
Pentaho Data Integration IntroductionPentaho Data Integration Introduction
Pentaho Data Integration Introductionmattcasters
 
Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Pentaho
 
Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Pentaho
 
Pentaho-BI
Pentaho-BIPentaho-BI
Pentaho-BIEdureka!
 
Big Data Adoption Status
Big Data Adoption Status Big Data Adoption Status
Big Data Adoption Status Xpand IT
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?Xpand IT
 
The Next Big Thing in Big Data
The Next Big Thing in Big DataThe Next Big Thing in Big Data
The Next Big Thing in Big DataPentaho
 
Mondrian and OLAP Overview
Mondrian and OLAP OverviewMondrian and OLAP Overview
Mondrian and OLAP OverviewAlex Meadows
 
MongoDB World 2016: MongoDB + Google Cloud
MongoDB World 2016: MongoDB + Google CloudMongoDB World 2016: MongoDB + Google Cloud
MongoDB World 2016: MongoDB + Google CloudMongoDB
 
MongoDB World 2016: Number Crush
MongoDB World 2016: Number CrushMongoDB World 2016: Number Crush
MongoDB World 2016: Number CrushMongoDB
 
Data analysis and visualization with mongo db [mongodb world 2016]
Data analysis and visualization with mongo db [mongodb world 2016]Data analysis and visualization with mongo db [mongodb world 2016]
Data analysis and visualization with mongo db [mongodb world 2016]Alexander Hendorf
 
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...MongoDB
 
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexHadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexApache Apex
 

Destacado (17)

Introduction To Pentaho
Introduction To PentahoIntroduction To Pentaho
Introduction To Pentaho
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview PresentationFilling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
 
Pentaho Data Integration Introduction
Pentaho Data Integration IntroductionPentaho Data Integration Introduction
Pentaho Data Integration Introduction
 
Pentaho Suite Analysis
Pentaho Suite Analysis Pentaho Suite Analysis
Pentaho Suite Analysis
 
Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics Improving the Business of Healthcare through Better Analytics
Improving the Business of Healthcare through Better Analytics
 
Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity Data Is Your Next Product Opportunity
Data Is Your Next Product Opportunity
 
Pentaho-BI
Pentaho-BIPentaho-BI
Pentaho-BI
 
Big Data Adoption Status
Big Data Adoption Status Big Data Adoption Status
Big Data Adoption Status
 
What's New in Pentaho 7.0?
What's New in Pentaho 7.0?What's New in Pentaho 7.0?
What's New in Pentaho 7.0?
 
The Next Big Thing in Big Data
The Next Big Thing in Big DataThe Next Big Thing in Big Data
The Next Big Thing in Big Data
 
Mondrian and OLAP Overview
Mondrian and OLAP OverviewMondrian and OLAP Overview
Mondrian and OLAP Overview
 
MongoDB World 2016: MongoDB + Google Cloud
MongoDB World 2016: MongoDB + Google CloudMongoDB World 2016: MongoDB + Google Cloud
MongoDB World 2016: MongoDB + Google Cloud
 
MongoDB World 2016: Number Crush
MongoDB World 2016: Number CrushMongoDB World 2016: Number Crush
MongoDB World 2016: Number Crush
 
Mondrain Harsch
Mondrain HarschMondrain Harsch
Mondrain Harsch
 
Data analysis and visualization with mongo db [mongodb world 2016]
Data analysis and visualization with mongo db [mongodb world 2016]Data analysis and visualization with mongo db [mongodb world 2016]
Data analysis and visualization with mongo db [mongodb world 2016]
 
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...
MongoDB World 2016: From the Polls to the Trolls: Seeing What the World Think...
 
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache ApexHadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
Hadoop Summit SJ 2016: Next Gen Big Data Analytics with Apache Apex
 

Similar a Data Mashups for Analytics

Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...
Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...
Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...Precisely
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Denodo
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Precisely
 
How 360 Degree Data Integration Enables the Customer-centric Business
How 360 Degree Data Integration Enables the Customer-centric BusinessHow 360 Degree Data Integration Enables the Customer-centric Business
How 360 Degree Data Integration Enables the Customer-centric BusinessAstera Software
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONRenee Yao
 
Realign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-CentricityRealign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-CentricityBizagi
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesCisco Canada
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework Vishwanath Ramdas
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...BigDataEverywhere
 
Analytics in the Cloud and the ROI for B2B
Analytics in the Cloud and the ROI for B2BAnalytics in the Cloud and the ROI for B2B
Analytics in the Cloud and the ROI for B2BVeronica Kirn
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingDenodo
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Drive Business Outcomes for Big Data Environments
Drive Business Outcomes for Big Data EnvironmentsDrive Business Outcomes for Big Data Environments
Drive Business Outcomes for Big Data EnvironmentsCisco Services
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Cloudera, Inc.
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014Hortonworks
 

Similar a Data Mashups for Analytics (20)

Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...
Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...
Looking to the Future: Embracing the Cloud for a More Modern Data Quality App...
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
 
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
Foundational Strategies for Trust in Big Data Part 1: Getting Data to the Pla...
 
How 360 Degree Data Integration Enables the Customer-centric Business
How 360 Degree Data Integration Enables the Customer-centric BusinessHow 360 Degree Data Integration Enables the Customer-centric Business
How 360 Degree Data Integration Enables the Customer-centric Business
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSIONCisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
Cisco_Big_Data_Webinar_At-A-Glance_ABSOLUTE_FINAL_VERSION
 
Realign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-CentricityRealign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-Centricity
 
Turning Big Data into Better Business Outcomes
Turning Big Data into Better Business OutcomesTurning Big Data into Better Business Outcomes
Turning Big Data into Better Business Outcomes
 
Big data for product managers
Big data for product managersBig data for product managers
Big data for product managers
 
Sgcp14dunlea
Sgcp14dunleaSgcp14dunlea
Sgcp14dunlea
 
Analytics Service Framework
Analytics Service Framework Analytics Service Framework
Analytics Service Framework
 
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
Big Data Everywhere Chicago: Platfora - Practices for Customer Analytics on H...
 
Microsoft's Approach to IoT
Microsoft's Approach to IoT Microsoft's Approach to IoT
Microsoft's Approach to IoT
 
Analytics in the Cloud and the ROI for B2B
Analytics in the Cloud and the ROI for B2BAnalytics in the Cloud and the ROI for B2B
Analytics in the Cloud and the ROI for B2B
 
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision MakingAnalyst Webinar: Best Practices In Enabling Data-Driven Decision Making
Analyst Webinar: Best Practices In Enabling Data-Driven Decision Making
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Drive Business Outcomes for Big Data Environments
Drive Business Outcomes for Big Data EnvironmentsDrive Business Outcomes for Big Data Environments
Drive Business Outcomes for Big Data Environments
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
Increase your ROI with Hadoop in Six Months - Presented by Dell, Cloudera and...
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 

Último

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

Data Mashups for Analytics

  • 1. Data Mashups for Analytics Bringing Everything Together for Actionable Insights Ben Hopkins Sr. Product Marketing Manager, Pentaho
  • 2. We Enable the Modern Data-driven Business Modern, Cohesive Business Analytics and Data Integration Platform • Full spectrum of analytics for all key roles • Embeddable, cloud-ready analytics • Broadest and deepest big data integration Innovation Through an Open Heritage • Open, pluggable, purpose-built for the future • Sustained leadership in big data ecosystem Business Momentum • Over 1,500 commercial customers • Over 10,000 production deployments
  • 3. Agenda ① Background ② Approaches to Data Blending ③ The Role of Data Integration ④ Real World Examples & Success
  • 4. Background Much of the value from big data will come from “mashing up” proprietary data with external and open data. McKinsey Global Institute 10 IT-enabled Business Trends for the Decade Ahead, 2013
  • 5. Poll Results from pentaho.com
  • 6. Poll Results from pentaho.com
  • 7. Poll Results from pentaho.com
  • 8. Background Proportion Utilizing Unstructured Data From: Social Media: 66% Internet of Things: 65% Mobile Device Data: 58% “When individual sources include automated and/or manual inputs, originate from disparate systems with different architectures, and are subject to different levels of governance, an effective integration process is essential.” From “Delivering Governed Data For Analytics At Scale,” Forrester Consulting, 2015
  • 9. The most powerful insights come from blending data on demand and at the source
  • 10. On Demand and At the Source Architected & Trusted Approach • Designed with full knowledge of underlying systems and constraints • Utilize most efficient point of processing • Provide fast access, avoid unnecessary staging • Maintains governance rules • Preserve semantics, auditability
  • 11. Where Does Data Integration Add Value? Business Intelligence and Data Warehousing “Effective decisions depend on aggregated, calculated, and time-series data values in a DW – data and data structure that wouldn’t exist without data integration” Builds New and Valuable Data Sets “Similar to a value-adding process in manufacturing, DI collects raw material (data from sources systems) and assembles it into a product (new data sets)” 360-Degree Views of Business Entities “Success in sales and service often depends on complete views of each customer, which are typically assembled with data integration tools and techniques” From “Ten Ways Data Integration Provides Business Value,” Philip Russom, TDWI, 2011
  • 12. Data Readiness Checklist Do I Need Data Integration Capabilities? 1. Do I need to blend several different data sources? 2. Is my data cleansed and modeled? 3. Do I want to enrich my data with new data sources? 4. Have I already captured all the data I need? 5. Will my data sources change in 6, 12, or 18 months? 6. Do I need ad-hoc and drill-down analytic capabilities? All Signs Point to Data Integration
  • 14. Blending Web Analytics and Support Data Business Question: Am I supporting all of the right browsers for my web app? Blended by region and browser Software Product Manager Google Analytics web visits via API Flat file of historical product support requests Android visits, but we don’t support yet
  • 15. Blending Machine and Production Data Business Question: What facility temperature is optimal for manufacturing output? HVAC sensor data in Hadoop accessed via Hive Production quotas and actuals from data warehouse Blended by facility and time Operations Manager Cold temperature ranges associated with higher production across almost all facilities See detailed mashup videos: pentaho.com/blend-of-the-week
  • 16. Caterpillar Delivering a 360-Degree View of Equipment Business Challenge • Identify opportunities for maintenance and fuel savings in industrial equipment operations • Predict equipment breakdowns to avoid downtime • Extend fleet-level insights to equipment operators
  • 17. Caterpillar Delivering a 360-Degree View of Equipment Pentaho Benefits • Blend sensor data with customer data and more into unified analytics service • Operationalize predictive ‘useful life’ models in the data workflow • Provide a revenue-generating offering to customers that drives substantial fuel and maintenance savings
  • 18. Entity 360 Marine Asset Intelligence Business User (COO) Reporting on Operations and Efficiency End Users Dashboards and Reports on Machine Performance Business Analytics Server Data Marts Data Scientist Data Mining and Predictive Data Governance Local Machine and Server Data Fleet Data via Satellite Cross Department Operations Data Data Integration Data Integration
  • 19. British Telecom Protecting Against Cyber Threats Business Challenge • Launch new service to market: BT Assure Cyber, an enterprise solution for cyber security insights across many data types • Previously BT Assure Cyber could only integrate relational data sources and not big data sources
  • 20. British Telecom Protecting Against Cyber Threats Pentaho Benefits • Native support for Hadoop in an enterprise environment • Ability to integrate telemetry data from sensors, security controls and advanced detection tools • Reduced detection time of cyber threats from weeks to seconds
  • 21. In Closing Next Steps  Explore more mashup examples: www.pentaho.com/blend-of-the-week  Take a look at Pentaho in the 2016 Gartner Business Analytics Magic Quadrant Key Takeaways  Teams are taking “data mashups” to new heights  Blend data on demand and at the source  Data integration can maximize analytic value

Notas del editor

  1. Pentaho is an end-to-end business analytics and data integration platform. And in particular, a major area of focus for our platform has been helping customers integrate Big Data sources into their architecture and analyze that blend of traditional and emerging data. At the same time, we also provide analytics software that is highly embeddable in that they can fit seamlessly into existing applications and processes. A lot of our success in these areas has been due to both early innovation in Big Data – I believe we’ve been working with Hadoop for 5 to 6 years – as well as the open architecture and standards that our platform is built on.
  2. On-demand & at the source
  3. Example: Quality of service changes in real time dependent on the network: was the customer able to connect, to hear, to remain connected without being dropped, etc.? you can easily create architected, blended views across both the traditional Call Detail Records in the warehouse, and the network data streaming into Big Data/NoSQLstore (MongoDB in this example) without sacrificing the governance or performance you expect. These blended views allow analysts and customer call centers to get accurate, of-the-minute information in near real time to determine the best action to take
  4. Understand browser usage for my products – what direction around supported browsers? Blend Google Analytics – visitors by region, browser Support – historical support requests by browser What browsers do people use to come to our website, vs. what browsers are we getting support requests on?
  5. Anecdotally heard temp dictates product output; is there an appropriate temperature to max production? Want to look at temperatures by building on an hourly basis Corp data warehouse: production quotas and actuals Does temperature correlate with output? Hadoop has the HVAC data, exposed via Hive – relational layer on Hadoop… Corp data warehouse from table directly… Connect to this transformation from Pentaho – and auto generate model Looking at broad ranges – ‘hot’ and ‘cold’
  6. BT BT Assure Cyber offers comprehensive and fully integrated cyber security for large organisations with complex security needs including the UK Ministry of Defence and other corporate and government customers. UK Government Communications HQ (GCHQ) and other sources claim cyber breaches cost the global economy hundreds of billions of dollars annually. Previously BT Assure Cyber could only integrate relational data sources, which meant that its customers’ big data sources were not being harvested to detect potential security breaches. By embedding Pentaho, event data and telemetry from a rich variety of data sources including business systems, sensors, traditional security controls and advanced detection tools are all integrated and analysed. Incidents that previously would have taken days or weeks to investigate and respond to, can now be identified and acted upon immediately. The Disruptive Insight BT knew that data variety and unpredictability was a mounting problem, but as an ‘Oracle shop’ with a relational heritage, they could not see an obvious way to solve it – customers all have different data landscapes depending on their individual needs, and BT must cater to all. BT started to tackle this by introducing the Hadoop framework. However, relying on Hadoop’s native, immature tools like Sqoop, Flume, Oozie, Kafka to integrate and analyse the data proved massively difficult, time-consuming and risky Pentaho convinced BT that its big data integration and analytics platform combined with a metadata approach was the only way to reliably assimilate and blend data from so many different and unpredictable sources. BT recognised that Pentaho’s visual tools provided much faster time to value with lower risk, requiring fewer specialised development resources than native Hadoop tools. Why the Customer Chose Pentaho Pentaho’s ability to handle data variety and uncertainty in a range of customer scenarios was central to winning this deal. Pentaho successfully proved its viability and offered references in many other high-stakes, complex and secure customer use cases in industries like financial services and energy. Getting the Deal Done The single most important factor was Pentaho’s native support for Hadoop in an enterprise environment. Combined with Pentaho’s commercial flexibility to licence the platform in line with how BT charges its customers – by data volumes ingested – the ROI case finally became too compelling to resist.