SlideShare una empresa de Scribd logo
1 de 32
Reinventing the Information Pipeline
From Big Data Strategy to Big Value December 2016
Agenda
• Introduction
• Challenges in the Information Pipeline
• Paxata in the Converged Data Platform
Paxata’s mission (since 2012)
Deliver the only enterprise-grade data preparation platform
for everyone to transform raw, meaningless data into
valuable, contextual and complete information
4
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
83%Companies agree that data is
their most strategic asset
5
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
6
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
7
The data chasm
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset
Challenges in the Information Pipeline
Traditional data preparation
creates a bottleneck
Traditional data preparation creates a bottleneck
Business teams have complex data sources for analytics projects
Traditional data preparation creates a bottleneck
Business teams funnel their requirements to IT
IT-centric data preparation
Business
Information
Traditional data preparation creates a bottleneck
IT runs requirements through a linear ETL process
executed with manual scripting or coding
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
IT reviews with business. Makes changes, fixes errors.
(Repeat)
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Business teams make decisions before data is available
-or-
Ask for changes and restart the process.
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designed for highly specialized technical people to prepare data for
business teams
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck
Designing for highly specialized technical
people to prepare data for business teams.
Expensive
Complicated
Error-prone
Time-consuming
Modern architecture balances
freedom with responsibility
Modern architecture: balancing freedom with responsibility
Built for business
•Freedom and
flexibility with
collaboration
Modern architecture: balancing freedom with responsibility
Collect and manage data
Time
Built for business
•Freedom and
flexibility with
collaboration
Enabled by IT
•Data governance,
scale, efficiency
Modern information pipeline is
Built for business
Freedom and flexibility with collaboration
Enabled by IT
Data governance, scale, efficiency
Data prep must address the
range of information workers
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
Data Developer
Data Analyst
Business Analyst
Information
Worker
Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
(200K)
Data Developer
(600K)
Data Analyst
(100M)
Business Analyst
(275M)
Information
Worker
(460M)
Paxata accelerates the
data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Consumer experience for preparing data
Architecture of the Paxata Adaptive Information Platform
Architecture of the Paxata Adaptive Information Platform
Contact us
Paxata in the apps gallery
Register for Paxata Live:
www.paxata.com/events
info@paxata.com
www.youtube.com/PaxataTV
www.paxata.com
December 8, 2016© Paxata, Inc. 32
Thank You!

Más contenido relacionado

La actualidad más candente

The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseCaserta
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...DataWorks Summit
 
Data catalog
Data catalogData catalog
Data catalogiamtodor
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta
 
Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeDATAVERSITY
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jrJonathan Raspaud
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AIDATAVERSITY
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Caserta
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceAlation
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIDATAVERSITY
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?Caserta
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analyticsThe Marketing Distillery
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the CloudCaserta
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeLeigh Hill
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18Harvinder Atwal
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...DATAVERSITY
 
Chief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - PresentationChief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - PresentationSrinivasan Sankar
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...North Texas Chapter of the ISSA
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDATAVERSITY
 

La actualidad más candente (20)

The Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's EnterpriseThe Rise of the CDO in Today's Enterprise
The Rise of the CDO in Today's Enterprise
 
A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...A modern, flexible approach to Hadoop implementation incorporating innovation...
A modern, flexible approach to Hadoop implementation incorporating innovation...
 
Data catalog
Data catalogData catalog
Data catalog
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
 
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...
 
Focus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL CodeFocus on Your Analysis, Not Your SQL Code
Focus on Your Analysis, Not Your SQL Code
 
Big and fast data strategy 2017 jr
Big and fast data strategy 2017 jrBig and fast data strategy 2017 jr
Big and fast data strategy 2017 jr
 
How to Consume Your Data for AI
How to Consume Your Data for AIHow to Consume Your Data for AI
How to Consume Your Data for AI
 
Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics Building a New Platform for Customer Analytics
Building a New Platform for Customer Analytics
 
Data Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data IntelligenceData Catalog as the Platform for Data Intelligence
Data Catalog as the Platform for Data Intelligence
 
Using Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROIUsing Machine Learning to Understand and Predict Marketing ROI
Using Machine Learning to Understand and Predict Marketing ROI
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
Getting down to business on Big Data analytics
Getting down to business on Big Data analyticsGetting down to business on Big Data analytics
Getting down to business on Big Data analytics
 
Big Data Analytics on the Cloud
Big Data Analytics on the CloudBig Data Analytics on the Cloud
Big Data Analytics on the Cloud
 
Data lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperativeData lineage to drive compliance and as a business imperative
Data lineage to drive compliance and as a business imperative
 
DataOps: Nine steps to transform your data science impact Strata London May 18
DataOps: Nine steps to transform your data science impact  Strata London May 18DataOps: Nine steps to transform your data science impact  Strata London May 18
DataOps: Nine steps to transform your data science impact Strata London May 18
 
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...
 
Chief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - PresentationChief Data & Analytics Officer Fall Boston - Presentation
Chief Data & Analytics Officer Fall Boston - Presentation
 
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...
 
DI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data WarehouseDI&A Slides: Data Lake vs. Data Warehouse
DI&A Slides: Data Lake vs. Data Warehouse
 

Destacado

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...University of Twente
 
Data Culture Series - Keynote - 27th Jan, London
Data Culture Series -  Keynote - 27th Jan, LondonData Culture Series -  Keynote - 27th Jan, London
Data Culture Series - Keynote - 27th Jan, LondonJonathan Woodward
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useSwiss Big Data User Group
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsRetigence Technologies
 
Continuous Performance Testing
Continuous Performance TestingContinuous Performance Testing
Continuous Performance TestingGrid Dynamics
 
광역화 집단에너지사업제안서
광역화 집단에너지사업제안서광역화 집단에너지사업제안서
광역화 집단에너지사업제안서Seokho Shin
 
Exploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryExploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryAzavea
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answersselinasimpson205
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist SoftServe
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceSumo Logic
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?DATAVERSITY
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO✔ Eric David Benari, PMP
 
Потоковая обработка больших данных
Потоковая обработка больших данныхПотоковая обработка больших данных
Потоковая обработка больших данныхCEE-SEC(R)
 
Engine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture EnhancementsEngine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture EnhancementsEngine Yard
 
6 tips for improving ruby performance
6 tips for improving ruby performance6 tips for improving ruby performance
6 tips for improving ruby performanceEngine Yard
 

Destacado (20)

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
Managing uncertainty in data - Presentation at Data Science Northeast Netherl...
 
MonitoringFrameWork
MonitoringFrameWorkMonitoringFrameWork
MonitoringFrameWork
 
Data Culture Series - Keynote - 27th Jan, London
Data Culture Series -  Keynote - 27th Jan, LondonData Culture Series -  Keynote - 27th Jan, London
Data Culture Series - Keynote - 27th Jan, London
 
Making Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to useMaking Hadoop based analytics simple for everyone to use
Making Hadoop based analytics simple for everyone to use
 
Supply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 TrendsSupply chain and Big data : top 5 Trends
Supply chain and Big data : top 5 Trends
 
Continuous Performance Testing
Continuous Performance TestingContinuous Performance Testing
Continuous Performance Testing
 
광역화 집단에너지사업제안서
광역화 집단에너지사업제안서광역화 집단에너지사업제안서
광역화 집단에너지사업제안서
 
Exploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban ForestryExploring Data Preparation and Visualization Tools for Urban Forestry
Exploring Data Preparation and Visualization Tools for Urban Forestry
 
Data Preparation for Data Science
Data Preparation for Data ScienceData Preparation for Data Science
Data Preparation for Data Science
 
Trace 3 interview questions and answers
Trace 3 interview questions and answersTrace 3 interview questions and answers
Trace 3 interview questions and answers
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
 
Jagger release 2.0
Jagger release 2.0Jagger release 2.0
Jagger release 2.0
 
Driving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data IntelligenceDriving Retail Success with Machine Data Intelligence
Driving Retail Success with Machine Data Intelligence
 
How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?How Can You Calculate the Cost of Your Data?
How Can You Calculate the Cost of Your Data?
 
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTODatabase Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO
 
Потоковая обработка больших данных
Потоковая обработка больших данныхПотоковая обработка больших данных
Потоковая обработка больших данных
 
Geemus
GeemusGeemus
Geemus
 
Cohodatawebinar
Cohodatawebinar Cohodatawebinar
Cohodatawebinar
 
Engine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture EnhancementsEngine Yard Cloud Architecture Enhancements
Engine Yard Cloud Architecture Enhancements
 
6 tips for improving ruby performance
6 tips for improving ruby performance6 tips for improving ruby performance
6 tips for improving ruby performance
 

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR

Modern Data Management
Modern Data ManagementModern Data Management
Modern Data ManagementSAP Technology
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of MetadataDATAVERSITY
 
Decision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDecision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDLT Solutions
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...Big Data Week
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...CompTIA
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052kavi172
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052Gilbert Rozario
 
Fried data summit big data for lob content
Fried data summit big data for lob contentFried data summit big data for lob content
Fried data summit big data for lob contentJeff Fried
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy Hussain Sultan
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo
 
Data-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply ChainData-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply ChainDATAVERSITY
 

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR (20)

Modern Data Management
Modern Data ManagementModern Data Management
Modern Data Management
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
The Importance of Metadata
The Importance of MetadataThe Importance of Metadata
The Importance of Metadata
 
Decision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great DataDecision Ready Data: Power Your Analytics with Great Data
Decision Ready Data: Power Your Analytics with Great Data
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
 
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
 
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
Why Your Data Science Architecture Should Include a Data Virtualization Tool ...
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...
 
Leveraging Streaming Data through Automation
Leveraging Streaming Data through AutomationLeveraging Streaming Data through Automation
Leveraging Streaming Data through Automation
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?¿En qué se parece el Gobierno del Dato a un parque de atracciones?
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Oea big-data-guide-1522052
Oea big-data-guide-1522052Oea big-data-guide-1522052
Oea big-data-guide-1522052
 
Fried data summit big data for lob content
Fried data summit big data for lob contentFried data summit big data for lob content
Fried data summit big data for lob content
 
How to make your data scientists happy
How to make your data scientists happy How to make your data scientists happy
How to make your data scientists happy
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
 
Data-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply ChainData-Centric Analytics and Understanding the Full Data Supply Chain
Data-Centric Analytics and Understanding the Full Data Supply Chain
 

Último

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823
 

Último (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Predicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science ProjectPredicting Loan Approval: A Data Science Project
Predicting Loan Approval: A Data Science Project
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men  🔝Bangalore🔝   Esc...
➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...
 

Reinventing the Modern Information Pipeline: Paxata and MapR

  • 1. Reinventing the Information Pipeline From Big Data Strategy to Big Value December 2016
  • 2. Agenda • Introduction • Challenges in the Information Pipeline • Paxata in the Converged Data Platform
  • 3. Paxata’s mission (since 2012) Deliver the only enterprise-grade data preparation platform for everyone to transform raw, meaningless data into valuable, contextual and complete information
  • 4. 4 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 83%Companies agree that data is their most strategic asset
  • 5. 5 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 6. 6 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 7. 7 The data chasm Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset
  • 8. Challenges in the Information Pipeline
  • 10. Traditional data preparation creates a bottleneck Business teams have complex data sources for analytics projects
  • 11. Traditional data preparation creates a bottleneck Business teams funnel their requirements to IT IT-centric data preparation Business Information
  • 12. Traditional data preparation creates a bottleneck IT runs requirements through a linear ETL process executed with manual scripting or coding IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 13. Traditional data preparation creates a bottleneck IT reviews with business. Makes changes, fixes errors. (Repeat) IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information
  • 14. Business teams make decisions before data is available -or- Ask for changes and restart the process. IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 15. Designed for highly specialized technical people to prepare data for business teams IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck
  • 16. Designing for highly specialized technical people to prepare data for business teams. Expensive Complicated Error-prone Time-consuming
  • 18. Modern architecture: balancing freedom with responsibility Built for business •Freedom and flexibility with collaboration
  • 19. Modern architecture: balancing freedom with responsibility Collect and manage data Time Built for business •Freedom and flexibility with collaboration Enabled by IT •Data governance, scale, efficiency
  • 20. Modern information pipeline is Built for business Freedom and flexibility with collaboration Enabled by IT Data governance, scale, efficiency
  • 21. Data prep must address the range of information workers
  • 22. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist Data Developer Data Analyst Business Analyst Information Worker
  • 23. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist (200K) Data Developer (600K) Data Analyst (100M) Business Analyst (275M) Information Worker (460M)
  • 24. Paxata accelerates the data to information pipeline
  • 25. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 26. Data Lake Enterprise Local Paxata accelerates the data to information pipeline
  • 27. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive
  • 28. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive Consumer experience for preparing data
  • 29. Architecture of the Paxata Adaptive Information Platform
  • 30. Architecture of the Paxata Adaptive Information Platform
  • 31. Contact us Paxata in the apps gallery Register for Paxata Live: www.paxata.com/events info@paxata.com www.youtube.com/PaxataTV www.paxata.com
  • 32. December 8, 2016© Paxata, Inc. 32 Thank You!

Notas del editor

  1. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  2. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  3. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  4. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  5. To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
  6. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
  7. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  8. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  9. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  10. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  11. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  12. Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
  13. Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  14. Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
  15. Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep. Source: Prakash VC deck
  16. Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep.
  17. Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information