Reinventing the Modern Information Pipeline: Paxata and MapR

•

1 recomendación•1,899 vistas

(Presented at MapR's Big Data Everywhere event in Redwood City, CA in December 2016) The relationship between business teams and IT has changed as the complexity of data has increased. A traditional data pipeline designed for an IT-centered approach to information management is not designed for the data demands of today's business decisions. Designing a big data strategy requires modernizing previous approaches. Self-service data preparation in a collaborative, intuitive, governed, and secure environment is the key to a nimble and decisive business unit.

Datos y análisis

Reinventing the Information Pipeline
From Big Data Strategy to Big Value December 2016

Agenda
• Introduction
• Challenges in the Information Pipeline
• Paxata in the Converged Data Platform

Paxata’s mission (since 2012)
Deliver the only enterprise-grade data preparation platform
for everyone to transform raw, meaningless data into
valuable, contextual and complete information

4
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
83%Companies agree that data is
their most strategic asset

5
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset

6
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset

7
The data chasm
Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018
12%Amount of data most companies
estimate they are analyzing
80%Time analysts will spend trying to
create data sets to draw insights
83%Companies agree that data is
their most strategic asset

Traditional data preparation
creates a bottleneck

Traditional data preparation creates a bottleneck
Business teams have complex data sources for analytics projects

Traditional data preparation creates a bottleneck
Business teams funnel their requirements to IT
IT-centric data preparation
Business
Information

Traditional data preparation creates a bottleneck
IT runs requirements through a linear ETL process
executed with manual scripting or coding
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information

Traditional data preparation creates a bottleneck
IT reviews with business. Makes changes, fixes errors.
(Repeat)
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information

Business teams make decisions before data is available
-or-
Ask for changes and restart the process.
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck

Designed for highly specialized technical people to prepare data for
business teams
IT-Centric Data Preparation
Model Extract Transform Load Optimize
Business
Information
Traditional data preparation creates a bottleneck

Designing for highly specialized technical
people to prepare data for business teams.
Expensive
Complicated
Error-prone
Time-consuming

Modern architecture balances
freedom with responsibility

Modern architecture: balancing freedom with responsibility
Built for business
•Freedom and
flexibility with
collaboration

Modern architecture: balancing freedom with responsibility
Collect and manage data
Time
Built for business
•Freedom and
flexibility with
collaboration
Enabled by IT
•Data governance,
scale, efficiency

Modern information pipeline is
Built for business
Freedom and flexibility with collaboration
Enabled by IT
Data governance, scale, efficiency

Data prep must address the
range of information workers

Data prep must address the range of information workers
Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between
Enterprise and Consumer Technologies,” August 30, 2012
Deep Technical Skills Limited Technical Skills
Data Scientist
Data Developer
Data Analyst
Business Analyst
Information
Worker

Paxata accelerates the
data to information pipeline

Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline

Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive

Data Lake
Enterprise
Local
Paxata accelerates the data to information pipeline
BI/Visualization
Predictive
Consumer experience for preparing data

Architecture of the Paxata Adaptive Information Platform

Contact us
Paxata in the apps gallery
Register for Paxata Live:
www.paxata.com/events
info@paxata.com
www.youtube.com/PaxataTV
www.paxata.com

December 8, 2016© Paxata, Inc. 32
Thank You!

Más contenido relacionado

La actualidad más candente

The Rise of the CDO in Today's EnterpriseCaserta

A modern, flexible approach to Hadoop implementation incorporating innovation...DataWorks Summit

Data catalogiamtodor

Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Caserta

Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...Caserta

Focus on Your Analysis, Not Your SQL CodeDATAVERSITY

Big and fast data strategy 2017 jrJonathan Raspaud

How to Consume Your Data for AIDATAVERSITY

Building a New Platform for Customer Analytics Caserta

Data Catalog as the Platform for Data IntelligenceAlation

Using Machine Learning to Understand and Predict Marketing ROIDATAVERSITY

You're the New CDO, Now What?Caserta

Getting down to business on Big Data analyticsThe Marketing Distillery

Big Data Analytics on the CloudCaserta

Data lineage to drive compliance and as a business imperativeLeigh Hill

DataOps: Nine steps to transform your data science impact Strata London May 18Harvinder Atwal

Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...DATAVERSITY

Chief Data & Analytics Officer Fall Boston - PresentationSrinivasan Sankar

NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...North Texas Chapter of the ISSA

DI&A Slides: Data Lake vs. Data WarehouseDATAVERSITY

La actualidad más candente (20)

The Rise of the CDO in Today's Enterprise

A modern, flexible approach to Hadoop implementation incorporating innovation...

Data catalog

Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016

Integrating the CDO Role Into Your Organization; Managing the Disruption (MIT...

Focus on Your Analysis, Not Your SQL Code

Big and fast data strategy 2017 jr

How to Consume Your Data for AI

Building a New Platform for Customer Analytics

Data Catalog as the Platform for Data Intelligence

Using Machine Learning to Understand and Predict Marketing ROI

You're the New CDO, Now What?

Getting down to business on Big Data analytics

Big Data Analytics on the Cloud

Data lineage to drive compliance and as a business imperative

DataOps: Nine steps to transform your data science impact Strata London May 18

Slides: Case Study — How J.B. Hunt is Driving Efficiency with AI and Real-Tim...

Chief Data & Analytics Officer Fall Boston - Presentation

NTXISSACSC3 - Why Enterprise Information Management is the Key to GRC by Mika...

DI&A Slides: Data Lake vs. Data Warehouse

Destacado

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...University of Twente

MonitoringFrameWorkKashif Saleem

Data Culture Series - Keynote - 27th Jan, LondonJonathan Woodward

Making Hadoop based analytics simple for everyone to useSwiss Big Data User Group

Supply chain and Big data : top 5 TrendsRetigence Technologies

Continuous Performance TestingGrid Dynamics

광역화 집단에너지사업제안서Seokho Shin

Exploring Data Preparation and Visualization Tools for Urban ForestryAzavea

Data Preparation for Data ScienceDataWorks Summit/Hadoop Summit

Trace 3 interview questions and answersselinasimpson205

Essential Data Engineering for Data Scientist SoftServe

Jagger release 2.0Grid Dynamics

Driving Retail Success with Machine Data IntelligenceSumo Logic

How Can You Calculate the Cost of Your Data?DATAVERSITY

Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO✔ Eric David Benari, PMP

Потоковая обработка больших данныхCEE-SEC(R)

GeemusEngine Yard

Cohodatawebinar Murugesan Arumugam

Engine Yard Cloud Architecture EnhancementsEngine Yard

6 tips for improving ruby performanceEngine Yard

Destacado (20)

Managing uncertainty in data - Presentation at Data Science Northeast Netherl...

MonitoringFrameWork

Data Culture Series - Keynote - 27th Jan, London

Making Hadoop based analytics simple for everyone to use

Supply chain and Big data : top 5 Trends

Continuous Performance Testing

광역화 집단에너지사업제안서

Exploring Data Preparation and Visualization Tools for Urban Forestry

Data Preparation for Data Science

Trace 3 interview questions and answers

Essential Data Engineering for Data Scientist

Jagger release 2.0

Driving Retail Success with Machine Data Intelligence

How Can You Calculate the Cost of Your Data?

Database Camp 2016 @ United Nations, NYC - Javier de la Torre, CEO, CARTO

Потоковая обработка больших данных

Geemus

Cohodatawebinar

Engine Yard Cloud Architecture Enhancements

6 tips for improving ruby performance

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR

Modern Data ManagementSAP Technology

Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY

The Importance of MetadataDATAVERSITY

Decision Ready Data: Power Your Analytics with Great DataDLT Solutions

Data Strategy Best PracticesDATAVERSITY

How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...Denodo

BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...Big Data Week

Why Your Data Science Architecture Should Include a Data Virtualization Tool ...Denodo

BAR360 open data platform presentation at DAMA, SydneySai Paravastu

Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...CompTIA

Leveraging Streaming Data through AutomationEnterprise Management Associates

Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit

¿En qué se parece el Gobierno del Dato a un parque de atracciones?Denodo

Oea big-data-guide-1522052kavi172

Oea big-data-guide-1522052Gilbert Rozario

Fried data summit big data for lob contentJeff Fried

How to make your data scientists happy Hussain Sultan

02 a holistic approach to big dataRaul Chong

Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...Denodo

Data-Centric Analytics and Understanding the Full Data Supply ChainDATAVERSITY

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR (20)

Modern Data Management

Data Lake Architecture – Modern Strategies & Approaches

The Importance of Metadata

Decision Ready Data: Power Your Analytics with Great Data

Data Strategy Best Practices

How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...

BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...

Why Your Data Science Architecture Should Include a Data Virtualization Tool ...

BAR360 open data platform presentation at DAMA, Sydney

Is Your Staff Big Data Ready? 5 Things to Know About What It Will Take to Suc...

Leveraging Streaming Data through Automation

Hadoop 2015: what we larned -Think Big, A Teradata Company

¿En qué se parece el Gobierno del Dato a un parque de atracciones?

Oea big-data-guide-1522052

Fried data summit big data for lob content

How to make your data scientists happy

02 a holistic approach to big data

Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...

Data-Centric Analytics and Understanding the Full Data Supply Chain

Último

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823

Probability Grade 10 Third Quarter LessonsJoseMangaJr1

➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7Call Girls in Nagpur High Profile Call Girls

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823

Invezz.com - Grow your wealth with trading signalsInvezz1

BigBuy dropshipping via API with DroFx.pptxolyaivanovalion

Halmar dropshipping via API with DroFxolyaivanovalion

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823

Reinventing the Modern Information Pipeline: Paxata and MapR

1. Reinventing the Information Pipeline From Big Data Strategy to Big Value December 2016

2. Agenda • Introduction • Challenges in the Information Pipeline • Paxata in the Converged Data Platform

3. Paxata’s mission (since 2012) Deliver the only enterprise-grade data preparation platform for everyone to transform raw, meaningless data into valuable, contextual and complete information

4. 4 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 83%Companies agree that data is their most strategic asset

5. 5 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset

6. 6 Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset

7. 7 The data chasm Source: Gartner News Room: http://www.gartner.com/newsroom/id/2975018 12%Amount of data most companies estimate they are analyzing 80%Time analysts will spend trying to create data sets to draw insights 83%Companies agree that data is their most strategic asset

8. Challenges in the Information Pipeline

9. Traditional data preparation creates a bottleneck

10. Traditional data preparation creates a bottleneck Business teams have complex data sources for analytics projects

11. Traditional data preparation creates a bottleneck Business teams funnel their requirements to IT IT-centric data preparation Business Information

12. Traditional data preparation creates a bottleneck IT runs requirements through a linear ETL process executed with manual scripting or coding IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information

13. Traditional data preparation creates a bottleneck IT reviews with business. Makes changes, fixes errors. (Repeat) IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information

14. Business teams make decisions before data is available -or- Ask for changes and restart the process. IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck

15. Designed for highly specialized technical people to prepare data for business teams IT-Centric Data Preparation Model Extract Transform Load Optimize Business Information Traditional data preparation creates a bottleneck

16. Designing for highly specialized technical people to prepare data for business teams. Expensive Complicated Error-prone Time-consuming

17. Modern architecture balances freedom with responsibility

18. Modern architecture: balancing freedom with responsibility Built for business •Freedom and flexibility with collaboration

19. Modern architecture: balancing freedom with responsibility Collect and manage data Time Built for business •Freedom and flexibility with collaboration Enabled by IT •Data governance, scale, efficiency

20. Modern information pipeline is Built for business Freedom and flexibility with collaboration Enabled by IT Data governance, scale, efficiency

21. Data prep must address the range of information workers

22. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist Data Developer Data Analyst Business Analyst Information Worker

23. Data prep must address the range of information workers Source: Forrester Research, Inc., “Info Workers Will Erase The Boundary Between Enterprise and Consumer Technologies,” August 30, 2012 Deep Technical Skills Limited Technical Skills Data Scientist (200K) Data Developer (600K) Data Analyst (100M) Business Analyst (275M) Information Worker (460M)

24. Paxata accelerates the data to information pipeline

25. Data Lake Enterprise Local Paxata accelerates the data to information pipeline

26. Data Lake Enterprise Local Paxata accelerates the data to information pipeline

27. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive

28. Data Lake Enterprise Local Paxata accelerates the data to information pipeline BI/Visualization Predictive Consumer experience for preparing data

29. Architecture of the Paxata Adaptive Information Platform

30. Architecture of the Paxata Adaptive Information Platform

31. Contact us Paxata in the apps gallery Register for Paxata Live: www.paxata.com/events info@paxata.com www.youtube.com/PaxataTV www.paxata.com

Notas del editor

Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
To seize the opportunity you must cross this data chasm. Why…Because its hard Traditional, legacy technologies and processes that companies currently leverage were NOT designed for the variety and volume of data that companies are working with today. Companies need to be more nimble We have many customers that have 10’s of Millions invested annual in traditional ETL processes, and they were still spending too much time preparing data and not on the value added tasks of analytics. They selected Paxata to help complement these technologies and fill the gaps with a more exploratory, interactive experience.
Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Visual Data Discovery Tools – people had a hunger to get at and dig into their data – traditional small spreadsheets or databases 1. Business teams funnel their data requirements to IT 2. IT runs requirements through linear ETL process, executed with manual scripting or coding 3. IT reviews with business, makes changes, fixes errors. Repeats this cycle. 4. By then, business teams make decisions long before data is available or they ask for changes and re-start the process Traditional Technologies Do Not Meet Today’s Needs Batch, Complicated, No Visibility, IT Only, Time Consuming, Error Prone, Expensive Legacy infrastructure for data preparation was never designed to scale to the orders of magnitude more data and the orders of magnitude more consumers of today’s information-driven world. A model in which a small set of highly skilled IT data scientists and data developers take business requirements and then execute a highly prescribed, lengthy, waterfall process for preparing data only to more often than not realize that they missed the mark as they lack the business context, is not a viable model.
Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
Slide use: problem of data (option 4) This is a five-part slide. Use this along with the 4 slides before it. Talking Points: Big Data and self-service analytics necessitate a fundamental transformation from an IT-centric data preparation process to a self-service data preparation model. In the self-service model, the steps that make of data preparation – data integration, quality, cleansing, enrichment and shaping don’t go away, they need to be re-imagined in a way that enables the business or data analyst to accomplish these tasks on their own which in turn empowers them to work with vertical slices of relevant data and get the results they want, when they need them. However, it’s important that the self-service model also provide the governance and traceability that IT requires to maintain trust in data and analytic results. In this new model, IT’s role changes to collection and centralization of access to raw data and to providing the right infrastructure to the business that drive self-service data preparation and analytics, while maintaining full governance.
Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep. Source: Prakash VC deck
Slide use: Who are the data analysts Talking points: This pyramid describes the typical information work roles in today’s enterprises and highlights the dramatic scale that self-service data preparation can bring. Legacy and many Big Data tools target the Data Scientist and the Data Developer, but as you can see there are hugely more data analysts our there, and self-service data prep empowers them to drive their own data destiny, breaking the logjam of traditional IT-constrained ETL and data preparation. By Data Analysts, we are referring to Power Excel users or Tableau users who understand data and analytics, but don’t write code or scripts. For self-service data prep to truly transform an organization, it must empower the data analyst; however, self-service data prep simplifies many traditionally complex and time-consuming preparation operations and the work of data scientists and data developers can be dramatically accelerated by self-service data prep.
Deliver the only enterprise-grade data preparation platform that lets everyone transform raw, meaningless data into valuable, contextual and complete information

Reinventing the Modern Information Pipeline: Paxata and MapR

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR

Similar a Reinventing the Modern Information Pipeline: Paxata and MapR (20)

Último

Último (20)

Reinventing the Modern Information Pipeline: Paxata and MapR

Notas del editor