SlideShare una empresa de Scribd logo
1 de 52
Descargar para leer sin conexión
Grab some
coffee and
enjoy the
pre-­show
banter
before the
top of the
hour!
The Briefing Room
The Maturity Model: Taking the Growing Pains Out of Hadoop
Twitter Tag: #briefr The Briefing Room
Welcome
Host:
Eric Kavanagh
eric.kavanagh@bloorgroup.com
@eric_kavanagh
Twitter Tag: #briefr The Briefing Room
  Reveal the essential characteristics of enterprise
software, good and bad
  Provide a forum for detailed analysis of today s innovative
technologies
  Give vendors a chance to explain their product to savvy
analysts
  Allow audience members to pose serious questions... and
get answers!
Mission
Twitter Tag: #briefr The Briefing Room
Topics
June: INNOVATORS
July: SQL INNOVATION
August: REAL-TIME DATA
Twitter Tag: #briefr The Briefing Room
Three Types of Evolution
Ø  Disruptive selection
Ø  Stabilizing selection
Ø  Directional selection
Twitter Tag: #briefr The Briefing Room
Types of Evolution?
WHAT GOES AROUND, COMES AROUND!
Twitter Tag: #briefr The Briefing Room
Analyst: Rick van der Lans
Rick F. van der Lans is an independent
analyst, consultant, author and
lecturer specializing in data
warehousing, business intelligence,
analytics, big data and database
technology. He is Managing Director of
R20/Consultancy. He has advised many
large companies worldwide on
defining their business intelligence
architectures. His popular IT books
have been translated into many
languages and have sold over 100,000
copies. Rick writes for TechTarget.com
and B-eye-Network. For the last 25
years, he has been presenting
professionally around the globe and at
international events.
Twitter Tag: #briefr The Briefing Room
Think Big, A Teradata Company
  Last year Teradata acquired Think Big Analytics, Inc., a
consulting and solutions company focused on big data
solutions
  Think Big has expertise in implementing a variety of open
source technologies, such as Hadoop, Hbase, Cassandra,
MongoDB and Storm, as well as experience with
Hortonworks, Cloudera and MapR
  Its consultants can assist with the planning, management
and deployment of big data implementations
Twitter Tag: #briefr The Briefing Room
Guest: Ron Bodkin
Ron Bodkin is Founder & President of Think Big, A
Teradata Company. Ron founded Think Big to help
companies realize measurable value from Big Data. The
company’s expertise spans all facets of data science
and data engineering and helps our customers to drive
maximum value from their Big Data initiative.
Previously, Ron was vice president Engineering at
Quantcast where he led the data science and engineer
teams that pioneered the use of Hadoop and NoSQL for
batch and real-time decision making. Prior to that, Ron
was Founder of New Aspects, which provided enterprise
consulting for Aspect-oriented programming. Ron was
also Co-Founder and CTO of B2B applications provider
C-Bridge, which he led to team of 900 people and a
successful IPO. Ron graduated with honors from McGill
University with a B.S. in Math and Computer Science.
Ron also earned his Master’s Degree in Computer
Science from MIT, leaving the PhD program after
presenting the idea for C-bridge and placing in the
finals of the 50k Entrepreneurship Contest.
MAKING BIG DATA COME ALIVE
​ The Maturity Model: Taking the Growing Pains out of
Hadoop
​ Ron Bodkin
​ Founder and President, Think Big
12
•  What is Big Data?
•  Hadoop Maturity Spectrum
(from MapReduce to Integrated
Hadoop Architecture)
•  Taking the Growing Pains out of
Hadoop
Agenda
13
•  Data sets so large and
complex that they become
awkward to work with using
standard tools and
techniques
•  Google, Quantcast, Yahoo,
LinkedIn… customer
innovation in open source
•  Discover value in dark data
and relationships among
data sets
–  Customer behavior
–  Product behavior
•  Evolution:
insight -> operational ->
business innovation
Source: Hortonworks
What is Big Data?
© 2015 Think Big, a Teradata Company 6/16/15
14
Hadoop (Circa 2008)
•  HDFS - Distributed File System
•  MapReduce distributed parallel
batch processing
•  Community Open Source since 2005
many contributors
•  Industry commitment: over $1 Billion
invested in startups, all major
software companies embrace
•  3 replicas for HA
•  50 PB+ scale
•  Great for analytics, batch processing
© 2015 Think Big, a Teradata Company 6/16/15
15
SQL on Hadoop (Circa 2009 to Today)
•  Expanded access to data in
Hadoop
•  Great for exploration of
“raw” data, e.g., by data
scientists
–  Can integrate UDFs
–  Tools integration expanding
access beyond SQL power users
•  Fast moving space
•  Also used for ETL/ELT
processing
© 2015 Think Big, a Teradata Company 6/16/15
16
•  Support for many processing engines including Spark, Storm, Tez
•  Common cluster with local data access in HDFS
Hadoop (Circa 2012)
YARN - Yet Another Resource Negotiator
16
© 2015 Think Big, a Teradata Company 6/16/15
17 © 2015 Think Big, a Teradata Company 6/16/15
Apache Spark (Circa 2013)
Spark SQL
Spark
Streaming
MLib
(Machine
learning)
GraphX
(Graph)
Apache Spark Core Engine
YARN
JDBC/ODBC
HDFS
18
OLAP on Hadoop (Circa 2014)
Events
Reporting, Visualization and Analytics
Traditional BI on Hadoop
•  Day or week old data
•  Long latencies to retrieve data
•  Doesn’t scale as number of events,
aggregates, or users grow
•  Expensive, siloed event analytics
•  Real time data
•  Unlimited custom reporting
(drill to trillions of aggregates)
•  Scales to thousands of simultaneous
users
•  Subsecond latency
•  Omni-channel analytics
OLAP on Hadoop
© 2015 Think Big, a Teradata Company 6/16/15
19
Today: Integrated Hadoop Architecture
© 2015 Think Big, a Teradata Company 6/16/15
MAKING BIG DATA COME ALIVE
​ How to Take the Growing Pains out of Hadoop
21
•  Picking the wrong starting point
– ho hum, boil the ocean
•  Immature Governance
•  Siloed Organization
•  Change Management
•  Skills Gap
•  Business as usual
•  Not fitting in with ecosystem
Big Data Pitfalls
© 2015 Think Big, a Teradata Company 6/16/15
22
Big Data Center of Excellence
© 2015 Think Big, a Teradata Company 6/16/15
23
Cross-Functional Collaboration
Data
Weak intersection between Data, Systems
and Business often means project failure
Current State Improved State
Collaborative integration of Analytics with
Big Data capabilities for success.
Data
Weak Area with
diminished influence
Cross-functional
collaboration
Data-driven
business alignment
© 2015 Think Big, a Teradata Company 6/16/15
24
A “Data Lake” is a centralized approach to capturing, refining, storing, and
exploring any form of raw data at scale, enabled by low cost technologies
from which downstream facilities may draw.
What is a Data Lake?
Information Sources Data Lake Downstream
Facilities
Data Variety is the driving factor in building a Data Lake.
© 2015 Think Big, a Teradata Company 6/16/15
25
Why a Data Lake?
•  Benefit from open source big data innovation and
cost savings
•  Control over enterprise data with governance
•  Offload history of operational and analytical data
platforms
•  Remix ETL, freeing capacity downstream
•  Gain ultimate flexibility in data use and access
•  Foundation for solving business problems
–  Big data solutions enable new uses of data via working with
new data sets and analytics
© 2015 Think Big, a Teradata Company 6/16/15
26
•  Corporate Repository – Data sourcing repository, possibly system of
record
•  Active Archive – Offloaded history of operational and analytical data
platforms that needs to be used infrequently
•  Discovery – Data Profiling, schema on read, wrangling, discover signals
•  ETL Offload – Move all or parts of current ETL processes into Hadoop
•  NEW ETL Development – Develop new ETL processes on the Data Lake
•  Analytics – Data Science, Custom/product built Analytics solutions (i.e.
churn, machine learning)
•  Business Intelligence/Reporting – Event analysis, dimensional rollup and
pivot reports
Data Lake Use Cases
© 2015 Think Big, a Teradata Company 6/16/15
27
Swamp Reservoir
Data Lake: Swamp or Reservoir?
© 2015 Think Big, a Teradata Company 6/16/15
28
Data Lake
Information Sources
Evaluate
Source Data
Ingest
Collect & Manage
Metadata
Profile - Structure
Sequence
Downstream
Facilities
Generate Reports
Discovery Signals
Compress
Automate
Protect
Prepare Data
for Ingest
Prepare Source
Metadata
A centralized well-governed repository of raw data and metadata into
which all data-producing streams flow from which business analysts &
downstream facilities may draw.
Data Reservoir
© 2015 Think Big, a Teradata Company 6/16/15
29
Metadata is required to facilitate location of
and entitlements to data
Schema Metadata is a foundation, but…
Operational Metadata & Business-Security is
critical to governance
•  Where did it come from? … What is the data serialization?
•  Who owns the data? Who can see the data? Who belongs to
which group?
•  What is the environment (landing zone, OS, Line of Business)?
•  What processes touched my data?
•  Did you lose any data (Checksums, etc.)?
•  When did the data get ingested, transformed?
•  Did it get exported, or archived…when, where how will it be used
(organizational)?
Governance & Security
© 2015 Think Big, a Teradata Company 6/16/15
30
High Tech Manufacturer Example
•  Enterprise-wide data access (ODS) for timely analytics and insights
•  Foundation for large scale proactive analytics
•  Reduce $Millions in scrap waste
•  Drives revenue and market share by accelerating time to market
Legacy Data Lake
Retention 3-6 months scattered DBs
then tape archive
All data online in common
data reservoir for 3+ years
Coverage Summaries, samples for
several data sets
All parametric data
captured in raw and
integrated form
Analysis Reactive resulting in missed
improvement opportunities
In daily operations and on
larger data sets to support
proactive improvements
© 2015 Think Big, a Teradata Company 6/16/15
31
•  Insight into new data (one time
analysis)
•  Decision support models (offline
insights in production)
•  Operational models (real-time
analytics)
•  Hadoop opportunities: new data,
new algorithms, different
resolution, frequency
•  Hadoop blends with MPP,
analytics, tools not the sole tool
Data Science Analytics
© 2015 Think Big, a Teradata Company 6/16/15
32
Real Time Analytics Architecture
Clickstream
Call Center
email
Brick & Mortar
Batch Storage
(HDFS)
Event Repository
(HBase)
API
Server
CIM
ML
Personali
zation
A/B
Testing
Reporting
Call
Center
Channel
Attribution
© 2015 Think Big, a Teradata Company 6/16/15
33
•  We’ve come a long way since
MapReduce 1.0 enabled by YARN
•  Don’t forget about the last mile for
business (SQL on Hadoop, OLAP for
Hadoop, training, etc.)
•  Data lakes without governance,
security, metadata management
equates to failed projects/loss of $$
•  Analytics on big data is where the
real business value lies – from
understanding to models to
operationalization
Exploiting the Power of Hadoop
© 2015 Think Big, a Teradata Company 6/16/15
34
•  100% Big Data Focus
•  Founded in 2010 with100+ engagements across 70 clients
•  Unlock value of big data with data science and data
engineering services
•  Proven vendor-neutral open source integration expertise
•  Agile team-based development methodology
•  Think Big Academy for skills and organizational development
•  Global delivery model
Who is Think Big?
Twitter Tag: #briefr The Briefing Room
Perceptions & Questions
Analyst:
Rick van der Lans
Copyright	
  ©	
  1991	
  -­‐	
  2015	
  R20/Consultancy	
  B.V.,	
  The	
  
Hague,	
  The	
  Netherlands.	
  All	
  rights	
  reserved.	
  No	
  part	
  
of	
  this	
  material	
  may	
  be	
  reproduced,	
  stored	
  in	
  a	
  
retrieval	
  system,	
  or	
  transmiHed	
  in	
  any	
  form	
  or	
  by	
  
any	
  means,	
  electronic,	
  mechanical,	
  photographic,	
  or	
  
otherwise,	
  without	
  the	
  explicit	
  wriHen	
  permission	
  of	
  
the	
  copyright	
  owners.	
  
	
  
The	
  Next	
  Stage	
  of	
  
Hadoop	
  and	
  Big	
  Data:	
  
Simplifica9on	
  
by	
  
Rick	
  F.	
  van	
  der	
  Lans	
  
R20/Consultancy	
  BV	
  
TwiHer	
  @rick_vanderlans	
  
www.r20.nl	
  
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 37
The	
  Adop9on	
  Stages	
  of	
  Technologies	
  
  Stage 0: Introduction and early adoption
  Stage 1: Adoption by more traditional customers
  Stage 2: Universal adoption
Stage 0 Stage 1 Stage 2 Stage n
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 38
Adop9on	
  Stage	
  0	
  
  Functionality: Minimal
  Scalability and performance: Promising
  Customers: Those with very specific, urgent needs;
they have tech-savvy developers
  Use cases: Specific
  Integration: Isolated solutions
Stage 0 Stage 1 Stage 2 Stage n
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 39
Adop9on	
  Stage	
  1	
  
  Functionality: Sufficient for some mission-critical
systems
  Scalability and performance: Medium
  Customers: Those with very specific, urgent needs;
they have tech-savvy developers
  Use cases: More general
  Integration: Isolated solutions
Stage 0 Stage 1 Stage 2 Stage n
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 40
Adop9on	
  Stage	
  2	
  
  Functionality: Full
  Scalability and performance: High
  Customers: Traditional; not always with tech-savvy
developers; focus on productivity and maintainability
  Use cases: Specific and general
  Integration: Integrated with traditional solutions
Stage 0 Stage 1 Stage 2 Stage n
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 41
Different	
  SQL-­‐on-­‐Hadoop	
  Solu9ons	
  
Apache	
  
MapReduce	
  API	
  
	
  
	
  
	
  
Apache	
  
MapReduce	
  
Apache	
  HDFS	
  
API	
  
	
  
	
  
	
  Apache	
  HDFS	
  
Apache	
  HDFS	
  
API	
  
	
  
	
  
	
  Apache	
  HDFS	
  
Apache	
  HBase	
  
API	
  
	
  
	
  
	
  Apache	
  HBase	
  
A	
  SQL	
  Dialect	
  
	
  
	
  
	
  SQL-­‐on-­‐Hadoop	
  
A	
  SQL	
  Dialect	
  
	
  
	
  
	
  SQL-­‐on-­‐Hadoop	
  
Apache	
  HDFS	
  
API	
  
	
  
	
  
	
  Any	
  HDFS	
  
A	
  SQL	
  Dialect	
  
	
  
	
  
	
  SQL-­‐on-­‐Hadoop	
  
Apache	
  HDFS	
  
API	
  
	
  
	
  
	
  Any	
  HDFS	
  
A	
  SQL	
  Dialect	
  
	
  
	
  
	
  SQL-­‐on-­‐Hadoop	
  
Spark	
  API	
  
	
  
	
  
	
  Spark	
  
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 42
Self-­‐Service	
  Data	
  Prepara9on	
  
  Non-technical interface for
studying data files
  Easy way of defining rules
  Data is fixed by defining
filters, not by changing data
in source systems
  Relationship with Data
Blending
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 43
Example:	
  Data	
  Prepara9on	
  
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 44
Example:	
  Data	
  Prepara9on	
  
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 45
Example:	
  Data	
  Prepara9on	
  
Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 46
Stage	
  2	
  =	
  Simplifica9on	
  
Questions on acceptance
You talked about the skills gap. Sometimes Hadoop is used to develop operational systems, and
sometimes to develop BI systems. Now, many specialists working in BICCs are good ETL programming,
reporting, data modeling, but not so good at low-level programming. For example, many of them
have never programmed Java. What would you recommend them to do to be able to use Hadoop?
Should organizations use Hadoop for developing new systems, or for migrating existing systems? How
easy do you think it is to integrate a new Hadoop-based system with the current infrastructure?
Normally, when a technology becomes more and more popular, it becomes clearer and clearer to
everyone what the costs and the risks are. For example, how much man-hours should be spend on
managing a Hadoop environment? So, what’s the productivity of a MapReduce programmer versus a
SQL programmer?
Questions on architectural
Now that so much data is produced in a distributed fashion, is the concept of physically centralizing
data for integration really realistic? Isn’t big data too big to move?
Over the years, large multi-national organizations have always struggled to develop one large
centralized enterprise-wide data warehouse. In most cases, the problems were organizational. Who
owns that data, and so on? Why would the data lake or data reservoir succeed today?
Questions on technical aspects
It was clear that for you YARN is a key module in the Hadoop platform. How mature is YARN today?
How good is it, for example, in avoiding one massive query that monopolizes the entire platform and
slows down all other applications?
My feeling is still that HDFS is not good at joins of files. The SQL-on-Hadoop engines are not strong in
joins either. Do you think this will change? For example, can you take a star model from a SQL-based
data mart, consisting of 7 dimensions and 1 fact, migrate it to Hadoop and expect a great
performance?
Any idea what the tipping point is, when to forget about a proprietary SQL and move to Hadoop with
SQL-on-Hadoop?
You mentioned OLAP on Hadoop using Spark and you indicated that such a solution scales to thousands
of simultaneous users. Is that really true? Has that been proven? Or does that work when the reporting
tools extract data and load it in memory of the client machine?
Almost every vendor of some big data tool publishes benchmark results showing how fast their
product is. But many of these benchmarks are single user, single query benchmarks, which I think is
not that useful. Why are they not showing multi-users benchmark results? Do you think they’re hiding
something?
On one of your slides you said that Hadoop is 50 PB+ scale. But realistically, how many companies on
this planet want to store that much data? And if it is 50PB+ scale, doesn’t that mean for the majority
of the companies it’s overkill of functionality? In general, how big should big be to justify the Hadoop
platform?
To end this list of questions and the finalize the briefing, you indicated that analytics on big data is
where the real business value lies. I fully agree with that, but can you elaborate on this a little more?
Questions on technical aspects, cont’d.
Twitter Tag: #briefr The Briefing Room
Twitter Tag: #briefr The Briefing Room
Upcoming Topics
www.insideanalysis.com
June: INNOVATORS
July: SQL INNOVATION
August: REAL-TIME DATA
Twitter Tag: #briefr The Briefing Room
THANK YOU
for your
ATTENTION!
Some images provided courtesy of Wikimedia Commons and
"Selection Types Chart" by Azcolvin429 - Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons -
https://commons.wikimedia.org/wiki/File:Selection_Types_Chart.png#/media/
File:Selection_Types_Chart.png and http://sagamer.co.za/img/evolution-phone.jpg

Más contenido relacionado

La actualidad más candente

Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...StampedeCon
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThomas Kelly, PMP
 
The principles of the business data lake
The principles of the business data lakeThe principles of the business data lake
The principles of the business data lakeCapgemini
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...Revolution Analytics
 
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on ReadBig Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on ReadThink Big, a Teradata Company
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkCaserta
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitecturePerficient, Inc.
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?Hortonworks
 
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016StampedeCon
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7mmathipra
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneySai Paravastu
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefitsRicky Barron
 
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Data Con LA
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonJeffrey T. Pollock
 
Developing a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceDeveloping a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceTony Baer
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoopDr. Wilfred Lin (Ph.D.)
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedDunn Solutions Group
 

La actualidad más candente (20)

Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
 
The Emerging Data Lake IT Strategy
The Emerging Data Lake IT StrategyThe Emerging Data Lake IT Strategy
The Emerging Data Lake IT Strategy
 
The principles of the business data lake
The principles of the business data lakeThe principles of the business data lake
The principles of the business data lake
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
 
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on ReadBig Data Modeling and Analytic Patterns – Beyond Schema on Read
Big Data Modeling and Analytic Patterns – Beyond Schema on Read
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
Creating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data ArchitectureCreating a Next-Generation Big Data Architecture
Creating a Next-Generation Big Data Architecture
 
IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?IDC Retail Insights - What's Possible with a Modern Data Architecture?
IDC Retail Insights - What's Possible with a Modern Data Architecture?
 
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
The Big Data Journey – How Companies Adopt Hadoop - StampedeCon 2016
 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
 
BAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, SydneyBAR360 open data platform presentation at DAMA, Sydney
BAR360 open data platform presentation at DAMA, Sydney
 
Data lake benefits
Data lake benefitsData lake benefits
Data lake benefits
 
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
 
Stream based Data Integration
Stream based Data IntegrationStream based Data Integration
Stream based Data Integration
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
 
Developing a Strategy for Data Lake Governance
Developing a Strategy for Data Lake GovernanceDeveloping a Strategy for Data Lake Governance
Developing a Strategy for Data Lake Governance
 
6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop6 enriching your data warehouse with big data and hadoop
6 enriching your data warehouse with big data and hadoop
 
The Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They NeedThe Data Lake and Getting Buisnesses the Big Data Insights They Need
The Data Lake and Getting Buisnesses the Big Data Insights They Need
 

Similar a The Maturity Model: Taking the Growing Pains Out of Hadoop

2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoptionHortonworks
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationInside Analysis
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution AnalyticsRevolution Analytics
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...Jürgen Ambrosi
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyDataWorks Summit
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterInside Analysis
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesDataWorks Summit
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesDataWorks Summit
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarHortonworks
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseJeffrey T. Pollock
 
Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?Denodo
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution ShowcaseInside Analysis
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationDenodo
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationInside Analysis
 

Similar a The Maturity Model: Taking the Growing Pains Out of Hadoop (20)

2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption2015 02 12 talend hortonworks webinar challenges to hadoop adoption
2015 02 12 talend hortonworks webinar challenges to hadoop adoption
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for Integration
 
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
12Nov13 Webinar: Big Data Analysis with Teradata and Revolution Analytics
 
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...1° Sessione Oracle CRUI: Analytics Data Lab,  the power of Big Data Investiga...
1° Sessione Oracle CRUI: Analytics Data Lab, the power of Big Data Investiga...
 
Hadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata CompanyHadoop 2015: what we larned -Think Big, A Teradata Company
Hadoop 2015: what we larned -Think Big, A Teradata Company
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
The Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value ThereafterThe Right Data Warehouse: Automation Now, Business Value Thereafter
The Right Data Warehouse: Automation Now, Business Value Thereafter
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Insights into Real-world Data Management Challenges
Insights into Real-world Data Management ChallengesInsights into Real-world Data Management Challenges
Insights into Real-world Data Management Challenges
 
A6 big data_in_the_cloud
A6 big data_in_the_cloudA6 big data_in_the_cloud
A6 big data_in_the_cloud
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Insights into Real World Data Management Challenges
Insights into Real World Data Management ChallengesInsights into Real World Data Management Challenges
Insights into Real World Data Management Challenges
 
Cloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinarCloudian 451-hortonworks - webinar
Cloudian 451-hortonworks - webinar
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
 
Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?Are You Killing the Benefits of Your Data Lake?
Are You Killing the Benefits of Your Data Lake?
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution Showcase
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop Acceleration
 

Más de Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIInside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownInside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeInside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionInside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsInside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingInside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLInside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelInside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureInside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskInside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataInside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseInside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave DuggalInside Analysis
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyInside Analysis
 

Más de Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey Malafsky
 

Último

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 

Último (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 

The Maturity Model: Taking the Growing Pains Out of Hadoop

  • 1. Grab some coffee and enjoy the pre-­show banter before the top of the hour!
  • 2. The Briefing Room The Maturity Model: Taking the Growing Pains Out of Hadoop
  • 3. Twitter Tag: #briefr The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com @eric_kavanagh
  • 4. Twitter Tag: #briefr The Briefing Room   Reveal the essential characteristics of enterprise software, good and bad   Provide a forum for detailed analysis of today s innovative technologies   Give vendors a chance to explain their product to savvy analysts   Allow audience members to pose serious questions... and get answers! Mission
  • 5. Twitter Tag: #briefr The Briefing Room Topics June: INNOVATORS July: SQL INNOVATION August: REAL-TIME DATA
  • 6. Twitter Tag: #briefr The Briefing Room Three Types of Evolution Ø  Disruptive selection Ø  Stabilizing selection Ø  Directional selection
  • 7. Twitter Tag: #briefr The Briefing Room Types of Evolution? WHAT GOES AROUND, COMES AROUND!
  • 8. Twitter Tag: #briefr The Briefing Room Analyst: Rick van der Lans Rick F. van der Lans is an independent analyst, consultant, author and lecturer specializing in data warehousing, business intelligence, analytics, big data and database technology. He is Managing Director of R20/Consultancy. He has advised many large companies worldwide on defining their business intelligence architectures. His popular IT books have been translated into many languages and have sold over 100,000 copies. Rick writes for TechTarget.com and B-eye-Network. For the last 25 years, he has been presenting professionally around the globe and at international events.
  • 9. Twitter Tag: #briefr The Briefing Room Think Big, A Teradata Company   Last year Teradata acquired Think Big Analytics, Inc., a consulting and solutions company focused on big data solutions   Think Big has expertise in implementing a variety of open source technologies, such as Hadoop, Hbase, Cassandra, MongoDB and Storm, as well as experience with Hortonworks, Cloudera and MapR   Its consultants can assist with the planning, management and deployment of big data implementations
  • 10. Twitter Tag: #briefr The Briefing Room Guest: Ron Bodkin Ron Bodkin is Founder & President of Think Big, A Teradata Company. Ron founded Think Big to help companies realize measurable value from Big Data. The company’s expertise spans all facets of data science and data engineering and helps our customers to drive maximum value from their Big Data initiative. Previously, Ron was vice president Engineering at Quantcast where he led the data science and engineer teams that pioneered the use of Hadoop and NoSQL for batch and real-time decision making. Prior to that, Ron was Founder of New Aspects, which provided enterprise consulting for Aspect-oriented programming. Ron was also Co-Founder and CTO of B2B applications provider C-Bridge, which he led to team of 900 people and a successful IPO. Ron graduated with honors from McGill University with a B.S. in Math and Computer Science. Ron also earned his Master’s Degree in Computer Science from MIT, leaving the PhD program after presenting the idea for C-bridge and placing in the finals of the 50k Entrepreneurship Contest.
  • 11. MAKING BIG DATA COME ALIVE ​ The Maturity Model: Taking the Growing Pains out of Hadoop ​ Ron Bodkin ​ Founder and President, Think Big
  • 12. 12 •  What is Big Data? •  Hadoop Maturity Spectrum (from MapReduce to Integrated Hadoop Architecture) •  Taking the Growing Pains out of Hadoop Agenda
  • 13. 13 •  Data sets so large and complex that they become awkward to work with using standard tools and techniques •  Google, Quantcast, Yahoo, LinkedIn… customer innovation in open source •  Discover value in dark data and relationships among data sets –  Customer behavior –  Product behavior •  Evolution: insight -> operational -> business innovation Source: Hortonworks What is Big Data? © 2015 Think Big, a Teradata Company 6/16/15
  • 14. 14 Hadoop (Circa 2008) •  HDFS - Distributed File System •  MapReduce distributed parallel batch processing •  Community Open Source since 2005 many contributors •  Industry commitment: over $1 Billion invested in startups, all major software companies embrace •  3 replicas for HA •  50 PB+ scale •  Great for analytics, batch processing © 2015 Think Big, a Teradata Company 6/16/15
  • 15. 15 SQL on Hadoop (Circa 2009 to Today) •  Expanded access to data in Hadoop •  Great for exploration of “raw” data, e.g., by data scientists –  Can integrate UDFs –  Tools integration expanding access beyond SQL power users •  Fast moving space •  Also used for ETL/ELT processing © 2015 Think Big, a Teradata Company 6/16/15
  • 16. 16 •  Support for many processing engines including Spark, Storm, Tez •  Common cluster with local data access in HDFS Hadoop (Circa 2012) YARN - Yet Another Resource Negotiator 16 © 2015 Think Big, a Teradata Company 6/16/15
  • 17. 17 © 2015 Think Big, a Teradata Company 6/16/15 Apache Spark (Circa 2013) Spark SQL Spark Streaming MLib (Machine learning) GraphX (Graph) Apache Spark Core Engine YARN JDBC/ODBC HDFS
  • 18. 18 OLAP on Hadoop (Circa 2014) Events Reporting, Visualization and Analytics Traditional BI on Hadoop •  Day or week old data •  Long latencies to retrieve data •  Doesn’t scale as number of events, aggregates, or users grow •  Expensive, siloed event analytics •  Real time data •  Unlimited custom reporting (drill to trillions of aggregates) •  Scales to thousands of simultaneous users •  Subsecond latency •  Omni-channel analytics OLAP on Hadoop © 2015 Think Big, a Teradata Company 6/16/15
  • 19. 19 Today: Integrated Hadoop Architecture © 2015 Think Big, a Teradata Company 6/16/15
  • 20. MAKING BIG DATA COME ALIVE ​ How to Take the Growing Pains out of Hadoop
  • 21. 21 •  Picking the wrong starting point – ho hum, boil the ocean •  Immature Governance •  Siloed Organization •  Change Management •  Skills Gap •  Business as usual •  Not fitting in with ecosystem Big Data Pitfalls © 2015 Think Big, a Teradata Company 6/16/15
  • 22. 22 Big Data Center of Excellence © 2015 Think Big, a Teradata Company 6/16/15
  • 23. 23 Cross-Functional Collaboration Data Weak intersection between Data, Systems and Business often means project failure Current State Improved State Collaborative integration of Analytics with Big Data capabilities for success. Data Weak Area with diminished influence Cross-functional collaboration Data-driven business alignment © 2015 Think Big, a Teradata Company 6/16/15
  • 24. 24 A “Data Lake” is a centralized approach to capturing, refining, storing, and exploring any form of raw data at scale, enabled by low cost technologies from which downstream facilities may draw. What is a Data Lake? Information Sources Data Lake Downstream Facilities Data Variety is the driving factor in building a Data Lake. © 2015 Think Big, a Teradata Company 6/16/15
  • 25. 25 Why a Data Lake? •  Benefit from open source big data innovation and cost savings •  Control over enterprise data with governance •  Offload history of operational and analytical data platforms •  Remix ETL, freeing capacity downstream •  Gain ultimate flexibility in data use and access •  Foundation for solving business problems –  Big data solutions enable new uses of data via working with new data sets and analytics © 2015 Think Big, a Teradata Company 6/16/15
  • 26. 26 •  Corporate Repository – Data sourcing repository, possibly system of record •  Active Archive – Offloaded history of operational and analytical data platforms that needs to be used infrequently •  Discovery – Data Profiling, schema on read, wrangling, discover signals •  ETL Offload – Move all or parts of current ETL processes into Hadoop •  NEW ETL Development – Develop new ETL processes on the Data Lake •  Analytics – Data Science, Custom/product built Analytics solutions (i.e. churn, machine learning) •  Business Intelligence/Reporting – Event analysis, dimensional rollup and pivot reports Data Lake Use Cases © 2015 Think Big, a Teradata Company 6/16/15
  • 27. 27 Swamp Reservoir Data Lake: Swamp or Reservoir? © 2015 Think Big, a Teradata Company 6/16/15
  • 28. 28 Data Lake Information Sources Evaluate Source Data Ingest Collect & Manage Metadata Profile - Structure Sequence Downstream Facilities Generate Reports Discovery Signals Compress Automate Protect Prepare Data for Ingest Prepare Source Metadata A centralized well-governed repository of raw data and metadata into which all data-producing streams flow from which business analysts & downstream facilities may draw. Data Reservoir © 2015 Think Big, a Teradata Company 6/16/15
  • 29. 29 Metadata is required to facilitate location of and entitlements to data Schema Metadata is a foundation, but… Operational Metadata & Business-Security is critical to governance •  Where did it come from? … What is the data serialization? •  Who owns the data? Who can see the data? Who belongs to which group? •  What is the environment (landing zone, OS, Line of Business)? •  What processes touched my data? •  Did you lose any data (Checksums, etc.)? •  When did the data get ingested, transformed? •  Did it get exported, or archived…when, where how will it be used (organizational)? Governance & Security © 2015 Think Big, a Teradata Company 6/16/15
  • 30. 30 High Tech Manufacturer Example •  Enterprise-wide data access (ODS) for timely analytics and insights •  Foundation for large scale proactive analytics •  Reduce $Millions in scrap waste •  Drives revenue and market share by accelerating time to market Legacy Data Lake Retention 3-6 months scattered DBs then tape archive All data online in common data reservoir for 3+ years Coverage Summaries, samples for several data sets All parametric data captured in raw and integrated form Analysis Reactive resulting in missed improvement opportunities In daily operations and on larger data sets to support proactive improvements © 2015 Think Big, a Teradata Company 6/16/15
  • 31. 31 •  Insight into new data (one time analysis) •  Decision support models (offline insights in production) •  Operational models (real-time analytics) •  Hadoop opportunities: new data, new algorithms, different resolution, frequency •  Hadoop blends with MPP, analytics, tools not the sole tool Data Science Analytics © 2015 Think Big, a Teradata Company 6/16/15
  • 32. 32 Real Time Analytics Architecture Clickstream Call Center email Brick & Mortar Batch Storage (HDFS) Event Repository (HBase) API Server CIM ML Personali zation A/B Testing Reporting Call Center Channel Attribution © 2015 Think Big, a Teradata Company 6/16/15
  • 33. 33 •  We’ve come a long way since MapReduce 1.0 enabled by YARN •  Don’t forget about the last mile for business (SQL on Hadoop, OLAP for Hadoop, training, etc.) •  Data lakes without governance, security, metadata management equates to failed projects/loss of $$ •  Analytics on big data is where the real business value lies – from understanding to models to operationalization Exploiting the Power of Hadoop © 2015 Think Big, a Teradata Company 6/16/15
  • 34. 34 •  100% Big Data Focus •  Founded in 2010 with100+ engagements across 70 clients •  Unlock value of big data with data science and data engineering services •  Proven vendor-neutral open source integration expertise •  Agile team-based development methodology •  Think Big Academy for skills and organizational development •  Global delivery model Who is Think Big?
  • 35. Twitter Tag: #briefr The Briefing Room Perceptions & Questions Analyst: Rick van der Lans
  • 36. Copyright  ©  1991  -­‐  2015  R20/Consultancy  B.V.,  The   Hague,  The  Netherlands.  All  rights  reserved.  No  part   of  this  material  may  be  reproduced,  stored  in  a   retrieval  system,  or  transmiHed  in  any  form  or  by   any  means,  electronic,  mechanical,  photographic,  or   otherwise,  without  the  explicit  wriHen  permission  of   the  copyright  owners.     The  Next  Stage  of   Hadoop  and  Big  Data:   Simplifica9on   by   Rick  F.  van  der  Lans   R20/Consultancy  BV   TwiHer  @rick_vanderlans   www.r20.nl  
  • 37. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 37 The  Adop9on  Stages  of  Technologies     Stage 0: Introduction and early adoption   Stage 1: Adoption by more traditional customers   Stage 2: Universal adoption Stage 0 Stage 1 Stage 2 Stage n
  • 38. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 38 Adop9on  Stage  0     Functionality: Minimal   Scalability and performance: Promising   Customers: Those with very specific, urgent needs; they have tech-savvy developers   Use cases: Specific   Integration: Isolated solutions Stage 0 Stage 1 Stage 2 Stage n
  • 39. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 39 Adop9on  Stage  1     Functionality: Sufficient for some mission-critical systems   Scalability and performance: Medium   Customers: Those with very specific, urgent needs; they have tech-savvy developers   Use cases: More general   Integration: Isolated solutions Stage 0 Stage 1 Stage 2 Stage n
  • 40. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 40 Adop9on  Stage  2     Functionality: Full   Scalability and performance: High   Customers: Traditional; not always with tech-savvy developers; focus on productivity and maintainability   Use cases: Specific and general   Integration: Integrated with traditional solutions Stage 0 Stage 1 Stage 2 Stage n
  • 41. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 41 Different  SQL-­‐on-­‐Hadoop  Solu9ons   Apache   MapReduce  API         Apache   MapReduce   Apache  HDFS   API        Apache  HDFS   Apache  HDFS   API        Apache  HDFS   Apache  HBase   API        Apache  HBase   A  SQL  Dialect        SQL-­‐on-­‐Hadoop   A  SQL  Dialect        SQL-­‐on-­‐Hadoop   Apache  HDFS   API        Any  HDFS   A  SQL  Dialect        SQL-­‐on-­‐Hadoop   Apache  HDFS   API        Any  HDFS   A  SQL  Dialect        SQL-­‐on-­‐Hadoop   Spark  API        Spark  
  • 42. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 42 Self-­‐Service  Data  Prepara9on     Non-technical interface for studying data files   Easy way of defining rules   Data is fixed by defining filters, not by changing data in source systems   Relationship with Data Blending
  • 43. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 43 Example:  Data  Prepara9on  
  • 44. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 44 Example:  Data  Prepara9on  
  • 45. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 45 Example:  Data  Prepara9on  
  • 46. Copyright © 1991 - 2015 R20/Consultancy B.V., The Hague, The Netherlands 46 Stage  2  =  Simplifica9on  
  • 47. Questions on acceptance You talked about the skills gap. Sometimes Hadoop is used to develop operational systems, and sometimes to develop BI systems. Now, many specialists working in BICCs are good ETL programming, reporting, data modeling, but not so good at low-level programming. For example, many of them have never programmed Java. What would you recommend them to do to be able to use Hadoop? Should organizations use Hadoop for developing new systems, or for migrating existing systems? How easy do you think it is to integrate a new Hadoop-based system with the current infrastructure? Normally, when a technology becomes more and more popular, it becomes clearer and clearer to everyone what the costs and the risks are. For example, how much man-hours should be spend on managing a Hadoop environment? So, what’s the productivity of a MapReduce programmer versus a SQL programmer? Questions on architectural Now that so much data is produced in a distributed fashion, is the concept of physically centralizing data for integration really realistic? Isn’t big data too big to move? Over the years, large multi-national organizations have always struggled to develop one large centralized enterprise-wide data warehouse. In most cases, the problems were organizational. Who owns that data, and so on? Why would the data lake or data reservoir succeed today?
  • 48. Questions on technical aspects It was clear that for you YARN is a key module in the Hadoop platform. How mature is YARN today? How good is it, for example, in avoiding one massive query that monopolizes the entire platform and slows down all other applications? My feeling is still that HDFS is not good at joins of files. The SQL-on-Hadoop engines are not strong in joins either. Do you think this will change? For example, can you take a star model from a SQL-based data mart, consisting of 7 dimensions and 1 fact, migrate it to Hadoop and expect a great performance? Any idea what the tipping point is, when to forget about a proprietary SQL and move to Hadoop with SQL-on-Hadoop? You mentioned OLAP on Hadoop using Spark and you indicated that such a solution scales to thousands of simultaneous users. Is that really true? Has that been proven? Or does that work when the reporting tools extract data and load it in memory of the client machine? Almost every vendor of some big data tool publishes benchmark results showing how fast their product is. But many of these benchmarks are single user, single query benchmarks, which I think is not that useful. Why are they not showing multi-users benchmark results? Do you think they’re hiding something?
  • 49. On one of your slides you said that Hadoop is 50 PB+ scale. But realistically, how many companies on this planet want to store that much data? And if it is 50PB+ scale, doesn’t that mean for the majority of the companies it’s overkill of functionality? In general, how big should big be to justify the Hadoop platform? To end this list of questions and the finalize the briefing, you indicated that analytics on big data is where the real business value lies. I fully agree with that, but can you elaborate on this a little more? Questions on technical aspects, cont’d.
  • 50. Twitter Tag: #briefr The Briefing Room
  • 51. Twitter Tag: #briefr The Briefing Room Upcoming Topics www.insideanalysis.com June: INNOVATORS July: SQL INNOVATION August: REAL-TIME DATA
  • 52. Twitter Tag: #briefr The Briefing Room THANK YOU for your ATTENTION! Some images provided courtesy of Wikimedia Commons and "Selection Types Chart" by Azcolvin429 - Own work. Licensed under CC BY-SA 3.0 via Wikimedia Commons - https://commons.wikimedia.org/wiki/File:Selection_Types_Chart.png#/media/ File:Selection_Types_Chart.png and http://sagamer.co.za/img/evolution-phone.jpg