SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
Grab some coffee and enjoy 
the pre-show banter before 
the top of the hour!
The New Frontier: Optimizing Big Data Exploration 
The Briefing Room
Twitter Tag: #briefr 
The Briefing Room 
Welcome 
Host: 
Eric Kavanagh 
eric.kavanagh@bloorgroup.com 
@eric_kavanagh
! Reveal the essential characteristics of enterprise software, 
good and bad 
! Provide a forum for detailed analysis of today’s innovative 
technologies 
! Give vendors a chance to explain their product to savvy 
analysts 
! Allow audience members to pose serious questions... and get 
answers! 
Twitter Tag: #briefr 
The Briefing Room 
Mission
Twitter Tag: #briefr 
The Briefing Room 
Topics 
This Month: BIG DATA 
March: CLOUD 
April: BIG DATA 
2014 Editorial Calendar at 
www.insideanalysis.com/webcasts/the-briefing-room
The Age of 
Exploration 
The Age of 
DATA
Twitter Tag: #briefr 
The Briefing Room 
Analyst: Robin Bloor 
Robin Bloor is 
Chief Analyst at 
The Bloor Group 
robin.bloor@bloorgroup.com 
@robinbloor
Twitter Tag: #briefr 
The Briefing Room 
Cirro 
! Cirro provides a single method to access any type of data, 
on any platform, in any environment 
! Its product suite consists of Cirro Data Hub, Analyst for 
Excel and Multi Store – all designed to remove complexity 
from Big Data analytics 
! Cirro’s products are cloud based and can run in public, 
private and on-premise environments
Twitter Tag: #briefr 
The Briefing Room 
Guest: Mark Theissen 
Mark is CEO at Cirro. He is a respected analytics and data 
warehousing expert with more than 22 years in the industry. 
Most recently Mark was the worldwide data warehousing 
technical lead at Microsoft following the acquisition of 
DATAllegro. At DATAllegro Mark was the COO and a member 
of the board of directors. Prior to joining DATAllegro, Mark 
was Vice President and Research Lead at META Group 
(Gartner Group) for Enterprise Analytics Strategies, covering data warehousing, 
business intelligence and data integration markets. Before META, Mark was VP 
of Professional Services at Accruent where he was responsible for domestic and 
overseas services and operations. Mark has a BS in Computer Information 
Systems from Chapman University and a MBA from the University of California, 
Irvine.
Briefing Room 2/11/14 
Next 
Genera*on 
Data 
Federa*on
On Demand Distributed Analysis 
Cirro is the ONLY Solution that can: 
• Access any data 
• On any platform 
• Without ETL or the cost and complexity of a 
semantic layer 
“ 
What 
used 
to 
take 
2-­‐4 
weeks 
is 
now 
done 
in 
a 
ma;er 
of 
minutes. 
Cirro 
is 
a 
‘game-­‐ 
changing’ 
approach 
to 
visualizing 
mul*-­‐ 
structured 
big 
data 
and 
integra*ng 
it 
with 
other 
data 
sources.” 
Director 
of 
Business 
Intelligence 
©2014 Cirro Inc. All rights reserved.
Cirro 
Data Hub 
©2014 Cirro Inc. All rights reserved. 
Cirro Enterprise Data Hub 
Visualization 
Tools 
Real-time 
Federation 
Data 
Language 
Translation 
Data 
Movement & 
Management 
RDBMS 
HDFS 
NoSql 
Legacy 
BI 
Tools 
CLI 
Excel 
SaaS
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop 
Oracle 
Hadoop 
SQL 
Server 
SQL 
predicates, 
local 
joins 
SQL 
predicates 
Standard 
SQL 
Row 
processing 
pushed 
into 
data 
systems 
MapReduce
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop 
Oracle 
Hadoop 
SQL 
Server 
SQL 
predicates, 
local 
joins 
SQL 
predicates 
Standard 
SQL 
Row 
processing 
pushed 
into 
data 
systems 
MapReduce 
50k 
Rows 
50m 
Rows 
5k 
Rows
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop 
Oracle 
Hadoop 
SQL 
Server 
SQL 
predicates, 
local 
joins 
SQL 
predicates 
Standard 
SQL 
Row 
processing 
pushed 
into 
data 
systems 
MapReduce 
50k 
Rows 
50m 
Rows 
5k 
Rows 
Limited 
movement 
Limited 
movement
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop 
Oracle 
Hadoop 
SQL 
Server 
SQL 
join, 
aggregaEon 
Standard 
SQL 
Row 
processing 
pushed 
into 
data 
systems
©2014 Cirro Inc. All rights reserved. 
How Federation Works 
I 
have 
a 
table 
on 
SQL 
Server 
that 
needs 
to 
join 
to 
tables 
on 
Oracle 
and 
Hadoop 
Results 
Des)na)on 
Op)ons 
Oracle 
Hadoop 
SQL 
Server 
Results 
Standard 
SQL 
Row 
processing 
pushed 
into 
data 
systems 
UI 
Tools 
Data 
Marts; 
in 
the 
Cloud 
or 
Data 
Center 
BI 
Server
©2014 Cirro Inc. All rights reserved. 
Completing The Solution… 
• Cirro Data Hub – Federated query 
processing 
• Use any tool 
• The fastest distributed 
processing possible 
• Cirro Analyst 
• Data discovery 
• Mash up data like never before 
• Go beyond SQL 
• Publish 
• Cirro Multi Store 
• Stage, Store, Process 
• Highly scalable
Next Generation Data Federation 
Ask Questions You Couldn’t Ask Before 
• Designed & Built for Big Data 
• Compatible with structured, semi-structured & unstructured data 
• Works in the cloud, in the data center, or both 
©2014 Cirro Inc. All rights reserved. 
• Real-Time Federation 
• Queries are dynamically optimized and executed, taking the 
processing to the data 
• Enables ad-hoc query and exploration of all data 
• No Semantic Layer Required
Cirro Federation vs. Data Virtualization 
Cirro 
Data 
Virtualiza)on 
©2014 Cirro Inc. All rights reserved. 
• Excellent 
for 
data 
exploraEon 
and 
discovery 
• Excellent 
for 
ad-­‐hoc 
queries 
• True 
federated 
processing 
– 
minimal 
data 
movement, 
no 
server 
boPlenecks 
– 
‘pushes 
processing 
to 
the 
data’ 
• Easy 
setup, 
maintenance 
& 
administraEon 
• Hadoop 
– 
can 
execute, 
Hive 
& 
Impala 
queries 
along 
with 
MapReduce 
programs 
• Pathway 
to 
NoSQL 
• Not 
appropriate 
for 
data 
exploraEon 
or 
discovery 
– 
requires 
you 
to 
know 
the 
quesEons 
you 
want 
to 
ask 
in 
advance 
of 
accessing 
the 
data 
• Not 
true 
federated 
processing 
– 
final 
joins 
and 
aggregaEons 
done 
on 
VirtualizaEon 
Server 
• Good 
for 
structured 
data 
processing 
workloads 
• Labor 
intensive 
setup, 
maintenance 
& 
administraEon 
for 
modeling 
and 
semanEc 
layer 
• Hadoop 
– 
limited 
to 
Hive 
access
Data Federation Use Cases 
• On Demand Distributed Data Analysis 
• Data warehouse offloading 
• Business intelligence federation 
• Self-service data exploration and discovery 
• Entry point for private cloud analytics 
• SaaS, Hadoop and/or NoSQL integration with 
enterprise data sources 
• Simplify application development 
©2014 Cirro Inc. All rights reserved.
On Demand Intra-Day Analytics 
Solution 
• Cirro Data Hub; Cirro Analyst 
Results 
• On demand analytics supports faster 
trend analysis and the ability to identify 
data anomalies 
• Cross-platform data access reduced 
from weeks to minutes 
• Flexible/iterative using in-house BI tools 
• Enables self-service data mash-ups by 
analysts across all data sources 
©2014 Cirro Inc. All rights reserved. 
Business Challenge 
• Data that drives trading analytics & 
decisions in data silos 
• Inability to analyze data ‘fast enough’ to 
make informed trading decisions 
• ETL tools and manual data consolidation is 
too slow and inflexible for hourly or daily 
iterative analysis 
• Inability to join traditional data with cloud 
sources 
Financial 
& 
Energy 
/ 
UElity 
Markets
Ask Questions You Couldn’t Ask Before 
Last Market Price 
©2014 Cirro Inc. All rights reserved. 
Oracle - Pricing DW 
Transaction Data 
Tableau 
Actionable Visualizations 
Subscrip)on 
Market 
Data
Ask Questions You Couldn’t Ask Before 
©2014 Cirro Inc. All rights reserved. 
A 
Anonymous Behavior 
Transactional Data 
Ads viewed/clicked 
Actionable 
Visualizations
©2014 Cirro Inc. All rights reserved. 
The Business Impact 
• Agility; 
conducEng 
analysis 
previously 
unavailable 
• CompeEEve 
advantage 
• Supports 
ad-­‐hoc 
analysis 
, 
Fastest 
Eme 
to 
value 
• Leverage 
in-­‐house 
BI 
tools 
– 
no 
new 
tools 
to 
learn 
Improved 
Business 
OperaEons 
• TradiEonal 
architectures 
not 
designed 
for 
Big 
Data 
• Easily 
add 
new 
data 
sources 
– 
RDBMS, 
Hadoop, 
NoSQL 
• Easy 
to 
install, 
use 
& 
manage 
• Future 
proof 
analyEcs 
developed 
Streamline 
IT 
Processes 
• ReducEon 
in 
license 
costs 
on 
EDW 
and 
RDBMS 
• Time 
& 
cost 
savings 
associated 
with 
data 
staging, 
modeling, 
ETL 
work, 
etc. 
• No 
new 
BI 
applicaEons 
to 
buy 
-­‐ 
use 
exisEng 
BI 
tools 
• No 
new 
skills 
to 
develop 
Cost 
Savings
Twitter Tag: #briefr 
The Briefing Room 
Perceptions & Questions 
Analyst: 
Robin Bloor
The Visible “Big Data” Trend 
u Corporate data volumes 
grow at about 55% per 
annum - exponentially 
u Data has been growing at 
this rate for, maybe, 40 
years 
u There is nothing new 
about big data; it clings to 
an established exponential 
trend
The Invisible Trend: Moore’s Law Cubed 
u The biggest databases are new 
databases 
u They grow at the cube of Moore’s 
Law 
u Moore’s Law = 10x every 6 years 
u VLDB: 1000x every 6 years 
• 1991/2 megabytes 
• 1997/8 gigabytes 
• 2003/4 terabytes 
• 2009/10 petabytes 
• 2015/16 exabytes
Whys and Wherefores? 
u Why do we assemble such gargantuan heaps of 
data? 
u While the data volume has grown like bamboo 
in spring, the size of executables has not? 
u Why not just move the processing to the data? 
u This is surely an option worth exploring – maybe 
it is even one of the foundations for Big Data 
Architecture…
No Country for Old DBMS (Thinking)
Questions are Easy, Answers Difficult 
The 
WORKLOAD 
Conundrum 
The 
DISTRIBUTION 
Conundrum 
The DATA 
FLOW 
Conundrum 
The REAL-TIME 
Conundrum
u What are the primary applications where Cirro 
makes a big impact? 
u What is (roughly) the largest number of data 
sources Cirro federates in any implementation? 
u What’s the most resource deployed for the 
largest Cirro implementation? How much 
memory? 
u How does “fault tolerance” work?
u How difficult is it to develop applications 
employing Cirro? Is it significantly different to a 
DBMS? 
u Are any companies adopting this technology 
strategically? 
u Which technologies/companies do you regard as 
competition?
Twitter Tag: #briefr 
The Briefing Room
This Month: BIG DATA 
March: CLOUD 
April: BIG DATA 
www.insideanalysis.com/webcasts/the-briefing-room 
Twitter Tag: #briefr 
The Briefing Room 
Upcoming Topics 
2014 Editorial Calendar at 
www.insideanalysis.com
Twitter Tag: #briefr 
THANK YOU 
for your 
ATTENTION! 
The Briefing Room
Photo credit for Slide 28: 
Lenny’s Alice in Wonderland site: http://www.alice-in-wonderland.net/ 
Twitter Tag: #briefr 
The Briefing Room

Más contenido relacionado

La actualidad más candente

Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...Mark Rittman
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...NoSQLmatters
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & HadoopBlackvard
 
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with Hadoop
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with HadoopBig Data 2.0: YARN Enablement for Distributed ETL & SQL with Hadoop
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with HadoopCaserta
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...StampedeCon
 
Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Inside Analysis
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016StampedeCon
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeCaserta
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for EveryoneCaserta
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Mark Rittman
 
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...Mark Rittman
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchSheetal Pratik
 
How to get started in Big Data without Big Costs - StampedeCon 2016
How to get started in Big Data without Big Costs - StampedeCon 2016How to get started in Big Data without Big Costs - StampedeCon 2016
How to get started in Big Data without Big Costs - StampedeCon 2016StampedeCon
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachSoftServe
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?Caserta
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 
Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Inside Analysis
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches DataWorks Summit
 

La actualidad más candente (20)

Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...Riga dev day 2016   adding a data reservoir and oracle bdd to extend your ora...
Riga dev day 2016 adding a data reservoir and oracle bdd to extend your ora...
 
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
 
Introduction To Big Data & Hadoop
Introduction To Big Data & HadoopIntroduction To Big Data & Hadoop
Introduction To Big Data & Hadoop
 
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with Hadoop
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with HadoopBig Data 2.0: YARN Enablement for Distributed ETL & SQL with Hadoop
Big Data 2.0: YARN Enablement for Distributed ETL & SQL with Hadoop
 
Smart data for a predictive bank
Smart data for a predictive bankSmart data for a predictive bank
Smart data for a predictive bank
 
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
Enterprise Search: Addressing the First Problem of Big Data & Analytics - Sta...
 
Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?
 
Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016Turn Data Into Actionable Insights - StampedeCon 2016
Turn Data Into Actionable Insights - StampedeCon 2016
 
Big Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data LakeBig Data: Setting Up the Big Data Lake
Big Data: Setting Up the Big Data Lake
 
Making Big Data Easy for Everyone
Making Big Data Easy for EveryoneMaking Big Data Easy for Everyone
Making Big Data Easy for Everyone
 
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
Social Network Analysis using Oracle Big Data Spatial & Graph (incl. why I di...
 
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
Using Oracle Big Data SQL 3.0 to add Hadoop & NoSQL to your Oracle Data Wareh...
 
LinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbenchLinkedInSaxoBankDataWorkbench
LinkedInSaxoBankDataWorkbench
 
How to get started in Big Data without Big Costs - StampedeCon 2016
How to get started in Big Data without Big Costs - StampedeCon 2016How to get started in Big Data without Big Costs - StampedeCon 2016
How to get started in Big Data without Big Costs - StampedeCon 2016
 
Agile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric ApproachAgile Big Data Analytics Development: An Architecture-Centric Approach
Agile Big Data Analytics Development: An Architecture-Centric Approach
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
You're the New CDO, Now What?
You're the New CDO, Now What?You're the New CDO, Now What?
You're the New CDO, Now What?
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 
Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop Big Data & SQL: The On-Ramp to Hadoop
Big Data & SQL: The On-Ramp to Hadoop
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
 

Similar a The New Frontier: Optimizing Big Data Exploration

Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with MicrosoftCaserta
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseJeffrey T. Pollock
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudInside Analysis
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopInside Analysis
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsCaserta
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsDenodo
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationInside Analysis
 
The Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationThe Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationInside Analysis
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationInside Analysis
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...jdijcks
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsLooker
 
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdf
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdfData Pipelines and Tools to Integrate with Power BI and Spotfire.pdf
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdfGregKreutzer2
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationInside Analysis
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubCloudera, Inc.
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Inside Analysis
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic IntelAPAC
 
Options for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketOptions for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketDremio Corporation
 

Similar a The New Frontier: Optimizing Big Data Exploration (20)

Big Data Analytics with Microsoft
Big Data Analytics with MicrosoftBig Data Analytics with Microsoft
Big Data Analytics with Microsoft
 
Big Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San JoseBig Data at Oracle - Strata 2015 San Jose
Big Data at Oracle - Strata 2015 San Jose
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of HadoopThe Maturity Model: Taking the Growing Pains Out of Hadoop
The Maturity Model: Taking the Growing Pains Out of Hadoop
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Architecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment OptionsArchitecting for Big Data: Trends, Tips, and Deployment Options
Architecting for Big Data: Trends, Tips, and Deployment Options
 
Self-Service Analytics with Guard Rails
Self-Service Analytics with Guard RailsSelf-Service Analytics with Guard Rails
Self-Service Analytics with Guard Rails
 
The Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with VirtualizationThe Agile Analyst: Solving the Data Problem with Virtualization
The Agile Analyst: Solving the Data Problem with Virtualization
 
The Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data ImplementationThe Great Lakes: How to Approach a Big Data Implementation
The Great Lakes: How to Approach a Big Data Implementation
 
Hadoop and SAP BI
Hadoop and SAP BI   Hadoop and SAP BI
Hadoop and SAP BI
 
Business in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for IntegrationBusiness in the Driver’s Seat – An Improved Model for Integration
Business in the Driver’s Seat – An Improved Model for Integration
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
 
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
Oracle Openworld Presentation with Paul Kent (SAS) on Big Data Appliance and ...
 
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven DecisionsPower to the People: A Stack to Empower Every User to Make Data-Driven Decisions
Power to the People: A Stack to Empower Every User to Make Data-Driven Decisions
 
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdf
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdfData Pipelines and Tools to Integrate with Power BI and Spotfire.pdf
Data Pipelines and Tools to Integrate with Power BI and Spotfire.pdf
 
Level Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop AccelerationLevel Up – How to Achieve Hadoop Acceleration
Level Up – How to Achieve Hadoop Acceleration
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic Gab Genai Cloudera - Going Beyond Traditional Analytic
Gab Genai Cloudera - Going Beyond Traditional Analytic
 
Options for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current MarketOptions for Data Prep - A Survey of the Current Market
Options for Data Prep - A Survey of the Current Market
 

Más de Inside Analysis

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIInside Analysis
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessInside Analysis
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationInside Analysis
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownInside Analysis
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security Inside Analysis
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeInside Analysis
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataInside Analysis
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionInside Analysis
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsInside Analysis
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingInside Analysis
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLInside Analysis
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelInside Analysis
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureInside Analysis
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskInside Analysis
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataInside Analysis
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseInside Analysis
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldInside Analysis
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave DuggalInside Analysis
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyInside Analysis
 

Más de Inside Analysis (20)

An Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BIAn Ounce of Prevention: Forging Healthy BI
An Ounce of Prevention: Forging Healthy BI
 
Agile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for SuccessAgile, Automated, Aware: How to Model for Success
Agile, Automated, Aware: How to Model for Success
 
First in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter IntegrationFirst in Class: Optimizing the Data Lake for Tighter Integration
First in Class: Optimizing the Data Lake for Tighter Integration
 
Fit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data LetdownFit For Purpose: Preventing a Big Data Letdown
Fit For Purpose: Preventing a Big Data Letdown
 
To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security To Serve and Protect: Making Sense of Hadoop Security
To Serve and Protect: Making Sense of Hadoop Security
 
The Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On TimeThe Hadoop Guarantee: Keeping Analytics Running On Time
The Hadoop Guarantee: Keeping Analytics Running On Time
 
Introducing: A Complete Algebra of Data
Introducing: A Complete Algebra of DataIntroducing: A Complete Algebra of Data
Introducing: A Complete Algebra of Data
 
The Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop AdoptionThe Role of Data Wrangling in Driving Hadoop Adoption
The Role of Data Wrangling in Driving Hadoop Adoption
 
Ahead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time AnalyticsAhead of the Stream: How to Future-Proof Real-Time Analytics
Ahead of the Stream: How to Future-Proof Real-Time Analytics
 
All Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of EverythingAll Together Now: Connected Analytics for the Internet of Everything
All Together Now: Connected Analytics for the Internet of Everything
 
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETLGoodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
Goodbye, Bottlenecks: How Scale-Out and In-Memory Solve ETL
 
The Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global LevelThe Biggest Picture: Situational Awareness on a Global Level
The Biggest Picture: Situational Awareness on a Global Level
 
Structurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your ArchitectureStructurally Sound: How to Tame Your Architecture
Structurally Sound: How to Tame Your Architecture
 
SQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the RiskSQL In Hadoop: Big Data Innovation Without the Risk
SQL In Hadoop: Big Data Innovation Without the Risk
 
The Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big DataThe Perfect Fit: Scalable Graph for Big Data
The Perfect Fit: Scalable Graph for Big Data
 
A Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data WarehouseA Revolutionary Approach to Modernizing the Data Warehouse
A Revolutionary Approach to Modernizing the Data Warehouse
 
Rethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile WorldRethinking Data Availability and Governance in a Mobile World
Rethinking Data Availability and Governance in a Mobile World
 
DisrupTech - Dave Duggal
DisrupTech - Dave DuggalDisrupTech - Dave Duggal
DisrupTech - Dave Duggal
 
Modus Operandi
Modus OperandiModus Operandi
Modus Operandi
 
Phasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey MalafskyPhasic Systems - Dr. Geoffrey Malafsky
Phasic Systems - Dr. Geoffrey Malafsky
 

Último

So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024TopCSSGallery
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 

Último (20)

So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024Top 10 Hubspot Development Companies in 2024
Top 10 Hubspot Development Companies in 2024
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 

The New Frontier: Optimizing Big Data Exploration

  • 1. Grab some coffee and enjoy the pre-show banter before the top of the hour!
  • 2. The New Frontier: Optimizing Big Data Exploration The Briefing Room
  • 3. Twitter Tag: #briefr The Briefing Room Welcome Host: Eric Kavanagh eric.kavanagh@bloorgroup.com @eric_kavanagh
  • 4. ! Reveal the essential characteristics of enterprise software, good and bad ! Provide a forum for detailed analysis of today’s innovative technologies ! Give vendors a chance to explain their product to savvy analysts ! Allow audience members to pose serious questions... and get answers! Twitter Tag: #briefr The Briefing Room Mission
  • 5. Twitter Tag: #briefr The Briefing Room Topics This Month: BIG DATA March: CLOUD April: BIG DATA 2014 Editorial Calendar at www.insideanalysis.com/webcasts/the-briefing-room
  • 6. The Age of Exploration The Age of DATA
  • 7. Twitter Tag: #briefr The Briefing Room Analyst: Robin Bloor Robin Bloor is Chief Analyst at The Bloor Group robin.bloor@bloorgroup.com @robinbloor
  • 8. Twitter Tag: #briefr The Briefing Room Cirro ! Cirro provides a single method to access any type of data, on any platform, in any environment ! Its product suite consists of Cirro Data Hub, Analyst for Excel and Multi Store – all designed to remove complexity from Big Data analytics ! Cirro’s products are cloud based and can run in public, private and on-premise environments
  • 9. Twitter Tag: #briefr The Briefing Room Guest: Mark Theissen Mark is CEO at Cirro. He is a respected analytics and data warehousing expert with more than 22 years in the industry. Most recently Mark was the worldwide data warehousing technical lead at Microsoft following the acquisition of DATAllegro. At DATAllegro Mark was the COO and a member of the board of directors. Prior to joining DATAllegro, Mark was Vice President and Research Lead at META Group (Gartner Group) for Enterprise Analytics Strategies, covering data warehousing, business intelligence and data integration markets. Before META, Mark was VP of Professional Services at Accruent where he was responsible for domestic and overseas services and operations. Mark has a BS in Computer Information Systems from Chapman University and a MBA from the University of California, Irvine.
  • 10. Briefing Room 2/11/14 Next Genera*on Data Federa*on
  • 11. On Demand Distributed Analysis Cirro is the ONLY Solution that can: • Access any data • On any platform • Without ETL or the cost and complexity of a semantic layer “ What used to take 2-­‐4 weeks is now done in a ma;er of minutes. Cirro is a ‘game-­‐ changing’ approach to visualizing mul*-­‐ structured big data and integra*ng it with other data sources.” Director of Business Intelligence ©2014 Cirro Inc. All rights reserved.
  • 12. Cirro Data Hub ©2014 Cirro Inc. All rights reserved. Cirro Enterprise Data Hub Visualization Tools Real-time Federation Data Language Translation Data Movement & Management RDBMS HDFS NoSql Legacy BI Tools CLI Excel SaaS
  • 13. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop
  • 14. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop Oracle Hadoop SQL Server SQL predicates, local joins SQL predicates Standard SQL Row processing pushed into data systems MapReduce
  • 15. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop Oracle Hadoop SQL Server SQL predicates, local joins SQL predicates Standard SQL Row processing pushed into data systems MapReduce 50k Rows 50m Rows 5k Rows
  • 16. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop Oracle Hadoop SQL Server SQL predicates, local joins SQL predicates Standard SQL Row processing pushed into data systems MapReduce 50k Rows 50m Rows 5k Rows Limited movement Limited movement
  • 17. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop Oracle Hadoop SQL Server SQL join, aggregaEon Standard SQL Row processing pushed into data systems
  • 18. ©2014 Cirro Inc. All rights reserved. How Federation Works I have a table on SQL Server that needs to join to tables on Oracle and Hadoop Results Des)na)on Op)ons Oracle Hadoop SQL Server Results Standard SQL Row processing pushed into data systems UI Tools Data Marts; in the Cloud or Data Center BI Server
  • 19. ©2014 Cirro Inc. All rights reserved. Completing The Solution… • Cirro Data Hub – Federated query processing • Use any tool • The fastest distributed processing possible • Cirro Analyst • Data discovery • Mash up data like never before • Go beyond SQL • Publish • Cirro Multi Store • Stage, Store, Process • Highly scalable
  • 20. Next Generation Data Federation Ask Questions You Couldn’t Ask Before • Designed & Built for Big Data • Compatible with structured, semi-structured & unstructured data • Works in the cloud, in the data center, or both ©2014 Cirro Inc. All rights reserved. • Real-Time Federation • Queries are dynamically optimized and executed, taking the processing to the data • Enables ad-hoc query and exploration of all data • No Semantic Layer Required
  • 21. Cirro Federation vs. Data Virtualization Cirro Data Virtualiza)on ©2014 Cirro Inc. All rights reserved. • Excellent for data exploraEon and discovery • Excellent for ad-­‐hoc queries • True federated processing – minimal data movement, no server boPlenecks – ‘pushes processing to the data’ • Easy setup, maintenance & administraEon • Hadoop – can execute, Hive & Impala queries along with MapReduce programs • Pathway to NoSQL • Not appropriate for data exploraEon or discovery – requires you to know the quesEons you want to ask in advance of accessing the data • Not true federated processing – final joins and aggregaEons done on VirtualizaEon Server • Good for structured data processing workloads • Labor intensive setup, maintenance & administraEon for modeling and semanEc layer • Hadoop – limited to Hive access
  • 22. Data Federation Use Cases • On Demand Distributed Data Analysis • Data warehouse offloading • Business intelligence federation • Self-service data exploration and discovery • Entry point for private cloud analytics • SaaS, Hadoop and/or NoSQL integration with enterprise data sources • Simplify application development ©2014 Cirro Inc. All rights reserved.
  • 23. On Demand Intra-Day Analytics Solution • Cirro Data Hub; Cirro Analyst Results • On demand analytics supports faster trend analysis and the ability to identify data anomalies • Cross-platform data access reduced from weeks to minutes • Flexible/iterative using in-house BI tools • Enables self-service data mash-ups by analysts across all data sources ©2014 Cirro Inc. All rights reserved. Business Challenge • Data that drives trading analytics & decisions in data silos • Inability to analyze data ‘fast enough’ to make informed trading decisions • ETL tools and manual data consolidation is too slow and inflexible for hourly or daily iterative analysis • Inability to join traditional data with cloud sources Financial & Energy / UElity Markets
  • 24. Ask Questions You Couldn’t Ask Before Last Market Price ©2014 Cirro Inc. All rights reserved. Oracle - Pricing DW Transaction Data Tableau Actionable Visualizations Subscrip)on Market Data
  • 25. Ask Questions You Couldn’t Ask Before ©2014 Cirro Inc. All rights reserved. A Anonymous Behavior Transactional Data Ads viewed/clicked Actionable Visualizations
  • 26. ©2014 Cirro Inc. All rights reserved. The Business Impact • Agility; conducEng analysis previously unavailable • CompeEEve advantage • Supports ad-­‐hoc analysis , Fastest Eme to value • Leverage in-­‐house BI tools – no new tools to learn Improved Business OperaEons • TradiEonal architectures not designed for Big Data • Easily add new data sources – RDBMS, Hadoop, NoSQL • Easy to install, use & manage • Future proof analyEcs developed Streamline IT Processes • ReducEon in license costs on EDW and RDBMS • Time & cost savings associated with data staging, modeling, ETL work, etc. • No new BI applicaEons to buy -­‐ use exisEng BI tools • No new skills to develop Cost Savings
  • 27. Twitter Tag: #briefr The Briefing Room Perceptions & Questions Analyst: Robin Bloor
  • 28.
  • 29. The Visible “Big Data” Trend u Corporate data volumes grow at about 55% per annum - exponentially u Data has been growing at this rate for, maybe, 40 years u There is nothing new about big data; it clings to an established exponential trend
  • 30. The Invisible Trend: Moore’s Law Cubed u The biggest databases are new databases u They grow at the cube of Moore’s Law u Moore’s Law = 10x every 6 years u VLDB: 1000x every 6 years • 1991/2 megabytes • 1997/8 gigabytes • 2003/4 terabytes • 2009/10 petabytes • 2015/16 exabytes
  • 31. Whys and Wherefores? u Why do we assemble such gargantuan heaps of data? u While the data volume has grown like bamboo in spring, the size of executables has not? u Why not just move the processing to the data? u This is surely an option worth exploring – maybe it is even one of the foundations for Big Data Architecture…
  • 32. No Country for Old DBMS (Thinking)
  • 33. Questions are Easy, Answers Difficult The WORKLOAD Conundrum The DISTRIBUTION Conundrum The DATA FLOW Conundrum The REAL-TIME Conundrum
  • 34. u What are the primary applications where Cirro makes a big impact? u What is (roughly) the largest number of data sources Cirro federates in any implementation? u What’s the most resource deployed for the largest Cirro implementation? How much memory? u How does “fault tolerance” work?
  • 35. u How difficult is it to develop applications employing Cirro? Is it significantly different to a DBMS? u Are any companies adopting this technology strategically? u Which technologies/companies do you regard as competition?
  • 36. Twitter Tag: #briefr The Briefing Room
  • 37. This Month: BIG DATA March: CLOUD April: BIG DATA www.insideanalysis.com/webcasts/the-briefing-room Twitter Tag: #briefr The Briefing Room Upcoming Topics 2014 Editorial Calendar at www.insideanalysis.com
  • 38. Twitter Tag: #briefr THANK YOU for your ATTENTION! The Briefing Room
  • 39. Photo credit for Slide 28: Lenny’s Alice in Wonderland site: http://www.alice-in-wonderland.net/ Twitter Tag: #briefr The Briefing Room