SlideShare una empresa de Scribd logo
1 de 22
Descargar para leer sin conexión
Hortonworks 
Page 1 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
We Do Hadoop. We Do Retail. 
September 22, 2014
Our Mission: Power your Modern Data Architecture 
with HDP and Enterprise Apache Hadoop 
Who we are 
June 2011: Original 24 architects, developers, operators of Hadoop from Yahoo! 
June 2014: An enterprise software company with 500+ Employees 
Our model 
Innovate and deliver Apache Hadoop as a complete enterprise data platform 
completely in the open, backed by a world class support organization 
Key Partners 
Page 2 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Fastest growing Fortune 1000 customer base 
Customer Momentum 
• 300+ customers in seven quarters, growing at 75+/quarter 
• Two thirds of customers come from F1000 
Largest Cluster in North America 
32,000 Nodes 
Page 3 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Largest Cluster in Europe 
1,000 Nodes 
30+ customers migrated from other distributions 
Some notable migrations include many of the early adopters of Hadoop: 
Experience at Scale 
80,000 nodes under contract 
Largest Known Cluster in APAC 
400 Nodes
Enabling a Modern Data Architecture 
with HDP and Apache Hadoop 
Spring 2014 
Version 1.4 
Page 4 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
We Do Hadoop. We Do Retail.
Traditional systems under pressure 
DATA SYSTEM APPLICATIONS 
Business 
Analytics 
Custom 
Applications 
RDBMS EDW MPP 
Page 5 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Packaged 
Applications 
• Silos of Data 
• Costly to Scale 
• Constrained Schemas 
Clickstream 
Geolocation 
Sentiment, Web Data 
Sensor. Machine Data 
Unstructured docs, emails 
Server logs 
SOURCES 
Existing Sources 
(CRM, ERP,…) 
New Data Types 
…and difficult to 
manage new data
Why a Modern Data Architecture? 
Business 
Analytics 
LIMITATIONS 
Silos & Expensive 
Single Purpose 
DATA SYSTEM APPLICATIONS 
Custom 
Applications 
Page 6 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Packaged 
Applications 
RDBMS EDW MPP 
MDA: Key Drivers 
1. Leverage new types of data 
2. IT optimization 
3. Enable a data lake 
GOALS 
• Extend new data sets across 
existing data platforms 
• Common data platform, multiple 
processing engines 
• Batch, interactive and real time on 
a single data platform 
EXISTING 
Systems 
Clickstream 
Web 
&Social 
Geoloca9on 
Sensor 
& 
Machine 
Server 
Logs 
Unstructured 
SOURCES
HDP2 and YARN enable the Modern Data Architecture 
Batch Interactive Real-Time 
HDFS 
(Hadoop Distributed File System) 
Page 7 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Hortonworks architected and 
led development of YARN 
Common data set, multiple applications 
• Optionally land all data in a single cluster 
• Batch, interactive & real-time use cases 
• Support multi-tenant access, processing 
& segmentation of data 
YARN: Architectural center of Hadoop 
• Consistent security, governance & operations 
• Ecosystem applications certified 
by Hortonworks to run natively in Hadoop 
SOURCES 
EXISTING 
Systems 
Clickstream 
Web 
&Social 
Geoloca9on 
Sensor 
& 
Machine 
Server 
Logs 
Unstructured 
DATA SYSTEM APPLICATIONS 
Business 
Analytics 
Custom 
Applications 
Packaged 
Applications 
RDBMS EDW MPP YARN: Data Operating System 
1 ° ° ° ° ° ° ° ° ° 
° ° ° ° ° ° ° ° ° N
HDP delivers a comprehensive data management platform 
HDP 2.1 
Hortonworks Data Platform 
BATCH, INTERACTIVE & REAL-TIME SECURITY 
DATA ACCESS 
Page 8 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Provision, 
Manage & 
Monitor 
Ambari 
Zookeeper 
Scheduling 
Oozie 
Data Workflow, 
Lifecycle & 
Governance 
Falcon 
Sqoop 
Flume 
NFS 
WebHDFS 
In-Memory 
Spark 
YARN: Data Operating System 
DATA MANAGEMENT 
GOVERNANCE 
& INTEGRATION 
Authentication 
Authorization 
Accounting 
Data Protection 
Storage: HDFS 
Resources: YARN 
Access: Hive, … 
Pipeline: Falcon 
Cluster: Knox 
OPERATIONS 
Script 
Pig 
Search 
Solr 
SQL 
Hive 
HCatalog 
NoSQL 
HBase 
Accumulo 
Stream 
Storm 
Others 
ISV 
Engines 
1 ° ° ° ° ° ° ° ° ° 
° ° ° ° ° ° ° ° ° ° 
° ° ° ° ° ° ° ° ° ° 
° 
° 
N 
HDFS 
(Hadoop Distributed File System) 
Deployment Choice 
Linux Windows On-Premise Cloud 
YARN is the architectural 
center of HDP 
• Enables batch, interactive 
and real-time workloads 
• Single SQL engine for both batch 
and interactive 
• Enable existing ISV apps to plug 
directly into Hadoop via YARN 
Provides comprehensive 
enterprise capabilities 
• Governance 
• Security 
• Operations 
The widest range of 
deployment options 
• Linux & Windows 
• On premise & cloud 
TezTez
Our Approach 
Page 9 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Hortonworks Approach 
1 Innovate the Core 
Architect and build 
innovation at the core of 
Hadoop 
• YARN: Data Operating System 
• HDFS as the storage layer 
• Key processing engines 
Script 
Pig 
Search 
Solr 
SQL 
Hive/Tez, 
HCatalog 
NoSQL 
HBase 
Accumulo 
Stream 
Storm 
Batch 
Map 
Reduce 
YARN 
: 
Data 
Opera9ng 
System 
HDFS 
(Hadoop 
Distributed 
File 
System) 
Page 10 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Hortonworks Approach 
1 Innovate the Core 
Architect and build 
innovation at the core of 
Hadoop 
• YARN: Data Operating System 
• HDFS as the storage layer 
• Key processing engines 
Extend Hadoop as an 
2 Enterprise Data Platform 
Extend Hadoop with enterprise 
capabilities for governance, 
security & operations 
Apply enterprise software rigor 
to the open source development 
process 
Script 
Pig 
Search 
Solr 
SQL 
Hive/Tez, 
HCatalog 
NoSQL 
HBase 
Accumulo 
Stream 
Storm 
Batch 
Map 
Reduce 
YARN 
: 
Data 
Opera9ng 
System 
HDFS 
(Hadoop 
Distributed 
File 
System) 
Page 11 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
HDP 2.1 
Governance 
& Integration 
Security 
Operations 
Data Access 
YARN 
Data Management
Hortonworks Approach 
1 Innovate the Core 
Architect and build 
innovation at the core of 
Hadoop 
• YARN: Data Operating System 
• HDFS as the storage layer 
• Key processing engines 
Extend Hadoop as an 
2 Enterprise Data Platform 3 Enable the Ecosystem 
Extend Hadoop with enterprise 
capabilities for governance, 
security & operations 
Apply enterprise software rigor 
to the open source development 
process 
Script 
Pig 
Search 
Solr 
SQL 
Hive/Tez, 
HCatalog 
NoSQL 
HBase 
Accumulo 
Stream 
Storm 
Batch 
Map 
Reduce 
Page 12 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Enable the leaders in the data 
center to easily adopt & extend 
their platforms 
• Establish Hadoop as standard 
component of a modern data 
architecture 
• Joint engineering 
YARN 
: 
Data 
Opera9ng 
System 
HDFS 
(Hadoop 
Distributed 
File 
System) 
HDP 2.1 
Governance 
& Integration 
Security 
Operations 
Data Access 
YARN 
Data Management
…all done completely 4 in Open Source 
Hadoop is a platform decision 
• Open Source: fastest path to innovation for a platform technology 
• Eliminate vendor lock in, no proprietary software 
• Data center leaders have committed to the open source approach 
Script 
Pig 
Contributes more to the Apache Hadoop 
ecosystem in the ASF than any other vendor 
Search 
Solr 
SQL 
Hive/Tez, 
HCatalog 
NoSQL 
HBase 
Accumulo 
Stream 
Storm 
Batch 
Map 
Reduce 
YARN 
: 
Data 
Opera9ng 
System 
HDFS 
(Hadoop 
Distributed 
File 
System) 
Page 13 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Apache 
Project Committers PMC 
Members 
Hadoop 26 20 
Tez 15 13 
Hive 15 5 
HBase 7 3 
Pig 5 5 
Accumulo 2 2 
Flume 1 0 
Storm 2 2 
Sqoop 1 0 
Ambari 32 28 
Oozie 3 2 
Zookeeper 2 1 
Knox 6 6 
Falcon 3 3 
TOTAL 120 90 
HDP 2.1 
Governance 
& Integration 
Security 
Operations 
Data Access 
YARN 
Data Management
The Modern Data Architecture w/ HDP 
Page 14 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
Hadoop Juices Sales in Retail 
FUNCTION USE CASE 
Marketing 
Page 15 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
360° View of Customer: 
Ø Customer Lifetime Value 
Ø Targeted Marketing Campaigns 
Segmentation 
Pricing 
Brand Sentiment Analysis 
eCommerce & Customer Service 
Product Recommendation Engine 
Web Path Optimization 
Call Center Productivity 
Forecasting, Allocation & Merchandizing 
Product Placement 
Store-Level Optimization of Assortment, Prices and Spaces 
Procurement & Supply Chain 
Inventory Management 
Real-time Delivery Management 
Improved Order Picking 
Vendor Management 
Strategic Sourcing
Case Study: 12 month Hadoop evolution at TrueCar 
Data Platform Capabilities 
June 2013 
Begin 
Hadoop 
Execution 
July 2013 
Hortonworks 
Partnership 
12 months execution plan 
Page 16 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
May ‘14 
IPO 
Aug 2013 
Training 
& Dev. 
Begins 
Nov 2013 
Production 
Cluster 
60 Nodes 
2 PB 
Jan 2014 
40% Dev. 
Staff 
Proficient 
Dec 2013 
Three 
Production 
Apps 
(3 total) 
Feb 2014 
Three More 
Production 
Apps 
(6 total) 
12 Month Results at TrueCAR 
• Six Production Hadoop Applications 
• Sixty nodes/2PB data 
• Storage Costs/Compute Costs 
from $19/GB to $0.23/GB 
“We addressed our data platform capabilities 
strategically as a pre-cursor to IPO.”
Hortonworks Support 
Page 17 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
We Do Hadoop. We Do Retail.
End to end support to ensure your Hadoop success 
Mission Critical Hadoop Support 
Page 18 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Hortonworks Support 
Backed by the architects, builders and 
operators of Hadoop, Hortonworks offers 
the most effective and complete Hadoop 
support available 
Support Provided 
• Application Development Support 
• Diagnose Install, Config & Cluster Mgmt Issues 
• Access to Upgrades, Updates and Patches 
• Diagnose Performance Issues 
• Remote Troubleshooting 
• Diagnose Loading, Processing & Query Issues 
• Customer Support Portal 
• Advanced Knowledgebase 
Architect & 
Design Development Implementation Production 
Only Hortonworks provides unlimited support 
across architecture, development, 
implementation & production
End to end support to ensure your Hadoop success 
Mission Critical Hadoop Support 
Services 
Architect & 
Design Development Implementation Production 
Only Hortonworks provides unlimited support 
across architecture, development, 
implementation & production 
Page 19 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Hortonworks Services 
Our services team ensures your Hadoop 
project will be delivered successfully 
Services Provided 
• Architecture 
• Implementation 
• Cluster Tuning 
• Migration 
• Best Practices
End to end support to ensure your Hadoop success 
Mission Critical Hadoop Support 
Services 
Training 
Architect & 
Design Development Implementation Production 
Only Hortonworks provides unlimited support 
across architecture, development, 
implementation & production 
Page 20 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Hortonworks University 
We offer a wide range of training options 
backed by experts and designed to 
evolve your teams Hadoop proficiency 
Custom Coursework 
• On-site training for your team 
• Customized for your requirements 
Public Courses 
• Offered in all geographies 
• Hadoop Architect 
• Hadoop Developer 
• Hadoop Analyst 
• Hadoop Operations 
• Data Science
Hadoop is a Platform Decision 
Open Leadership 
Drive innovation in the open via 
the Apache community-driven 
open source process 
Page 21 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
Enterprise Rigor 
Engineer, test and certify 
Apache Hadoop with the 
enterprise in mind 
Ecosystem Endorsement 
Focus on deep integration with 
existing data center technologies 
and skills 
Fastest Growing Customer and Partner Base 
Largest and most experienced Hadoop adopters have standardized on Hortonworks 
The data center leaders have standardized on Hortonworks
Questions? 
Page 22 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 
We Do Hadoop. We Do Retail. 
September 22, 2014

Más contenido relacionado

La actualidad más candente

Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsHortonworks
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks
 
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextDiscover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextHortonworks
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchHortonworks
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Hortonworks
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks
 
Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFHortonworks
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHortonworks
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageHortonworks
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez Hortonworks
 
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHortonworks
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Hortonworks
 
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...Hortonworks
 
Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014Hortonworks
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramHortonworks
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache HadoopHortonworks
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks
 

La actualidad más candente (20)

Predicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior GraphsPredicting Customer Experience through Hadoop and Customer Behavior Graphs
Predicting Customer Experience through Hadoop and Customer Behavior Graphs
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextDiscover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
 
Discover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop SearchDiscover HDP 2.1: Apache Solr for Hadoop Search
Discover HDP 2.1: Apache Solr for Hadoop Search
 
Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014Splunk-hortonworks-risk-management-oct-2014
Splunk-hortonworks-risk-management-oct-2014
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
 
Design a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDFDesign a Dataflow in 7 minutes with Apache NiFi/HDF
Design a Dataflow in 7 minutes with Apache NiFi/HDF
 
Hp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar SlidesHp Converged Systems and Hortonworks - Webinar Slides
Hp Converged Systems and Hortonworks - Webinar Slides
 
Enterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble StorageEnterprise Hadoop with Hortonworks and Nimble Storage
Enterprise Hadoop with Hortonworks and Nimble Storage
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez
 
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
Hortonworks Protegrity Webinar: Leverage Security in Hadoop Without Sacrifici...
 
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
Hortonworks and Red Hat Webinar_Sept.3rd_Part 1
 
HPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare TransformationHPE and Hortonworks join forces to Deliver Healthcare Transformation
HPE and Hortonworks join forces to Deliver Healthcare Transformation
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
 
Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2
 
Introduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready ProgramIntroduction to the Hortonworks YARN Ready Program
Introduction to the Hortonworks YARN Ready Program
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
 
Hortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptxHortonworks sqrrl webinar v5.pptx
Hortonworks sqrrl webinar v5.pptx
 

Destacado

The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata Hortonworks
 
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big DataMicrosoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big DataHortonworks
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopSlim Baltagi
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Hortonworks
 
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...Hortonworks
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...Revolution Analytics
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureMapR Technologies
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesDataWorks Summit
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data ArchitectureDATAVERSITY
 
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Zaloni
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Hortonworks
 
Developing Hadoop strategy for your Enterprise
Developing Hadoop strategy for your EnterpriseDeveloping Hadoop strategy for your Enterprise
Developing Hadoop strategy for your EnterpriseAvkash Chauhan
 
Information Technology Innovator David Ward 2011
Information Technology Innovator David Ward 2011Information Technology Innovator David Ward 2011
Information Technology Innovator David Ward 2011ward2dr
 
3D IT Architecture - Data Center
3D IT Architecture - Data Center3D IT Architecture - Data Center
3D IT Architecture - Data CenterPaul Brink
 
The LightConnectTM Fabric V-POD Data Center Architecture
The LightConnectTM Fabric V-POD Data Center ArchitectureThe LightConnectTM Fabric V-POD Data Center Architecture
The LightConnectTM Fabric V-POD Data Center ArchitectureCALIENT Technologies
 
Presentation data center and cloud architecture
Presentation   data center and cloud architecturePresentation   data center and cloud architecture
Presentation data center and cloud architecturexKinAnx
 

Destacado (20)

The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata The Value of the Modern Data Architecture with Apache Hadoop and Teradata
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
 
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big DataMicrosoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
 
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
 
Solution architecture for big data projects
Solution architecture for big data projectsSolution architecture for big data projects
Solution architecture for big data projects
 
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
The Modern Data Architecture for Predictive Analytics with Hortonworks and Re...
 
Zeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data ArchitectureZeta Architecture: The Next Generation Big Data Architecture
Zeta Architecture: The Next Generation Big Data Architecture
 
Meetup oslo hortonworks HDP
Meetup oslo hortonworks HDPMeetup oslo hortonworks HDP
Meetup oslo hortonworks HDP
 
Hadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data ArchitecturesHadoop Powers Modern Enterprise Data Architectures
Hadoop Powers Modern Enterprise Data Architectures
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data Architecture
 
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
Building a Modern Data Architecture by Ben Sharma at Strata + Hadoop World Sa...
 
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
Distilling Hadoop Patterns of Use and How You Can Use Them for Your Big Data ...
 
Developing Hadoop strategy for your Enterprise
Developing Hadoop strategy for your EnterpriseDeveloping Hadoop strategy for your Enterprise
Developing Hadoop strategy for your Enterprise
 
Information Technology Innovator David Ward 2011
Information Technology Innovator David Ward 2011Information Technology Innovator David Ward 2011
Information Technology Innovator David Ward 2011
 
3D IT Architecture - Data Center
3D IT Architecture - Data Center3D IT Architecture - Data Center
3D IT Architecture - Data Center
 
The LightConnectTM Fabric V-POD Data Center Architecture
The LightConnectTM Fabric V-POD Data Center ArchitectureThe LightConnectTM Fabric V-POD Data Center Architecture
The LightConnectTM Fabric V-POD Data Center Architecture
 
Data-center SDN
Data-center  SDN Data-center  SDN
Data-center SDN
 
HTRC Architecture Overview
HTRC Architecture OverviewHTRC Architecture Overview
HTRC Architecture Overview
 
Presentation data center and cloud architecture
Presentation   data center and cloud architecturePresentation   data center and cloud architecture
Presentation data center and cloud architecture
 

Similar a Hortonworks - What's Possible with a Modern Data Architecture?

Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Innovative Management Services
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Hortonworks
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemShivaji Dutta
 
Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Rommel Garcia
 
Realtime Analytics in Hadoop
Realtime Analytics in HadoopRealtime Analytics in Hadoop
Realtime Analytics in HadoopRommel Garcia
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataWANdisco Plc
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudHortonworks
 
Cloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a championCloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a championAmeet Paranjape
 
Discover hdp 2.2 hdfs - final
Discover hdp 2.2   hdfs - finalDiscover hdp 2.2   hdfs - final
Discover hdp 2.2 hdfs - finalHortonworks
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGskumpf
 
YARN - Strata 2014
YARN - Strata 2014YARN - Strata 2014
YARN - Strata 2014Hortonworks
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataPatrickCrompton
 
Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]Hortonworks
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course WorkshopDataWorks Summit
 
Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitDataWorks Summit
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Mac Moore
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksData Con LA
 
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Hortonworks
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksHortonworks
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopHortonworks
 

Similar a Hortonworks - What's Possible with a Modern Data Architecture? (20)

Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystem
 
Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0
 
Realtime Analytics in Hadoop
Realtime Analytics in HadoopRealtime Analytics in Hadoop
Realtime Analytics in Hadoop
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 
Cloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a championCloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a champion
 
Discover hdp 2.2 hdfs - final
Discover hdp 2.2   hdfs - finalDiscover hdp 2.2   hdfs - final
Discover hdp 2.2 hdfs - final
 
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUGReal-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
Real-Time Processing in Hadoop for IoT Use Cases - Phoenix HUG
 
YARN - Strata 2014
YARN - Strata 2014YARN - Strata 2014
YARN - Strata 2014
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big Data
 
Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]Discover.hdp2.2.h base.final[2]
Discover.hdp2.2.h base.final[2]
 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
 
Internet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop SummitInternet of Things Crash Course Workshop at Hadoop Summit
Internet of Things Crash Course Workshop at Hadoop Summit
 
Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015Storm Demo Talk - Colorado Springs May 2015
Storm Demo Talk - Colorado Springs May 2015
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
 
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
 
Create a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache HadoopCreate a Smarter Data Lake with HP Haven and Apache Hadoop
Create a Smarter Data Lake with HP Haven and Apache Hadoop
 

Más de Hortonworks

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyHortonworks
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakHortonworks
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsHortonworks
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysHortonworks
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's NewHortonworks
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerHortonworks
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsHortonworks
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeHortonworks
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidHortonworks
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleHortonworks
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATAHortonworks
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Hortonworks
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseHortonworks
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseHortonworks
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationHortonworks
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementHortonworks
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHortonworks
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCHortonworks
 

Más de Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Hortonworks - What's Possible with a Modern Data Architecture?

  • 1. Hortonworks Page 1 © Hortonworks Inc. 2011 – 2014. All Rights Reserved We Do Hadoop. We Do Retail. September 22, 2014
  • 2. Our Mission: Power your Modern Data Architecture with HDP and Enterprise Apache Hadoop Who we are June 2011: Original 24 architects, developers, operators of Hadoop from Yahoo! June 2014: An enterprise software company with 500+ Employees Our model Innovate and deliver Apache Hadoop as a complete enterprise data platform completely in the open, backed by a world class support organization Key Partners Page 2 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 3. Fastest growing Fortune 1000 customer base Customer Momentum • 300+ customers in seven quarters, growing at 75+/quarter • Two thirds of customers come from F1000 Largest Cluster in North America 32,000 Nodes Page 3 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Largest Cluster in Europe 1,000 Nodes 30+ customers migrated from other distributions Some notable migrations include many of the early adopters of Hadoop: Experience at Scale 80,000 nodes under contract Largest Known Cluster in APAC 400 Nodes
  • 4. Enabling a Modern Data Architecture with HDP and Apache Hadoop Spring 2014 Version 1.4 Page 4 © Hortonworks Inc. 2011 – 2014. All Rights Reserved We Do Hadoop. We Do Retail.
  • 5. Traditional systems under pressure DATA SYSTEM APPLICATIONS Business Analytics Custom Applications RDBMS EDW MPP Page 5 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Packaged Applications • Silos of Data • Costly to Scale • Constrained Schemas Clickstream Geolocation Sentiment, Web Data Sensor. Machine Data Unstructured docs, emails Server logs SOURCES Existing Sources (CRM, ERP,…) New Data Types …and difficult to manage new data
  • 6. Why a Modern Data Architecture? Business Analytics LIMITATIONS Silos & Expensive Single Purpose DATA SYSTEM APPLICATIONS Custom Applications Page 6 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Packaged Applications RDBMS EDW MPP MDA: Key Drivers 1. Leverage new types of data 2. IT optimization 3. Enable a data lake GOALS • Extend new data sets across existing data platforms • Common data platform, multiple processing engines • Batch, interactive and real time on a single data platform EXISTING Systems Clickstream Web &Social Geoloca9on Sensor & Machine Server Logs Unstructured SOURCES
  • 7. HDP2 and YARN enable the Modern Data Architecture Batch Interactive Real-Time HDFS (Hadoop Distributed File System) Page 7 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks architected and led development of YARN Common data set, multiple applications • Optionally land all data in a single cluster • Batch, interactive & real-time use cases • Support multi-tenant access, processing & segmentation of data YARN: Architectural center of Hadoop • Consistent security, governance & operations • Ecosystem applications certified by Hortonworks to run natively in Hadoop SOURCES EXISTING Systems Clickstream Web &Social Geoloca9on Sensor & Machine Server Logs Unstructured DATA SYSTEM APPLICATIONS Business Analytics Custom Applications Packaged Applications RDBMS EDW MPP YARN: Data Operating System 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° N
  • 8. HDP delivers a comprehensive data management platform HDP 2.1 Hortonworks Data Platform BATCH, INTERACTIVE & REAL-TIME SECURITY DATA ACCESS Page 8 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Provision, Manage & Monitor Ambari Zookeeper Scheduling Oozie Data Workflow, Lifecycle & Governance Falcon Sqoop Flume NFS WebHDFS In-Memory Spark YARN: Data Operating System DATA MANAGEMENT GOVERNANCE & INTEGRATION Authentication Authorization Accounting Data Protection Storage: HDFS Resources: YARN Access: Hive, … Pipeline: Falcon Cluster: Knox OPERATIONS Script Pig Search Solr SQL Hive HCatalog NoSQL HBase Accumulo Stream Storm Others ISV Engines 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° N HDFS (Hadoop Distributed File System) Deployment Choice Linux Windows On-Premise Cloud YARN is the architectural center of HDP • Enables batch, interactive and real-time workloads • Single SQL engine for both batch and interactive • Enable existing ISV apps to plug directly into Hadoop via YARN Provides comprehensive enterprise capabilities • Governance • Security • Operations The widest range of deployment options • Linux & Windows • On premise & cloud TezTez
  • 9. Our Approach Page 9 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 10. Hortonworks Approach 1 Innovate the Core Architect and build innovation at the core of Hadoop • YARN: Data Operating System • HDFS as the storage layer • Key processing engines Script Pig Search Solr SQL Hive/Tez, HCatalog NoSQL HBase Accumulo Stream Storm Batch Map Reduce YARN : Data Opera9ng System HDFS (Hadoop Distributed File System) Page 10 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 11. Hortonworks Approach 1 Innovate the Core Architect and build innovation at the core of Hadoop • YARN: Data Operating System • HDFS as the storage layer • Key processing engines Extend Hadoop as an 2 Enterprise Data Platform Extend Hadoop with enterprise capabilities for governance, security & operations Apply enterprise software rigor to the open source development process Script Pig Search Solr SQL Hive/Tez, HCatalog NoSQL HBase Accumulo Stream Storm Batch Map Reduce YARN : Data Opera9ng System HDFS (Hadoop Distributed File System) Page 11 © Hortonworks Inc. 2011 – 2014. All Rights Reserved HDP 2.1 Governance & Integration Security Operations Data Access YARN Data Management
  • 12. Hortonworks Approach 1 Innovate the Core Architect and build innovation at the core of Hadoop • YARN: Data Operating System • HDFS as the storage layer • Key processing engines Extend Hadoop as an 2 Enterprise Data Platform 3 Enable the Ecosystem Extend Hadoop with enterprise capabilities for governance, security & operations Apply enterprise software rigor to the open source development process Script Pig Search Solr SQL Hive/Tez, HCatalog NoSQL HBase Accumulo Stream Storm Batch Map Reduce Page 12 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Enable the leaders in the data center to easily adopt & extend their platforms • Establish Hadoop as standard component of a modern data architecture • Joint engineering YARN : Data Opera9ng System HDFS (Hadoop Distributed File System) HDP 2.1 Governance & Integration Security Operations Data Access YARN Data Management
  • 13. …all done completely 4 in Open Source Hadoop is a platform decision • Open Source: fastest path to innovation for a platform technology • Eliminate vendor lock in, no proprietary software • Data center leaders have committed to the open source approach Script Pig Contributes more to the Apache Hadoop ecosystem in the ASF than any other vendor Search Solr SQL Hive/Tez, HCatalog NoSQL HBase Accumulo Stream Storm Batch Map Reduce YARN : Data Opera9ng System HDFS (Hadoop Distributed File System) Page 13 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Apache Project Committers PMC Members Hadoop 26 20 Tez 15 13 Hive 15 5 HBase 7 3 Pig 5 5 Accumulo 2 2 Flume 1 0 Storm 2 2 Sqoop 1 0 Ambari 32 28 Oozie 3 2 Zookeeper 2 1 Knox 6 6 Falcon 3 3 TOTAL 120 90 HDP 2.1 Governance & Integration Security Operations Data Access YARN Data Management
  • 14. The Modern Data Architecture w/ HDP Page 14 © Hortonworks Inc. 2011 – 2014. All Rights Reserved
  • 15. Hadoop Juices Sales in Retail FUNCTION USE CASE Marketing Page 15 © Hortonworks Inc. 2011 – 2014. All Rights Reserved 360° View of Customer: Ø Customer Lifetime Value Ø Targeted Marketing Campaigns Segmentation Pricing Brand Sentiment Analysis eCommerce & Customer Service Product Recommendation Engine Web Path Optimization Call Center Productivity Forecasting, Allocation & Merchandizing Product Placement Store-Level Optimization of Assortment, Prices and Spaces Procurement & Supply Chain Inventory Management Real-time Delivery Management Improved Order Picking Vendor Management Strategic Sourcing
  • 16. Case Study: 12 month Hadoop evolution at TrueCar Data Platform Capabilities June 2013 Begin Hadoop Execution July 2013 Hortonworks Partnership 12 months execution plan Page 16 © Hortonworks Inc. 2011 – 2014. All Rights Reserved May ‘14 IPO Aug 2013 Training & Dev. Begins Nov 2013 Production Cluster 60 Nodes 2 PB Jan 2014 40% Dev. Staff Proficient Dec 2013 Three Production Apps (3 total) Feb 2014 Three More Production Apps (6 total) 12 Month Results at TrueCAR • Six Production Hadoop Applications • Sixty nodes/2PB data • Storage Costs/Compute Costs from $19/GB to $0.23/GB “We addressed our data platform capabilities strategically as a pre-cursor to IPO.”
  • 17. Hortonworks Support Page 17 © Hortonworks Inc. 2011 – 2014. All Rights Reserved We Do Hadoop. We Do Retail.
  • 18. End to end support to ensure your Hadoop success Mission Critical Hadoop Support Page 18 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks Support Backed by the architects, builders and operators of Hadoop, Hortonworks offers the most effective and complete Hadoop support available Support Provided • Application Development Support • Diagnose Install, Config & Cluster Mgmt Issues • Access to Upgrades, Updates and Patches • Diagnose Performance Issues • Remote Troubleshooting • Diagnose Loading, Processing & Query Issues • Customer Support Portal • Advanced Knowledgebase Architect & Design Development Implementation Production Only Hortonworks provides unlimited support across architecture, development, implementation & production
  • 19. End to end support to ensure your Hadoop success Mission Critical Hadoop Support Services Architect & Design Development Implementation Production Only Hortonworks provides unlimited support across architecture, development, implementation & production Page 19 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks Services Our services team ensures your Hadoop project will be delivered successfully Services Provided • Architecture • Implementation • Cluster Tuning • Migration • Best Practices
  • 20. End to end support to ensure your Hadoop success Mission Critical Hadoop Support Services Training Architect & Design Development Implementation Production Only Hortonworks provides unlimited support across architecture, development, implementation & production Page 20 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Hortonworks University We offer a wide range of training options backed by experts and designed to evolve your teams Hadoop proficiency Custom Coursework • On-site training for your team • Customized for your requirements Public Courses • Offered in all geographies • Hadoop Architect • Hadoop Developer • Hadoop Analyst • Hadoop Operations • Data Science
  • 21. Hadoop is a Platform Decision Open Leadership Drive innovation in the open via the Apache community-driven open source process Page 21 © Hortonworks Inc. 2011 – 2014. All Rights Reserved Enterprise Rigor Engineer, test and certify Apache Hadoop with the enterprise in mind Ecosystem Endorsement Focus on deep integration with existing data center technologies and skills Fastest Growing Customer and Partner Base Largest and most experienced Hadoop adopters have standardized on Hortonworks The data center leaders have standardized on Hortonworks
  • 22. Questions? Page 22 © Hortonworks Inc. 2011 – 2014. All Rights Reserved We Do Hadoop. We Do Retail. September 22, 2014