SlideShare una empresa de Scribd logo
1 de 29
Descargar para leer sin conexión
Page 1 © Hortonworks Inc. 2014
Discover HDP 2.2:
Apache HBase with YARN & Slider for Fast NoSQL Access
Hortonworks. We do Hadoop.
Page 2 © Hortonworks Inc. 2014
Speakers
Justin Sears
Hortonworks Product Marketing Manager
Carter Shanklin
Hortonworks Director of Product Management & PM for
Apache HBase in Hortonworks Data Platform
Enis Soztutar
Hortonworks Engineer, Apache HBase Committer & PMC Member
Page 3 © Hortonworks Inc. 2014
Agenda
•  Introduction to Apache HBase
•  New HBase Innovation in HDP 2.2
–  HBase HA
–  Support for rolling upgrades
–  HBase on YARN using Apache Slider
•  Q & A
We’ll move quickly:
•  Attendee phone lines are muted
•  Text any questions to Enis Soztutar using Webex chat
•  Questions answered at the end
•  Unanswered questions and answers in upcoming blog post
Page 4 © Hortonworks Inc. 2014
Big Data, Hadoop & Data Center Re-platforming
Business Drivers
•  From reactive analytics
to proactive interactions
•  Insights that drive
competitive advantage
& optimal returns
Financial Drivers
•  Cost of data systems, as
% of IT spend,
continues to grow
•  Cost advantages of
commodity hardware
& open source software
$
Technical Drivers
•  Data is growing
exponentially & existing
systems overwhelmed
•  Predominantly driven by
NEW types of data that
can inform analytics
There is an inequitable balance between vendor and customer in the market
Page 5 © Hortonworks Inc. 2014
Clickstream
Capture and analyze
website visitors’ data
trails and optimize
your website
Sensors
Discover patterns in
data streaming
automatically from
remote sensors and
machines
Server Logs
Research logs to
diagnose process
failures and prevent
security breaches
New Types of DataHadoop Value:
Sentiment
Understand how
your customers feel
about your brand
and products –
right now
Geographic
Analyze location-
based data to
manage operations
where they occur
Unstructured
Understand patterns
in files across millions
of web pages, emails,
and documents
Page 6 © Hortonworks Inc. 2014
A Shift from Reactive to Proactive Interactions
HDP and Hadoop allow
organizations to use
data to shift interactions
from…
Reactive
Post Transaction
Proactive
Pre Decision
…to Real-time PersonalizationFrom static branding
…to repair before breakFrom break then fix
…to Designer MedicineFrom mass treatment
…to Automated AlgorithmsFrom Educated Investing
…to 1x1 TargetingFrom mass branding
A shift in Advertising
A shift in Financial Services
A shift in Healthcare
A shift in Retail
A shift in Telco
Page 7 © Hortonworks Inc. 2014
Enterprise Goals for the Modern Data Architecture
•  Consolidate siloed data sets structured
and unstructured
•  Central data set on a single cluster
•  Multiple workloads across batch
interactive and real time
•  Central services for security, governance
and operation
•  Preserve existing investment in current
tools and platforms
•  Single view of the customer, product,
supply chain
APPLICATIONSDATASYSTEM
Business
Analytics
Custom
Applications
Packaged
Applications
RDBMS
EDW
MPP
YARN: Data Operating System
1 ° ° ° ° ° ° ° ° °
° ° ° ° ° ° ° ° ° N
Interactive Real-TimeBatch
CRM
ERP
Other
1 ° ° °
° ° ° °
HDFS
(Hadoop Distributed File System)
SOURCES
EXISTING	
  
Systems	
  
Clickstream	
   Web	
  	
  
&Social	
  
Geoloca9on	
   Sensor	
  	
  
&	
  Machine	
  
Server	
  	
  
Logs	
  
Unstructured	
  
Page 8 © Hortonworks Inc. 2014
YARN Transformed Hadoop & Opened a New Era
YARN
The Architectural
Center of Hadoop
•  Common data platform, many applications
•  Support multi-tenant access & processing
•  Batch, interactive & real-time use cases
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
Tez
Tez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
Others
ISV
Engines
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
Slider
 Slider
BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Page 9 © Hortonworks Inc. 2014
YARN Extends Hadoop to Other Data Center Leaders
YARN
The Architectural
Center of Hadoop
•  Common data platform, many applications
•  Support multi-tenant access & processing
•  Batch, interactive & real-time use cases
•  Supports 3rd-party ISV tools
(ex. SAS, Syncsort, Actian, etc.)
YARN Ready Applications
Facilitates ongoing innovation and enterprise adoption via
ecosystem of new and existing “YARN Ready” solutions
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
Tez
Tez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
Others
ISV
Engines
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
Slider
 Slider
BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Page 10 © Hortonworks Inc. 2014
Enterprise Hadoop: Central Set of Services
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
° °
° °
° ° ° ° °
° ° ° ° °
Enables Apache Hadoop to be
an Enterprise Data Platform
with centralized services for:
•  Governance
•  Operations
•  Security
Everything that plugs into
Hadoop inherits these services
Provision,
Manage &
Monitor
Ambari
Zookeeper
Scheduling
Oozie
Load data and
manage
according
to policy
Deploy and
effectively
manage the
platform
Provide layered
approach to
security through
Authentication,
Authorization,
Accounting, and
Data Protection
SECURITYGOVERNANCE OPERATIONS
Script
Pig
SQL
Hive
Java
Scala
Cascading
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
BATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Others
ISV
Engines
YARN: Data Operating System
(Cluster Resource Management)
HDFS
(Hadoop Distributed File System)
Tez
 Slider
 Slider
Tez
 Tez
Page 11 © Hortonworks Inc. 2014
Hortonworks Data Platform 2.2
HDP Delivers Enterprise Hadoop
YARN: Data Operating System
(Cluster Resource Management)
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
Tez
Tez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
NoSQL
HBase
Accumulo
Slider
 Slider
SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Provision,
Manage &
Monitor
Ambari
Zookeeper
Scheduling
Oozie
Data Workflow,
Lifecycle &
Governance
Falcon
Sqoop
Flume
Kafka
NFS
WebHDFS
Authentication
Authorization
Audit
Data Protection
Storage: HDFS
Resources: YARN
Access: Hive
Pipeline: Falcon
Cluster: Ranger
Cluster: Knox
Deployment ChoiceLinux Windows Cloud
YARN is the architectural
center of HDP
•  Common data set across all
applications
•  Batch, interactive & real-time
workloads
•  Multi-tenant access & processing
Provides comprehensive
enterprise capabilities
•  Governance
•  Security
•  Operations
Enables broad
ecosystem adoption
•  ISVs can plug directly into Hadoop
The widest range of deployment options
•  Linux & Windows
•  On premises & cloud
Others
ISV
Engines
On-Premises
Page 12 © Hortonworks Inc. 2014
Hortonworks Data Platform 2.2
HDP Delivers Enterprise Hadoop
1 ° ° ° ° ° ° °
° ° ° ° ° ° ° °
Script
Pig
SQL
Hive
Tez
Tez
Java
Scala
Cascading
Tez
° °
° °
° ° ° ° °
° ° ° ° °
HDFS
(Hadoop Distributed File System)
Stream
Storm
Search
Solr
Slider
SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS
In-Memory
Spark
Provision,
Manage &
Monitor
Ambari
Zookeeper
Scheduling
Oozie
Data Workflow,
Lifecycle &
Governance
Falcon
Sqoop
Flume
Kafka
NFS
WebHDFS
Authentication
Authorization
Audit
Data Protection
Storage: HDFS
Resources: YARN
Access: Hive
Pipeline: Falcon
Cluster: Ranger
Cluster: Knox
YARN is the architectural
center of HDP
•  Common data set across all
applications
•  Batch, interactive & real-time
workloads
•  Multi-tenant access & processing
Provides comprehensive
enterprise capabilities
•  Governance
•  Security
•  Operations
Enables broad
ecosystem adoption
•  ISVs can plug directly into Hadoop
The widest range of deployment options
•  Linux & Windows
•  On premises & cloud
Others
ISV
Engines
YARN: Data Operating System
(Cluster Resource Management)
Deployment ChoiceLinux Windows CloudOn-Premises
NoSQL
HBase
Accumulo
Slider
Page 13 © Hortonworks Inc. 2014
Introduction to Apache HBase
Page 14 © Hortonworks Inc. 2014
What Is Apache HBase?
Flexible	
  Schema	
  
Extreme	
  Low	
  Latency	
  
SQL	
  and	
  NoSQL	
  Interfaces	
  
Store	
  and	
  Process	
  Petabytes	
  of	
  Data	
  
Scale	
  out	
  on	
  Commodity	
  Servers	
  
Integrated	
  with	
  YARN	
  
100%	
  Open	
  Source	
  
YARN	
  :	
  Data	
  Opera9ng	
  System	
  
HBase	
  
	
  
RegionServer	
  
1	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
  
°	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   °	
   N	
  
HDFS	
  
(Permanent	
  Data	
  Storage)	
  
HBase	
  
	
  
RegionServer	
  
HBase	
  
	
  
RegionServer	
  
Flexible Schema
Extreme Low Latency
Directly Integrated with Hadoop
Page 15 © Hortonworks Inc. 2014
New in HDP 2.2: HBase HA
Page 16 © Hortonworks Inc. 2014
Primary	
  Keys:	
  
(Read	
  Write)	
  
1-­‐100	
  
Standby	
  Keys:	
  
(Read	
  Only)	
  
101-­‐200	
  
201-­‐300	
  
Primary	
  Keys:	
  
(Read	
  Write)	
  
101-­‐200	
  
Standby	
  Keys:	
  
(Read	
  Only)	
  
201-­‐300	
  
301-­‐400	
  
Primary	
  Keys:	
  
(Read	
  Write)	
  
201-­‐300	
  
Standby	
  Keys:	
  
(Read	
  Only)	
  
301-­‐400	
  
1-­‐100	
  
Primary	
  Keys:	
  
(Read	
  Write)	
  
301-­‐400	
  
Standby	
  Keys:	
  
(Read	
  Only)	
  
1-­‐100	
  
101-­‐200	
  
HBase	
  
RegionServer	
  1	
  
HBase	
  
RegionServer	
  2	
  
HBase	
  
RegionServer	
  3	
  
HBase	
  
RegionServer	
  4	
  
HDFS	
  
(3	
  Copies	
  of	
  All	
  Data,	
  Available	
  to	
  all	
  RegionServers)	
  
1
2
3
1 HBase	
  Keys	
  are	
  range	
  parVVoned	
  across	
  servers,	
  node	
  failure	
  affects	
  1	
  key	
  range,	
  rest	
  remain	
  available.	
  
2 HBase	
  HA	
  stores	
  read-­‐only	
  copies	
  in	
  separate	
  RegionServers.	
  Data	
  can	
  sVll	
  be	
  read	
  if	
  a	
  node	
  fails.	
  
3 3	
  copies	
  of	
  all	
  data	
  stored	
  in	
  HDFS.	
  Data	
  from	
  failed	
  nodes	
  automaVcally	
  recovered	
  on	
  other	
  nodes.	
  
HBase	
  HA:	
  3	
  Levels	
  of	
  Protec9on	
  
Page 17 © Hortonworks Inc. 2014
Comparing HBase HA Phase 1 Versus 2
Item	
   HA	
  Phase	
  1	
  /	
  HDP	
  2.1	
   HA	
  Phase	
  2	
  /	
  HDP	
  2.2	
  
Data	
  Staleness	
   >	
  30s	
   Near	
  Zero	
  
HA	
  in	
  Scans	
   Unsupported	
   Supported	
  
Region	
  Split/Merge	
   Disabled	
   Supported	
  
META	
  Table	
  Highly	
  Available	
   Unsupported	
   Supported	
  
HBCK	
  check	
  for	
  common	
  HA	
  problems	
   Unsupported	
   Supported	
  
Page 18 © Hortonworks Inc. 2014
New in HDP 2.2: Rolling Upgrades
Page 19 © Hortonworks Inc. 2014
Rolling Upgrade Goals
Zero downtime upgrades
Roll forward and roll backward
Update clients and servers independently
Page 20 © Hortonworks Inc. 2014
HBase Rolling Upgrade: Component Overview
New	
  Package	
  
Format	
  
	
  
Install	
  mulVple	
  versions	
  of	
  
Hadoop	
  so`ware	
  on	
  a	
  single	
  
node	
  or	
  cluster.	
  
hdp-­‐select	
  U9lity	
  
	
  
	
  
Choose	
  the	
  component	
  
version	
  you	
  want,	
  roll	
  
forward	
  or	
  backward.	
  
Decoupled	
  Clients	
  
and	
  Servers	
  
	
  
Upgrade	
  servers	
  
independently	
  of	
  clients.	
  
Page 21 © Hortonworks Inc. 2014
HBase Rolling Upgrade: Directory Layout
Directory	
  Layout:	
  /usr/hdp	
  
[root@cluster1	
  current]#	
  pwd	
  
/usr/hdp/current	
  
[root@cluster1	
  current]#	
  ls	
  -­‐l	
  |	
  grep	
  hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  6	
  22:57	
  hbase-­‐client	
  -­‐>	
  /usr/hdp/2.2.0.0-­‐1995/hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  6	
  22:57	
  hbase-­‐master	
  -­‐>	
  /usr/hdp/2.2.0.0-­‐1995/hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  6	
  22:57	
  hbase-­‐regionserver	
  -­‐>	
  /usr/hdp/2.2.0.0-­‐1995/hbase	
  
[root@cluster1	
  hdp]#	
  pwd	
  
/usr/hdp	
  
[root@cluster1	
  hdp]#	
  ls	
  -­‐l	
  
drwxr-­‐xr-­‐x.	
  19	
  root	
  root	
  4096	
  Nov	
  15	
  07:26	
  2.2.0.0-­‐1995	
  
drwxr-­‐xr-­‐x.	
  	
  2	
  root	
  root	
  4096	
  Dec	
  	
  7	
  01:22	
  2.2.0.1-­‐2217	
  
drwxr-­‐xr-­‐x.	
  	
  2	
  root	
  root	
  4096	
  Dec	
  	
  6	
  22:57	
  current	
  
Multiple versions of
the HDP stack.
Within	
  /usr/hdp/current	
  
Page 22 © Hortonworks Inc. 2014
HBase Rolling Upgrade: Upgrade One Component
hdp-­‐select	
  
[root@cluster1	
  hdp]#	
  hdp-­‐select	
  status	
  |	
  grep	
  hbase	
  
hbase-­‐client	
  -­‐	
  2.2.0.0-­‐1995	
  
hbase-­‐master	
  -­‐	
  2.2.0.0-­‐1995	
  
hbase-­‐regionserver	
  -­‐	
  2.2.0.0-­‐1995	
  
Upgrade	
  Servers	
  Before	
  Clients	
  
[root@cluster1	
  hdp]#	
  hdp-­‐select	
  set	
  hbase-­‐master	
  2.2.0.1-­‐2217	
  
[root@cluster1	
  current]#	
  pwd	
  
/usr/hdp/current	
  
[root@cluster1	
  current]#	
  ls	
  -­‐l	
  |	
  grep	
  hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  6	
  22:57	
  hbase-­‐client	
  -­‐>	
  /usr/hdp/2.2.0.0-­‐1995/hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  7	
  02:23	
  hbase-­‐master	
  -­‐>	
  /usr/hdp/2.2.0.1-­‐2217/hbase	
  
lrwxrwxrwx.	
  1	
  root	
  root	
  27	
  Dec	
  	
  6	
  22:57	
  hbase-­‐regionserver	
  -­‐>	
  /usr/hdp/2.2.0.0-­‐1995/hbase	
  
Page 23 © Hortonworks Inc. 2014
Rolling Upgrade Contracts
Rolling Upgrade works for minor upgrades.
•  Example: HDP 2.2.0 to HDP 2.2.1.
Wire compatibility guaranteed between clients and servers.
Binary compatibility guaranteed, e.g. for coprocessors.
Data format compatibility guaranteed.
Page 24 © Hortonworks Inc. 2014
Rolling Upgrade Benefits
Rolling	
  Upgrade	
  Benefit	
  
Upgrade	
  with	
  zero	
  downVme.	
  
Roll	
  forward	
  and	
  roll	
  backward.	
  
Instant	
  switchover	
  /	
  restart	
  preserve	
  data	
  locality	
  when	
  upgrading	
  HBase.	
  
Update	
  servers	
  and	
  clients	
  independently.	
  
Page 25 © Hortonworks Inc. 2014
New in HDP 2.2: HBase on YARN via Slider
Page 26 © Hortonworks Inc. 2014
Deploying HBase with Slider
What is it?
•  Deploy HBase into the Hadoop cluster using YARN.
Benefit Details
Simplified Deployment No need to deploy HBase or its configuration to individual cluster nodes.
Lifecycle Management Start / stop / process management handled automatically.
Multitenancy Different users can run HBase clusters within one Hadoop cluster.
Multiple Versions Run different versions of HBase (e.g. 0.98 and 1.0) on the same cluster.
Elasticity Cluster size is a parameter and easily changed.
Co-located Analytics HBase resource usage is known to YARN, nodes running HBase will not
be used as heavily to satisfy MapReduce or Tez jobs.
Page 27 © Hortonworks Inc. 2014
HBase / Slider Sample
Configure HBase settings in appConfig.json and resources.json
Sample Slider Command:
•  slider	
  create	
  mycluster	
  	
  
	
  	
  	
  	
  	
  	
  -­‐-­‐template	
  appConfig.json	
  	
  
	
  	
  	
  	
  	
  	
  -­‐-­‐resources	
  resources.json	
  
{	
  
	
  	
  "schema":	
  "http://example.org/specification/v2.0.0",	
  
	
  	
  "metadata":	
  {	
  
	
  	
  },	
  
	
  	
  "global":	
  {	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.hstore.flush.retries.number":	
  "120",	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.client.keyvalue.maxsize":	
  "10485760",	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.hstore.compactionThreshold":	
  "3",	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.rootdir":	
  "${DEFAULT_DATA_DIR}/data",	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.stagingdir":	
  "${DEFAULT_DATA_DIR}/staging",	
  
	
  	
  	
  	
  "site.hbase-­‐site.hbase.regionserver.handler.count":	
  "60”,	
  
...	
  
Page 28 © Hortonworks Inc. 2014
Q & A
Page 29 © Hortonworks Inc. 2014
Thank you!
Learn more at:
hortonworks.com/hadoop/hbase/
Register for the last
Discover HDP 2.2 Webinar
Hortonworks.com/webinars

Más contenido relacionado

La actualidad más candente

Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
Hortonworks
 

La actualidad más candente (20)

Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in HadoopDiscover HDP 2.1: Apache Falcon for Data Governance in Hadoop
Discover HDP 2.1: Apache Falcon for Data Governance in Hadoop
 
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
Webinar - Accelerating Hadoop Success with Rapid Data Integration for the Mod...
 
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.nextDiscover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
Discover HDP 2.2: Even Faster SQL Queries with Apache Hive and Stinger.next
 
Combine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARNCombine SAS High-Performance Capabilities with Hadoop YARN
Combine SAS High-Performance Capabilities with Hadoop YARN
 
Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'Don't Let Security Be The 'Elephant in the Room'
Don't Let Security Be The 'Elephant in the Room'
 
Deploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via SliderDeploying Docker applications on YARN via Slider
Deploying Docker applications on YARN via Slider
 
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
Discover hdp 2.2: Data storage innovations in Hadoop Distributed Filesystem (...
 
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
Predictive Analytics and Machine Learning…with SAS and Apache HadoopPredictive Analytics and Machine Learning…with SAS and Apache Hadoop
Predictive Analytics and Machine Learning …with SAS and Apache Hadoop
 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
 
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
Discover Red Hat and Apache Hadoop for the Modern Data Architecture - Part 3
 
HDP Advanced Security: Comprehensive Security for Enterprise Hadoop
HDP Advanced Security: Comprehensive Security for Enterprise HadoopHDP Advanced Security: Comprehensive Security for Enterprise Hadoop
HDP Advanced Security: Comprehensive Security for Enterprise Hadoop
 
YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez YARN Ready: Integrating to YARN with Tez
YARN Ready: Integrating to YARN with Tez
 
Discover hdp 2.2 hdfs - final
Discover hdp 2.2   hdfs - finalDiscover hdp 2.2   hdfs - final
Discover hdp 2.2 hdfs - final
 
Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014Hortonworks Yarn Code Walk Through January 2014
Hortonworks Yarn Code Walk Through January 2014
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache HadoopHortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
Hortonworks Technical Workshop: Real Time Monitoring with Apache Hadoop
 
State of the Union with Shaun Connolly
State of the Union with Shaun ConnollyState of the Union with Shaun Connolly
State of the Union with Shaun Connolly
 
Stinger.Next by Alan Gates of Hortonworks
Stinger.Next by Alan Gates of HortonworksStinger.Next by Alan Gates of Hortonworks
Stinger.Next by Alan Gates of Hortonworks
 
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
Discover Enterprise Security Features in Hortonworks Data Platform 2.1: Apach...
 
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
A Comprehensive Approach to Building your Big Data - with Cisco, Hortonworks ...
 

Similar a Discover.hdp2.2.h base.final[2]

Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 

Similar a Discover.hdp2.2.h base.final[2] (20)

Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
Discover HDP 2.2: Comprehensive Hadoop Security with Apache Ranger and Apache...
 
Discover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.finalDiscover.hdp2.2.storm and kafka.final
Discover.hdp2.2.storm and kafka.final
 
Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]Discover.hdp2.2.ambari.final[1]
Discover.hdp2.2.ambari.final[1]
 
Introduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystemIntroduction to the Hadoop EcoSystem
Introduction to the Hadoop EcoSystem
 
Cloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a championCloud Austin Meetup - Hadoop like a champion
Cloud Austin Meetup - Hadoop like a champion
 
Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0Realtime analytics + hadoop 2.0
Realtime analytics + hadoop 2.0
 
Realtime Analytics in Hadoop
Realtime Analytics in HadoopRealtime Analytics in Hadoop
Realtime Analytics in Hadoop
 
Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?Hortonworks - What's Possible with a Modern Data Architecture?
Hortonworks - What's Possible with a Modern Data Architecture?
 
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache HadoopRescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
Rescue your Big Data from Downtime with HP Operations Bridge and Apache Hadoop
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
 
Hortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - WebinarHortonworks and Platfora in Financial Services - Webinar
Hortonworks and Platfora in Financial Services - Webinar
 
YARN - Strata 2014
YARN - Strata 2014YARN - Strata 2014
YARN - Strata 2014
 
Building a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise HadoopBuilding a Modern Data Architecture with Enterprise Hadoop
Building a Modern Data Architecture with Enterprise Hadoop
 
Hadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun MurthyHadoop - Looking to the Future By Arun Murthy
Hadoop - Looking to the Future By Arun Murthy
 
Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2Hortonworks and Red Hat Webinar - Part 2
Hortonworks and Red Hat Webinar - Part 2
 
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder HortonworksThe Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
The Future of Hadoop by Arun Murthy, PMC Apache Hadoop & Cofounder Hortonworks
 
Introduction to Hadoop
Introduction to HadoopIntroduction to Hadoop
Introduction to Hadoop
 
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in HadoopDiscover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
Discover HDP2.1: Apache Storm for Stream Data Processing in Hadoop
 
Supporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big DataSupporting Financial Services with a More Flexible Approach to Big Data
Supporting Financial Services with a More Flexible Approach to Big Data
 
Apache Hadoop on the Open Cloud
Apache Hadoop on the Open CloudApache Hadoop on the Open Cloud
Apache Hadoop on the Open Cloud
 

Más de Hortonworks

Más de Hortonworks (20)

Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next LevelHortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
Hortonworks DataFlow (HDF) 3.3 - Taking Stream Processing to the Next Level
 
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT StrategyIoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
IoT Predictions for 2019 and Beyond: Data at the Heart of Your IoT Strategy
 
Getting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with CloudbreakGetting the Most Out of Your Data in the Cloud with Cloudbreak
Getting the Most Out of Your Data in the Cloud with Cloudbreak
 
Johns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log EventsJohns Hopkins - Using Hadoop to Secure Access Log Events
Johns Hopkins - Using Hadoop to Secure Access Log Events
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad GuysCatch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
 
HDF 3.2 - What's New
HDF 3.2 - What's NewHDF 3.2 - What's New
HDF 3.2 - What's New
 
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging ManagerCuring Kafka Blindness with Hortonworks Streams Messaging Manager
Curing Kafka Blindness with Hortonworks Streams Messaging Manager
 
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical EnvironmentsInterpretation Tool for Genomic Sequencing Data in Clinical Environments
Interpretation Tool for Genomic Sequencing Data in Clinical Environments
 
IBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data LandscapeIBM+Hortonworks = Transformation of the Big Data Landscape
IBM+Hortonworks = Transformation of the Big Data Landscape
 
Premier Inside-Out: Apache Druid
Premier Inside-Out: Apache DruidPremier Inside-Out: Apache Druid
Premier Inside-Out: Apache Druid
 
Accelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at ScaleAccelerating Data Science and Real Time Analytics at Scale
Accelerating Data Science and Real Time Analytics at Scale
 
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATATIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
TIME SERIES: APPLYING ADVANCED ANALYTICS TO INDUSTRIAL PROCESS DATA
 
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
Blockchain with Machine Learning Powered by Big Data: Trimble Transportation ...
 
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: ClearsenseDelivering Real-Time Streaming Data for Healthcare Customers: Clearsense
Delivering Real-Time Streaming Data for Healthcare Customers: Clearsense
 
Making Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with EaseMaking Enterprise Big Data Small with Ease
Making Enterprise Big Data Small with Ease
 
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World PresentationWebinewbie to Webinerd in 30 Days - Webinar World Presentation
Webinewbie to Webinerd in 30 Days - Webinar World Presentation
 
Driving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data ManagementDriving Digital Transformation Through Global Data Management
Driving Digital Transformation Through Global Data Management
 
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming FeaturesHDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
HDF 3.1 pt. 2: A Technical Deep-Dive on New Streaming Features
 
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
Hortonworks DataFlow (HDF) 3.1 - Redefining Data-In-Motion with Modern Data A...
 
Unlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDCUnlock Value from Big Data with Apache NiFi and Streaming CDC
Unlock Value from Big Data with Apache NiFi and Streaming CDC
 

Último

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
masabamasaba
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
VictorSzoltysek
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 

Último (20)

%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfontein
 
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
%in Lydenburg+277-882-255-28 abortion pills for sale in Lydenburg
 
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM TechniquesAI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
AI Mastery 201: Elevating Your Workflow with Advanced LLM Techniques
 
10 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 202410 Trends Likely to Shape Enterprise Technology in 2024
10 Trends Likely to Shape Enterprise Technology in 2024
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
Exploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdfExploring the Best Video Editing App.pdf
Exploring the Best Video Editing App.pdf
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdfPayment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
Payment Gateway Testing Simplified_ A Step-by-Step Guide for Beginners.pdf
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...Chinsurah Escorts ☎️8617697112  Starting From 5K to 15K High Profile Escorts ...
Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...
 

Discover.hdp2.2.h base.final[2]

  • 1. Page 1 © Hortonworks Inc. 2014 Discover HDP 2.2: Apache HBase with YARN & Slider for Fast NoSQL Access Hortonworks. We do Hadoop.
  • 2. Page 2 © Hortonworks Inc. 2014 Speakers Justin Sears Hortonworks Product Marketing Manager Carter Shanklin Hortonworks Director of Product Management & PM for Apache HBase in Hortonworks Data Platform Enis Soztutar Hortonworks Engineer, Apache HBase Committer & PMC Member
  • 3. Page 3 © Hortonworks Inc. 2014 Agenda •  Introduction to Apache HBase •  New HBase Innovation in HDP 2.2 –  HBase HA –  Support for rolling upgrades –  HBase on YARN using Apache Slider •  Q & A We’ll move quickly: •  Attendee phone lines are muted •  Text any questions to Enis Soztutar using Webex chat •  Questions answered at the end •  Unanswered questions and answers in upcoming blog post
  • 4. Page 4 © Hortonworks Inc. 2014 Big Data, Hadoop & Data Center Re-platforming Business Drivers •  From reactive analytics to proactive interactions •  Insights that drive competitive advantage & optimal returns Financial Drivers •  Cost of data systems, as % of IT spend, continues to grow •  Cost advantages of commodity hardware & open source software $ Technical Drivers •  Data is growing exponentially & existing systems overwhelmed •  Predominantly driven by NEW types of data that can inform analytics There is an inequitable balance between vendor and customer in the market
  • 5. Page 5 © Hortonworks Inc. 2014 Clickstream Capture and analyze website visitors’ data trails and optimize your website Sensors Discover patterns in data streaming automatically from remote sensors and machines Server Logs Research logs to diagnose process failures and prevent security breaches New Types of DataHadoop Value: Sentiment Understand how your customers feel about your brand and products – right now Geographic Analyze location- based data to manage operations where they occur Unstructured Understand patterns in files across millions of web pages, emails, and documents
  • 6. Page 6 © Hortonworks Inc. 2014 A Shift from Reactive to Proactive Interactions HDP and Hadoop allow organizations to use data to shift interactions from… Reactive Post Transaction Proactive Pre Decision …to Real-time PersonalizationFrom static branding …to repair before breakFrom break then fix …to Designer MedicineFrom mass treatment …to Automated AlgorithmsFrom Educated Investing …to 1x1 TargetingFrom mass branding A shift in Advertising A shift in Financial Services A shift in Healthcare A shift in Retail A shift in Telco
  • 7. Page 7 © Hortonworks Inc. 2014 Enterprise Goals for the Modern Data Architecture •  Consolidate siloed data sets structured and unstructured •  Central data set on a single cluster •  Multiple workloads across batch interactive and real time •  Central services for security, governance and operation •  Preserve existing investment in current tools and platforms •  Single view of the customer, product, supply chain APPLICATIONSDATASYSTEM Business Analytics Custom Applications Packaged Applications RDBMS EDW MPP YARN: Data Operating System 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° N Interactive Real-TimeBatch CRM ERP Other 1 ° ° ° ° ° ° ° HDFS (Hadoop Distributed File System) SOURCES EXISTING   Systems   Clickstream   Web     &Social   Geoloca9on   Sensor     &  Machine   Server     Logs   Unstructured  
  • 8. Page 8 © Hortonworks Inc. 2014 YARN Transformed Hadoop & Opened a New Era YARN The Architectural Center of Hadoop •  Common data platform, many applications •  Support multi-tenant access & processing •  Batch, interactive & real-time use cases YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive Tez Tez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° Others ISV Engines HDFS (Hadoop Distributed File System) Stream Storm Search Solr NoSQL HBase Accumulo Slider Slider BATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark
  • 9. Page 9 © Hortonworks Inc. 2014 YARN Extends Hadoop to Other Data Center Leaders YARN The Architectural Center of Hadoop •  Common data platform, many applications •  Support multi-tenant access & processing •  Batch, interactive & real-time use cases •  Supports 3rd-party ISV tools (ex. SAS, Syncsort, Actian, etc.) YARN Ready Applications Facilitates ongoing innovation and enterprise adoption via ecosystem of new and existing “YARN Ready” solutions YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive Tez Tez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° Others ISV Engines HDFS (Hadoop Distributed File System) Stream Storm Search Solr NoSQL HBase Accumulo Slider Slider BATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark
  • 10. Page 10 © Hortonworks Inc. 2014 Enterprise Hadoop: Central Set of Services YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Enables Apache Hadoop to be an Enterprise Data Platform with centralized services for: •  Governance •  Operations •  Security Everything that plugs into Hadoop inherits these services Provision, Manage & Monitor Ambari Zookeeper Scheduling Oozie Load data and manage according to policy Deploy and effectively manage the platform Provide layered approach to security through Authentication, Authorization, Accounting, and Data Protection SECURITYGOVERNANCE OPERATIONS Script Pig SQL Hive Java Scala Cascading Stream Storm Search Solr NoSQL HBase Accumulo BATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark Others ISV Engines YARN: Data Operating System (Cluster Resource Management) HDFS (Hadoop Distributed File System) Tez Slider Slider Tez Tez
  • 11. Page 11 © Hortonworks Inc. 2014 Hortonworks Data Platform 2.2 HDP Delivers Enterprise Hadoop YARN: Data Operating System (Cluster Resource Management) 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive Tez Tez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° HDFS (Hadoop Distributed File System) Stream Storm Search Solr NoSQL HBase Accumulo Slider Slider SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark Provision, Manage & Monitor Ambari Zookeeper Scheduling Oozie Data Workflow, Lifecycle & Governance Falcon Sqoop Flume Kafka NFS WebHDFS Authentication Authorization Audit Data Protection Storage: HDFS Resources: YARN Access: Hive Pipeline: Falcon Cluster: Ranger Cluster: Knox Deployment ChoiceLinux Windows Cloud YARN is the architectural center of HDP •  Common data set across all applications •  Batch, interactive & real-time workloads •  Multi-tenant access & processing Provides comprehensive enterprise capabilities •  Governance •  Security •  Operations Enables broad ecosystem adoption •  ISVs can plug directly into Hadoop The widest range of deployment options •  Linux & Windows •  On premises & cloud Others ISV Engines On-Premises
  • 12. Page 12 © Hortonworks Inc. 2014 Hortonworks Data Platform 2.2 HDP Delivers Enterprise Hadoop 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° Script Pig SQL Hive Tez Tez Java Scala Cascading Tez ° ° ° ° ° ° ° ° ° ° ° ° ° ° HDFS (Hadoop Distributed File System) Stream Storm Search Solr Slider SECURITYGOVERNANCE OPERATIONSBATCH, INTERACTIVE & REAL-TIME DATA ACCESS In-Memory Spark Provision, Manage & Monitor Ambari Zookeeper Scheduling Oozie Data Workflow, Lifecycle & Governance Falcon Sqoop Flume Kafka NFS WebHDFS Authentication Authorization Audit Data Protection Storage: HDFS Resources: YARN Access: Hive Pipeline: Falcon Cluster: Ranger Cluster: Knox YARN is the architectural center of HDP •  Common data set across all applications •  Batch, interactive & real-time workloads •  Multi-tenant access & processing Provides comprehensive enterprise capabilities •  Governance •  Security •  Operations Enables broad ecosystem adoption •  ISVs can plug directly into Hadoop The widest range of deployment options •  Linux & Windows •  On premises & cloud Others ISV Engines YARN: Data Operating System (Cluster Resource Management) Deployment ChoiceLinux Windows CloudOn-Premises NoSQL HBase Accumulo Slider
  • 13. Page 13 © Hortonworks Inc. 2014 Introduction to Apache HBase
  • 14. Page 14 © Hortonworks Inc. 2014 What Is Apache HBase? Flexible  Schema   Extreme  Low  Latency   SQL  and  NoSQL  Interfaces   Store  and  Process  Petabytes  of  Data   Scale  out  on  Commodity  Servers   Integrated  with  YARN   100%  Open  Source   YARN  :  Data  Opera9ng  System   HBase     RegionServer   1   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   °   N   HDFS   (Permanent  Data  Storage)   HBase     RegionServer   HBase     RegionServer   Flexible Schema Extreme Low Latency Directly Integrated with Hadoop
  • 15. Page 15 © Hortonworks Inc. 2014 New in HDP 2.2: HBase HA
  • 16. Page 16 © Hortonworks Inc. 2014 Primary  Keys:   (Read  Write)   1-­‐100   Standby  Keys:   (Read  Only)   101-­‐200   201-­‐300   Primary  Keys:   (Read  Write)   101-­‐200   Standby  Keys:   (Read  Only)   201-­‐300   301-­‐400   Primary  Keys:   (Read  Write)   201-­‐300   Standby  Keys:   (Read  Only)   301-­‐400   1-­‐100   Primary  Keys:   (Read  Write)   301-­‐400   Standby  Keys:   (Read  Only)   1-­‐100   101-­‐200   HBase   RegionServer  1   HBase   RegionServer  2   HBase   RegionServer  3   HBase   RegionServer  4   HDFS   (3  Copies  of  All  Data,  Available  to  all  RegionServers)   1 2 3 1 HBase  Keys  are  range  parVVoned  across  servers,  node  failure  affects  1  key  range,  rest  remain  available.   2 HBase  HA  stores  read-­‐only  copies  in  separate  RegionServers.  Data  can  sVll  be  read  if  a  node  fails.   3 3  copies  of  all  data  stored  in  HDFS.  Data  from  failed  nodes  automaVcally  recovered  on  other  nodes.   HBase  HA:  3  Levels  of  Protec9on  
  • 17. Page 17 © Hortonworks Inc. 2014 Comparing HBase HA Phase 1 Versus 2 Item   HA  Phase  1  /  HDP  2.1   HA  Phase  2  /  HDP  2.2   Data  Staleness   >  30s   Near  Zero   HA  in  Scans   Unsupported   Supported   Region  Split/Merge   Disabled   Supported   META  Table  Highly  Available   Unsupported   Supported   HBCK  check  for  common  HA  problems   Unsupported   Supported  
  • 18. Page 18 © Hortonworks Inc. 2014 New in HDP 2.2: Rolling Upgrades
  • 19. Page 19 © Hortonworks Inc. 2014 Rolling Upgrade Goals Zero downtime upgrades Roll forward and roll backward Update clients and servers independently
  • 20. Page 20 © Hortonworks Inc. 2014 HBase Rolling Upgrade: Component Overview New  Package   Format     Install  mulVple  versions  of   Hadoop  so`ware  on  a  single   node  or  cluster.   hdp-­‐select  U9lity       Choose  the  component   version  you  want,  roll   forward  or  backward.   Decoupled  Clients   and  Servers     Upgrade  servers   independently  of  clients.  
  • 21. Page 21 © Hortonworks Inc. 2014 HBase Rolling Upgrade: Directory Layout Directory  Layout:  /usr/hdp   [root@cluster1  current]#  pwd   /usr/hdp/current   [root@cluster1  current]#  ls  -­‐l  |  grep  hbase   lrwxrwxrwx.  1  root  root  27  Dec    6  22:57  hbase-­‐client  -­‐>  /usr/hdp/2.2.0.0-­‐1995/hbase   lrwxrwxrwx.  1  root  root  27  Dec    6  22:57  hbase-­‐master  -­‐>  /usr/hdp/2.2.0.0-­‐1995/hbase   lrwxrwxrwx.  1  root  root  27  Dec    6  22:57  hbase-­‐regionserver  -­‐>  /usr/hdp/2.2.0.0-­‐1995/hbase   [root@cluster1  hdp]#  pwd   /usr/hdp   [root@cluster1  hdp]#  ls  -­‐l   drwxr-­‐xr-­‐x.  19  root  root  4096  Nov  15  07:26  2.2.0.0-­‐1995   drwxr-­‐xr-­‐x.    2  root  root  4096  Dec    7  01:22  2.2.0.1-­‐2217   drwxr-­‐xr-­‐x.    2  root  root  4096  Dec    6  22:57  current   Multiple versions of the HDP stack. Within  /usr/hdp/current  
  • 22. Page 22 © Hortonworks Inc. 2014 HBase Rolling Upgrade: Upgrade One Component hdp-­‐select   [root@cluster1  hdp]#  hdp-­‐select  status  |  grep  hbase   hbase-­‐client  -­‐  2.2.0.0-­‐1995   hbase-­‐master  -­‐  2.2.0.0-­‐1995   hbase-­‐regionserver  -­‐  2.2.0.0-­‐1995   Upgrade  Servers  Before  Clients   [root@cluster1  hdp]#  hdp-­‐select  set  hbase-­‐master  2.2.0.1-­‐2217   [root@cluster1  current]#  pwd   /usr/hdp/current   [root@cluster1  current]#  ls  -­‐l  |  grep  hbase   lrwxrwxrwx.  1  root  root  27  Dec    6  22:57  hbase-­‐client  -­‐>  /usr/hdp/2.2.0.0-­‐1995/hbase   lrwxrwxrwx.  1  root  root  27  Dec    7  02:23  hbase-­‐master  -­‐>  /usr/hdp/2.2.0.1-­‐2217/hbase   lrwxrwxrwx.  1  root  root  27  Dec    6  22:57  hbase-­‐regionserver  -­‐>  /usr/hdp/2.2.0.0-­‐1995/hbase  
  • 23. Page 23 © Hortonworks Inc. 2014 Rolling Upgrade Contracts Rolling Upgrade works for minor upgrades. •  Example: HDP 2.2.0 to HDP 2.2.1. Wire compatibility guaranteed between clients and servers. Binary compatibility guaranteed, e.g. for coprocessors. Data format compatibility guaranteed.
  • 24. Page 24 © Hortonworks Inc. 2014 Rolling Upgrade Benefits Rolling  Upgrade  Benefit   Upgrade  with  zero  downVme.   Roll  forward  and  roll  backward.   Instant  switchover  /  restart  preserve  data  locality  when  upgrading  HBase.   Update  servers  and  clients  independently.  
  • 25. Page 25 © Hortonworks Inc. 2014 New in HDP 2.2: HBase on YARN via Slider
  • 26. Page 26 © Hortonworks Inc. 2014 Deploying HBase with Slider What is it? •  Deploy HBase into the Hadoop cluster using YARN. Benefit Details Simplified Deployment No need to deploy HBase or its configuration to individual cluster nodes. Lifecycle Management Start / stop / process management handled automatically. Multitenancy Different users can run HBase clusters within one Hadoop cluster. Multiple Versions Run different versions of HBase (e.g. 0.98 and 1.0) on the same cluster. Elasticity Cluster size is a parameter and easily changed. Co-located Analytics HBase resource usage is known to YARN, nodes running HBase will not be used as heavily to satisfy MapReduce or Tez jobs.
  • 27. Page 27 © Hortonworks Inc. 2014 HBase / Slider Sample Configure HBase settings in appConfig.json and resources.json Sample Slider Command: •  slider  create  mycluster                -­‐-­‐template  appConfig.json                -­‐-­‐resources  resources.json   {      "schema":  "http://example.org/specification/v2.0.0",      "metadata":  {      },      "global":  {          "site.hbase-­‐site.hbase.hstore.flush.retries.number":  "120",          "site.hbase-­‐site.hbase.client.keyvalue.maxsize":  "10485760",          "site.hbase-­‐site.hbase.hstore.compactionThreshold":  "3",          "site.hbase-­‐site.hbase.rootdir":  "${DEFAULT_DATA_DIR}/data",          "site.hbase-­‐site.hbase.stagingdir":  "${DEFAULT_DATA_DIR}/staging",          "site.hbase-­‐site.hbase.regionserver.handler.count":  "60”,   ...  
  • 28. Page 28 © Hortonworks Inc. 2014 Q & A
  • 29. Page 29 © Hortonworks Inc. 2014 Thank you! Learn more at: hortonworks.com/hadoop/hbase/ Register for the last Discover HDP 2.2 Webinar Hortonworks.com/webinars