SlideShare a Scribd company logo
1 of 44
1 hbasecon.com
HBase 2.0 and Beyond Panel
Moderator: Jonathan Hsieh
Panel: Matteo Bertozzi / Sean Busbey / Jingcheng Du / Lars Hofhansl /
/ Enis Soztutar / Jimmy Xiang
2 hbasecon.com
Who are we?
 Matteo Bertozzi – HBase PMC, Cloudera
 Sean Busbey – HBase PMC, Cloudera
 Jingcheng Du – Intel
 Lars Hofhansl – HBase PMC, 0.94.x RM,
Salesforce.com
 Jonathan Hsieh – HBase PMC
 Enis Soztutar – HBase PMC, 1.0.0 RM,
Hortonworks
 Jimmy Xiang – HBase PMC, Cloudera
3 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
4 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
5 hbasecon.com
Why Moderate Object Storage (MOB)?
 A growing demand for the ability to store moderateobjects (MOB) in HBase ( 100KB
up to 10MB).
 Write amplification created by compactions, the write performance degrades along
with the accumulation of massive MOBs in HBase.
 Too many store files -> Frequent region compactions -> Massive I/O -> Slow compactions ->
Flush delay -> High memory usage -> Blocking updates
8.098
10.159 10.700
0.000
2.000
4.000
6.000
8.000
10.000
12.000
125G 500G 1T
Latency(sec)
Data volume
Data Insertion Average Latency
(5MB/record, 32 pre-split regions)
0
5
10
15
20
25
1 2 3 4 5 6 7 8
Latency(sec)
Time (hour)
1T Data Insertion Average Latency
(5MB/record, 32 pre-split regions)
6 hbasecon.com
How MOB I/O works
HRegionServer
Client
HFIle
MOB cell
HLog
memstore
MOB cell
MOB HFile
Flush
MOB cell
Write Path
Ref cell
Client
Read Path HRegionServer
memstore
HFIle
MOB HFile
MOB cell
MOB cell
Ref cell
7 hbasecon.com
Benefits
 Move the MOBs out of the main I/O path to make the write amplification more predictable.
 The same APIs to read and write MOBs.
 Work with HBase export/copy table, bulk load, replication and snapshot features.
 Work with HBase security mechanism.
8.098
10.159
10.700
6.851 6.963 7.033
0.000
2.000
4.000
6.000
8.000
10.000
12.000
125G 500G 1T
Latency(sec)
Data volume
Data Insertion Average Latency
(5MB/record, 32 pre-split regions)
MOB Disabled
MOB Enabled
10.590
57.975
6.212
33.886
0.000
10.000
20.000
30.000
40.000
50.000
60.000
Data Insertion Data Random Get
Latency(sec)
Average Latency for R/W Mixed Workload
(5MB/record, 32 pre-split regions,
300G pre-load, 200G insertion)
MOB Disabled
MOB Enabled
0
2
4
6
8
10
12
14
16
18
10 20 30 40 50 60
Lantecy(sec)
Time (minute)
Data Insertion Average Latency MOB Enabled
MOB Disabled
0
10
20
30
40
50
60
70
80
90
100
10 20 30 40 50 60
Latency(minute)
Time (minute)
Data Random Get Average Latency
MOB Enabled
MOB Disabled
8 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
9 hbasecon.com
Problem – Multi-Steps ops & Failures
DDL & other operations consist of multiple steps
e.g.
Create Table
Handler
Create regions on FileSystem
Add regions to META
Assign
cpHost.postCreateTableHandler() -> (ACLs)
if we crash in between steps.
we end up with half state.
e.g. File-System present, META not present
hbck MAY be able to repair it
if we crash in the middle of a single step (e.g. create N regions on fs)
hbck has not enough information to rebuild a correct state.
Requires manual intervention to repair the state
10 hbasecon.com
Solution – Multi-Steps ops & Failures
Rewrite each operation to use a State-Machine
e.g.
Create Table
Handler
Create regions on FileSystem
Add regions to META
Assign
cpHost.postCreateTableHandler() -> (ACLs)
...each executed step is written to a store
if the machine goes down
we know what was pending
and what should be rolledback
or how to continue to complete the operation
11 hbasecon.com
Procedure-v2/Notification-Bus
 The Procedure v2/NotificationBus aims to provide a unified way to build:
 Synchronous calls, with the ability to see the state/result in case of failure.
 Multisteps procedure with a rollback/rollforward ability in case of failure (e.g.
create/delete table)
 Notifications across multiple machines (e.g. ACLs/Labels/Quota cache updates)
 Coordination of long-running/heavy procedures (e.g. compactions, splits, …)
 Procedures across multiple machines (e.g. Snapshots, Assignment)
 Replication for Master operations (e.g. grant/revoke)
12 hbasecon.com
Procedure-v2/Notification-Bus - Roadmap
 Apache HBase 1.1
 Fault tolerant Master Operations (e.g. create/delete/…)
 Sync Client (We are still wire compatible, both ways)
 Apache HBase 1.2
 Master WebUI
 Notification BUS, and at least Snapshot using it.
 Apache HBase 1.3+ or 2.0 (depending on how hard is to keep Master/RSs compatibility)
 Replace Cache Updates, Assignment Manager, Distributed Log Replay,…
 New Features: Coordinated compactions, Master ops Replication (e.g. grant/revoke)
13 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
14 hbasecon.com
ZK-based Region Assignment
 Region states could be inconsistent
 Assignment info stored in both meta table and ZooKeeper
 Both Master and RegionServer can update them
 Limited scalability and operations efficiency
 ZooKeeper events used for coordination
14
15 hbasecon.com
ZK-less Region Assignment
 RPC based
 Master, the true coordinator
 Only Master can update meta table
 All state changes are persisted
 Follow the state machine
 RegionServer does what told by Master
 Report status to Master
 Each step needs acknowledgement from Master
15
16 hbasecon.com
Current Status
 Off by default in 1.0
 Impact
 Master is in the critical path
 Meta should be co-located with Master
 Procedure V2 could solve it (future work)
 Deployment topology change
 Master is a RegionServer, serves small system tables
 Blog post has more info
 https://blogs.apache.org/hbase/entry/hbase_zk_less_region_assignment
16
17 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
18 hbasecon.com
HBase Semantic Versioning
The Return to Sanity
19 hbasecon.com
Client
Version?
Server
Version?
Hadoop
Version?
Binary
Compatibility?
HFile
Version?
ARRGGHHH.
Should be SIMPLE!
Protobufs
Client/Server
Compatibility?
20 hbasecon.com
Semantic Versioning Makes Things Simple
21 hbasecon.com
HBase <Major>.<Minor>.<Patch>
22 hbasecon.com
MAJOR version
when you make incompatible API changes
23 hbasecon.com
MINOR version
when you add backwards-compatible functionality
24 hbasecon.com
PATCH version
when you make backwards-compatible bug fixes
25 hbasecon.com
We are adoption this starting with HBase 1.0
26 hbasecon.com
Compatibility Dimensions
(the long version)
 Client-Server wire protocol compatibility
 Server-Server protocol compatibility
 File format compatibility
 Client API compatibility
 Client Binary compatibility
 Server-Side Limited API compatibility (taken from Hadoop)
 Dependency Compatibility
 Operational Compatibility
27 hbasecon.com
TL;DR:
 A patch upgrade is a drop-in replacement
 A minor upgrade requires no application or client code
modification
 A major upgrade allows us - the HBase community - to make
breaking changes.
28 hbasecon.com
Simple
29 hbasecon.com
Thanks
http://semver.org/
http://hbase.apache.org/book.html#hbase.versioning
30 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
31 hbasecon.com
Improving read availability
 HBase is CP
 When a node goes down, some regions are unavailable until
recovery
 Some class of applications want high availability (for reads)
 Region replicas
 TIMELINE consistency reads
32 hbasecon.com
Phase contents
 Phase 1
 Region replicas
 Stale data up to minutes (15 min)
 in 1.0
 Phase 2
 millisecond-latencies for staleness (WAL replication)
 Replicas for the meta table
 Region splits and merges with region replicas
 Scan support
 In 1.1
33 hbasecon.com
Region1
Region2
Region3
WAL
append
ReplicaReplication
RegionServer 1
tail
hfile hfile hfile
HDFS
Flush/Compaction
34 hbasecon.com
Region1
Region2
Region3
WAL
ReplicaReplication
RegionServer 1
tail
Region2 (replica)
RegionServer 15
replay
RegionServer 20
Region1 (replica)
replay
hfile hfile hfile
HDFS
Flush/Compaction
Read flush files
35 hbasecon.com
Pluggable WAL Replication
 Pluggable WAL replication endpoint
 You can write your own replicators!
 Similar to co-processors (runs in the same RS process)
hbase> add_peer ’my_peer',
ENDPOINT_CLASSNAME =>
'org.hbase.MyReplicationEndpoint',
DATA => { "key1" => 1 },
CONFIG => { "config1" => "value1", "config2"
=> "value2" }}
36 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
37 hbasecon.com
Workload Throughput
Distributed work will eventually be limited by one of
• CPU
• Disk IO
• Network IO
38 hbasecon.com
HBase Under (synthetic) Load Now
Not CPU Bound
39 hbasecon.com
HBase Under (synthetic) Load Now
Not Disk Bound
40 hbasecon.com
HBase Under (synthetic) Load Now
Not Network
Bound
41 hbasecon.com
Modest Gain: Multiple WALs
 All regions write to one Write
ahead log file. (WAL)
 Idea: Let’s have multiple write
ahead logs so that we can write
more in parallel.
 Follow-up work:
 To the limit if were on SSD we
could have one WAL per
region.
RS
1
2
3
DNDisksRS
1
2
3
DNDisks
IDLE
IDLE
42 hbasecon.com
Future Solutions
• Alternative WAL providers
• Read path optimizations based on profiling
• Better tuning
43 hbasecon.com
Outline
 Storing Larger Objects efficiently
 Making DDL Operations fault tolerant
 Better Region Assignment
 Compatibility guarantees for our users
 Improving Availability
 Using all machine resources
 Q+A
44 hbasecon.com
Thanks!

More Related Content

What's hot

Meet HBase 1.0
Meet HBase 1.0Meet HBase 1.0
Meet HBase 1.0enissoz
 
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, SalesforceHBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, SalesforceCloudera, Inc.
 
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and  High-Demand EnvironmentHBaseCon 2015: HBase at Scale in an Online and  High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and High-Demand EnvironmentHBaseCon
 
hbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Large scale data near-line loading method and architecturehbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Large scale data near-line loading method and architectureHBaseCon
 
HBase: Where Online Meets Low Latency
HBase: Where Online Meets Low LatencyHBase: Where Online Meets Low Latency
HBase: Where Online Meets Low LatencyHBaseCon
 
Real-time HBase: Lessons from the Cloud
Real-time HBase: Lessons from the CloudReal-time HBase: Lessons from the Cloud
Real-time HBase: Lessons from the CloudHBaseCon
 
HBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ SalesforceHBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ SalesforceHBaseCon
 
HBaseCon 2013: How to Get the MTTR Below 1 Minute and More
HBaseCon 2013: How to Get the MTTR Below 1 Minute and MoreHBaseCon 2013: How to Get the MTTR Below 1 Minute and More
HBaseCon 2013: How to Get the MTTR Below 1 Minute and MoreCloudera, Inc.
 
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Removable singularity: a story of HBase upgrade in PinterestHBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Removable singularity: a story of HBase upgrade in PinterestHBaseCon
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseenissoz
 
Operating and supporting HBase Clusters
Operating and supporting HBase ClustersOperating and supporting HBase Clusters
Operating and supporting HBase Clustersenissoz
 
hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: HBase在Hulu的使用和实践hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: HBase在Hulu的使用和实践HBaseCon
 
HBase 0.20.0 Performance Evaluation
HBase 0.20.0 Performance EvaluationHBase 0.20.0 Performance Evaluation
HBase 0.20.0 Performance EvaluationSchubert Zhang
 
HBase Read High Availability Using Timeline Consistent Region Replicas
HBase  Read High Availability Using Timeline Consistent Region ReplicasHBase  Read High Availability Using Timeline Consistent Region Replicas
HBase Read High Availability Using Timeline Consistent Region Replicasenissoz
 
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...Cloudera, Inc.
 
hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0HBaseCon
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0enissoz
 
Hug Hbase Presentation.
Hug Hbase Presentation.Hug Hbase Presentation.
Hug Hbase Presentation.Jack Levin
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance TuningLars Hofhansl
 

What's hot (19)

Meet HBase 1.0
Meet HBase 1.0Meet HBase 1.0
Meet HBase 1.0
 
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, SalesforceHBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
HBaseCon 2012 | Learning HBase Internals - Lars Hofhansl, Salesforce
 
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and  High-Demand EnvironmentHBaseCon 2015: HBase at Scale in an Online and  High-Demand Environment
HBaseCon 2015: HBase at Scale in an Online and High-Demand Environment
 
hbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Large scale data near-line loading method and architecturehbaseconasia2017: Large scale data near-line loading method and architecture
hbaseconasia2017: Large scale data near-line loading method and architecture
 
HBase: Where Online Meets Low Latency
HBase: Where Online Meets Low LatencyHBase: Where Online Meets Low Latency
HBase: Where Online Meets Low Latency
 
Real-time HBase: Lessons from the Cloud
Real-time HBase: Lessons from the CloudReal-time HBase: Lessons from the Cloud
Real-time HBase: Lessons from the Cloud
 
HBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ SalesforceHBaseCon 2015: HBase Performance Tuning @ Salesforce
HBaseCon 2015: HBase Performance Tuning @ Salesforce
 
HBaseCon 2013: How to Get the MTTR Below 1 Minute and More
HBaseCon 2013: How to Get the MTTR Below 1 Minute and MoreHBaseCon 2013: How to Get the MTTR Below 1 Minute and More
HBaseCon 2013: How to Get the MTTR Below 1 Minute and More
 
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Removable singularity: a story of HBase upgrade in PinterestHBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
HBaseCon2017 Removable singularity: a story of HBase upgrade in Pinterest
 
HBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBaseHBase and HDFS: Understanding FileSystem Usage in HBase
HBase and HDFS: Understanding FileSystem Usage in HBase
 
Operating and supporting HBase Clusters
Operating and supporting HBase ClustersOperating and supporting HBase Clusters
Operating and supporting HBase Clusters
 
hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: HBase在Hulu的使用和实践hbaseconasia2017: HBase在Hulu的使用和实践
hbaseconasia2017: HBase在Hulu的使用和实践
 
HBase 0.20.0 Performance Evaluation
HBase 0.20.0 Performance EvaluationHBase 0.20.0 Performance Evaluation
HBase 0.20.0 Performance Evaluation
 
HBase Read High Availability Using Timeline Consistent Region Replicas
HBase  Read High Availability Using Timeline Consistent Region ReplicasHBase  Read High Availability Using Timeline Consistent Region Replicas
HBase Read High Availability Using Timeline Consistent Region Replicas
 
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...
HBaseCon 2012 | Gap Inc Direct: Serving Apparel Catalog from HBase for Live W...
 
hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0hbaseconasia2017: hbase-2.0.0
hbaseconasia2017: hbase-2.0.0
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0
 
Hug Hbase Presentation.
Hug Hbase Presentation.Hug Hbase Presentation.
Hug Hbase Presentation.
 
Apache HBase Performance Tuning
Apache HBase Performance TuningApache HBase Performance Tuning
Apache HBase Performance Tuning
 

Similar to HBaseCon 2015: HBase 2.0 and Beyond Panel

HBaseCon 2015: Meet HBase 1.0
HBaseCon 2015: Meet HBase 1.0HBaseCon 2015: Meet HBase 1.0
HBaseCon 2015: Meet HBase 1.0HBaseCon
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0enissoz
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsDataWorks Summit/Hadoop Summit
 
Clusters (Distributed computing)
Clusters (Distributed computing)Clusters (Distributed computing)
Clusters (Distributed computing)Sri Prasanna
 
Baidu cloudfoundry english
Baidu cloudfoundry englishBaidu cloudfoundry english
Baidu cloudfoundry englishJames Watters
 
Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010Membase
 
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NY
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NYApache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NY
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NYWangda Tan
 
Splice Machine Overview
Splice Machine OverviewSplice Machine Overview
Splice Machine OverviewKunal Gupta
 
Clustered Architecture Patterns Delivering Scalability And Availability
Clustered Architecture Patterns Delivering Scalability And AvailabilityClustered Architecture Patterns Delivering Scalability And Availability
Clustered Architecture Patterns Delivering Scalability And AvailabilityConSanFrancisco123
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0DataWorks Summit
 
How To Set Up SQL Load Balancing with HAProxy - Slides
How To Set Up SQL Load Balancing with HAProxy - SlidesHow To Set Up SQL Load Balancing with HAProxy - Slides
How To Set Up SQL Load Balancing with HAProxy - SlidesSeveralnines
 
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...Cloudera, Inc.
 
MySQL HA with PaceMaker
MySQL HA with  PaceMakerMySQL HA with  PaceMaker
MySQL HA with PaceMakerKris Buytaert
 
Scaling wix with microservices architecture devoxx London 2015
Scaling wix with microservices architecture devoxx London 2015Scaling wix with microservices architecture devoxx London 2015
Scaling wix with microservices architecture devoxx London 2015Aviran Mordo
 

Similar to HBaseCon 2015: HBase 2.0 and Beyond Panel (20)

HBaseCon 2015: Meet HBase 1.0
HBaseCon 2015: Meet HBase 1.0HBaseCon 2015: Meet HBase 1.0
HBaseCon 2015: Meet HBase 1.0
 
Meet Apache HBase - 2.0
Meet Apache HBase - 2.0Meet Apache HBase - 2.0
Meet Apache HBase - 2.0
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0
 
PROSE
PROSEPROSE
PROSE
 
Operating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and ImprovementsOperating and Supporting Apache HBase Best Practices and Improvements
Operating and Supporting Apache HBase Best Practices and Improvements
 
Clusters (Distributed computing)
Clusters (Distributed computing)Clusters (Distributed computing)
Clusters (Distributed computing)
 
Baidu cloudfoundry english
Baidu cloudfoundry englishBaidu cloudfoundry english
Baidu cloudfoundry english
 
Membase Meetup 2010
Membase Meetup 2010Membase Meetup 2010
Membase Meetup 2010
 
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NY
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NYApache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NY
Apache hadoop 3.x state of the union and upgrade guidance - Strata 2019 NY
 
Splice Machine Overview
Splice Machine OverviewSplice Machine Overview
Splice Machine Overview
 
3487570
34875703487570
3487570
 
Clustered Architecture Patterns Delivering Scalability And Availability
Clustered Architecture Patterns Delivering Scalability And AvailabilityClustered Architecture Patterns Delivering Scalability And Availability
Clustered Architecture Patterns Delivering Scalability And Availability
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
 
How To Set Up SQL Load Balancing with HAProxy - Slides
How To Set Up SQL Load Balancing with HAProxy - SlidesHow To Set Up SQL Load Balancing with HAProxy - Slides
How To Set Up SQL Load Balancing with HAProxy - Slides
 
slides (PPT)
slides (PPT)slides (PPT)
slides (PPT)
 
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
 
linuxcluster.ppt
linuxcluster.pptlinuxcluster.ppt
linuxcluster.ppt
 
Open ebs 101
Open ebs 101Open ebs 101
Open ebs 101
 
MySQL HA with PaceMaker
MySQL HA with  PaceMakerMySQL HA with  PaceMaker
MySQL HA with PaceMaker
 
Scaling wix with microservices architecture devoxx London 2015
Scaling wix with microservices architecture devoxx London 2015Scaling wix with microservices architecture devoxx London 2015
Scaling wix with microservices architecture devoxx London 2015
 

More from HBaseCon

hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on KubernetesHBaseCon
 
hbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on Beamhbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on BeamHBaseCon
 
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at Huaweihbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at HuaweiHBaseCon
 
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinteresthbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in PinterestHBaseCon
 
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程HBaseCon
 
hbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at NeteaseHBaseCon
 
hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: 基于HBase的企业级大数据平台hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: 基于HBase的企业级大数据平台HBaseCon
 
hbaseconasia2017: HBase at JD.com
hbaseconasia2017: HBase at JD.comhbaseconasia2017: HBase at JD.com
hbaseconasia2017: HBase at JD.comHBaseCon
 
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huaweihbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: Ecosystems with HBase and CloudTable service at HuaweiHBaseCon
 
hbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: HBase Practice At XiaoMihbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: HBase Practice At XiaoMiHBaseCon
 
HBaseCon2017 Democratizing HBase
HBaseCon2017 Democratizing HBaseHBaseCon2017 Democratizing HBase
HBaseCon2017 Democratizing HBaseHBaseCon
 
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon
 
HBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon
 
HBaseCon2017 Highly-Available HBase
HBaseCon2017 Highly-Available HBaseHBaseCon2017 Highly-Available HBase
HBaseCon2017 Highly-Available HBaseHBaseCon
 
HBaseCon2017 Apache HBase at Didi
HBaseCon2017 Apache HBase at DidiHBaseCon2017 Apache HBase at Didi
HBaseCon2017 Apache HBase at DidiHBaseCon
 
HBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon
 
HBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon
 
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...HBaseCon
 
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBaseHBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBaseHBaseCon
 
HBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon
 

More from HBaseCon (20)

hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kuberneteshbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
hbaseconasia2017: Building online HBase cluster of Zhihu based on Kubernetes
 
hbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on Beamhbaseconasia2017: HBase on Beam
hbaseconasia2017: HBase on Beam
 
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at Huaweihbaseconasia2017: HBase Disaster Recovery Solution at Huawei
hbaseconasia2017: HBase Disaster Recovery Solution at Huawei
 
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinteresthbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
hbaseconasia2017: Removable singularity: a story of HBase upgrade in Pinterest
 
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
hbaseconasia2017: HareQL:快速HBase查詢工具的發展過程
 
hbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Neteasehbaseconasia2017: Apache HBase at Netease
hbaseconasia2017: Apache HBase at Netease
 
hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: 基于HBase的企业级大数据平台hbaseconasia2017: 基于HBase的企业级大数据平台
hbaseconasia2017: 基于HBase的企业级大数据平台
 
hbaseconasia2017: HBase at JD.com
hbaseconasia2017: HBase at JD.comhbaseconasia2017: HBase at JD.com
hbaseconasia2017: HBase at JD.com
 
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huaweihbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
hbaseconasia2017: Ecosystems with HBase and CloudTable service at Huawei
 
hbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: HBase Practice At XiaoMihbaseconasia2017: HBase Practice At XiaoMi
hbaseconasia2017: HBase Practice At XiaoMi
 
HBaseCon2017 Democratizing HBase
HBaseCon2017 Democratizing HBaseHBaseCon2017 Democratizing HBase
HBaseCon2017 Democratizing HBase
 
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBaseHBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
HBaseCon2017 Quanta: Quora's hierarchical counting system on HBase
 
HBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBaseHBaseCon2017 Transactions in HBase
HBaseCon2017 Transactions in HBase
 
HBaseCon2017 Highly-Available HBase
HBaseCon2017 Highly-Available HBaseHBaseCon2017 Highly-Available HBase
HBaseCon2017 Highly-Available HBase
 
HBaseCon2017 Apache HBase at Didi
HBaseCon2017 Apache HBase at DidiHBaseCon2017 Apache HBase at Didi
HBaseCon2017 Apache HBase at Didi
 
HBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase Client
 
HBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environmentHBaseCon2017 Improving HBase availability in a multi tenant environment
HBaseCon2017 Improving HBase availability in a multi tenant environment
 
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
HBaseCon2017 Spark HBase Connector: Feature Rich and Efficient Access to HBas...
 
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBaseHBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
HBaseCon2017 Efficient and portable data processing with Apache Beam and HBase
 
HBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at XiaomiHBaseCon2017 HBase at Xiaomi
HBaseCon2017 HBase at Xiaomi
 

Recently uploaded

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdfWave PLM
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationkaushalgiri8080
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software DevelopersVinodh Ram
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfkalichargn70th171
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEOrtus Solutions, Corp
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - InfographicHr365.us smith
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...stazi3110
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...MyIntelliSource, Inc.
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Intelisync
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 

Recently uploaded (20)

Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...Advancing Engineering with AI through the Next Generation of Strategic Projec...
Advancing Engineering with AI through the Next Generation of Strategic Projec...
 
5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf5 Signs You Need a Fashion PLM Software.pdf
5 Signs You Need a Fashion PLM Software.pdf
 
Project Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanationProject Based Learning (A.I).pptx detail explanation
Project Based Learning (A.I).pptx detail explanation
 
Professional Resume Template for Software Developers
Professional Resume Template for Software DevelopersProfessional Resume Template for Software Developers
Professional Resume Template for Software Developers
 
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdfThe Ultimate Test Automation Guide_ Best Practices and Tips.pdf
The Ultimate Test Automation Guide_ Best Practices and Tips.pdf
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASEBATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
BATTLEFIELD ORM: TIPS, TACTICS AND STRATEGIES FOR CONQUERING YOUR DATABASE
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
Engage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The UglyEngage Usergroup 2024 - The Good The Bad_The Ugly
Engage Usergroup 2024 - The Good The Bad_The Ugly
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Asset Management Software - Infographic
Asset Management Software - InfographicAsset Management Software - Infographic
Asset Management Software - Infographic
 
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
Building a General PDE Solving Framework with Symbolic-Numeric Scientific Mac...
 
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
Try MyIntelliAccount Cloud Accounting Software As A Service Solution Risk Fre...
 
Exploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the ProcessExploring iOS App Development: Simplifying the Process
Exploring iOS App Development: Simplifying the Process
 
HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)Introduction to Decentralized Applications (dApps)
Introduction to Decentralized Applications (dApps)
 
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 

HBaseCon 2015: HBase 2.0 and Beyond Panel

  • 1. 1 hbasecon.com HBase 2.0 and Beyond Panel Moderator: Jonathan Hsieh Panel: Matteo Bertozzi / Sean Busbey / Jingcheng Du / Lars Hofhansl / / Enis Soztutar / Jimmy Xiang
  • 2. 2 hbasecon.com Who are we?  Matteo Bertozzi – HBase PMC, Cloudera  Sean Busbey – HBase PMC, Cloudera  Jingcheng Du – Intel  Lars Hofhansl – HBase PMC, 0.94.x RM, Salesforce.com  Jonathan Hsieh – HBase PMC  Enis Soztutar – HBase PMC, 1.0.0 RM, Hortonworks  Jimmy Xiang – HBase PMC, Cloudera
  • 3. 3 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 4. 4 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 5. 5 hbasecon.com Why Moderate Object Storage (MOB)?  A growing demand for the ability to store moderateobjects (MOB) in HBase ( 100KB up to 10MB).  Write amplification created by compactions, the write performance degrades along with the accumulation of massive MOBs in HBase.  Too many store files -> Frequent region compactions -> Massive I/O -> Slow compactions -> Flush delay -> High memory usage -> Blocking updates 8.098 10.159 10.700 0.000 2.000 4.000 6.000 8.000 10.000 12.000 125G 500G 1T Latency(sec) Data volume Data Insertion Average Latency (5MB/record, 32 pre-split regions) 0 5 10 15 20 25 1 2 3 4 5 6 7 8 Latency(sec) Time (hour) 1T Data Insertion Average Latency (5MB/record, 32 pre-split regions)
  • 6. 6 hbasecon.com How MOB I/O works HRegionServer Client HFIle MOB cell HLog memstore MOB cell MOB HFile Flush MOB cell Write Path Ref cell Client Read Path HRegionServer memstore HFIle MOB HFile MOB cell MOB cell Ref cell
  • 7. 7 hbasecon.com Benefits  Move the MOBs out of the main I/O path to make the write amplification more predictable.  The same APIs to read and write MOBs.  Work with HBase export/copy table, bulk load, replication and snapshot features.  Work with HBase security mechanism. 8.098 10.159 10.700 6.851 6.963 7.033 0.000 2.000 4.000 6.000 8.000 10.000 12.000 125G 500G 1T Latency(sec) Data volume Data Insertion Average Latency (5MB/record, 32 pre-split regions) MOB Disabled MOB Enabled 10.590 57.975 6.212 33.886 0.000 10.000 20.000 30.000 40.000 50.000 60.000 Data Insertion Data Random Get Latency(sec) Average Latency for R/W Mixed Workload (5MB/record, 32 pre-split regions, 300G pre-load, 200G insertion) MOB Disabled MOB Enabled 0 2 4 6 8 10 12 14 16 18 10 20 30 40 50 60 Lantecy(sec) Time (minute) Data Insertion Average Latency MOB Enabled MOB Disabled 0 10 20 30 40 50 60 70 80 90 100 10 20 30 40 50 60 Latency(minute) Time (minute) Data Random Get Average Latency MOB Enabled MOB Disabled
  • 8. 8 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 9. 9 hbasecon.com Problem – Multi-Steps ops & Failures DDL & other operations consist of multiple steps e.g. Create Table Handler Create regions on FileSystem Add regions to META Assign cpHost.postCreateTableHandler() -> (ACLs) if we crash in between steps. we end up with half state. e.g. File-System present, META not present hbck MAY be able to repair it if we crash in the middle of a single step (e.g. create N regions on fs) hbck has not enough information to rebuild a correct state. Requires manual intervention to repair the state
  • 10. 10 hbasecon.com Solution – Multi-Steps ops & Failures Rewrite each operation to use a State-Machine e.g. Create Table Handler Create regions on FileSystem Add regions to META Assign cpHost.postCreateTableHandler() -> (ACLs) ...each executed step is written to a store if the machine goes down we know what was pending and what should be rolledback or how to continue to complete the operation
  • 11. 11 hbasecon.com Procedure-v2/Notification-Bus  The Procedure v2/NotificationBus aims to provide a unified way to build:  Synchronous calls, with the ability to see the state/result in case of failure.  Multisteps procedure with a rollback/rollforward ability in case of failure (e.g. create/delete table)  Notifications across multiple machines (e.g. ACLs/Labels/Quota cache updates)  Coordination of long-running/heavy procedures (e.g. compactions, splits, …)  Procedures across multiple machines (e.g. Snapshots, Assignment)  Replication for Master operations (e.g. grant/revoke)
  • 12. 12 hbasecon.com Procedure-v2/Notification-Bus - Roadmap  Apache HBase 1.1  Fault tolerant Master Operations (e.g. create/delete/…)  Sync Client (We are still wire compatible, both ways)  Apache HBase 1.2  Master WebUI  Notification BUS, and at least Snapshot using it.  Apache HBase 1.3+ or 2.0 (depending on how hard is to keep Master/RSs compatibility)  Replace Cache Updates, Assignment Manager, Distributed Log Replay,…  New Features: Coordinated compactions, Master ops Replication (e.g. grant/revoke)
  • 13. 13 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 14. 14 hbasecon.com ZK-based Region Assignment  Region states could be inconsistent  Assignment info stored in both meta table and ZooKeeper  Both Master and RegionServer can update them  Limited scalability and operations efficiency  ZooKeeper events used for coordination 14
  • 15. 15 hbasecon.com ZK-less Region Assignment  RPC based  Master, the true coordinator  Only Master can update meta table  All state changes are persisted  Follow the state machine  RegionServer does what told by Master  Report status to Master  Each step needs acknowledgement from Master 15
  • 16. 16 hbasecon.com Current Status  Off by default in 1.0  Impact  Master is in the critical path  Meta should be co-located with Master  Procedure V2 could solve it (future work)  Deployment topology change  Master is a RegionServer, serves small system tables  Blog post has more info  https://blogs.apache.org/hbase/entry/hbase_zk_less_region_assignment 16
  • 17. 17 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 18. 18 hbasecon.com HBase Semantic Versioning The Return to Sanity
  • 20. 20 hbasecon.com Semantic Versioning Makes Things Simple
  • 22. 22 hbasecon.com MAJOR version when you make incompatible API changes
  • 23. 23 hbasecon.com MINOR version when you add backwards-compatible functionality
  • 24. 24 hbasecon.com PATCH version when you make backwards-compatible bug fixes
  • 25. 25 hbasecon.com We are adoption this starting with HBase 1.0
  • 26. 26 hbasecon.com Compatibility Dimensions (the long version)  Client-Server wire protocol compatibility  Server-Server protocol compatibility  File format compatibility  Client API compatibility  Client Binary compatibility  Server-Side Limited API compatibility (taken from Hadoop)  Dependency Compatibility  Operational Compatibility
  • 27. 27 hbasecon.com TL;DR:  A patch upgrade is a drop-in replacement  A minor upgrade requires no application or client code modification  A major upgrade allows us - the HBase community - to make breaking changes.
  • 30. 30 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 31. 31 hbasecon.com Improving read availability  HBase is CP  When a node goes down, some regions are unavailable until recovery  Some class of applications want high availability (for reads)  Region replicas  TIMELINE consistency reads
  • 32. 32 hbasecon.com Phase contents  Phase 1  Region replicas  Stale data up to minutes (15 min)  in 1.0  Phase 2  millisecond-latencies for staleness (WAL replication)  Replicas for the meta table  Region splits and merges with region replicas  Scan support  In 1.1
  • 34. 34 hbasecon.com Region1 Region2 Region3 WAL ReplicaReplication RegionServer 1 tail Region2 (replica) RegionServer 15 replay RegionServer 20 Region1 (replica) replay hfile hfile hfile HDFS Flush/Compaction Read flush files
  • 35. 35 hbasecon.com Pluggable WAL Replication  Pluggable WAL replication endpoint  You can write your own replicators!  Similar to co-processors (runs in the same RS process) hbase> add_peer ’my_peer', ENDPOINT_CLASSNAME => 'org.hbase.MyReplicationEndpoint', DATA => { "key1" => 1 }, CONFIG => { "config1" => "value1", "config2" => "value2" }}
  • 36. 36 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A
  • 37. 37 hbasecon.com Workload Throughput Distributed work will eventually be limited by one of • CPU • Disk IO • Network IO
  • 38. 38 hbasecon.com HBase Under (synthetic) Load Now Not CPU Bound
  • 39. 39 hbasecon.com HBase Under (synthetic) Load Now Not Disk Bound
  • 40. 40 hbasecon.com HBase Under (synthetic) Load Now Not Network Bound
  • 41. 41 hbasecon.com Modest Gain: Multiple WALs  All regions write to one Write ahead log file. (WAL)  Idea: Let’s have multiple write ahead logs so that we can write more in parallel.  Follow-up work:  To the limit if were on SSD we could have one WAL per region. RS 1 2 3 DNDisksRS 1 2 3 DNDisks IDLE IDLE
  • 42. 42 hbasecon.com Future Solutions • Alternative WAL providers • Read path optimizations based on profiling • Better tuning
  • 43. 43 hbasecon.com Outline  Storing Larger Objects efficiently  Making DDL Operations fault tolerant  Better Region Assignment  Compatibility guarantees for our users  Improving Availability  Using all machine resources  Q+A

Editor's Notes

  1. When working with a big mass of machines, your first optimization step has to be getting to the exhaustion of one of these three resources. The specifics will depend on your workload, but right now we have big room for improvement.
  2. This is a mixed write / update/ read workload after reaching a state where there are memstore flushes and compactions happening.. It’s mostly waiting on synchronization AFAICT
  3. This is a mixed write / update/ read workload after reaching a state where there are memstore flushes and compactions happening.. It’s mostly waiting on synchronization AFAICT
  4. This is a mixed write / update/ read workload after reaching a state where there are memstore flushes and compactions happening.. It’s mostly waiting on synchronization AFAICT
  5. Historically one of the long poles in the tent has been the WAL, since all the regions served by a regions server hit the same one. As of HBase 1.0, there are options to expand to multiple pipelines. But the gains are modest. As of HBase 1.1, we can make use of HDFS storage policies to keep just the WAL on SSD in mixed disk deployments. We need more testing and operational feedback from the community though.
  6. Longer term solutions that will start showing up in HBase 2.0 involve updates to both the read and write paths. For WAL limitations, we need to examine some base assumptions; HDFS is made for throughput of large blobs, not for many small writes. Custom DFSClient in HBase to show value, then push upstream Maybe it’s best to defer to a system made for these kinds of writes, e.g. Kafka Stack has recently done some excellent work profiling what happens in an HBase system under load and some optimizations to better work with the jit compiler have been landing as a result. Frankly, we have a huge number of tuning options now that can eat a lot of hardware, but they remain inaccessible. Documentation improvements and a round of updating defaults based on current machine specs.