SlideShare una empresa de Scribd logo
1 de 35
Descargar para leer sin conexión
New Journey of HBase in Alibaba and Cloud
Chunhui Shen and Long Cao
August 17,2018
八年磨一剑,HBase在阿里巴巴和云上的新征程
Content AliHB-Introduction of Alibaba HBase
History,Tech Overview,Open Source,Core Scenarios
01
Recent Key Challenge & Improvements
GC Trouble,Separation of Computing & Storage,Cold-
Hot Data,Diagnostic System, Migration & Backup
02
03 HBase Ecosystem & Multi-model DB & Cloud
KV,Tabular,SQL,Graph,Time Series,Geospatial ,
Search, Mixed Workloads,Cloud
AliHB-Introduction of Alibaba HBase01
HBase History in Alibaba
• Why HBase
– Began using since 2010
– Active community
– Hadoop ecosystem
– Facebook successful case
– Google famous paper: Big Table
Our Choice in 2010
Open Source
Commercial
Develop New
Big Data
Store System
Cassandra
MySQL、Oracle
Data
Burst
• Used Version
– 0.20->0.90->0.92->0.94->0.98->1.1->2.0
• The earliest case in 2010-2011
– Search Store
– Taobao History Order
– Alipay Risk Management
• Internal branch AliHB
Overview of AliHB
5
• Performance
• High-Performance Data
Structure、Lock-Free、
Group IO
• Feature
• SQL、Secondary Index
• Multi-Tenants、Cold-Hot
Separation、Async API
• Stability
• High Availability Architecture
• Faster MTTR
• Verification in Double 11
Shopping Day
• Efficient Maintenance
• Effective Monitoring
• Full Path Trace
• No-pause migration
• 12000+ Nodes,100+ Clusters ,200+ Million OPS,100+ PB Data
• 20+ BU,6000+ Users, 100+ Production Changes per Day
Open Source and Community
6
• Contributing to open source since 2011
• 3 PMC, 6 Committers in Alibaba
• Sponsor the Chinese HBase Technology Community
• Already Organized 2 HBase Meetup
• At least one HBase Related tech article one day
• Tens of thousands of readers now, and more are coming
• Hosting HBase Con Asia 2018
• Promote the use of HBase through several conference talks
• Hope more people to join in HBase Community
Core Scenarios in Alibaba
7
Ali-HBase
旺旺(IM)
Alipay Bills Cainiao Logistics
Log
Monitor, Log,
Tracking, IoT Data…Message, Orders, Feeds … AI Storage
Ant Intelligent Security
Intelligent Customer Service
Recommendation
Search, BI Report…
Recent Key Challenge & Improvements02
GC Trouble
9
Frequent
Slow Request
Very Slow
Request
Service
Unavailable
GC Problems Under100GB Memory
GC Trouble
10
Only for offline application
Rewriting with C++
Exploring a Thorough
Solution
GC Trouble
11
Type Pause Time Frequency
YGC 100ms+ Once per 5 Secs
CMS 100~500ms Once per 5 Mins
FGC 20s-180s Once per 7~60 Days
Type Pause Time Frequency
YGC 5ms Once per 5 Secs
CMS 100ms Once per 5 Hours
FGC N/A N/A
CCSMap BucketCacheV2
Allocation and reclaim the major memory
by hbase itself, rather than JVM
ZenGC
New GC algorithm in AJDK
Try best to reuse object(In Core Path) when
programming
GC Trouble
New BucketCache in HBase-2.0
CCSMap in HBase-3.0
Separation of Computing & Storage
13
Localized Deployment
– Low IO latency with Short-Circuit Read
– Unbalanced storage space, especially between clusters
– Difficult to increase the usage ratio of CPU and Disk (both), especially when lots of scenarios
– Cluster scaling is slow because of datanode decommission
Separation of Computing & Storage
14
Shared-Storage Deployment
– Big shared storage, more
balanced
– Compute node can scale
independently
– Storage node can scale
independently
– Auto-scaling become
feasible
– Based on load statistics,
smart schedule between
clusters
– Share compute resources
with other applications
Heterogeneous Cold-Hot Storage
15
• HBase has the capability to hold all the data of whole life cycle
• But in most cases, like monitor, trace, order, logistics
• The recently generated data is often accessed, but occupy very little storage space
• The history data is rarely visited, but occupy a lot of storage space
• Common solution
• Cold storage system for history data
• Hot storage system for recent data
• Move the data from hot storage system to cold storage system periodically
Heterogeneous Cold-Hot Storage
16
• Easy To Use
• Auto Tiered
• Heterogeneous
• Read Optimization
Diagnostic System
17
“Request Rush?” — Monitor
“Big Region?” — Web UI
“Full Disk?” — df
“Bad Disk?” — tsar,demsg
……
12000+ Nodes,100+ Clusters ,6000+ Users
HBase Diagnostic Center
1. The unified entrance of trouble shooting
2. Experience/Solution => Function of Diagnostic System
Diagnostic System
18
2
One extra server for all
No Agent
Adding rule dynamically
Runtime information
Check all components
Only 10 seconds for a diagnosis6
Diagnostic System
19
 Compaction
 Stuck
 Balance Abnormal
 Table Abnormal
 Region Offline
 Replication Delay
 Too many files
 High Meta Load
 Multi Assign
 ……
HBase
 ZK Unavailable
 Block Miss
 NameNode Abnormal
 Full capacity of datanode
 Inconsistent state between
two namenodes
 Too much Xceivers
 Disk not mounted
 ……
ZK/HDFS
 Insufficient disk space
 Slow Disk
 Bad Disk
 Too much TCP error
 Slow ping
 CPU hang
 Load too high
 Port is unreachable
 ……
Hardware
50+
Rules
80%+
Accuracy
Shared on
Apsara HBase
Migration & Backup
20
Migration & Backup
21
Independent with HBase
• almost no impact to service
• easy to upgrade
• support multi versions
• support the non-hbase
target
Second-level RPO
Minute-level RTO
HBase Ecosystem & Multi-model DB & Cloud03
Popularity changes per DB category
Ranking scores per category in percent
Data size per day
All in one
RelationalKey Value Doucument Graph Time Series Geospatial
Tabular NoSQL
All in one
RelationalKey Value Doucument Graph Time Series Geospatial
Tabular NoSQL
HBase
HBase Phoenix/AntsDB HBase JanusGraph OpenTSDB GeoMesa
28
Multi-model - Native Or Layer
Neo4j
InfluxDB
CockroachDB
PG
HBase Ecosystem
DataStax
CosmosDB
Multi-model
KVIndex KVIndex
Storage
Multi-model
HBase Meet Cloud – Benefits
Cloud Native
New Hardware Flexibility Cost Savings
(TCO)
RDMA
Flash
GPU
Non-volatile
memory
Fast Add/Remove
Resource
Insight
Fix bugs in time
Self-driven
End up paying for
features
Flexibility
self-driven
Reduce human
……
30
Multi-model - Native Or Layer
Neo4j
InfluxDB
CockroachDB
PG
HBase Ecosystem
DataStax
CosmosDB
Multi-model
KVIndex KVIndex
Storage
Multi-model
ApsaraDB HBase Platform – Cloud Native
HBase
(KV、Tabular 、Doucument)
Solr/ES
(Full Text Index)
Hot data on SSD
SQL
Phoenix
Graph
JanusGraph
Time Series
OpenTSDB
Geospatial
GeoMesa
Spark
Cold data on HDD
and use EC like OSS
Warm data on SSD&HDD
Remote Read/Write use RDMA and 25G network
32
ApsaraDB HBase Platform Advantage
Item
ApsaraDB HBase (ALiyun Product)
https://cn.aliyun.com/product/hbase
Apache HBase (Sofeware)
Basic
High availability 99.9% ~ 99.99% N/A
Data reliability 99.999999999% N/A
Online Ability
Multi-master
clustering
Multi-master clustering,Multi-AZ/Regon NO
GC FGC NO,YGC 5ms GC 20s~100s,YGC 100ms+
Reduce Cost
Storage Cost Cut by 50%+ on share cloud disk,Total 3 Copy Maybe on Cloud Disk,Total 9 Copy
Support Cold
Storage
Support OSS,Cut by 70% at less read NO
Multi-model DB Multi-model DB
KV,Tabular,SQL,Graph,Time Series,Geospatial
Full Text index, Search
KV,Tabular
Enterprise
Characteristics
Disaster recovery Backup and Restore NO,maybe3.0
Security user/password,ACL Kerberos,ACL
Analytics Spark on HBase ,More optimization Spark on HBase
Version upgrade Automatic upgrade N/A
Self-driven
Database
control system
15min Create a DB/Monitor
Online add storage and node/Elastic Power in future
N/A
Diagnostic
Big request ,Big Table merge,Hot Region …… NO
欢迎加入
杭州、硅谷、深圳、北京
谢谢观看
Thanks

Más contenido relacionado

La actualidad más candente

In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
DataWorks Summit
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
DataWorks Summit
 
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
HostedbyConfluent
 

La actualidad más candente (20)

Best practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability TutorialBest practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability Tutorial
 
In-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great TasteIn-memory Caching in HDFS: Lower Latency, Same Great Taste
In-memory Caching in HDFS: Lower Latency, Same Great Taste
 
Introduction to memcached
Introduction to memcachedIntroduction to memcached
Introduction to memcached
 
RocksDB compaction
RocksDB compactionRocksDB compaction
RocksDB compaction
 
Hive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep DiveHive + Tez: A Performance Deep Dive
Hive + Tez: A Performance Deep Dive
 
Flink vs. Spark
Flink vs. SparkFlink vs. Spark
Flink vs. Spark
 
Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive

Apache Kudu: Technical Deep Dive


Apache Kudu: Technical Deep Dive


 
Building an open data platform with apache iceberg
Building an open data platform with apache icebergBuilding an open data platform with apache iceberg
Building an open data platform with apache iceberg
 
Cassandra Introduction & Features
Cassandra Introduction & FeaturesCassandra Introduction & Features
Cassandra Introduction & Features
 
Introduction to Kafka Cruise Control
Introduction to Kafka Cruise ControlIntroduction to Kafka Cruise Control
Introduction to Kafka Cruise Control
 
Log Structured Merge Tree
Log Structured Merge TreeLog Structured Merge Tree
Log Structured Merge Tree
 
PostgreSQL and Benchmarks
PostgreSQL and BenchmarksPostgreSQL and Benchmarks
PostgreSQL and Benchmarks
 
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
Designing Apache Hudi for Incremental Processing With Vinoth Chandar and Etha...
 
Apache Kafka Best Practices
Apache Kafka Best PracticesApache Kafka Best Practices
Apache Kafka Best Practices
 
HBase HUG Presentation: Avoiding Full GCs with MemStore-Local Allocation Buffers
HBase HUG Presentation: Avoiding Full GCs with MemStore-Local Allocation BuffersHBase HUG Presentation: Avoiding Full GCs with MemStore-Local Allocation Buffers
HBase HUG Presentation: Avoiding Full GCs with MemStore-Local Allocation Buffers
 
RocksDB Performance and Reliability Practices
RocksDB Performance and Reliability PracticesRocksDB Performance and Reliability Practices
RocksDB Performance and Reliability Practices
 
Webinar: Deep Dive on Apache Flink State - Seth Wiesman
Webinar: Deep Dive on Apache Flink State - Seth WiesmanWebinar: Deep Dive on Apache Flink State - Seth Wiesman
Webinar: Deep Dive on Apache Flink State - Seth Wiesman
 
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudAmazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
 
Alfresco tuning part1
Alfresco tuning part1Alfresco tuning part1
Alfresco tuning part1
 
Spark shuffle introduction
Spark shuffle introductionSpark shuffle introduction
Spark shuffle introduction
 

Similar a HBaseConAsia2018 Keynote 2: Recent Development of HBase in Alibaba and Cloud

Similar a HBaseConAsia2018 Keynote 2: Recent Development of HBase in Alibaba and Cloud (20)

AWS Webcast - Managing Big Data in the AWS Cloud_20140924
AWS Webcast - Managing Big Data in the AWS Cloud_20140924AWS Webcast - Managing Big Data in the AWS Cloud_20140924
AWS Webcast - Managing Big Data in the AWS Cloud_20140924
 
Dataflow in 104corp - AWS UserGroup TW 2018
Dataflow in 104corp - AWS UserGroup TW 2018Dataflow in 104corp - AWS UserGroup TW 2018
Dataflow in 104corp - AWS UserGroup TW 2018
 
Horizon for Big Data
Horizon for Big DataHorizon for Big Data
Horizon for Big Data
 
Large-scale Web Apps @ Pinterest
Large-scale Web Apps @ PinterestLarge-scale Web Apps @ Pinterest
Large-scale Web Apps @ Pinterest
 
Dataflow in 104corp - DataConTW2018
Dataflow in 104corp - DataConTW2018Dataflow in 104corp - DataConTW2018
Dataflow in 104corp - DataConTW2018
 
How Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon RedshiftHow Glidewell Moves Data to Amazon Redshift
How Glidewell Moves Data to Amazon Redshift
 
Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015Getting Started with Big Data and HPC in the Cloud - August 2015
Getting Started with Big Data and HPC in the Cloud - August 2015
 
AWS APAC Webinar Week - Big Data on AWS. RedShift, EMR, & IOT
AWS APAC Webinar Week - Big Data on AWS. RedShift, EMR, & IOTAWS APAC Webinar Week - Big Data on AWS. RedShift, EMR, & IOT
AWS APAC Webinar Week - Big Data on AWS. RedShift, EMR, & IOT
 
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Sparkhbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
hbaseconasia2019 BigData NoSQL System: ApsaraDB, HBase and Spark
 
Big Data Architectural Patterns
Big Data Architectural PatternsBig Data Architectural Patterns
Big Data Architectural Patterns
 
NoSQL: Cassadra vs. HBase
NoSQL: Cassadra vs. HBaseNoSQL: Cassadra vs. HBase
NoSQL: Cassadra vs. HBase
 
Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016Introduction to Kudu - StampedeCon 2016
Introduction to Kudu - StampedeCon 2016
 
Apache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouseApache Tajo - An open source big data warehouse
Apache Tajo - An open source big data warehouse
 
Data & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon RedshiftData & Analytics - Session 2 - Introducing Amazon Redshift
Data & Analytics - Session 2 - Introducing Amazon Redshift
 
Aesop change data propagation
Aesop change data propagationAesop change data propagation
Aesop change data propagation
 
(BDT310) Big Data Architectural Patterns and Best Practices on AWS
(BDT310) Big Data Architectural Patterns and Best Practices on AWS(BDT310) Big Data Architectural Patterns and Best Practices on AWS
(BDT310) Big Data Architectural Patterns and Best Practices on AWS
 
Architecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud DetectionArchitecting applications with Hadoop - Fraud Detection
Architecting applications with Hadoop - Fraud Detection
 
AWS Enterprise Summit Netherlands - Big Data Architectural Patterns & Best Pr...
AWS Enterprise Summit Netherlands - Big Data Architectural Patterns & Best Pr...AWS Enterprise Summit Netherlands - Big Data Architectural Patterns & Best Pr...
AWS Enterprise Summit Netherlands - Big Data Architectural Patterns & Best Pr...
 
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
 
Austin Scales- Clickstream Analytics at Bazaarvoice
Austin Scales- Clickstream Analytics at BazaarvoiceAustin Scales- Clickstream Analytics at Bazaarvoice
Austin Scales- Clickstream Analytics at Bazaarvoice
 

Más de Michael Stack

HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latenciesHBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
Michael Stack
 

Más de Michael Stack (20)

hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloudhbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
hbaseconasia2019 HBase Table Monitoring and Troubleshooting System on Cloud
 
hbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at Pinteresthbaseconasia2019 Recent work on HBase at Pinterest
hbaseconasia2019 Recent work on HBase at Pinterest
 
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltdhbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
hbaseconasia2019 Phoenix Practice in China Life Insurance Co., Ltd
 
hbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at Didihbaseconasia2019 HBase at Didi
hbaseconasia2019 HBase at Didi
 
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
hbaseconasia2019 The Practice in trillion-level Video Storage and billion-lev...
 
hbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at Tencenthbaseconasia2019 HBase at Tencent
hbaseconasia2019 HBase at Tencent
 
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
hbaseconasia2019 Spatio temporal Data Management based on Ali-HBase Ganos and...
 
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
hbaseconasia2019 Bridging the Gap between Big Data System Software Stack and ...
 
hbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index Componenthbaseconasia2019 Pharos as a Pluggable Secondary Index Component
hbaseconasia2019 Pharos as a Pluggable Secondary Index Component
 
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibabahbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
hbaseconasia2019 Phoenix Improvements and Practices on Cloud HBase at Alibaba
 
hbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at Xiaomihbaseconasia2019 OpenTSDB at Xiaomi
hbaseconasia2019 OpenTSDB at Xiaomi
 
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBasehbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
hbaseconasia2019 Test-suite for Automating Data-consistency checks on HBase
 
hbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index Solutionhbaseconasia2019 Distributed Bitmap Index Solution
hbaseconasia2019 Distributed Bitmap Index Solution
 
hbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent Memoryhbaseconasia2019 HBase Bucket Cache on Persistent Memory
hbaseconasia2019 HBase Bucket Cache on Persistent Memory
 
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACLhbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
hbaseconasia2019 The Procedure v2 Implementation of WAL Splitting and ACL
 
hbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBasehbaseconasia2019 BDS: A data synchronization platform for HBase
hbaseconasia2019 BDS: A data synchronization platform for HBase
 
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
hbaseconasia2019 Further GC optimization for HBase 2.x: Reading HFileBlock in...
 
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
 
HBaseConAsia2019 Keynote
HBaseConAsia2019 KeynoteHBaseConAsia2019 Keynote
HBaseConAsia2019 Keynote
 
HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latenciesHBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
HBaseConAsia2018 Track3-1: Serving billions of queries in millisecond latencies
 

Último

6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
Diya Sharma
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
soniya singh
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Chandigarh Call girls 9053900678 Call girls in Chandigarh
 

Último (20)

VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
VVVIP Call Girls In Connaught Place ➡️ Delhi ➡️ 9999965857 🚀 No Advance 24HRS...
 
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting  High Prof...
VIP Model Call Girls Hadapsar ( Pune ) Call ON 9905417584 Starting High Prof...
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024On Starlink, presented by Geoff Huston at NZNOG 2024
On Starlink, presented by Geoff Huston at NZNOG 2024
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Dlf City Phase 3 Gurgaon >༒8448380779 Escort Service
 
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...(+971568250507  ))#  Young Call Girls  in Ajman  By Pakistani Call Girls  in ...
(+971568250507 ))# Young Call Girls in Ajman By Pakistani Call Girls in ...
 
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
All Time Service Available Call Girls Mg Road 👌 ⏭️ 6378878445
 
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
₹5.5k {Cash Payment}New Friends Colony Call Girls In [Delhi NIHARIKA] 🔝|97111...
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝
 
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
Ganeshkhind ! Call Girls Pune - 450+ Call Girl Cash Payment 8005736733 Neha T...
 
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls In Pratap Nagar Delhi 💯Call Us 🔝8264348440🔝
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
 
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
 

HBaseConAsia2018 Keynote 2: Recent Development of HBase in Alibaba and Cloud

  • 1. New Journey of HBase in Alibaba and Cloud Chunhui Shen and Long Cao August 17,2018 八年磨一剑,HBase在阿里巴巴和云上的新征程
  • 2. Content AliHB-Introduction of Alibaba HBase History,Tech Overview,Open Source,Core Scenarios 01 Recent Key Challenge & Improvements GC Trouble,Separation of Computing & Storage,Cold- Hot Data,Diagnostic System, Migration & Backup 02 03 HBase Ecosystem & Multi-model DB & Cloud KV,Tabular,SQL,Graph,Time Series,Geospatial , Search, Mixed Workloads,Cloud
  • 4. HBase History in Alibaba • Why HBase – Began using since 2010 – Active community – Hadoop ecosystem – Facebook successful case – Google famous paper: Big Table Our Choice in 2010 Open Source Commercial Develop New Big Data Store System Cassandra MySQL、Oracle Data Burst • Used Version – 0.20->0.90->0.92->0.94->0.98->1.1->2.0 • The earliest case in 2010-2011 – Search Store – Taobao History Order – Alipay Risk Management • Internal branch AliHB
  • 5. Overview of AliHB 5 • Performance • High-Performance Data Structure、Lock-Free、 Group IO • Feature • SQL、Secondary Index • Multi-Tenants、Cold-Hot Separation、Async API • Stability • High Availability Architecture • Faster MTTR • Verification in Double 11 Shopping Day • Efficient Maintenance • Effective Monitoring • Full Path Trace • No-pause migration • 12000+ Nodes,100+ Clusters ,200+ Million OPS,100+ PB Data • 20+ BU,6000+ Users, 100+ Production Changes per Day
  • 6. Open Source and Community 6 • Contributing to open source since 2011 • 3 PMC, 6 Committers in Alibaba • Sponsor the Chinese HBase Technology Community • Already Organized 2 HBase Meetup • At least one HBase Related tech article one day • Tens of thousands of readers now, and more are coming • Hosting HBase Con Asia 2018 • Promote the use of HBase through several conference talks • Hope more people to join in HBase Community
  • 7. Core Scenarios in Alibaba 7 Ali-HBase 旺旺(IM) Alipay Bills Cainiao Logistics Log Monitor, Log, Tracking, IoT Data…Message, Orders, Feeds … AI Storage Ant Intelligent Security Intelligent Customer Service Recommendation Search, BI Report…
  • 8. Recent Key Challenge & Improvements02
  • 9. GC Trouble 9 Frequent Slow Request Very Slow Request Service Unavailable GC Problems Under100GB Memory
  • 10. GC Trouble 10 Only for offline application Rewriting with C++ Exploring a Thorough Solution
  • 11. GC Trouble 11 Type Pause Time Frequency YGC 100ms+ Once per 5 Secs CMS 100~500ms Once per 5 Mins FGC 20s-180s Once per 7~60 Days Type Pause Time Frequency YGC 5ms Once per 5 Secs CMS 100ms Once per 5 Hours FGC N/A N/A CCSMap BucketCacheV2 Allocation and reclaim the major memory by hbase itself, rather than JVM ZenGC New GC algorithm in AJDK Try best to reuse object(In Core Path) when programming
  • 12. GC Trouble New BucketCache in HBase-2.0 CCSMap in HBase-3.0
  • 13. Separation of Computing & Storage 13 Localized Deployment – Low IO latency with Short-Circuit Read – Unbalanced storage space, especially between clusters – Difficult to increase the usage ratio of CPU and Disk (both), especially when lots of scenarios – Cluster scaling is slow because of datanode decommission
  • 14. Separation of Computing & Storage 14 Shared-Storage Deployment – Big shared storage, more balanced – Compute node can scale independently – Storage node can scale independently – Auto-scaling become feasible – Based on load statistics, smart schedule between clusters – Share compute resources with other applications
  • 15. Heterogeneous Cold-Hot Storage 15 • HBase has the capability to hold all the data of whole life cycle • But in most cases, like monitor, trace, order, logistics • The recently generated data is often accessed, but occupy very little storage space • The history data is rarely visited, but occupy a lot of storage space • Common solution • Cold storage system for history data • Hot storage system for recent data • Move the data from hot storage system to cold storage system periodically
  • 16. Heterogeneous Cold-Hot Storage 16 • Easy To Use • Auto Tiered • Heterogeneous • Read Optimization
  • 17. Diagnostic System 17 “Request Rush?” — Monitor “Big Region?” — Web UI “Full Disk?” — df “Bad Disk?” — tsar,demsg …… 12000+ Nodes,100+ Clusters ,6000+ Users HBase Diagnostic Center 1. The unified entrance of trouble shooting 2. Experience/Solution => Function of Diagnostic System
  • 18. Diagnostic System 18 2 One extra server for all No Agent Adding rule dynamically Runtime information Check all components Only 10 seconds for a diagnosis6
  • 19. Diagnostic System 19  Compaction  Stuck  Balance Abnormal  Table Abnormal  Region Offline  Replication Delay  Too many files  High Meta Load  Multi Assign  …… HBase  ZK Unavailable  Block Miss  NameNode Abnormal  Full capacity of datanode  Inconsistent state between two namenodes  Too much Xceivers  Disk not mounted  …… ZK/HDFS  Insufficient disk space  Slow Disk  Bad Disk  Too much TCP error  Slow ping  CPU hang  Load too high  Port is unreachable  …… Hardware 50+ Rules 80%+ Accuracy Shared on Apsara HBase
  • 21. Migration & Backup 21 Independent with HBase • almost no impact to service • easy to upgrade • support multi versions • support the non-hbase target Second-level RPO Minute-level RTO
  • 22. HBase Ecosystem & Multi-model DB & Cloud03
  • 23. Popularity changes per DB category
  • 24. Ranking scores per category in percent
  • 26. All in one RelationalKey Value Doucument Graph Time Series Geospatial Tabular NoSQL
  • 27. All in one RelationalKey Value Doucument Graph Time Series Geospatial Tabular NoSQL HBase HBase Phoenix/AntsDB HBase JanusGraph OpenTSDB GeoMesa
  • 28. 28 Multi-model - Native Or Layer Neo4j InfluxDB CockroachDB PG HBase Ecosystem DataStax CosmosDB Multi-model KVIndex KVIndex Storage Multi-model
  • 29. HBase Meet Cloud – Benefits Cloud Native New Hardware Flexibility Cost Savings (TCO) RDMA Flash GPU Non-volatile memory Fast Add/Remove Resource Insight Fix bugs in time Self-driven End up paying for features Flexibility self-driven Reduce human ……
  • 30. 30 Multi-model - Native Or Layer Neo4j InfluxDB CockroachDB PG HBase Ecosystem DataStax CosmosDB Multi-model KVIndex KVIndex Storage Multi-model
  • 31. ApsaraDB HBase Platform – Cloud Native HBase (KV、Tabular 、Doucument) Solr/ES (Full Text Index) Hot data on SSD SQL Phoenix Graph JanusGraph Time Series OpenTSDB Geospatial GeoMesa Spark Cold data on HDD and use EC like OSS Warm data on SSD&HDD Remote Read/Write use RDMA and 25G network
  • 32. 32 ApsaraDB HBase Platform Advantage Item ApsaraDB HBase (ALiyun Product) https://cn.aliyun.com/product/hbase Apache HBase (Sofeware) Basic High availability 99.9% ~ 99.99% N/A Data reliability 99.999999999% N/A Online Ability Multi-master clustering Multi-master clustering,Multi-AZ/Regon NO GC FGC NO,YGC 5ms GC 20s~100s,YGC 100ms+ Reduce Cost Storage Cost Cut by 50%+ on share cloud disk,Total 3 Copy Maybe on Cloud Disk,Total 9 Copy Support Cold Storage Support OSS,Cut by 70% at less read NO Multi-model DB Multi-model DB KV,Tabular,SQL,Graph,Time Series,Geospatial Full Text index, Search KV,Tabular Enterprise Characteristics Disaster recovery Backup and Restore NO,maybe3.0 Security user/password,ACL Kerberos,ACL Analytics Spark on HBase ,More optimization Spark on HBase Version upgrade Automatic upgrade N/A Self-driven Database control system 15min Create a DB/Monitor Online add storage and node/Elastic Power in future N/A Diagnostic Big request ,Big Table merge,Hot Region …… NO
  • 33.