Managing multi tenant resource toward Hive 2.0

Managing multi tenant
resource toward Hive 2.0
Kai Sasaki
Treasure Data Inc.

About Me
• Kai Sasaki (佐々木海)
• @Lewuathe (Twitter)
• Software Engineer  
at Treasure Data Inc.
• Maintaining and develop  
Hadoop/Presto infrastructure

Topic
• Treasure Data infrastructure
• Hive 2.0 change
• Migration architecture
• Resource management for multi tenancy
• Performance comparison

• Live Data Management Platform
• Original creator of Fluentd/Embulk/Digdag
• 70+ integrations with
• BI tools
• Mobile/IoT
• Cloud Storage
• and more

• Hive/Pig/Presto data processing interface
• 40000+ Hive queries / day
• 130000+ Presto queries / day
• Plazma Cloud Storage
• 450000+ records/sec imported

Hive 2.0
• Include major new features
• Fixed 600+ bugs
• 140+ improvements or new features
• Backward compatible as much as possible
• Hive 1.x stable line
• 2.1.0 is available from June 20th, 2016
http://www.slideshare.net/HadoopSummit/apache-hive-20-sql-speed-scale

Hive 2.0
• HPLSQL
• LLAP
• HBase metastore
• Improvements of Hive on Spark
• CBO improvements

HPLSQL
• Procedural SQL like Oracle’s PL/SQL
• Cursor
• loops (WHILE, FOR, LOOP)
• branches (IF)
• External library which communicates through JDBC
• http://www.hplsql.org/doc

LLAP
• Sub-second Queries in Hive
• Save JVM container launch time
• Data caching
• Fit to Adhoc or interactive use case
• Beta in 2.0
http://hortonworks.com/wp-content/uploads/2014/09/Screen-Shot-2014-09-02-at-5.03.47-PM.png

LLAP
• Sub-second Queries in Hive

HBase metastore
• Use HBase as metastore of Hive
• Fetching thousands of partitions
• Limitation of concurrent connection
• Will support transaction with Apache Omid
• Alpha in Hive 2.0

Many ﬁxes
and  
Cutting edge features

That’s all?
• Operation cost of migration
• Manage multiple cluster
• Test and verify multiple packages
• Difference of conﬁguration and parameter

That’s all?
• Operation cost of migration
• Manage multiple cluster
• Test and verify multiple packages
• Difference of conﬁguration and parameter
• Need to reduce operation cost at the same time

Challenge
• NO DOWNTIME
• NO HARMFUL OPERATION
• Change package easily
• Separate from other components (Micro service)
• NO DEGRADATION
• Automatic query test and validation

NO DOWNTIME
• Hadoop cluster Blue-Green deployment
• Reliable queue system separated from Hadoop
→ PerfectQueue
• Reliable storage system separated from Hadoop
→ Plazma

PerfectQueue
• Distributed queue built on top of RDBMS
• At-least-once semantics
• Graceful and live restarting
• State consistency by transaction
• https://github.com/treasure-data/perfectqueue

Plazma
• Distributed cloud-based storage
• PostgreSQL + S3/Riak CS
• Enable time-index push down for Hive/Pig/Presto
• Column-oriented IO (mpc1)
• Data consistency with transactional API

Plazma
x
PQ
PQ
App
request
pull
submit

Plazma
x
PQ
PQ
App
request
pull
submit fetch

Plazma
x
PQ
PQ
App
request
pull
submit fetch
disposable
components

Plazma
x
PQ
PQ
App
request
pull
submit fetch
v1
v2

Plazma
x
PQ
PQ
App
request
pull
submit
fetch
v1
v2

Plazma
PQ
PQ
App
request
pull
submit
fetch
v2

NO HARMFUL OPS
• Automatic package version up
• Chef server speciﬁes the version
• Hadoop package repository
• S3 remote package repository
• Hadoop as a REST service
• elephant-server

elephant-server
• Hadoop as REST service
• Pluggable executor
• Hive
• Pig
• Embulk MapReduce executor
• Distributed on-memory queue (Hazelcast)

PQ
PQ
App
request
pull REST
elephant
server

PQ
PQ
App
request
pull REST
elephant
server
elephant
server
elephant
server

PQ
PQ
App
request
pull REST
elephant
server
hazelcast
elephant
server
elephant
server

PQ
PQ
App
request
pull REST
elephant
server
hazelcast
elephant
server
elephant
server
service
discovery

PQ
PQ
App
request
pull REST
elephant
server
hazelcast
elephant
server
elephant
server
service
discovery x
x

PQ
PQ
App
request
pull REST
elephant
server
hazelcast
elephant
server
elephant
server
service
discovery
package
distribution
S3
x
x

PQ
PQ
App
request
pull REST
elephant
server
hazelcast
elephant
server
elephant
server
request
x
x
fetch submit
service
discovery
package
distribution
S3

NO DEGRADATION
• Validation in
• Parameter difference
• Query result difference
• Performance deterioration
• Automatic testing and persistent result tables

PQ
PQ
App
request
pull REST
elephant
server
S3
1. upload param  
and conﬁgurations

PQ
PQ
App
request
pull REST
elephant
server
S3
1. upload param  
and conﬁgurations
x
submit
v1

PQ
PQ
App
request
pull REST
elephant
server
S3
1. upload param  
and conﬁgurations
2. upload query result
Plazma
x
submit
v1
3. send metrics

PQ
PQ
App
request
pull REST
elephant
server
S3
1. upload param  
and conﬁgurations
Plazma
x
submit
v1
3. send metrics
S3 Plazma
x
v2

elephant
server
S3
1. upload param  
and conﬁgurations
Plazma
x
submit
v1
3. send metrics
S3 Plazma
x
v2
Veriﬁcation between  
persistent result set
PQ
PQ
App
request
pull REST

Resource management
• Deﬁne 1 resource per 1 account
• Workload type of an account varies
• Batch, Adhoc, BI tool…
• Require high level resource management  
across clusters
• An account can have multiple resource pools
• For service and internal purpose

request
queue1
queue2
cluster1
cluster2
cluster1
cluster2
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B

request
queue1
queue2
cluster1
cluster2
cluster1
cluster2
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B
Hadoop
queue A
Hadoop
queue B
Enables us to deﬁne
which resource the request can use

PQ
PQ
App
request
REST
elephant
server
x

PQ
PQ
App
request
REST
elephant
server
PQ
PQ
x
1. multiple job queue

PQ
PQ
App
request
REST
elephant
server
x
x
PQ
PQ
1. multiple job queue 2. multiple Hadoop cluster

PQ
PQ
App
request
REST
elephant
server
x
q1
q2
q3
x
PQ
PQ
q1
q2
q3
1. multiple job queue 2. multiple Hadoop cluster
3. multiple Hadoop queue

Brieﬂy
performance comparison

130GB+ 70B+ records
Elapsedtime(sec)
0
200
400
600
800
COUNT
Hive 1.x + MapReduce
Hive 2.x + Tez + Vectorization

130GB+ 70B+ records
Elapsedtime(sec)
0
250
500
750
1000
GROUP BY

130GB+ 70B+ records
Elapsedtime(sec)
0
275
550
825
1100
JOIN

Recap
• Hadoop architecture in Treasure Data  
for Hive 2.0 and beyond
• Resource management for multi tenancy

Managing multi tenant resource toward Hive 2.0

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Destacado

Destacado (20)

Similar a Managing multi tenant resource toward Hive 2.0

Similar a Managing multi tenant resource toward Hive 2.0 (20)

Más de Kai Sasaki

Más de Kai Sasaki (20)

Último

Último (20)

Managing multi tenant resource toward Hive 2.0