SlideShare una empresa de Scribd logo
1 de 55
Descargar para leer sin conexión
Scaling and High Performance
Storage System
Yosuke Hara - @yosukehara
A Researcher of R.I.T. and Tech Lead LeoFS
with Hiroki Matsue, LeoFS Support and Rakuten Software Engineer
LeoFS is "Unstructured Big Data Storage for the Web"
and a highly available, distributed, eventually consistent
storage system.
!
Organizations can use LeoFS to store lots of data
efficently, safely and inexpensively.
!
LeoFS was published as OSS
on July of 2012
leo-project.net/leofs
Overview
Brief Benchmark Report
Multi Data Center Replication
LeoFS Administration at Rakuten
Future Plans
“NFS” Support and more
Overview
Tokyo, Japan
The Lion of Storage Systems
HIGH Availability
HIGH Cost
Performance Ratio
HIGH
Scalability
LeoFS Non Stop
Velocity: Low Latency
Minimum Resources
Volume: Petabyte / Exabyte
Variety: Photo, Movie, Unstructured-data
3 Vs in 3 HIGHs
Metadata Object Storage
Storage Engine/Router
Monitor
GUI Console
( Erlang RPC)
LeoFS Overview
Storage
Manager
( Erlang RPC)
Gateway
( TCP/IP,SNMP )
Request from
Web Applications / Browsers
w/HTTP over REST-API / S3-API
Load Balancer
Keeping High Availability
Keeping High Performance
Easy Administration
Metadata Object Storage
Storage Engine/Router
Metadata Object Storage
Storage Engine/Router
Gateway
LeoFS Overview - Gateway
Stateless Proxy + Object Cache
REST-API / S3-API
Use Consistent Hashing
for decision of a primary node
[ Memory Cache, Disc Cache ]
StorageClusterGateway(s)Clients
HTTP Request and Response
Built in Object Cache Mechanism
Storage Cluster
Fast HTTP Server - Cowboy
API Handler
Object Cache Mechanism
Storage
Storage(StorageCluster)GatewayLeoFS Overview - Storage
Use "Consistent Hashing"
for Data Operation
in the Storage Cluster
Choosing Replica Target Node(s)
RING
2 ^ 128 (MD5)
# of replicas = 3
KEY = “bucket/leofs.key”
Hash = md5(Filename)
Secondary-1
Secondary-2
Primary Node
"P2P"
WRITE: Auto Replication
READ : Auto Repair of an Inconsistent Object with Async
Request From Gateway
LeoFS Overview - Storage
...
LeoFS Storage
Replicator
Recoverer
...
Storage Engine
StorageEngine,Metadata+ObjectStorageGateway
Storage consists of Object Storage and Metadata Storage
Includes Replicator and Recoverer for the eventual consistency
Metadata
Storage Object
Storage
LeoFS Overview - Storage - Data StructureMetadata
Storage
ObjectStorage
Robust and
High Performance
Necessary for GC
Offset Version
Time-
stampKey
<Metadata>
Checksum
for Sync
KeySize
Custom
Meta Size File Size
for retrieving an object
Footer (8B)
Checksum KeySize DataSize Offset Version
Time-
stamp
Key User-Meta Footer
Header (Metadata - Fixed length) Body (Variable Length)
User-Meta
Size
Actual
File
<Needle>
Super-block
Needle-1
Needle-2
Needle-3
<Object Container>
Needle-4
Needle-5
To Equalize Disk Usage in Every Storage Node
To Realize High I/O efficiency and High Availability
LeoFS Overview - Storage - Large Object Support
chunk-0
chunk-1
chunk-2
chunk-3
An Original Object’s Metadata
Original Object Name
Original Object Size
# of Chunks
Storage ClusterGatewayClient(s)
[ WRITE Operation ]
Chunked Objects
Every chunked object and
metadata are replicated
in the cluster
Manager
Storage Cluster
LeoFS Overview - Manager
Monitor
Operate
RING, Node State
status, suspend,
resume, detach,
whereis, ...
Gateway(s)
StorageClusterGateway(s)
Manager(s)
Operate LeoFS - Gateway and Storage Cluster
"RING Monitor" and "NodeState Monitor"
Brief
Benchmark
Report
Hokkaido, Japan
LeoFS kept in a stable performance
through the benchmark
Brief Benchmark Report
Bottleneck is Disk I/O
The cache mechanism contributed to reduce
network traffic between Gateway and Storage
Summary of the benchmark results
Brief Benchmark Report
1st Case:
Group of Value Ranges (HDD)
Storage:5, Gateway:1, Manager:2
R:W = 9:1
2nd Case:
Group of Value Ranges (HDD)
Storage:5, Gateway:1, Manager:2
R:W = 8:2
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140605/tests/1m_r9w1_240min
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140605/tests/1m_r8w2_120min
Brief Benchmark Report
CPU Intel(R) Xeon(R) CPU X5650 @ 2.67GHz * 2 (12 cores / 24 threads)
Memory 96GB
Disk HDD - 240GB RAID0
Network 10G-Ether
Server Spec - Gateway:
CPU Intel(R) Xeon(R) CPU X5650 @ 2.67GHz * 2 (12 cores / 24 threads)
Memory 96GB
Disk
HDD - 240GB RAID0 (System)
HDD - 2TB RAID0 (Data)
Network 10G-Ether
Server Spec - Storage x5:
Network 10Gbps
OS CentOS release 6.5 (Final)
Erlang OTP R16B03-1
LeoFS v1.0.2
Environment:
System Consistency Level: [ N:3, W:2, R:1, D:2 ]
Duration 4.0h
R:W 9:1
# of Concurrent Processes 64
# of Keys 100,000
Value Size
!
!
!
!
!
Benchmark Configuration:
Range (byte) Percentage
1024 10240 24%
10241 102400 30%
10241 819200 30%
819201 1572864 16%
Brief Benchmark Report - 1st Case (HDD / R:W=9:1)
source: https://github.com/leo-project/notes/tree/master/leofs/benchmark/leofs/20140601/tests/1m_r9w1_240min
50ms
Brief Benchmark Report - 1st Case (HDD / R:W=9:1)
50ms
1,500ops
No Errors
0
50,000
100,000
150,000
200,000
250,000
300,000
350,000
400,000
450,000
500,000
550,000
600,000
650,000
700,000
750,000
800,000
850,000
900,000
950,000
1,000,000
1,050,000
1,100,000
1,150,000
1,200,000
1,250,000
1,300,000
1,350,000
1,400,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
gateway rxbyt/s gateway txbyt/s
storage-1 rxbyt/s storage-1 txbyt/s
storage-2 rxbyt/s storage-2 txbyt/s
storage-3 rxbyt/s storage-3 txbyt/s
storage-4 rxbyt/s storage-4 txbyt/s
storage-5 rxbyt/s storage-5 txbyt/s
Brief Benchmark Report - 1st Case / Network Traffic
10.0Gbps
7.0Gbps
5.0Gbps
6.0Gbps
StorageGateway
60%
0.0
0.1
0.3
0.4
0.6
0.7
0.9
1.0
1.1
1.3
1.4
1.6
1.7
1.9
2.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
Memory Usage
CPU Load 5min
Brief Benchmark Report - 1st Case / Memory and CPU
1.0
0
10
20
30
40
50
60
70
80
90
100
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
7500s
8000s
8500s
9000s
9500s
10000s
10500s
11000s
11500s
12000s
12500s
13000s
13500s
14000s
gateway storage-1 storage-2 storage-3 storage-4 storage-5
Network 10Gbps
OS CentOS release 6.5 (Final)
Erlang OTP R16B03-1
LeoFS v1.0.2
Environment:
System Consistency Level: [ N:3, W:2, R:1, D:2 ]
Duration 2.0h
R:W 8:2
# of Concurrent Processes 64
# of Keys 100,000
Value Size
!
!
!
!
!
Benchmark Configuration:
Brief Benchmark Report - 2nd Case (HDD / R:W=8:2)
Range (byte) Percentage
1024 10240 24%
10241 102400 30%
10241 819200 30%
819201 1572864 16%
Brief Benchmark Report - 2nd Case (HDD / R:W=8:2)
60-70ms 80-90ms
1,000ops
No Errors
Compare 1st case
with 2nd case
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
1,200,000
1,300,000
1,400,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
gateway rxbyt/s gateway txbyt/s
storage-1 rxbyt/s storage-1 txbyt/s
storage-2 rxbyt/s storage-2 txbyt/s
storage-3 rxbyt/s storage-3 txbyt/s
storage-4 rxbyt/s storage-4 txbyt/s
storage-5 rxbyt/s storage-5 txbyt/s
0
100,000
200,000
300,000
400,000
500,000
600,000
700,000
800,000
900,000
1,000,000
1,100,000
1,200,000
1,300,000
1,400,000
1,500,000
1,600,000
1,700,000
1,800,000
1,900,000
2,000,000
2,100,000
2,200,000
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
6.0Gbps
Brief Benchmark Report
7.0Gbps
6.0Gbps
7.0Gbps
minus 0.7Gbps
1st Case - Network Traffic
2nd Case - Network Traffic
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
500.0
550.0
600.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
0.0
50.0
100.0
150.0
200.0
250.0
300.0
350.0
400.0
450.0
500.0
550.0
600.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
storage-1
storage-2
storage-3
storage-4
storage-5
100
100
Brief Benchmark Report
2nd Case - Disk util%
200
200
1st Case - Disk util%
1.8x high
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
2.2
2.4
2.6
2.8
3.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
Brief Benchmark Report
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
2.2
2.4
2.6
2.8
3.0
0s
500s
1000s
1500s
2000s
2500s
3000s
3500s
4000s
4500s
5000s
5500s
6000s
6500s
7000s
1.00
1.00
1.6x high
2nd Case - CPU Load 5min
1st Case - CPU Load 5min
LeoFS kept in a stable performance
through the benchmark
Brief Benchmark Report
Bottleneck is Disk I/O
The cache mechanism contributed to reduce
network traffic between Gateway and Storage
Conclusion:
Multi
Data Center
Replication
Hokkaido, Japan
Tokyo
Europe
US
Multi Data Center Replication
HIGH-Scalability
HIGH-Availability
Easy Operation for Admins
+
NO SPOF
NO Performance Degradation
Singapore
1. Easy Operation to build multi clusters.
2. Asynchronous data replication between clusters
Stacked data is transferred to remote cluster(s)
3. Eventual consistency
Multi Data Center Replication
Designed it as simple as possible
DC-3DC-2
StorageclusterManagerclusterClient
DC-1
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
[# of replicas:1] [# of replicas:1][# of replicas:3]
"join cluster DC-2 and DC-3"
leo_rpcleo_rpc
Multi Data Center Replication
Executing “Join Cluster”
on Manager Console
Preparing MDC Replication
DC-3DC-2
StorageclusterManagerclusterClient
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
[# of replicas:1] [# of replicas:1]
Request to
the Target Region
Application(s)
DC-1
[# of replicas:3]
Temporally Stacking objects
- One container's capacity is *32MB
- When capacity is full,
send it to remote cluster(s)
* 32MB: default capacity - able to set optional value
leo_rpcleo_rpc
Multi Data Center Replication
Stacking objects
DC-3DC-2
StorageclusterManagerclusterClient
Monitors and Replicates each “RING” and “System Configuration”
"Leo Storage Platform"
DC-1
Stacked an object with a metadata
Compress it with LZ4
Replicated an object
Request to
the Target Region
Application(s)
leo_rpc
leo_rpc
leo_rpc
Multi Data Center Replication
Transferring stacked objects
Stacked objects
DC-3DC-2
StorageclusterManagerclusterClient
Monitor and Replicate each “RING” and “System Configuration”
"Leo Storage Platform"
Request to
the Target Region
Application(s)
DC-1
1) Receive metadata of stored objects
2) Compare them at the local cluster
3) Fix inconsistent objects
leo_rpcleo_rpc
leo_rpc
leo_rpc
Multi Data Center Replication
Investigating stored objects
LeoFS

Administration
at Rakuten
Presented by Hiroki Matsue
Rakuten Software Engineer
Tokyo, JapanKyoto, Japan
Storage Platform
File Sharing Service
Others
Portal Site
Photo Storage
Background Storage of OpenStack
LeoFS Administration at Rakuten
Storage Platform
Storage Platform - Scaling the Storage Platform
(Movie)
Reduce Costs
High Reliability
Easy to Scale
S3-API
Using Various Services
Total Usage: 450TB	
# of Files: 600Million	
Daily Growth: 100GB	
Daily Reqs: 13Million
Storage Platform - Scaling the Storage Platform
E-Commerce
Blog
Insurance Calendar
Recruiting
Review Photo
share
Portal &
Contents
Bookmark
B
Storage
Platform
(Movie)
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Gatewayx4Storagex14
Manager x 2
Requests from
Web Applications / Browsers
w/HTTP over S3-API
Load Balancer / Cache Servers
Storage Platform - System Layout
Total disk space: 600TB
Number of Files: 600Million
Access Stats:
800Mbps (MAX)
400Mbps (AVG)
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Gatewayx4Storagex14
Manager x 2
Storage Platform - Monitor
Send Mail Alert
Ganglia and
Nagios Agent
Status Collection (Ganglia)
Status Check (Nagios)
Port + Threshold Check
Storage Platform - Spreading Globally
Covering All Services
with Multi DC Replication
File Sharing Service
+
https://owncloud.com/
+
File Sharing Service - Required Targets
Reduce Costs
Handle Confidential Files
Store Large Files
Scale Easily
+
Share Docs and Movies with Group Companies
Over 20 Companies, Over 10 Countries
Over 4,000 Users, Over 10,000 Teams
File Sharing Service - Usage
LDAP
Monitor
GUI Console
( Erlang RPC)
( Erlang RPC)
( TCP/IP,SNMP )
Manager x 2
Authenticate Users
Manage
Configurations
Manage
Login Session
(KVS)
File Sharing Service - System Layout
Web GUI File Browser
Cover 25 Countries/Regions
Over 20,000 Users
+
File Sharing Service - Future Plans
Empowering the Services and the Users
Through the Cloud Storage
Future Plans
Tokyo, JapanHokkaido, Japan
NFS Support
Future Plans
Data-HUB: Centralize unstructured data in LeoFS
Search / Analysis
PaaS / IaaS Photo-Storage
Many Kind of Data PhotoLog / Event Data
Loading Data
Analysis Data
Stream Processing
SavannaDB for Statistics Data
Retrieve
m
etrics
and
stats
from
SavannaDB's
Agents
Storage Cluster
ManagerGateway
The Lion of Storage Systems
REST-API (JSON)
Operate LeoFS
Notify
a
m
essage
of over #
of req
threshold
SavannaDB's Agent
Insight LeoFS
LeoInsight
Future Plans
+
Set Sail for “Cloud Storage”
Website: leo-project.net
Twitter: @LeoFastStorage
Facebook: www.facebook.com/org.leofs

Más contenido relacionado

La actualidad más candente

Service Function Chaining in Openstack Neutron
Service Function Chaining in Openstack NeutronService Function Chaining in Openstack Neutron
Service Function Chaining in Openstack Neutron
Michelle Holley
 

La actualidad más candente (20)

Ceph and RocksDB
Ceph and RocksDBCeph and RocksDB
Ceph and RocksDB
 
Ceph Performance and Sizing Guide
Ceph Performance and Sizing GuideCeph Performance and Sizing Guide
Ceph Performance and Sizing Guide
 
Introducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes OperatorIntroducing the Apache Flink Kubernetes Operator
Introducing the Apache Flink Kubernetes Operator
 
End-to-end Streaming Between gRPC Services Via Kafka with John Fallows
End-to-end Streaming Between gRPC Services Via Kafka with John FallowsEnd-to-end Streaming Between gRPC Services Via Kafka with John Fallows
End-to-end Streaming Between gRPC Services Via Kafka with John Fallows
 
Intro to open source observability with grafana, prometheus, loki, and tempo(...
Intro to open source observability with grafana, prometheus, loki, and tempo(...Intro to open source observability with grafana, prometheus, loki, and tempo(...
Intro to open source observability with grafana, prometheus, loki, and tempo(...
 
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and PinotExactly-Once Financial Data Processing at Scale with Flink and Pinot
Exactly-Once Financial Data Processing at Scale with Flink and Pinot
 
Apache NiFi Record Processing
Apache NiFi Record ProcessingApache NiFi Record Processing
Apache NiFi Record Processing
 
cLoki: Like Loki but for ClickHouse
cLoki: Like Loki but for ClickHousecLoki: Like Loki but for ClickHouse
cLoki: Like Loki but for ClickHouse
 
Apache Kafka Best Practices
Apache Kafka Best PracticesApache Kafka Best Practices
Apache Kafka Best Practices
 
Tutorial: Using GoBGP as an IXP connecting router
Tutorial: Using GoBGP as an IXP connecting routerTutorial: Using GoBGP as an IXP connecting router
Tutorial: Using GoBGP as an IXP connecting router
 
OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVP...
OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVP...OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVP...
OpenNebulaConf2018 - Scalable L2 overlay networks with routed VXLAN / BGP EVP...
 
Ceph issue 해결 사례
Ceph issue 해결 사례Ceph issue 해결 사례
Ceph issue 해결 사례
 
Observability in Java: Getting Started with OpenTelemetry
Observability in Java: Getting Started with OpenTelemetryObservability in Java: Getting Started with OpenTelemetry
Observability in Java: Getting Started with OpenTelemetry
 
Service Function Chaining in Openstack Neutron
Service Function Chaining in Openstack NeutronService Function Chaining in Openstack Neutron
Service Function Chaining in Openstack Neutron
 
QEMU Disk IO Which performs Better: Native or threads?
QEMU Disk IO Which performs Better: Native or threads?QEMU Disk IO Which performs Better: Native or threads?
QEMU Disk IO Which performs Better: Native or threads?
 
Troubleshooting Kerberos in Hadoop: Taming the Beast
Troubleshooting Kerberos in Hadoop: Taming the BeastTroubleshooting Kerberos in Hadoop: Taming the Beast
Troubleshooting Kerberos in Hadoop: Taming the Beast
 
Evening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in FlinkEvening out the uneven: dealing with skew in Flink
Evening out the uneven: dealing with skew in Flink
 
Apache Pulsar First Overview
Apache PulsarFirst OverviewApache PulsarFirst Overview
Apache Pulsar First Overview
 
Seastore: Next Generation Backing Store for Ceph
Seastore: Next Generation Backing Store for CephSeastore: Next Generation Backing Store for Ceph
Seastore: Next Generation Backing Store for Ceph
 
Linux Profiling at Netflix
Linux Profiling at NetflixLinux Profiling at Netflix
Linux Profiling at Netflix
 

Destacado

Destacado (6)

[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
[RakutenTechConf2014] [D-4] The next step of LeoFS and Introducing NewDB Project
 
Rakuten LeoFs - distributed file system
Rakuten LeoFs - distributed file systemRakuten LeoFs - distributed file system
Rakuten LeoFs - distributed file system
 
The State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStackThe State of Ceph, Manila, and Containers in OpenStack
The State of Ceph, Manila, and Containers in OpenStack
 
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
하둡 알아보기(Learn about Hadoop basic), NetApp FAS NFS Connector for Hadoop
 
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech TalksDeep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
Deep Dive on the AWS Storage Gateway - April 2017 AWS Online Tech Talks
 
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
대용량 분산 아키텍쳐 설계 #3 대용량 분산 시스템 아키텍쳐
 

Similar a Scaling and High Performance Storage System: LeoFS

Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance BarriersCeph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Community
 

Similar a Scaling and High Performance Storage System: LeoFS (20)

RakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
RakutenTechConf2013] [D-3_1] LeoFS - Open the New DoorRakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
RakutenTechConf2013] [D-3_1] LeoFS - Open the New Door
 
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance BarriersCeph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
Ceph Day Berlin: Ceph on All Flash Storage - Breaking Performance Barriers
 
User-space Network Processing
User-space Network ProcessingUser-space Network Processing
User-space Network Processing
 
optimizing_ceph_flash
optimizing_ceph_flashoptimizing_ceph_flash
optimizing_ceph_flash
 
Stephan Ewen - Experiences running Flink at Very Large Scale
Stephan Ewen -  Experiences running Flink at Very Large ScaleStephan Ewen -  Experiences running Flink at Very Large Scale
Stephan Ewen - Experiences running Flink at Very Large Scale
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance BarriersCeph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
Ceph Day Beijing - Ceph on All-Flash Storage - Breaking Performance Barriers
 
Ceph
CephCeph
Ceph
 
Apache Big Data EU 2015 - HBase
Apache Big Data EU 2015 - HBaseApache Big Data EU 2015 - HBase
Apache Big Data EU 2015 - HBase
 
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
Технологии работы с дисковыми хранилищами и файловыми системами Windows Serve...
 
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
IMCSummit 2015 - Day 2 IT Business Track - 4 Myths about In-Memory Databases ...
 
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix clusterFive major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster
Five major tips to maximize performance on a 200+ SQL HBase/Phoenix cluster
 
Ceph
CephCeph
Ceph
 
LLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in HiveLLAP: Sub-Second Analytical Queries in Hive
LLAP: Sub-Second Analytical Queries in Hive
 
Near Real time Indexing Kafka Messages to Apache Blur using Spark Streaming
Near Real time Indexing Kafka Messages to Apache Blur using Spark StreamingNear Real time Indexing Kafka Messages to Apache Blur using Spark Streaming
Near Real time Indexing Kafka Messages to Apache Blur using Spark Streaming
 
Plank
PlankPlank
Plank
 
CERN IT Monitoring
CERN IT Monitoring CERN IT Monitoring
CERN IT Monitoring
 
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
[ACNA2022] Hadoop Vectored IO_ your data just got faster!.pdf
 

Más de Rakuten Group, Inc.

Más de Rakuten Group, Inc. (20)

コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
コードレビュー改善のためにJenkinsとIntelliJ IDEAのプラグインを自作してみた話
 
楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり楽天における安全な秘匿情報管理への道のり
楽天における安全な秘匿情報管理への道のり
 
What Makes Software Green?
What Makes Software Green?What Makes Software Green?
What Makes Software Green?
 
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product At...
 
DataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組みDataSkillCultureを浸透させる楽天の取り組み
DataSkillCultureを浸透させる楽天の取り組み
 
大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開大規模なリアルタイム監視の導入と展開
大規模なリアルタイム監視の導入と展開
 
楽天における大規模データベースの運用
楽天における大規模データベースの運用楽天における大規模データベースの運用
楽天における大規模データベースの運用
 
楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー楽天サービスを支えるネットワークインフラストラクチャー
楽天サービスを支えるネットワークインフラストラクチャー
 
楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割楽天の規模とクラウドプラットフォーム統括部の役割
楽天の規模とクラウドプラットフォーム統括部の役割
 
Rakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdfRakuten Services and Infrastructure Team.pdf
Rakuten Services and Infrastructure Team.pdf
 
The Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdfThe Data Platform Administration Handling the 100 PB.pdf
The Data Platform Administration Handling the 100 PB.pdf
 
Supporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdfSupporting Internal Customers as Technical Account Managers.pdf
Supporting Internal Customers as Technical Account Managers.pdf
 
Making Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdfMaking Cloud Native CI_CD Services.pdf
Making Cloud Native CI_CD Services.pdf
 
How We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdfHow We Defined Our Own Cloud.pdf
How We Defined Our Own Cloud.pdf
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
Travel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech infoTravel & Leisure Platform Department's tech info
Travel & Leisure Platform Department's tech info
 
OWASPTop10_Introduction
OWASPTop10_IntroductionOWASPTop10_Introduction
OWASPTop10_Introduction
 
Introduction of GORA API Group technology
Introduction of GORA API Group technologyIntroduction of GORA API Group technology
Introduction of GORA API Group technology
 
100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情100PBを越えるデータプラットフォームの実情
100PBを越えるデータプラットフォームの実情
 
社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー社内エンジニアを支えるテクニカルアカウントマネージャー
社内エンジニアを支えるテクニカルアカウントマネージャー
 

Último

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Último (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 

Scaling and High Performance Storage System: LeoFS