Iris: Inter-cloud Resource Integration System for Elastic Cloud Data Center
Optimizing Overlay-based Virtual Networking Through Optimistic Interrupts and Cut-through Forwarding
1. Optimizing
Overlay-‐based
Virtual
Networking
Through
Optimistic
Interrupts
and
Cut-‐through
Forwarding
Z.
Cui
et
al.,
SC2012
高野 了成
産業技術総合研究所 情報技術研究研究部門
2012年12月13日 福田研輪講@NII
2. SC2012
• 24th ACM/IEEE International Conference for High
performance computing, Networking, Storage and
Analysis
• 11⽉月10⽇日〜~16⽇日 ⽶米国ユタ州ソルトレイクシティ
• HPC関連のトップカンファレンス
– 今年年の採択率率率21% (100/472)
• TOP500、各種Awards、
Workshop、Tutorial、BoFなど
• 巨⼤大な展⽰示会場
– Intel, NVIDIA, Fusion-‐‑‒IO, etc
– SDN、Big data関連が⽬目⽴立立った
2
3. 35 Technical Sessions
• Analysis of I/O Storage • Optimizing Application Performance
• Autotuning and Search-‐‑‒Based Optimization • Resilience
• Breadth First Search • Visualization and Analysis of Massive Data Sets
• Direct Numerical Simulations • Graph Algorithms
• Checkpointing • Locality in Programming Models and Runtimes
• GPU Programming Models and Patterns • Networks
• Maximizing Performance on Multi-‐‑‒Core and • Runtime-‐‑‒Based Analysis and Optimization
Many-‐‑‒Core Architectures
• Cosmology Applications
• Cloud Computing • Fault Detection and Analysis
• Auto-‐‑‒diagnosis of Correctness and
Performance Issues
• Grid Computing
• Performance Modeling
• DRAM Power and Resiliency Management
• Big Data
• Grids/Clouds Networking • Memory Systems
• Weather and Seismic Simulations • Numerical Algorithms
• Compiler-‐‑‒Based Analysis and Optimization • Performance Optimization
• Fast Algorithm
• Massively Parallel Simulations
• Communication Optimization
• Linear Algebra Algorithms
• Optimizing I/O For Analytics
• Datacenter Technologies • New Computer Systems
3
5. なぜHPCで仮想化なのか?
• スパコンの90%はLinuxを使⽤用
• 軽量量カーネル(LWK)と⽐比較すると⾼高いオーバヘッド
– メモリ管理理
– OSノイズ
• 仮想化を⽤用いて、アプリが必要なときだけLWKを使⽤用 Department of Computer Science
– 主張「仮想化はホストOSのオーバヘッドをバイパスできる」
Our Approach
Linux
Compute Node
OS
No Hardware
UNIX HPC Application
Linux
Compute Node
OS
Yes Lightweight Application
Needs LWK? Completes
Kernel
Hardware
Linux VMM layer
Linux
Compute Node
OS
Hardware
出典:http://v3vee.org/talks/ross12.pdf
6
出典:http://en.wikipedia.org/wiki/File:Operating_systems_used_on_top_500_supercomputers.svg
5
6. Palacios VMM
• OS-‐‑‒independent, embeddable virtual machine
monitor (VMM)
– Host OS: Linux, Kitten LWK, Minix
• Minimalist interface
• Low noise
• Contiguous memory allocation User
application
• Passthrough resources and
User
resource partitioning User space application Guest OS
Kernel space
Host OS Palacios
http://www.v3vee.org/palacios/ Hardware
6
7. VNETモデル
• ユーザVMのためのL2仮想オーバレイネットワーク
– VMにlocation independenceを提供
– VMのトラフィックをconfigurable overlay networkに流流す
• 元々はVirtuosoと呼ばれるvirtual machine grid
computingを構築するためのミドルウェア向けに提案
Fast-path links amongst
Resilient Star Backbone
the VNETs hosting VMs
Userʼ’s Foreign host
LAN LAN 1
VM Host 1
1 +
Proxy IP network VNET
+
VNET
Foreign host
LAN 2 A. Sundararaj et al., “Increasing Application
Performance in Virtual Environments
VM 4 VM Through Run-‐‑‒time Inference and
Foreign host
Host 4 VM Host 3 2 Host 2
Adaptation,” HPDC05
+ 3
Foreign host + +
LAN 4 VNET VNET VNET
LAN 3
7
8. VNET/Pアーキテクチャ
VNET/P Architecture
User Space Guest OS Guest OS
Application Application
VNET/P Device Device
Co t o
Control Driver Driver
Linux Kernel Virtual NIC Virtual NIC
VNET/P Core
VNET/P
Bridge
Palacios VMM
Host Driver
Physical Network
L. Xia et al., “VNET/P: Bridging the Cloud and High
Performance Computing Through Fast Overlay
Networking,” HPDC 2012.
9
8
9. Data Path
Data Path (Packet Transmission)
(packet t
( k t transmission)
i i )
Guest
TCP/IP Device Device
stack Driver Driver
VM Exit VM Entry
Palacios vNIC
VNET/P Core: VNET/P Bridge vNIC
Exit Exit
Handler Routing/
R ti / Send
S d packetk t
Handler
encapsulation to host network
VNET/P
Ether IP TCP/UDP Ether IP UDP Ether IP TCP/UDP
hdr hdr hdr Data hdr hdr hdr hdr hdr hdr Data
Time
10
引⽤用:http://v3vee.org/talks/hpdc12-‐‑‒vnet.pdf
9