SlideShare una empresa de Scribd logo
1 de 46
Descargar para leer sin conexión
OSDC 2014 
Overlay Datacenter Information Christian Kniep 
Bull SAS 
2014-04-10
About Me 
❖ Me (>30y) 
❖ SysOps (>10y) 
❖ SysOps v1.1 (>8y) 
❖ BSc (2008-2011) 
❖ DevOps (>4y) 
❖ R&D [OpsDev?](>1y) 
2
❖ Cluster Stack 
❖ Motivation (InfiniBand use-case) 
❖ QNIB/ng 
Agenda 
❖ QNIBTerminal (virtual cluster using docker) 
3 
Cluster 
Stack 
IB QNIBng 
I. 
QNIB 
Terminal 
II. 
III.
Cluster Stack Work Environment 
4
Cluster? 
5 
„A computer cluster consists of a set of loosely connected or tightly connected computers 
that work together so that in many respects they can be viewed as a single system.“ - wikipedia.org 
User
HPC-Cluster 
6 
High Performance Computing 
❖ HPC: Surfing the bottleneck 
❖ Weakest link breaks performance
Cluster Layers 
7 
(rough estimate) 
Software: End user application 
Services: Storage, Job Scheduler, sshd 
MiddleWare: MPI, ISV-libs 
Operating System: Kernel, Userland tools 
Hardware: IMPI, lm_sensors, IB counter 
End 
User 
Excel: KPI, SLA 
Mgmt 
SysOps 
Power User/ISV 
SysOps Mgmt 
ISV Mgmt 
SysOps L2 
SysOps L1 
Events Metrics 
SysOps L3
Layern 
❖ Every Layer is composed of layers 
❖ How deep to go? 
8
Little Data w/o Connection 
❖ No way of connecting them 
❖ Connecting is manual labour 
❖ Experience driven 
❖ Niche solutions misleading 
9 
❖ Multiple data sources
IB + QNIBng Motivation 
10
Modular Switch 
11 
❖ Looks like one „switch“
Modular Switch 
12 
❖ Looks like one „switch“ 
❖ Composed of a network itself
Modular Switch 
13 
❖ Looks like one „switch“ 
❖ Composed of a network itself 
❖ Which route is taken is transparent to 
application 
❖ LB1<>FB1<>LB4
Modular Switch 
14 
❖ Looks like one „switch“ 
❖ Composed of a network itself 
❖ Which route is taken is transparent to 
application 
❖ LB1<>FB1<>LB4 
❖ LB1<>FB2<>LB4
Modular Switch 
15 
❖ Looks like one „switch“ 
❖ Composed of a network itself 
❖ Which route is taken is transparent to 
application 
❖ LB1<>FB1<>LB4 
❖ LB1<>FB2<>LB4 
❖ LB1 ->FB1 ->LB4 / LB1 <-FB2 <-LB4
❖ 96 port switch 
Debug-Nightmare 
❖ multiple autonomous job-cells 
❖ Relevant information 
❖ Job status (Resource Scheduler) 
❖ Routes (IB Subnet Manager) 
❖ IB Counter (Command Line) 
❖ changing one plug, recomputes routes :) 
16 
❖ Job seems to fail due to bad internal link
Communication Networks 
IBPM: An Open-Source-Based Framework for 
InfiniBand Performance Monitoring 
Michael Hoefling1, Michael Menth1, Christian Kniep2, Marcus Camen2 
Background: InfiniBand (IB) IBPM: Demo Overview 
Rate Measurement in IB Networks 
f State-of-the art communication technology for interconnection in 
high-performance computing data centers 
f Point-to-point bidirectional links 
f High throughput (40 Gbit/s with QDR) 
f Low latency 
f Dynamic on-line network reconfiguration 
in cooperation with 
Idea 
f Extract raw network information from IB network 
f Analyze output 
f Derive statistics about performance of the network 
Topology Extraction 
f Subnet discovery using ibnetdiscover 
f Produces human readable file of network topology 
f Process output to produce graphical representation of the 
network 
Remote Counter Readout 
f Each port has its own set of performance counters 
f Counters measure, e.g., transferred data, congestion, errors, 
link states changes 
ibsim-Based Network Simulation 
f ibsim simulates an IB network 
f Simple topology changes possible (GUI) 
f ibsim limitations 
ƒ No performance simulation possible 
ƒ No data rate changes possible 
Real IB Network 
f Physical network 
f Allows performance measurements 
f GUI controlled traffic scenarios 
17
❖ OpenSM Performance Manager 
❖ Sends token to all ports 
❖ All ports reply with metrics 
OpenSM 
❖ Callback triggered for every reply 
❖ Dumps info to file 
Sw 
18 
OpenSM 
PerfMgmt 
osmeventplugin 
Sw 
node 
node node 
node node 
node 
node 
❖ osmeventplugin
OpenSM 
OpenSM 
PerfMgmt 
qnqinbinbg 
19 
❖ qnib 
❖ sends metrics to RRDtool 
❖ events to PostgreSQL 
❖ qnibng 
❖ sends metrics to graphite 
❖ events to logstash
Graphite Events port is up/down 
20
21
22
QNIBTerminal Proof of Concept 
23
Cluster Stack Mock-Up 
❖ IB events and metrics are not enough 
❖ How to get real-world behavior? 
❖ Wanted: 
❖ Slurm (Resource Scheduler) 
❖ MPI enabled compute nodes 
❖ As much additional cluster stack as possible 
(Graphite,elasticsearch/logstash/kibana, Icinga, Cluster-FS, …) 
24
Classical Virtualization 
❖ Big overhead for simple node 
❖ Resources provisioned in advance 
❖ Host resources allocated 
25
LXC (docker) 
❖ minimal overhead ( couple of MB) 
❖ no resource pinning 
❖ cgroups option 
❖ highly automatable 
26 
NOW: Watch OSDC2014 talk ‚Docker‘ by ‚Tobias Schwab‘
Virtual Cluster Nodes 
❖ Master Node (etcd, DNS, slurmctld) 
❖ monitoring (graphite + statsd) 
❖ log mgmt (ELK) 
❖ compute nodes (slurmd) 
❖ alarming (Icinga) [not integrated] 
27 
host 
master 
monitoring 
log mgmt 
compute0 
compute1 
computeN
Master Node 
❖ takes care of inventory (etcd) 
❖ provides DNS (+PTR) 
❖ Integrate Rudder, ansible, chef,…? 
28
Non-Master Nodes (in general) 
❖ are started with master as DNS 
❖ mounting /scratch, /chome (sits on SSDs) 
❖ supervisord kicks in and starts services and setup-scripts 
❖ sending metrics to graphite 
❖ logs to logstash 
29
docker-compute 
❖ slurmd 
❖ sshd 
❖ logstash-forwarder 
❖ openmpi 
❖ qperf 
30
docker-graphite (monitoring) 
❖ full graphite stack + statsd 
❖ stresses IO (<3 SSDs) 
❖ needs more care (optimize IO) 
31
docker-elk (Log Mgmt) 
❖ elasticsearch, logstash, kibana 
❖ inputs: syslog, lumberjack 
❖ filters: none 
❖ outputs: elasticsearch 
32
It’s alive! 
33
Start Compute Node 
34
Start Compute Node 
35
Check Slurm Config 
36
Run MPI-Job 
37
TCP benchmark 
38
QNIBTerminal Future Work 
39
docker-icinga 
40 
❖ Icinga to provide 
❖ state-of-the-cluster overview 
❖ bundle with graphite/elk 
❖ no big deal… 
❖ Is this going to scale?
docker-(GlusterFS,Lustre) 
❖ Cluster scratch to integrate with 
❖ Use of kernel-modules freezes attempt 
❖ Might be pushed in VirtualBox (vagrant) 
41
❖ How is SysOps/DevOps/Mgmt 
❖ react to the changes 
❖ adopt them 
❖ feared by them 
Humans! 
42
❖ Truckload of 
❖ Events 
❖ Metrics 
❖ Interaction 
Big Data! 
43 
node01.system.memory.usage 9 
node13.system.memory.usage 14 
node35.system.memory.usage 12 
node95.system.memory.usage 11 
target=sumSeries(node{01,13,35,95}.system.memory.usage) 
job1.node01.system.memory.usage 9 
job1.node13.system.memory.usage 14 
job1.node35.system.memory.usage 12 
job1.node95.system.memory.usage 11 
target=sumSeries(job01.*.system.memory.usage)
pipework / mininet 
❖ Currently all containers are bound to docker0 bridge 
❖ Creating topology with virtual/real switches would be nice 
❖ First iteration might use pipework 
❖ More complete one should use vSwitches (mininet?) 
44
Dockerfiles 
❖ Only 3 images are fd20 based 
45
Questions? 
❖ Pictures 
❖ p2: http://de.wikipedia.org/wiki/Datei:Audi_logo.svg 
http://commons.wikimedia.org/wiki/File:Daimler_AG.svg 
http://ffb.uni-lueneburg.de/20JahreFFB/ 
❖ p4: https://www.flickr.com/photos/adeneko/4229090961 
❖ p6: cae t100 
https://www.flickr.com/photos/losalamosnatlab/7422429706 
❖ p8: http://www.brendangregg.com/Slides/SCaLE_Linux_Performance2013.pdf 
❖ p9: https://www.flickr.com/photos/riafoge/6796129047 
❖ p10: https://www.flickr.com/photos/119364768@N03/12928685224/ 
❖ p11: http://www.mellanox.com/page/products_dyn?product_family=74 
❖ p23: https://www.flickr.com/photos/jaxport/3077543062 
❖ p25/26: https://blog.trifork.com/2013/08/08/next-step-in-virtualization-docker-lightweight-containers/ 
❖ p33: https://www.flickr.com/photos/fkehren/5139094564 
❖ p39: https://www.flickr.com/photos/brizzlebornandbred/12852909293 
46

Más contenido relacionado

La actualidad más candente

Ceph RBD Update - June 2021
Ceph RBD Update - June 2021Ceph RBD Update - June 2021
Ceph RBD Update - June 2021Ceph Community
 
Ceph Month 2021: RADOS Update
Ceph Month 2021: RADOS UpdateCeph Month 2021: RADOS Update
Ceph Month 2021: RADOS UpdateCeph Community
 
OVN DBs HA with scale test
OVN DBs HA with scale testOVN DBs HA with scale test
OVN DBs HA with scale testAliasgar Ginwala
 
Comprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesComprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesNetronome
 
2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpiQNIB Solutions
 
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud"
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud" Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud"
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud" Flink Forward
 
Debugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vosDebugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vosGluster.org
 
Live migration: pros, cons and gotchas -- Pavel Emelyanov
Live migration: pros, cons and gotchas -- Pavel EmelyanovLive migration: pros, cons and gotchas -- Pavel Emelyanov
Live migration: pros, cons and gotchas -- Pavel EmelyanovOpenVZ
 
Performance optimization for all flash based on aarch64 v2.0
Performance optimization for all flash based on aarch64 v2.0Performance optimization for all flash based on aarch64 v2.0
Performance optimization for all flash based on aarch64 v2.0Ceph Community
 
Marriage with docker
Marriage with dockerMarriage with docker
Marriage with dockerDušan Katona
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Divejoshdurgin
 
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...Fluence Labs
 
Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...
Flink Forward SF 2017:  Cliff Resnick & Seth Wiesman -   From Zero to Streami...Flink Forward SF 2017:  Cliff Resnick & Seth Wiesman -   From Zero to Streami...
Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...Flink Forward
 
Evaluation of RBD replication options @CERN
Evaluation of RBD replication options @CERNEvaluation of RBD replication options @CERN
Evaluation of RBD replication options @CERNCeph Community
 
Data Reduction for Gluster with VDO
Data Reduction for Gluster with VDOData Reduction for Gluster with VDO
Data Reduction for Gluster with VDOGluster.org
 
CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016Belmiro Moreira
 

La actualidad más candente (20)

Ceph RBD Update - June 2021
Ceph RBD Update - June 2021Ceph RBD Update - June 2021
Ceph RBD Update - June 2021
 
Ceph Month 2021: RADOS Update
Ceph Month 2021: RADOS UpdateCeph Month 2021: RADOS Update
Ceph Month 2021: RADOS Update
 
OVN DBs HA with scale test
OVN DBs HA with scale testOVN DBs HA with scale test
OVN DBs HA with scale test
 
Comprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge CasesComprehensive XDP Off‌load-handling the Edge Cases
Comprehensive XDP Off‌load-handling the Edge Cases
 
2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi2014 11-05 hpcac-kniep_christian_dockermpi
2014 11-05 hpcac-kniep_christian_dockermpi
 
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud"
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud" Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud"
Flink Forward San Francisco 2018: Steven Wu - "Scaling Flink in Cloud"
 
Debugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vosDebugging with-wireshark-niels-de-vos
Debugging with-wireshark-niels-de-vos
 
Ceph on Windows
Ceph on WindowsCeph on Windows
Ceph on Windows
 
Live migration: pros, cons and gotchas -- Pavel Emelyanov
Live migration: pros, cons and gotchas -- Pavel EmelyanovLive migration: pros, cons and gotchas -- Pavel Emelyanov
Live migration: pros, cons and gotchas -- Pavel Emelyanov
 
Performance optimization for all flash based on aarch64 v2.0
Performance optimization for all flash based on aarch64 v2.0Performance optimization for all flash based on aarch64 v2.0
Performance optimization for all flash based on aarch64 v2.0
 
Marriage with docker
Marriage with dockerMarriage with docker
Marriage with docker
 
Ceph Block Devices: A Deep Dive
Ceph Block Devices: A Deep DiveCeph Block Devices: A Deep Dive
Ceph Block Devices: A Deep Dive
 
Kraken mesoscon 2018
Kraken mesoscon 2018Kraken mesoscon 2018
Kraken mesoscon 2018
 
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...
Fast, deterministic, and verifiable computations with WebAssembly. WASM on th...
 
Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...
Flink Forward SF 2017:  Cliff Resnick & Seth Wiesman -   From Zero to Streami...Flink Forward SF 2017:  Cliff Resnick & Seth Wiesman -   From Zero to Streami...
Flink Forward SF 2017: Cliff Resnick & Seth Wiesman - From Zero to Streami...
 
Speeding up ps and top
Speeding up ps and topSpeeding up ps and top
Speeding up ps and top
 
Qemu gluster fs
Qemu gluster fsQemu gluster fs
Qemu gluster fs
 
Evaluation of RBD replication options @CERN
Evaluation of RBD replication options @CERNEvaluation of RBD replication options @CERN
Evaluation of RBD replication options @CERN
 
Data Reduction for Gluster with VDO
Data Reduction for Gluster with VDOData Reduction for Gluster with VDO
Data Reduction for Gluster with VDO
 
CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016CPU Optimizations in the CERN Cloud - February 2016
CPU Optimizations in the CERN Cloud - February 2016
 

Destacado (12)

Tik 4
Tik 4Tik 4
Tik 4
 
Zummie makes it hot part two
Zummie makes it hot part twoZummie makes it hot part two
Zummie makes it hot part two
 
Zummi makes it hot part one
Zummi makes it hot part oneZummi makes it hot part one
Zummi makes it hot part one
 
TIK BAB 1
TIK BAB 1TIK BAB 1
TIK BAB 1
 
Credit sem ppt
Credit sem pptCredit sem ppt
Credit sem ppt
 
Tik 5
Tik 5Tik 5
Tik 5
 
TIK 2
TIK 2TIK 2
TIK 2
 
Reglamento paouad
Reglamento paouadReglamento paouad
Reglamento paouad
 
Docker SDN (software-defined-networking) JUG
Docker SDN (software-defined-networking) JUGDocker SDN (software-defined-networking) JUG
Docker SDN (software-defined-networking) JUG
 
A new beginning pt.3
A new beginning pt.3A new beginning pt.3
A new beginning pt.3
 
Ansible docker
Ansible dockerAnsible docker
Ansible docker
 
DockerCon SF 2015: Networking Breakout
DockerCon SF 2015: Networking BreakoutDockerCon SF 2015: Networking Breakout
DockerCon SF 2015: Networking Breakout
 

Similar a QNIBTerminal: Understand your datacenter by overlaying multiple information layers.

Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)Kevin Lynch
 
SDNDS.TW Mininet
SDNDS.TW MininetSDNDS.TW Mininet
SDNDS.TW MininetNCTU
 
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...Flink Forward
 
Network Automation (Bay Area Juniper Networks Meetup)
Network Automation (Bay Area Juniper Networks Meetup)Network Automation (Bay Area Juniper Networks Meetup)
Network Automation (Bay Area Juniper Networks Meetup)Alejandro Salinas
 
Scala & Spark(1.6) in Performance Aspect for Scala Taiwan
Scala & Spark(1.6) in Performance Aspect for Scala TaiwanScala & Spark(1.6) in Performance Aspect for Scala Taiwan
Scala & Spark(1.6) in Performance Aspect for Scala TaiwanJimin Hsieh
 
Scaling the Container Dataplane
Scaling the Container Dataplane Scaling the Container Dataplane
Scaling the Container Dataplane Michelle Holley
 
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan Kooman
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan KoomanOpenNebulaConf 2014 - ONE BIT to rule them all - Stefan Kooman
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan KoomanOpenNebula Project
 
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan Kooman
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan KoomanOpenNebula Conf 2014 | ONE BIT to rule them all - Stefan Kooman
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan KoomanNETWAYS
 
What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0ScyllaDB
 
Security Monitoring with eBPF
Security Monitoring with eBPFSecurity Monitoring with eBPF
Security Monitoring with eBPFAlex Maestretti
 
Stacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStackStacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStackOpen-NFP
 
Evergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival SkillsEvergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival SkillsEvergreen ILS
 
Kubernetes @ Squarespace: Kubernetes in the Datacenter
Kubernetes @ Squarespace: Kubernetes in the DatacenterKubernetes @ Squarespace: Kubernetes in the Datacenter
Kubernetes @ Squarespace: Kubernetes in the DatacenterKevin Lynch
 
How to deal second interface service discovery and load balancer in kubernetes
How to deal second interface  service discovery and load balancer  in kubernetesHow to deal second interface  service discovery and load balancer  in kubernetes
How to deal second interface service discovery and load balancer in kubernetesMeng-Ze Lee
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDKKernel TLV
 
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Nat Morris
 
Network Stack in Userspace (NUSE)
Network Stack in Userspace (NUSE)Network Stack in Userspace (NUSE)
Network Stack in Userspace (NUSE)Hajime Tazaki
 
Bharath Ram Chandrasekar_Tele 6603_SDN &NFV
Bharath Ram Chandrasekar_Tele 6603_SDN &NFVBharath Ram Chandrasekar_Tele 6603_SDN &NFV
Bharath Ram Chandrasekar_Tele 6603_SDN &NFVBharath Ram Chandrasekar
 
H2O Design and Infrastructure with Matt Dowle
H2O Design and Infrastructure with Matt DowleH2O Design and Infrastructure with Matt Dowle
H2O Design and Infrastructure with Matt DowleSri Ambati
 

Similar a QNIBTerminal: Understand your datacenter by overlaying multiple information layers. (20)

Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
Kubernetes @ Squarespace (SRE Portland Meetup October 2017)
 
SDNDS.TW Mininet
SDNDS.TW MininetSDNDS.TW Mininet
SDNDS.TW Mininet
 
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
Flink Forward SF 2017: Stephan Ewen - Experiences running Flink at Very Large...
 
Network Automation (Bay Area Juniper Networks Meetup)
Network Automation (Bay Area Juniper Networks Meetup)Network Automation (Bay Area Juniper Networks Meetup)
Network Automation (Bay Area Juniper Networks Meetup)
 
Scala & Spark(1.6) in Performance Aspect for Scala Taiwan
Scala & Spark(1.6) in Performance Aspect for Scala TaiwanScala & Spark(1.6) in Performance Aspect for Scala Taiwan
Scala & Spark(1.6) in Performance Aspect for Scala Taiwan
 
SDN approach.pptx
SDN approach.pptxSDN approach.pptx
SDN approach.pptx
 
Scaling the Container Dataplane
Scaling the Container Dataplane Scaling the Container Dataplane
Scaling the Container Dataplane
 
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan Kooman
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan KoomanOpenNebulaConf 2014 - ONE BIT to rule them all - Stefan Kooman
OpenNebulaConf 2014 - ONE BIT to rule them all - Stefan Kooman
 
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan Kooman
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan KoomanOpenNebula Conf 2014 | ONE BIT to rule them all - Stefan Kooman
OpenNebula Conf 2014 | ONE BIT to rule them all - Stefan Kooman
 
What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0What’s New in ScyllaDB Open Source 5.0
What’s New in ScyllaDB Open Source 5.0
 
Security Monitoring with eBPF
Security Monitoring with eBPFSecurity Monitoring with eBPF
Security Monitoring with eBPF
 
Stacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStackStacks and Layers: Integrating P4, C, OVS and OpenStack
Stacks and Layers: Integrating P4, C, OVS and OpenStack
 
Evergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival SkillsEvergreen Sysadmin Survival Skills
Evergreen Sysadmin Survival Skills
 
Kubernetes @ Squarespace: Kubernetes in the Datacenter
Kubernetes @ Squarespace: Kubernetes in the DatacenterKubernetes @ Squarespace: Kubernetes in the Datacenter
Kubernetes @ Squarespace: Kubernetes in the Datacenter
 
How to deal second interface service discovery and load balancer in kubernetes
How to deal second interface  service discovery and load balancer  in kubernetesHow to deal second interface  service discovery and load balancer  in kubernetes
How to deal second interface service discovery and load balancer in kubernetes
 
Introduction to DPDK
Introduction to DPDKIntroduction to DPDK
Introduction to DPDK
 
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014Hardware accelerated switching with Linux @ SWLUG Talks May 2014
Hardware accelerated switching with Linux @ SWLUG Talks May 2014
 
Network Stack in Userspace (NUSE)
Network Stack in Userspace (NUSE)Network Stack in Userspace (NUSE)
Network Stack in Userspace (NUSE)
 
Bharath Ram Chandrasekar_Tele 6603_SDN &NFV
Bharath Ram Chandrasekar_Tele 6603_SDN &NFVBharath Ram Chandrasekar_Tele 6603_SDN &NFV
Bharath Ram Chandrasekar_Tele 6603_SDN &NFV
 
H2O Design and Infrastructure with Matt Dowle
H2O Design and Infrastructure with Matt DowleH2O Design and Infrastructure with Matt Dowle
H2O Design and Infrastructure with Matt Dowle
 

Último

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Último (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

QNIBTerminal: Understand your datacenter by overlaying multiple information layers.

  • 1. OSDC 2014 Overlay Datacenter Information Christian Kniep Bull SAS 2014-04-10
  • 2. About Me ❖ Me (>30y) ❖ SysOps (>10y) ❖ SysOps v1.1 (>8y) ❖ BSc (2008-2011) ❖ DevOps (>4y) ❖ R&D [OpsDev?](>1y) 2
  • 3. ❖ Cluster Stack ❖ Motivation (InfiniBand use-case) ❖ QNIB/ng Agenda ❖ QNIBTerminal (virtual cluster using docker) 3 Cluster Stack IB QNIBng I. QNIB Terminal II. III.
  • 4. Cluster Stack Work Environment 4
  • 5. Cluster? 5 „A computer cluster consists of a set of loosely connected or tightly connected computers that work together so that in many respects they can be viewed as a single system.“ - wikipedia.org User
  • 6. HPC-Cluster 6 High Performance Computing ❖ HPC: Surfing the bottleneck ❖ Weakest link breaks performance
  • 7. Cluster Layers 7 (rough estimate) Software: End user application Services: Storage, Job Scheduler, sshd MiddleWare: MPI, ISV-libs Operating System: Kernel, Userland tools Hardware: IMPI, lm_sensors, IB counter End User Excel: KPI, SLA Mgmt SysOps Power User/ISV SysOps Mgmt ISV Mgmt SysOps L2 SysOps L1 Events Metrics SysOps L3
  • 8. Layern ❖ Every Layer is composed of layers ❖ How deep to go? 8
  • 9. Little Data w/o Connection ❖ No way of connecting them ❖ Connecting is manual labour ❖ Experience driven ❖ Niche solutions misleading 9 ❖ Multiple data sources
  • 10. IB + QNIBng Motivation 10
  • 11. Modular Switch 11 ❖ Looks like one „switch“
  • 12. Modular Switch 12 ❖ Looks like one „switch“ ❖ Composed of a network itself
  • 13. Modular Switch 13 ❖ Looks like one „switch“ ❖ Composed of a network itself ❖ Which route is taken is transparent to application ❖ LB1<>FB1<>LB4
  • 14. Modular Switch 14 ❖ Looks like one „switch“ ❖ Composed of a network itself ❖ Which route is taken is transparent to application ❖ LB1<>FB1<>LB4 ❖ LB1<>FB2<>LB4
  • 15. Modular Switch 15 ❖ Looks like one „switch“ ❖ Composed of a network itself ❖ Which route is taken is transparent to application ❖ LB1<>FB1<>LB4 ❖ LB1<>FB2<>LB4 ❖ LB1 ->FB1 ->LB4 / LB1 <-FB2 <-LB4
  • 16. ❖ 96 port switch Debug-Nightmare ❖ multiple autonomous job-cells ❖ Relevant information ❖ Job status (Resource Scheduler) ❖ Routes (IB Subnet Manager) ❖ IB Counter (Command Line) ❖ changing one plug, recomputes routes :) 16 ❖ Job seems to fail due to bad internal link
  • 17. Communication Networks IBPM: An Open-Source-Based Framework for InfiniBand Performance Monitoring Michael Hoefling1, Michael Menth1, Christian Kniep2, Marcus Camen2 Background: InfiniBand (IB) IBPM: Demo Overview Rate Measurement in IB Networks f State-of-the art communication technology for interconnection in high-performance computing data centers f Point-to-point bidirectional links f High throughput (40 Gbit/s with QDR) f Low latency f Dynamic on-line network reconfiguration in cooperation with Idea f Extract raw network information from IB network f Analyze output f Derive statistics about performance of the network Topology Extraction f Subnet discovery using ibnetdiscover f Produces human readable file of network topology f Process output to produce graphical representation of the network Remote Counter Readout f Each port has its own set of performance counters f Counters measure, e.g., transferred data, congestion, errors, link states changes ibsim-Based Network Simulation f ibsim simulates an IB network f Simple topology changes possible (GUI) f ibsim limitations ƒ No performance simulation possible ƒ No data rate changes possible Real IB Network f Physical network f Allows performance measurements f GUI controlled traffic scenarios 17
  • 18. ❖ OpenSM Performance Manager ❖ Sends token to all ports ❖ All ports reply with metrics OpenSM ❖ Callback triggered for every reply ❖ Dumps info to file Sw 18 OpenSM PerfMgmt osmeventplugin Sw node node node node node node node ❖ osmeventplugin
  • 19. OpenSM OpenSM PerfMgmt qnqinbinbg 19 ❖ qnib ❖ sends metrics to RRDtool ❖ events to PostgreSQL ❖ qnibng ❖ sends metrics to graphite ❖ events to logstash
  • 20. Graphite Events port is up/down 20
  • 21. 21
  • 22. 22
  • 23. QNIBTerminal Proof of Concept 23
  • 24. Cluster Stack Mock-Up ❖ IB events and metrics are not enough ❖ How to get real-world behavior? ❖ Wanted: ❖ Slurm (Resource Scheduler) ❖ MPI enabled compute nodes ❖ As much additional cluster stack as possible (Graphite,elasticsearch/logstash/kibana, Icinga, Cluster-FS, …) 24
  • 25. Classical Virtualization ❖ Big overhead for simple node ❖ Resources provisioned in advance ❖ Host resources allocated 25
  • 26. LXC (docker) ❖ minimal overhead ( couple of MB) ❖ no resource pinning ❖ cgroups option ❖ highly automatable 26 NOW: Watch OSDC2014 talk ‚Docker‘ by ‚Tobias Schwab‘
  • 27. Virtual Cluster Nodes ❖ Master Node (etcd, DNS, slurmctld) ❖ monitoring (graphite + statsd) ❖ log mgmt (ELK) ❖ compute nodes (slurmd) ❖ alarming (Icinga) [not integrated] 27 host master monitoring log mgmt compute0 compute1 computeN
  • 28. Master Node ❖ takes care of inventory (etcd) ❖ provides DNS (+PTR) ❖ Integrate Rudder, ansible, chef,…? 28
  • 29. Non-Master Nodes (in general) ❖ are started with master as DNS ❖ mounting /scratch, /chome (sits on SSDs) ❖ supervisord kicks in and starts services and setup-scripts ❖ sending metrics to graphite ❖ logs to logstash 29
  • 30. docker-compute ❖ slurmd ❖ sshd ❖ logstash-forwarder ❖ openmpi ❖ qperf 30
  • 31. docker-graphite (monitoring) ❖ full graphite stack + statsd ❖ stresses IO (<3 SSDs) ❖ needs more care (optimize IO) 31
  • 32. docker-elk (Log Mgmt) ❖ elasticsearch, logstash, kibana ❖ inputs: syslog, lumberjack ❖ filters: none ❖ outputs: elasticsearch 32
  • 40. docker-icinga 40 ❖ Icinga to provide ❖ state-of-the-cluster overview ❖ bundle with graphite/elk ❖ no big deal… ❖ Is this going to scale?
  • 41. docker-(GlusterFS,Lustre) ❖ Cluster scratch to integrate with ❖ Use of kernel-modules freezes attempt ❖ Might be pushed in VirtualBox (vagrant) 41
  • 42. ❖ How is SysOps/DevOps/Mgmt ❖ react to the changes ❖ adopt them ❖ feared by them Humans! 42
  • 43. ❖ Truckload of ❖ Events ❖ Metrics ❖ Interaction Big Data! 43 node01.system.memory.usage 9 node13.system.memory.usage 14 node35.system.memory.usage 12 node95.system.memory.usage 11 target=sumSeries(node{01,13,35,95}.system.memory.usage) job1.node01.system.memory.usage 9 job1.node13.system.memory.usage 14 job1.node35.system.memory.usage 12 job1.node95.system.memory.usage 11 target=sumSeries(job01.*.system.memory.usage)
  • 44. pipework / mininet ❖ Currently all containers are bound to docker0 bridge ❖ Creating topology with virtual/real switches would be nice ❖ First iteration might use pipework ❖ More complete one should use vSwitches (mininet?) 44
  • 45. Dockerfiles ❖ Only 3 images are fd20 based 45
  • 46. Questions? ❖ Pictures ❖ p2: http://de.wikipedia.org/wiki/Datei:Audi_logo.svg http://commons.wikimedia.org/wiki/File:Daimler_AG.svg http://ffb.uni-lueneburg.de/20JahreFFB/ ❖ p4: https://www.flickr.com/photos/adeneko/4229090961 ❖ p6: cae t100 https://www.flickr.com/photos/losalamosnatlab/7422429706 ❖ p8: http://www.brendangregg.com/Slides/SCaLE_Linux_Performance2013.pdf ❖ p9: https://www.flickr.com/photos/riafoge/6796129047 ❖ p10: https://www.flickr.com/photos/119364768@N03/12928685224/ ❖ p11: http://www.mellanox.com/page/products_dyn?product_family=74 ❖ p23: https://www.flickr.com/photos/jaxport/3077543062 ❖ p25/26: https://blog.trifork.com/2013/08/08/next-step-in-virtualization-docker-lightweight-containers/ ❖ p33: https://www.flickr.com/photos/fkehren/5139094564 ❖ p39: https://www.flickr.com/photos/brizzlebornandbred/12852909293 46