SlideShare a Scribd company logo
1 of 26
Building service testbeds on FIRE




BonFIRE: A Multi-cloud Test Facility for
 Internet of Services Experimentation

            Ally Hume, EPCC, University of Edinburgh



             Tridentcom 2012
BonFIRE Project



    The BonFIRE project is designing, building and operating
        a multi-cloud facility to support research across
      applications, services and systems targeting services
            research community on Future Internet.




2                         Tridentcom 2012              BonFIRE 2
Three Scenarios


    •   Multi-site clouds connected through normal internet
    •   Cloud scenario with emulated network
    •   Extended Cloud scenario with controlled network (implies
        federation with network facility)




3                           Tridentcom 2012                BonFIRE 3
BonFIRE scenarios and sites
         EPCC (Edinburgh)                        Scenario 3
                                            GEANT AutoBAHN and
                                               SFA, Federica

                                                                                        PSNC (Poznan)

   Scenario 1
                                                                                    Scenario 2
(normal internet)
                                                                               Emulab (Virtual Wall)
       HP (Bristol)



                                                                                IBBT (Ghent)




                                                                     HLRS (Stuttgart)

         INRIA (Rennes)

Permanent (>450cores / 45TB) & On-Request (+3000 cores) infrastructures
Note: network links not representative
   4                                      Tridentcom 2012                                BonFIRE 4
Example experiment categories

    1. Service applications experiments (non-cloud)
       – Including distributed peer to peer applications
    2. Cloud (or multi-cloud) applications
       – e.g. cloud busting scenarios with private and public clouds.
    3. Cloud infrastructure experiments
       – e.g. schedulers on top of BonFIRE, new live-migration strategies,
         contention minimisation
    4. Application-and-network experiments
       – Experimenters configuring network to support applications
       – Bandwidth on demand
       – Application specific routing




5                                Tridentcom 2012                        BonFIRE 5
Four pillars of BonFIRE




6    Tridentcom 2012          BonFIRE 6
Observability

    • To understand the behaviour of the system you need to
      see inside the abstractions normally provided by clouds.




7                           Tridentcom 2012                BonFIRE 7
Application, VM and Physical
                    machine metrics

        Physical Machine




    Virtual           Virtual         Virtual
    Machine           Machine         Machine




8                   Tridentcom 2012             BonFIRE 8
Infrastructure metrics




9   Tridentcom 2012           BonFIRE 9
Observing inside the cloud


      bonfire.epcc.ed.ac.uk

            OpenNebula v 2.0.1                 OpenNebula
                                                event logs
                                      3.6 TB
             Xen Hypervisor v 3.0.3


        vmhost1

      128 GB RAM,
      48 2.3 MHz cores

                                                 vmhost1
                                                  metrics

        vmhost2

      128 GB RAM,
      48 2.3 MHz cores


                                                 vmhost2
                                                  metrics




10        Tridentcom 2012                              BonFIRE 10
Experiments supported

     • Passive use of infrastructure metrics
        – Understanding of contention and its impact on:
           • Application performance
           • General VM performance
     • Active use of infrastructure metrics:
        – by a cloud scheduler to co-locate VMs to minimise contention
            • e.g. by collecting statistics of the previous usage patterns of an
              image
        – by a cloud application to react to the contention currently in the
          system




11                                Tridentcom 2012                         BonFIRE 11
Control

     • Observing is good but control is even better.




12                           Tridentcom 2012           BonFIRE 12
Exclusive physical machines and
                                               controlled placement
                                                              INRIA

                                                                       node1           node2                node3




                                                                      node 5           node6                node7


        Location: inria
        Location: inria
        Cluster: 34563
        Cluster: 34563
          Host: p40                                                   node8            node9

                            Resource                  Open
                            Manager                  Nebula
         Give me
        3 physical
                                                                                     Cluster: 34563
        machines
           from                                                                p23
                                                                                                      p40
     26/4/12 10:00 to
      28/4/12 20:00
                                       Reservation
                                         System
                          Cluster: 34563                                             p92
                               P23
                               P40
                               p92




13                                                            Tridentcom 2012                                       BonFIRE 13
Supported Experiments


     • Exclusive use of physical machine and controlled
       placement
        – Supports elimination of unwanted contention
        – Increases experiment repeatability
        – Supports the implementation of controlled contention
            • possible via common contention templates

            Physical Machine
                                                           Contention
                                                           Template 1




                                                           Contention
         VM under                                          Template 2
                         Contention   Contention
           test          generator     generator




14                                       Tridentcom 2012                BonFIRE 14
Supported Experiments

     • Exclusive use of physical machine and controlled
       placement
             – Experiments using external schedulers




                                                                                 MyExclusiveCluster

                       Application
                                                         BonFIRE           PM1         PM2            PM3
                        Scheduler
                                                         Resource
                       (External to
                                                         Manager
                         BonFIRE)


 Instance type: lite              Instance type : lite
                                    Location: inria
                                       Cluster:
                                  MyExclusiveCluster
                                      Host: PM2




15                                                       Tridentcom 2012                                BonFIRE 15
Controlled (EMULAB) networks with
                                           Virtual Wall

                                                                                  high
                                                                               bandwidth
                                                                                 server
                                                                               connection
               last mile
                 client                    The internet
              connection
                                                                        Software Network B
                                                                        Software Network B
                                                                         RouterBandwidth: 1 Gbps
                                                                              Bandwidth: 1 Gbps
                                                                         Router                    Server 1
                                                                                                          1
                                                                                   Latency: 0
                                                                                    Latency: 0
                                                                                   Loss rate: 0
                                                                                         rate: 0
                 Network C
                 Network C     Software
                                Software          Network D
                                                   Network D                    Traffic: 100Mbps
                                                                                Traffic: 100Mbps
     Client Bandwidth: 100 Mbps Router
     Client Bandwidth: 100 Mbps Router        Bandwidth :: 1Gbps
                                              Bandwidth 1Gbps
               Latency: 10ms
               Latency: 10ms                    Latency: 100ms
                                                Latency: 100ms
                Loss rate: 1%
                Loss rate: 1%                   Loss rate: 20%
                                                 Loss rate: 20%
              Traffic: 10 Mbps
              Traffic: 10 Mbps                Traffic: 200 Mbps
                                               Traffic: 200 Mbps
                                                                                  Network A
                                                                                   Network A
                                                                   Software
                                                                   Software   Bandwidth: 1 Gpbs
                                                                              Bandwidth: 1 Gpbs    Server 2
                                                                                                          2
                                                                    Router
                                                                    Router
                                                                                   Latency: 0
                                                                                   Latency:
                                                                                  Loss rate: 0
                                                                                  Loss rate: 0
                                                                               Traffic: 250Mbps
                                                                               Traffic: 250Mbps




16                                            Tridentcom 2012                                  BonFIRE 16
Supported Experiments

     •   Controlled network quality of service
          –   Testing streaming applications
          –   Testing Internet of Services distributed applications
          –   Multi-cloud applications
     •   Experiments into how bandwidth on demand services could be used by cloud
         applications




                                                      Hybrid
                                                      Cloud


                                                               Public
                                         Private               Cloud
                                         Cloud




17                                                 Tridentcom 2012      BonFIRE 17
FEDERICA offering

 •   FEDERICA is an e-Infrastructure based on virtualization of compute
     and network resources
 •   BonFIRE experimenters will be able to request slices of network
     resources and are able to:
     – Select network topology
     – Decide IP Configuration
     – Configure static routes and dynamic routing protocols (OSPF, BGP)

                                      FEDERICA      FEDERICA
                                        Server        Server




                                 FEDERICA               FEDERICA
                                   Router                 Router
         BonFIRE cloud site                                        BonFIRE cloud site
                              Controlled
                               Network
                                                        FEDERICA
                                             FEDERICA     Server
                                               Router



18                                          Tridentcom 2012                             BonFIRE 18
FEDERICA interconnection

• We will have:
     – A pair of BonFIRE sites connected
       to FEDERICA PoPs
     – Access to resources from four
       FEDERICA PoPs.


• BonFIRE implementing SFA
  adaptor to request FEDERICA
  slices.
     – NOVI is developing an SFA interface
       on top of FEDERICA Web Services.
     – GÉANT3 will deploy the NOVI SFA
       interface.
     – We are using NOVI to test the SFA
       adaptor.

19                                Tridentcom 2012   BonFIRE 19
Integration with AutoBAHN
                                              EPCC                          PSNC
 • Integrate BonFIRE with the GÉANT           BonFIRE                       BonFIRE
   Bandwidth-on-Demand interfaces             Site                          Site

   (AutoBAHN)

 • This will allows network-aware
     experiments with requirements
     for guaranteed bandwidth
         • Control
         • Future service


 • Why AutoBAHN?
     – Reference BoD system for European NRENs and GÉANT
     – Most mature solution in terms of specification, implementation and
       deployment in the multi-domain environment interconnecting some of
       the sites
     – Handles both on demand and advance requests

20                              Tridentcom 2012                      BonFIRE 20
Advanced features




21   Tridentcom 2012     BonFIRE 21
Advanced features

     • Vertical elasticity
        – Scale up or down a virtual machine
        – For some applications scale up may be better than scale out
           • video transcoding
     • Bandwidth on demand services
        – As well as control tool BoD services can also be used by network-
          aware applications
     • Application and network
        – e.g. OpenFlow for application specific routing




22                               Tridentcom 2012                        BonFIRE 22
Architecture
                          Monitoring
                          dashboard                   GUI

                                                                        Portal
                                                                                                      Identity
                                                                                              LDAP
                                                                                                       Server
                                                                                                  (Used by Portal,
                                                              API                                   Experiment
                                                                                                 Manager , Resource
                                                Experiment Manager                                 Manager and
                                                                                                     Testbeds)


           OCCI     Monitoring         Scheduling     Reservation         Accounting


                                                         ResourceManager                              Message
                                                                                              Read/
                                                                                              Write    Queue

                  OCCI         Scheduling       Reservation         Accounting


                                                                         Enactor


                                      OCCI      Scheduling      Reservation      Accounting
     SSH                                                                                                              Key
                                                                         Testbed                                                                             BonFIRE internal interface or API
                                                                                                                        Interface or API accessible by
                                                                                                                 X                                       X
   SSH            Monitoring   Monitoring
                                                                                                                        end users, user-agents or the
                                                                                                                        layer above
                                                                                                                                                             accessible only by the layer above

    SSH
 Gateway            GUI          API
                                                                                                                                                              BonFIRE internal API accessible
                            VM
  Gateway                (Monitoring                VM                     VM                                   VM
                                                                                                                        BonFIRE virtual machine          X
                                                                                                                                                              from multiple layers of the
                                                                                                                                                              architecture or user agents
                         Aggregator)                                                                                                                          running on BONFIRE VMs

                                                                                                                                     Layer of the                  Component of BonFIRE
                                                                                                                                     BonFIRE                       architecture used by multiple
                                                                                                                                     architecture                  layers




23                                                                          Tridentcom 2012                                                                  BonFIRE 23
SFA Adaptor implementation

                     • OCCI: one request per
                       each resource.
                     • Rspec: one single request
                       per experiment (including
                       all resources).
                     • Extensions:
                           – Add an Enactor DB to batch
                             the OCCI requests belonging
                             to one experiment.
                           – Add a SFAAdaptorEndPoint
                             to convert and map the OCCI
                             requests to a RSpec request.
                           – SFA dialog with the
                             Aggregate Manager (AM).


24       Tridentcom 2012                       BonFIRE 24
Summary

     • Multi-cloud test facility for services community
     • Focus on:
        –   Observability
        –   Control
        –   Advanced Features
        –   Usability
     • Supports a wide range of experimenters
     • Not targeting network experimenters but network is key
       part of experiments
     • Uses a broker architecture for federation
     • Being used by open call experiments
     • Open for general use by around October 2012



25                              Tridentcom 2012            BonFIRE 25
Acknowledgements


     Copyright © 2012, EPCC, The University of Edinburgh, on
       behalf of the BonFIRE Consortium.

     Licensed under Creative Commons “Attribution-NoDerivs”



     BonFIRE is funded by the European Union Seventh
     Framework Programme (FP7/2007-2013) under grant
        agreement number 257386.

                   www.bonfire-project.eu


26                          Tridentcom 2012               BonFIRE 26

More Related Content

What's hot

H.264 Library
H.264 LibraryH.264 Library
H.264 Library
Videoguy
 
The size and complexity of the CERN network
The size and complexity of the CERN networkThe size and complexity of the CERN network
The size and complexity of the CERN network
Suma Pria Tunggal
 
Diplomatiki word
Diplomatiki wordDiplomatiki word
Diplomatiki word
Xaris1985
 
Concurrency in Python
Concurrency in PythonConcurrency in Python
Concurrency in Python
konryd
 
Honeypot Farms using Ethernet Bridging over a TCP Connection
Honeypot Farms using Ethernet Bridging over a TCP Connection Honeypot Farms using Ethernet Bridging over a TCP Connection
Honeypot Farms using Ethernet Bridging over a TCP Connection
morisson
 
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC ReportJuniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
Sergii Liventsev
 
Gfarm Fs Tatebe Tip2004
Gfarm Fs Tatebe Tip2004Gfarm Fs Tatebe Tip2004
Gfarm Fs Tatebe Tip2004
xlight
 

What's hot (20)

Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...Using Photonics to Prototype the Research Campus Infrastructure of the Future...
Using Photonics to Prototype the Research Campus Infrastructure of the Future...
 
Lxcloud
LxcloudLxcloud
Lxcloud
 
H.264 Library
H.264 LibraryH.264 Library
H.264 Library
 
Virtual net performance
Virtual net performanceVirtual net performance
Virtual net performance
 
Tma ph d_school_2011
Tma ph d_school_2011Tma ph d_school_2011
Tma ph d_school_2011
 
Instrumenting parsecs raytrace
Instrumenting parsecs raytraceInstrumenting parsecs raytrace
Instrumenting parsecs raytrace
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Wireless Hacking Talk
Wireless Hacking TalkWireless Hacking Talk
Wireless Hacking Talk
 
The size and complexity of the CERN network
The size and complexity of the CERN networkThe size and complexity of the CERN network
The size and complexity of the CERN network
 
Diplomatiki word
Diplomatiki wordDiplomatiki word
Diplomatiki word
 
Concurrency in Python
Concurrency in PythonConcurrency in Python
Concurrency in Python
 
Honeypot Farms using Ethernet Bridging over a TCP Connection
Honeypot Farms using Ethernet Bridging over a TCP Connection Honeypot Farms using Ethernet Bridging over a TCP Connection
Honeypot Farms using Ethernet Bridging over a TCP Connection
 
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC ReportJuniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
Juniper Networks MX Series 3D with Junos Trio Chipset - EANTC Report
 
2011_12_4K research in PSNC
2011_12_4K research in PSNC2011_12_4K research in PSNC
2011_12_4K research in PSNC
 
A High-Performance Campus-Scale Cyberinfrastructure for Effectively Bridging ...
A High-Performance Campus-Scale Cyberinfrastructure for Effectively Bridging ...A High-Performance Campus-Scale Cyberinfrastructure for Effectively Bridging ...
A High-Performance Campus-Scale Cyberinfrastructure for Effectively Bridging ...
 
2011.jtr.pbasanta.
2011.jtr.pbasanta.2011.jtr.pbasanta.
2011.jtr.pbasanta.
 
FlexRay Product Days RTaW
FlexRay Product Days RTaWFlexRay Product Days RTaW
FlexRay Product Days RTaW
 
Gfarm Fs Tatebe Tip2004
Gfarm Fs Tatebe Tip2004Gfarm Fs Tatebe Tip2004
Gfarm Fs Tatebe Tip2004
 
LTTng-UST: Efficient System-Wide User-Space Tracing
LTTng-UST: Efficient System-Wide User-Space TracingLTTng-UST: Efficient System-Wide User-Space Tracing
LTTng-UST: Efficient System-Wide User-Space Tracing
 
ISBI MPI Tutorial
ISBI MPI TutorialISBI MPI Tutorial
ISBI MPI Tutorial
 

Viewers also liked

Viewers also liked (9)

COCOMA presentation, FIA 2013
COCOMA presentation, FIA 2013COCOMA presentation, FIA 2013
COCOMA presentation, FIA 2013
 
Virus informatico
Virus informaticoVirus informatico
Virus informatico
 
BonFIRE: features, sites and tools
BonFIRE: features, sites and toolsBonFIRE: features, sites and tools
BonFIRE: features, sites and tools
 
Dialers
DialersDialers
Dialers
 
Learn BEM: CSS Naming Convention
Learn BEM: CSS Naming ConventionLearn BEM: CSS Naming Convention
Learn BEM: CSS Naming Convention
 
How to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media PlanHow to Build a Dynamic Social Media Plan
How to Build a Dynamic Social Media Plan
 
SEO: Getting Personal
SEO: Getting PersonalSEO: Getting Personal
SEO: Getting Personal
 
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika AldabaLightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
Lightning Talk #9: How UX and Data Storytelling Can Shape Policy by Mika Aldaba
 
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job? Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
Succession “Losers”: What Happens to Executives Passed Over for the CEO Job?
 

Similar to BonFIRE TridentCom presentation

The Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructureThe Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructure
Bill St. Arnaud
 
_Solve Cloud Packet Mysteries_4
_Solve Cloud Packet Mysteries_4_Solve Cloud Packet Mysteries_4
_Solve Cloud Packet Mysteries_4
Laura Taylor
 
Qo E E2 E3 European Response To Network Neutrality Liyang Hou
Qo E E2 E3   European Response To Network Neutrality   Liyang HouQo E E2 E3   European Response To Network Neutrality   Liyang Hou
Qo E E2 E3 European Response To Network Neutrality Liyang Hou
imec.archive
 
Selfxl Project Solutions Linux Ow2
Selfxl Project Solutions Linux Ow2Selfxl Project Solutions Linux Ow2
Selfxl Project Solutions Linux Ow2
Catherine Nuel
 

Similar to BonFIRE TridentCom presentation (20)

Delay Tolerant Streaming Services, Thomas Plagemann, UiO
Delay Tolerant Streaming Services, Thomas Plagemann, UiODelay Tolerant Streaming Services, Thomas Plagemann, UiO
Delay Tolerant Streaming Services, Thomas Plagemann, UiO
 
Contrail Deep-dive - Cloud Network Services at Scale
Contrail Deep-dive - Cloud Network Services at ScaleContrail Deep-dive - Cloud Network Services at Scale
Contrail Deep-dive - Cloud Network Services at Scale
 
Neutron-to-Neutron: interconnecting multiple OpenStack deployments
Neutron-to-Neutron: interconnecting multiple OpenStack deploymentsNeutron-to-Neutron: interconnecting multiple OpenStack deployments
Neutron-to-Neutron: interconnecting multiple OpenStack deployments
 
Chep2012
Chep2012Chep2012
Chep2012
 
Sensor networks
Sensor networksSensor networks
Sensor networks
 
case study avio group
case study avio groupcase study avio group
case study avio group
 
Avio Aerospace Case Study
Avio Aerospace Case Study Avio Aerospace Case Study
Avio Aerospace Case Study
 
Design & Deployment of a Future Internet Testbed
Design & Deployment of a Future Internet Testbed Design & Deployment of a Future Internet Testbed
Design & Deployment of a Future Internet Testbed
 
The Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructureThe Future of R&E networks and cyber-infrastructure
The Future of R&E networks and cyber-infrastructure
 
_Solve Cloud Packet Mysteries_4
_Solve Cloud Packet Mysteries_4_Solve Cloud Packet Mysteries_4
_Solve Cloud Packet Mysteries_4
 
Inria Tech Talk : Améliorez vos applications de robotique & réalité augmentée
Inria Tech Talk : Améliorez vos applications de robotique & réalité augmentéeInria Tech Talk : Améliorez vos applications de robotique & réalité augmentée
Inria Tech Talk : Améliorez vos applications de robotique & réalité augmentée
 
Userspace networking
Userspace networkingUserspace networking
Userspace networking
 
FIBRE & beyond
FIBRE & beyondFIBRE & beyond
FIBRE & beyond
 
Generic network architecture discussion
Generic network architecture discussionGeneric network architecture discussion
Generic network architecture discussion
 
Testbed for Heterogeneous Cloud
Testbed for Heterogeneous CloudTestbed for Heterogeneous Cloud
Testbed for Heterogeneous Cloud
 
Ad hoc Networks
Ad hoc NetworksAd hoc Networks
Ad hoc Networks
 
Qo E E2 E3 European Response To Network Neutrality Liyang Hou
Qo E E2 E3   European Response To Network Neutrality   Liyang HouQo E E2 E3   European Response To Network Neutrality   Liyang Hou
Qo E E2 E3 European Response To Network Neutrality Liyang Hou
 
Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2
Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2
Prairie DevCon-What's New in Hyper-V in Windows Server "8" Beta - Part 2
 
Selfxl Project Solutions Linux Ow2
Selfxl Project Solutions Linux Ow2Selfxl Project Solutions Linux Ow2
Selfxl Project Solutions Linux Ow2
 
Monitoring federation open stack infrastructure
Monitoring federation open stack infrastructureMonitoring federation open stack infrastructure
Monitoring federation open stack infrastructure
 

Recently uploaded

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

BonFIRE TridentCom presentation

  • 1. Building service testbeds on FIRE BonFIRE: A Multi-cloud Test Facility for Internet of Services Experimentation Ally Hume, EPCC, University of Edinburgh Tridentcom 2012
  • 2. BonFIRE Project The BonFIRE project is designing, building and operating a multi-cloud facility to support research across applications, services and systems targeting services research community on Future Internet. 2 Tridentcom 2012 BonFIRE 2
  • 3. Three Scenarios • Multi-site clouds connected through normal internet • Cloud scenario with emulated network • Extended Cloud scenario with controlled network (implies federation with network facility) 3 Tridentcom 2012 BonFIRE 3
  • 4. BonFIRE scenarios and sites EPCC (Edinburgh) Scenario 3 GEANT AutoBAHN and SFA, Federica PSNC (Poznan) Scenario 1 Scenario 2 (normal internet) Emulab (Virtual Wall) HP (Bristol) IBBT (Ghent) HLRS (Stuttgart) INRIA (Rennes) Permanent (>450cores / 45TB) & On-Request (+3000 cores) infrastructures Note: network links not representative 4 Tridentcom 2012 BonFIRE 4
  • 5. Example experiment categories 1. Service applications experiments (non-cloud) – Including distributed peer to peer applications 2. Cloud (or multi-cloud) applications – e.g. cloud busting scenarios with private and public clouds. 3. Cloud infrastructure experiments – e.g. schedulers on top of BonFIRE, new live-migration strategies, contention minimisation 4. Application-and-network experiments – Experimenters configuring network to support applications – Bandwidth on demand – Application specific routing 5 Tridentcom 2012 BonFIRE 5
  • 6. Four pillars of BonFIRE 6 Tridentcom 2012 BonFIRE 6
  • 7. Observability • To understand the behaviour of the system you need to see inside the abstractions normally provided by clouds. 7 Tridentcom 2012 BonFIRE 7
  • 8. Application, VM and Physical machine metrics Physical Machine Virtual Virtual Virtual Machine Machine Machine 8 Tridentcom 2012 BonFIRE 8
  • 9. Infrastructure metrics 9 Tridentcom 2012 BonFIRE 9
  • 10. Observing inside the cloud bonfire.epcc.ed.ac.uk OpenNebula v 2.0.1 OpenNebula event logs 3.6 TB Xen Hypervisor v 3.0.3 vmhost1 128 GB RAM, 48 2.3 MHz cores vmhost1 metrics vmhost2 128 GB RAM, 48 2.3 MHz cores vmhost2 metrics 10 Tridentcom 2012 BonFIRE 10
  • 11. Experiments supported • Passive use of infrastructure metrics – Understanding of contention and its impact on: • Application performance • General VM performance • Active use of infrastructure metrics: – by a cloud scheduler to co-locate VMs to minimise contention • e.g. by collecting statistics of the previous usage patterns of an image – by a cloud application to react to the contention currently in the system 11 Tridentcom 2012 BonFIRE 11
  • 12. Control • Observing is good but control is even better. 12 Tridentcom 2012 BonFIRE 12
  • 13. Exclusive physical machines and controlled placement INRIA node1 node2 node3 node 5 node6 node7 Location: inria Location: inria Cluster: 34563 Cluster: 34563 Host: p40 node8 node9 Resource Open Manager Nebula Give me 3 physical Cluster: 34563 machines from p23 p40 26/4/12 10:00 to 28/4/12 20:00 Reservation System Cluster: 34563 p92 P23 P40 p92 13 Tridentcom 2012 BonFIRE 13
  • 14. Supported Experiments • Exclusive use of physical machine and controlled placement – Supports elimination of unwanted contention – Increases experiment repeatability – Supports the implementation of controlled contention • possible via common contention templates Physical Machine Contention Template 1 Contention VM under Template 2 Contention Contention test generator generator 14 Tridentcom 2012 BonFIRE 14
  • 15. Supported Experiments • Exclusive use of physical machine and controlled placement – Experiments using external schedulers MyExclusiveCluster Application BonFIRE PM1 PM2 PM3 Scheduler Resource (External to Manager BonFIRE) Instance type: lite Instance type : lite Location: inria Cluster: MyExclusiveCluster Host: PM2 15 Tridentcom 2012 BonFIRE 15
  • 16. Controlled (EMULAB) networks with Virtual Wall high bandwidth server connection last mile client The internet connection Software Network B Software Network B RouterBandwidth: 1 Gbps Bandwidth: 1 Gbps Router Server 1 1 Latency: 0 Latency: 0 Loss rate: 0 rate: 0 Network C Network C Software Software Network D Network D Traffic: 100Mbps Traffic: 100Mbps Client Bandwidth: 100 Mbps Router Client Bandwidth: 100 Mbps Router Bandwidth :: 1Gbps Bandwidth 1Gbps Latency: 10ms Latency: 10ms Latency: 100ms Latency: 100ms Loss rate: 1% Loss rate: 1% Loss rate: 20% Loss rate: 20% Traffic: 10 Mbps Traffic: 10 Mbps Traffic: 200 Mbps Traffic: 200 Mbps Network A Network A Software Software Bandwidth: 1 Gpbs Bandwidth: 1 Gpbs Server 2 2 Router Router Latency: 0 Latency: Loss rate: 0 Loss rate: 0 Traffic: 250Mbps Traffic: 250Mbps 16 Tridentcom 2012 BonFIRE 16
  • 17. Supported Experiments • Controlled network quality of service – Testing streaming applications – Testing Internet of Services distributed applications – Multi-cloud applications • Experiments into how bandwidth on demand services could be used by cloud applications Hybrid Cloud Public Private Cloud Cloud 17 Tridentcom 2012 BonFIRE 17
  • 18. FEDERICA offering • FEDERICA is an e-Infrastructure based on virtualization of compute and network resources • BonFIRE experimenters will be able to request slices of network resources and are able to: – Select network topology – Decide IP Configuration – Configure static routes and dynamic routing protocols (OSPF, BGP) FEDERICA FEDERICA Server Server FEDERICA FEDERICA Router Router BonFIRE cloud site BonFIRE cloud site Controlled Network FEDERICA FEDERICA Server Router 18 Tridentcom 2012 BonFIRE 18
  • 19. FEDERICA interconnection • We will have: – A pair of BonFIRE sites connected to FEDERICA PoPs – Access to resources from four FEDERICA PoPs. • BonFIRE implementing SFA adaptor to request FEDERICA slices. – NOVI is developing an SFA interface on top of FEDERICA Web Services. – GÉANT3 will deploy the NOVI SFA interface. – We are using NOVI to test the SFA adaptor. 19 Tridentcom 2012 BonFIRE 19
  • 20. Integration with AutoBAHN EPCC PSNC • Integrate BonFIRE with the GÉANT BonFIRE BonFIRE Bandwidth-on-Demand interfaces Site Site (AutoBAHN) • This will allows network-aware experiments with requirements for guaranteed bandwidth • Control • Future service • Why AutoBAHN? – Reference BoD system for European NRENs and GÉANT – Most mature solution in terms of specification, implementation and deployment in the multi-domain environment interconnecting some of the sites – Handles both on demand and advance requests 20 Tridentcom 2012 BonFIRE 20
  • 21. Advanced features 21 Tridentcom 2012 BonFIRE 21
  • 22. Advanced features • Vertical elasticity – Scale up or down a virtual machine – For some applications scale up may be better than scale out • video transcoding • Bandwidth on demand services – As well as control tool BoD services can also be used by network- aware applications • Application and network – e.g. OpenFlow for application specific routing 22 Tridentcom 2012 BonFIRE 22
  • 23. Architecture Monitoring dashboard GUI Portal Identity LDAP Server (Used by Portal, API Experiment Manager , Resource Experiment Manager Manager and Testbeds) OCCI Monitoring Scheduling Reservation Accounting ResourceManager Message Read/ Write Queue OCCI Scheduling Reservation Accounting Enactor OCCI Scheduling Reservation Accounting SSH Key Testbed BonFIRE internal interface or API Interface or API accessible by X X SSH Monitoring Monitoring end users, user-agents or the layer above accessible only by the layer above SSH Gateway GUI API BonFIRE internal API accessible VM Gateway (Monitoring VM VM VM BonFIRE virtual machine X from multiple layers of the architecture or user agents Aggregator) running on BONFIRE VMs Layer of the Component of BonFIRE BonFIRE architecture used by multiple architecture layers 23 Tridentcom 2012 BonFIRE 23
  • 24. SFA Adaptor implementation • OCCI: one request per each resource. • Rspec: one single request per experiment (including all resources). • Extensions: – Add an Enactor DB to batch the OCCI requests belonging to one experiment. – Add a SFAAdaptorEndPoint to convert and map the OCCI requests to a RSpec request. – SFA dialog with the Aggregate Manager (AM). 24 Tridentcom 2012 BonFIRE 24
  • 25. Summary • Multi-cloud test facility for services community • Focus on: – Observability – Control – Advanced Features – Usability • Supports a wide range of experimenters • Not targeting network experimenters but network is key part of experiments • Uses a broker architecture for federation • Being used by open call experiments • Open for general use by around October 2012 25 Tridentcom 2012 BonFIRE 25
  • 26. Acknowledgements Copyright © 2012, EPCC, The University of Edinburgh, on behalf of the BonFIRE Consortium. Licensed under Creative Commons “Attribution-NoDerivs” BonFIRE is funded by the European Union Seventh Framework Programme (FP7/2007-2013) under grant agreement number 257386. www.bonfire-project.eu 26 Tridentcom 2012 BonFIRE 26

Editor's Notes

  1. Extended cloud scenario : It consists of using four BonFIRE computing sites interconnected through today’s public internet. Cloud with a controlled experimental network scenario. This scenario consists of using nodes connected over a user-controlled emulated virtual internet. This second scenario is supported by the Virtual Wall environment enriched with Cloud capabilities. Extended cloud with complex network implications scenario. This consists of using two cloud sites interconnected through a controlled network that allows studying the implications of the network behavior on services and clouds, and experiment with new solutions and approaches. This scenario implies a vertical federation with a cloud-to-network interface. If we implement 3 rd scenario  very powerful - To allow analysing CROSS CUTTING issues – network and service level aspects – in the same test ---- Scenario 1 - Extended Cloud Federation of cloud sites using normal Internet Scenario 2 - Cloud with emulated and controlled network Network emulation environment for cross-cutting tests Scenario 3 - Federation with other external facilities Potential use of Federica network slices (network interactions / experimental internet) Connection to Open Cirrus or other cloud infrastructures ---
  2. Skip the cluster VM, do only the physical host one.