SlideShare una empresa de Scribd logo
OpenSAF in the Cloud. Why an HA 
Middleware is still needed 
Anders Widell 
Ericsson 
Mathivanan NP 
Oracle 
opensaf.sourceforge.net
Agenda 
● The OpenSAF Project 
●High Availability and Service Availability 
●Why Application HA is necessary in the 
cloud 
●OpenSAF HA capabilities 
● Proposal to leverage OpenSAF HA with 
existing cloud solutions for unified 
availability management 
●OpenSAF roadmap
OpenSAF High Availability and the Cloud 
‘The cloud 
people are 
here’ 
They 
have 5 
Nines 
What is 
OpenSAF? 
They have 
APIs? 
Should we 
consider the 
telcos? 
We have 
99.99% 
uptime. We 
are good 
What is SA? 
Deployments 
will anyway 
have 
standbys 
SAF/OpenSAF 
Cloud
The OpenSAF project 
● Most comprehensive Service 
Availability middleware providing 
availability, manageability and 
platform services for developing 
HA available applications 
● Interface APIs in C with support 
for Java and Python bindings 
● LGPL v2.1 license 
● Implements SA Forum AIS 
specification 
● Supported by the OpenSAF 
foundation
High Availability and Service Availability 
● The probability that a service is available to 
its users at a random point in time 
● In telecom, 99.999% availability (five nines) 
is often required 
●HA and SA are essentially the same, but 
SA enables more – for example planned 
updates of hardware and software
Two Opinions about Application HA 
in the Cloud 
The cloud doesn't 
change anything 
regarding HA – it is the 
same as outside the 
cloud 
You don't need to worry 
about HA – the cloud 
will take care of that for 
you
High Availability and Service Availability
Hardware Faults 
● The cloud infrastructure can handle 
hardware faults for you – all the application 
sees is a node reboot 
●With a hot standby VM, even a reboot may 
be avoided 
● Problem with co-located VMs – we don't 
want to have active and standby app on 
the same physical node
Software Faults 
● Applications currently have no or limited 
HA support from cloud infrastructure 
●Using HA middleware, we can also get 
shorter fail-over time in the event of a 
hardware fault
The Cloud Gives You More Faults 
●Hypervisor and cloud infrastructure are 
also subject to faults 
●Hardware used in cloud may be less 
reliable (not carrier grade) 
●Geographic distribution may decrease the 
risk of total outage, at the cost of network 
latency and increased risk for split-brain
The cloud way – pets vs. cattle 
• Pets: few powerful nodes, scale-up 
• Cattle: many cheap nodes, scale-out 
• “architecting for failure” vs “architecting for 
scale”
The cloud way – Standardized 
Service Level Agreement 
Your problem was 
triggered by some other 
vendor/service inside 
the cloud 
Provide service 
throughout the 
year
OpenSAF based HA 
●OpenSAF based HA solutions are 
applicable across the availability spectrum: 
● Enterprise 
● Telecom and aerospace/defense 
● Millisecond failover
OpenSAF based HA 
Supports all 
redundancy 
configurations 
(Including no 
redundancy) 
Fault 
Management 
policies 
(Recovery and 
Repair) 
Express 
Dependencies 
between 
distributed/ 
stand alone 
software 
Code intrusive 
or Not? 
Lifecycle 
scripts and 
timeouts 
configuration, 
workload 
management 
Orchestration 
of rolling 
upgrade of the 
cluster nodes. 
Standardized 
manageability 
Monitoring 
and 
Healthcheck
OpenSAF based HA - Fault Management 
• Detection - Component Health Checks, Active/Passive Monitoring, 
api based error reporting, resource agents 
• Isolation - Node Power off 
or Resource isolation 
• Recovery - Failover of role 
assignments to standby/spare 
resources 
● Repair - Automatic restart of 
failed resource 
● Notifications – Standardized state change notifications (and 
logging)
OpenSAF HA – Key Advantages 
● Provide for Availability as a service in the 
cloud 
●Centralized/Streamlined orchestration of 
workload management (maintaining 
affinity) 
● Enable cloud software to be more carrier 
grade 
● Ease of Integration – With Both API based 
and scripts based entities (software, vm, 
agents, etc)
OpenSAF HA – Key Advantages 
● Enables reliability for stateful applications 
● Application level failure detection and 
recovery. Enables fault mitigation and 
milli second failover 
● Support for automated rolling upgrades across 
the cluster involving application and cluster 
expansion/shrinking 
● Pythonic interface for provisioning, status and 
management of HA entities. (Java mappings 
also supported)
Leveraging existing cloud solutions with 
OpenSAF
OOppeennSSAAFF aanndd VVmmwwaarree ((AA ssttuuddyy)) 
● Outage time measured with/without adding OpenSAF 
capabilities to existing VMware solutions (FT and HA) 
● Outage time measurement by running OpenSAF within and 
outside the VMs and other combinations 
● OpenSAF can detect Hardware, OS and Application failures 
● The study concluded that outage time significantly reduced 
when combining OpenSAF with existing Vmware capabilities 
Reference: Ali Nikzad's thesis: 'OpenSAF and Vmware: From the perspective of HA' 
http://spectrum.library.concordia.ca/978013/4/Nikzad_MASc_S2014.pdf
Leveraging openstack and OpenSAF 
● OpenSAF can provide HighAvailability as a service in 
openstack – Uniform, centralized, automated availability 
management across openstack 
● Openstack's flexible deployment architectures enables 
easy integration with OpenSAF for all redundancy 
configurations for any of the OpenStack infrastructure 
software (distributed and standalone) 
● Monitoring (Intrusive and Non-Intrusive) a basic 
requirement 
- With/Without Resource agents. 
● Provide for a perspective of TRY_AGAIN /TIME_OUT 
semantics
OpenSAF provides for a Unified HA 
Unified 
HA from 
OpenSAF 
Integrated HA architecture 
for compute, network, 
storage, dashboard 
Application HA 
VM HA 
Unified view 
and/of Availability 
Management 
Provides for openstack 
'availability architecture, 
hierarchy' and 'standardized 
management' (admin, log, 
notification, upgrade) 
interface
OpenSAF Roadmap 
● Enhanced cluster management 
(quorum/consensus based membership) 
● Scaling out even further 
● Feature rich CLI 
● Container - contained
& 
Thank You

Más contenido relacionado

La actualidad más candente

Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMwareInfrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
OpenStack
 
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
OpenStack
 
OpenStack for devops environment
OpenStack for devops environment OpenStack for devops environment
OpenStack for devops environment
Orgad Kimchi
 
How cloud native vn fs deployed on open stack will change the telecom industry
How cloud native vn fs deployed on open stack will change the telecom industryHow cloud native vn fs deployed on open stack will change the telecom industry
How cloud native vn fs deployed on open stack will change the telecom industry
Arthur Berezin
 
Role of sdn controllers in open stack
Role of sdn controllers in open stackRole of sdn controllers in open stack
Role of sdn controllers in open stack
openstackindia
 
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
OpenStack Korea Community
 
OpenStack at Scale Inside NetApp
OpenStack at Scale Inside NetAppOpenStack at Scale Inside NetApp
OpenStack at Scale Inside NetApp
Tesora
 
OPNFV & OpenStack
OPNFV & OpenStackOPNFV & OpenStack
OPNFV & OpenStack
openstackindia
 
Telco open stack use cases james thorne
Telco open stack use cases   james thorneTelco open stack use cases   james thorne
Telco open stack use cases james thorne
Sriram Subramanian
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
Igor Sfiligoi
 
AppOrbit DevOps NYC
AppOrbit DevOps NYCAppOrbit DevOps NYC
AppOrbit DevOps NYC
Bob Sokol
 
Helix core on aws webinar
Helix core on aws webinar Helix core on aws webinar
Helix core on aws webinar
Perforce
 
MySQL Manchester TT - Replication Features
MySQL Manchester TT  - Replication FeaturesMySQL Manchester TT  - Replication Features
MySQL Manchester TT - Replication Features
Mark Swarbrick
 
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About TsharkChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud
 
Open Source, Open Architecture, Open Standards NFV MANO
Open Source, Open Architecture, Open Standards NFV MANOOpen Source, Open Architecture, Open Standards NFV MANO
Open Source, Open Architecture, Open Standards NFV MANO
Cloudify Community
 
OpsStack--Integrated Operation Platform
OpsStack--Integrated Operation PlatformOpsStack--Integrated Operation Platform
OpsStack--Integrated Operation Platform
ChinaNetCloud
 
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
OpenStack
 
Highly available cloud_foundry
Highly available cloud_foundryHighly available cloud_foundry
Highly available cloud_foundry
Henry Sinclair
 
How Cloudify uses Chef as a Foundation for PaaS
How Cloudify uses Chef as a Foundation for PaaSHow Cloudify uses Chef as a Foundation for PaaS
How Cloudify uses Chef as a Foundation for PaaS
Nati Shalom
 
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
Asher Feldman
 

La actualidad más candente (20)

Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMwareInfrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
Infrastructure as Code 101: Steve Tegeler + Nathan Ness, VMware
 
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
Making Openstack Really Easy - Why Build Open Source When You Can Buy? Danny ...
 
OpenStack for devops environment
OpenStack for devops environment OpenStack for devops environment
OpenStack for devops environment
 
How cloud native vn fs deployed on open stack will change the telecom industry
How cloud native vn fs deployed on open stack will change the telecom industryHow cloud native vn fs deployed on open stack will change the telecom industry
How cloud native vn fs deployed on open stack will change the telecom industry
 
Role of sdn controllers in open stack
Role of sdn controllers in open stackRole of sdn controllers in open stack
Role of sdn controllers in open stack
 
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
[OpenStack Day in Korea 2015] Track 2-2 - OpenStack for PaaS: Why it's Hot
 
OpenStack at Scale Inside NetApp
OpenStack at Scale Inside NetAppOpenStack at Scale Inside NetApp
OpenStack at Scale Inside NetApp
 
OPNFV & OpenStack
OPNFV & OpenStackOPNFV & OpenStack
OPNFV & OpenStack
 
Telco open stack use cases james thorne
Telco open stack use cases   james thorneTelco open stack use cases   james thorne
Telco open stack use cases james thorne
 
Modest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYROModest scale HPC on Azure using CGYRO
Modest scale HPC on Azure using CGYRO
 
AppOrbit DevOps NYC
AppOrbit DevOps NYCAppOrbit DevOps NYC
AppOrbit DevOps NYC
 
Helix core on aws webinar
Helix core on aws webinar Helix core on aws webinar
Helix core on aws webinar
 
MySQL Manchester TT - Replication Features
MySQL Manchester TT  - Replication FeaturesMySQL Manchester TT  - Replication Features
MySQL Manchester TT - Replication Features
 
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About TsharkChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About Tshark
 
Open Source, Open Architecture, Open Standards NFV MANO
Open Source, Open Architecture, Open Standards NFV MANOOpen Source, Open Architecture, Open Standards NFV MANO
Open Source, Open Architecture, Open Standards NFV MANO
 
OpsStack--Integrated Operation Platform
OpsStack--Integrated Operation PlatformOpsStack--Integrated Operation Platform
OpsStack--Integrated Operation Platform
 
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
How to deliver High Performance OpenStack Cloud: Christoph Dwertmann, Vault S...
 
Highly available cloud_foundry
Highly available cloud_foundryHighly available cloud_foundry
Highly available cloud_foundry
 
How Cloudify uses Chef as a Foundation for PaaS
How Cloudify uses Chef as a Foundation for PaaSHow Cloudify uses Chef as a Foundation for PaaS
How Cloudify uses Chef as a Foundation for PaaS
 
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
Open Connect Firmware Delivery With Spinnaker (Spinnaker Summit 2018)
 

Similar a OpenSAF in the cloud: Why an HA middleware is still needed

Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
HostedbyConfluent
 
Mule soft cloudhub deployment basics
Mule soft cloudhub deployment basicsMule soft cloudhub deployment basics
Mule soft cloudhub deployment basics
Arun Yaligar
 
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
jeckels
 
Red hat NFV Roadmap - OpenStack Summit 2016/Red Hat NFV Mini Summit
Red hat NFV Roadmap    - OpenStack Summit 2016/Red Hat NFV Mini SummitRed hat NFV Roadmap    - OpenStack Summit 2016/Red Hat NFV Mini Summit
Red hat NFV Roadmap - OpenStack Summit 2016/Red Hat NFV Mini Summit
kimw001
 
OpenStack Telco Cloud Challenges, David Fick, Oracle
OpenStack Telco Cloud Challenges, David Fick, OracleOpenStack Telco Cloud Challenges, David Fick, Oracle
OpenStack Telco Cloud Challenges, David Fick, Oracle
Sriram Subramanian
 
OpenShift In a Nutshell - Episode 03 - Infrastructure part I
OpenShift In a Nutshell - Episode 03 - Infrastructure part IOpenShift In a Nutshell - Episode 03 - Infrastructure part I
OpenShift In a Nutshell - Episode 03 - Infrastructure part I
Behnam Loghmani
 
Making workload nomadic when accelerated
Making workload nomadic when acceleratedMaking workload nomadic when accelerated
Making workload nomadic when accelerated
Zhipeng Huang
 
DevOps and BigData Analytics
DevOps and BigData Analytics DevOps and BigData Analytics
DevOps and BigData Analytics
sbbabu
 
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Nagios
 
Putting the M in MANO: Major new Ensemble release delivers NFV management and...
Putting the M in MANO: Major new Ensemble release delivers NFV management and...Putting the M in MANO: Major new Ensemble release delivers NFV management and...
Putting the M in MANO: Major new Ensemble release delivers NFV management and...
ADVA
 
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStackStratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
Ali Kafel
 
Oracle Ravello
Oracle Ravello Oracle Ravello
Oracle Ravello
Andrey Akulov
 
Lookout on Scaling Security to 100 Million Devices
Lookout on Scaling Security to 100 Million DevicesLookout on Scaling Security to 100 Million Devices
Lookout on Scaling Security to 100 Million Devices
ScyllaDB
 
Hp serviceguard
Hp serviceguard Hp serviceguard
Hp serviceguard
dkamalraj
 
A complete Open Source cloud: Storage, Virt, IaaS, PaaS
A complete Open Source cloud: Storage, Virt, IaaS, PaaSA complete Open Source cloud: Storage, Virt, IaaS, PaaS
A complete Open Source cloud: Storage, Virt, IaaS, PaaS
Dave Neary
 
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellenX-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
NEXTtour
 
As34269277
As34269277As34269277
As34269277
IJERA Editor
 
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
jeckels
 
Cloudjiffy vs AWS ops works
Cloudjiffy vs AWS ops worksCloudjiffy vs AWS ops works
Cloudjiffy vs AWS ops works
Sharma Aashish
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
confluent
 

Similar a OpenSAF in the cloud: Why an HA middleware is still needed (20)

Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
Kubernetes Connectivity to Cloud Native Kafka | Christina Lin and Evan Shorti...
 
Mule soft cloudhub deployment basics
Mule soft cloudhub deployment basicsMule soft cloudhub deployment basics
Mule soft cloudhub deployment basics
 
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
The Power of Java and Oracle WebLogic Server in the Public Cloud (OpenWorld, ...
 
Red hat NFV Roadmap - OpenStack Summit 2016/Red Hat NFV Mini Summit
Red hat NFV Roadmap    - OpenStack Summit 2016/Red Hat NFV Mini SummitRed hat NFV Roadmap    - OpenStack Summit 2016/Red Hat NFV Mini Summit
Red hat NFV Roadmap - OpenStack Summit 2016/Red Hat NFV Mini Summit
 
OpenStack Telco Cloud Challenges, David Fick, Oracle
OpenStack Telco Cloud Challenges, David Fick, OracleOpenStack Telco Cloud Challenges, David Fick, Oracle
OpenStack Telco Cloud Challenges, David Fick, Oracle
 
OpenShift In a Nutshell - Episode 03 - Infrastructure part I
OpenShift In a Nutshell - Episode 03 - Infrastructure part IOpenShift In a Nutshell - Episode 03 - Infrastructure part I
OpenShift In a Nutshell - Episode 03 - Infrastructure part I
 
Making workload nomadic when accelerated
Making workload nomadic when acceleratedMaking workload nomadic when accelerated
Making workload nomadic when accelerated
 
DevOps and BigData Analytics
DevOps and BigData Analytics DevOps and BigData Analytics
DevOps and BigData Analytics
 
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
 
Putting the M in MANO: Major new Ensemble release delivers NFV management and...
Putting the M in MANO: Major new Ensemble release delivers NFV management and...Putting the M in MANO: Major new Ensemble release delivers NFV management and...
Putting the M in MANO: Major new Ensemble release delivers NFV management and...
 
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStackStratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
Stratus Fault-Tolerant Cloud Infrastructure Software for NFV using OpenStack
 
Oracle Ravello
Oracle Ravello Oracle Ravello
Oracle Ravello
 
Lookout on Scaling Security to 100 Million Devices
Lookout on Scaling Security to 100 Million DevicesLookout on Scaling Security to 100 Million Devices
Lookout on Scaling Security to 100 Million Devices
 
Hp serviceguard
Hp serviceguard Hp serviceguard
Hp serviceguard
 
A complete Open Source cloud: Storage, Virt, IaaS, PaaS
A complete Open Source cloud: Storage, Virt, IaaS, PaaSA complete Open Source cloud: Storage, Virt, IaaS, PaaS
A complete Open Source cloud: Storage, Virt, IaaS, PaaS
 
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellenX-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
X-Tour: Hochverfuegbare Anwendungen mit Nutanix bereitstellen
 
As34269277
As34269277As34269277
As34269277
 
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
Oracle Coherence Strategy and Roadmap (OpenWorld, September 2014)
 
Cloudjiffy vs AWS ops works
Cloudjiffy vs AWS ops worksCloudjiffy vs AWS ops works
Cloudjiffy vs AWS ops works
 
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
Modern Cloud-Native Streaming Platforms: Event Streaming Microservices with A...
 

Último

Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
kumardaparthi1024
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
Brandon Minnick, MBA
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
Mariano Tinti
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
MichaelKnudsen27
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Jeffrey Haguewood
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
Pixlogix Infotech
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
Jason Packer
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Malak Abu Hammad
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
Hiroshi SHIBATA
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Alpen-Adria-Universität
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 

Último (20)

Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
GenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizationsGenAI Pilot Implementation in the organizations
GenAI Pilot Implementation in the organizations
 
Choosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptxChoosing The Best AWS Service For Your Website + API.pptx
Choosing The Best AWS Service For Your Website + API.pptx
 
Mariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceXMariano G Tinti - Decoding SpaceX
Mariano G Tinti - Decoding SpaceX
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Nordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptxNordic Marketo Engage User Group_June 13_ 2024.pptx
Nordic Marketo Engage User Group_June 13_ 2024.pptx
 
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
Salesforce Integration for Bonterra Impact Management (fka Social Solutions A...
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
Best 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERPBest 20 SEO Techniques To Improve Website Visibility In SERP
Best 20 SEO Techniques To Improve Website Visibility In SERP
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024Columbus Data & Analytics Wednesdays - June 2024
Columbus Data & Analytics Wednesdays - June 2024
 
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfUnlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdf
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
Introduction of Cybersecurity with OSS at Code Europe 2024
Introduction of Cybersecurity with OSS  at Code Europe 2024Introduction of Cybersecurity with OSS  at Code Europe 2024
Introduction of Cybersecurity with OSS at Code Europe 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing InstancesEnergy Efficient Video Encoding for Cloud and Edge Computing Instances
Energy Efficient Video Encoding for Cloud and Edge Computing Instances
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 

OpenSAF in the cloud: Why an HA middleware is still needed

  • 1. OpenSAF in the Cloud. Why an HA Middleware is still needed Anders Widell Ericsson Mathivanan NP Oracle opensaf.sourceforge.net
  • 2. Agenda ● The OpenSAF Project ●High Availability and Service Availability ●Why Application HA is necessary in the cloud ●OpenSAF HA capabilities ● Proposal to leverage OpenSAF HA with existing cloud solutions for unified availability management ●OpenSAF roadmap
  • 3. OpenSAF High Availability and the Cloud ‘The cloud people are here’ They have 5 Nines What is OpenSAF? They have APIs? Should we consider the telcos? We have 99.99% uptime. We are good What is SA? Deployments will anyway have standbys SAF/OpenSAF Cloud
  • 4. The OpenSAF project ● Most comprehensive Service Availability middleware providing availability, manageability and platform services for developing HA available applications ● Interface APIs in C with support for Java and Python bindings ● LGPL v2.1 license ● Implements SA Forum AIS specification ● Supported by the OpenSAF foundation
  • 5. High Availability and Service Availability ● The probability that a service is available to its users at a random point in time ● In telecom, 99.999% availability (five nines) is often required ●HA and SA are essentially the same, but SA enables more – for example planned updates of hardware and software
  • 6. Two Opinions about Application HA in the Cloud The cloud doesn't change anything regarding HA – it is the same as outside the cloud You don't need to worry about HA – the cloud will take care of that for you
  • 7. High Availability and Service Availability
  • 8. Hardware Faults ● The cloud infrastructure can handle hardware faults for you – all the application sees is a node reboot ●With a hot standby VM, even a reboot may be avoided ● Problem with co-located VMs – we don't want to have active and standby app on the same physical node
  • 9. Software Faults ● Applications currently have no or limited HA support from cloud infrastructure ●Using HA middleware, we can also get shorter fail-over time in the event of a hardware fault
  • 10. The Cloud Gives You More Faults ●Hypervisor and cloud infrastructure are also subject to faults ●Hardware used in cloud may be less reliable (not carrier grade) ●Geographic distribution may decrease the risk of total outage, at the cost of network latency and increased risk for split-brain
  • 11. The cloud way – pets vs. cattle • Pets: few powerful nodes, scale-up • Cattle: many cheap nodes, scale-out • “architecting for failure” vs “architecting for scale”
  • 12. The cloud way – Standardized Service Level Agreement Your problem was triggered by some other vendor/service inside the cloud Provide service throughout the year
  • 13. OpenSAF based HA ●OpenSAF based HA solutions are applicable across the availability spectrum: ● Enterprise ● Telecom and aerospace/defense ● Millisecond failover
  • 14. OpenSAF based HA Supports all redundancy configurations (Including no redundancy) Fault Management policies (Recovery and Repair) Express Dependencies between distributed/ stand alone software Code intrusive or Not? Lifecycle scripts and timeouts configuration, workload management Orchestration of rolling upgrade of the cluster nodes. Standardized manageability Monitoring and Healthcheck
  • 15. OpenSAF based HA - Fault Management • Detection - Component Health Checks, Active/Passive Monitoring, api based error reporting, resource agents • Isolation - Node Power off or Resource isolation • Recovery - Failover of role assignments to standby/spare resources ● Repair - Automatic restart of failed resource ● Notifications – Standardized state change notifications (and logging)
  • 16. OpenSAF HA – Key Advantages ● Provide for Availability as a service in the cloud ●Centralized/Streamlined orchestration of workload management (maintaining affinity) ● Enable cloud software to be more carrier grade ● Ease of Integration – With Both API based and scripts based entities (software, vm, agents, etc)
  • 17. OpenSAF HA – Key Advantages ● Enables reliability for stateful applications ● Application level failure detection and recovery. Enables fault mitigation and milli second failover ● Support for automated rolling upgrades across the cluster involving application and cluster expansion/shrinking ● Pythonic interface for provisioning, status and management of HA entities. (Java mappings also supported)
  • 18. Leveraging existing cloud solutions with OpenSAF
  • 19. OOppeennSSAAFF aanndd VVmmwwaarree ((AA ssttuuddyy)) ● Outage time measured with/without adding OpenSAF capabilities to existing VMware solutions (FT and HA) ● Outage time measurement by running OpenSAF within and outside the VMs and other combinations ● OpenSAF can detect Hardware, OS and Application failures ● The study concluded that outage time significantly reduced when combining OpenSAF with existing Vmware capabilities Reference: Ali Nikzad's thesis: 'OpenSAF and Vmware: From the perspective of HA' http://spectrum.library.concordia.ca/978013/4/Nikzad_MASc_S2014.pdf
  • 20. Leveraging openstack and OpenSAF ● OpenSAF can provide HighAvailability as a service in openstack – Uniform, centralized, automated availability management across openstack ● Openstack's flexible deployment architectures enables easy integration with OpenSAF for all redundancy configurations for any of the OpenStack infrastructure software (distributed and standalone) ● Monitoring (Intrusive and Non-Intrusive) a basic requirement - With/Without Resource agents. ● Provide for a perspective of TRY_AGAIN /TIME_OUT semantics
  • 21. OpenSAF provides for a Unified HA Unified HA from OpenSAF Integrated HA architecture for compute, network, storage, dashboard Application HA VM HA Unified view and/of Availability Management Provides for openstack 'availability architecture, hierarchy' and 'standardized management' (admin, log, notification, upgrade) interface
  • 22. OpenSAF Roadmap ● Enhanced cluster management (quorum/consensus based membership) ● Scaling out even further ● Feature rich CLI ● Container - contained