SlideShare una empresa de Scribd logo
1 de 39
Descargar para leer sin conexión
High Availability
Cluster with openSUSE
Leap
M. Edwin Zakaria
medwin@opensuse.org
2
• Mohammad Edwin Zakaria
• Linux user since 1998
• openSUSE since 6.2 around 1999
https://en.opensuse.org/User:Medwin
• openSUSE member
https://connect.opensuse.org/show/Medwin
• openSUSE Indonesia
3
4
5
6
7
8
What is
- Cluster ?
- High Availability ?
9
Curious?
• A computer cluster consists of a set of loosely
or tightly connected computers that work
together so that, in many respects, they can be
viewed as a single system. Unlike grid
computers, computer clusters have each node
set to perform the same task, controlled and
scheduled by software
10
Curious?
• High availability (HA) is a system that is
designed to avoid the loss of service by
reducing or managing failures as well as
minimizing planned downtime for the system.
We expect a service to be highly available
when life, health, and well-being, including
the economic well-being of a company, depend
on it.
11
Curious?
• Harvard Research Group divide the HA into
several Availability Environment Classification
(AEC): AE4, AE3, AE2, AE1, AE0
• http://www.hrgresearch.com/pdf/AEC%20Defint
ions.pdf
• Other categories: continuous availability, fault
tolerance, disaster tolerance
12
Once again what is cluster?
• High performance computing
• Load balancer (high capacity)
• High availability
‒ 99.99%
‒ MTBF (mean time between failure = total operating
time/total numbers of failure)
‒ Single point of failure
13
Once again what is cluster?
14
Challange in HA
• Murphy’s Law “If anything can go wrong, it will”
‒ Loss of data
‒ Service outage
• Flood, fire, earthquake, natural disaster,
hardware damage
‒ Can you afford a downtime?
‒ Can you afford low availability system?
‒ Cost of downtime?
15
Different between HA term
• HA term is widely use
• VMware vSphere HA
‒ Closed source
‒ Hypervisor level and host hardware level
• openSUSE/SUSE HA
‒ Open source
‒ OS level
‒ Protect critical resources running on VM
‒ HA within Linux OS
16
HA in Linux
• Started with heartbeat project in around 1990
• Now manage by ClusterLabs http://clusterlabs.org/
• The ClusterLabs stack, incorporating Corosync and
Pacemaker defines an Open Source, High
Availability cluster offering suitable for both small
and large deployments.
• Pacemaker has been around since 2004 and is
primarily a collaborative effort between Red Hat
and SUSE, they also receive considerable help and
support from the folks at LinBit and the community
in general.
17
Hardware Consideration
• External network, high traffic, use FO or eth
bonding
• Communication network between cluster
node, use for messaging, membership,
STONITH
• Storage network, use FO or eth bonding
• Manage switch
• STONITH/fencing device
• Shared storage: NAS (nfs/cifs), SAN (fc/iscsi)
18
Hardware Consideration
19
Hardware Consideration
20
Software Component
• Corosync
‒ messaging and membership
• Pacemaker
‒ Cluster resource management
• Resource Agents
‒ Manage and monitor availability of service
• Fencing device
‒ STONITH to ensure data integrity
• User interface
‒ Crmsh and Hawk
21
Other component
• LVS linux virtual server
• HAproxy
• Shared file system: OCFS2, GFS2
• Block device replication: DRBD
• Shared storage: SAN
• Geo cluster
22
More details
• Pacemaker :
Pacemaker is a cluster resource manager. It
achieves maximum availability for your cluster
resources by detecting and recovering from
node and resource-level failures by making use
of the messaging and membership capabilities
provided by your preferred cluster
infrastructure (either Corosync or Heartbeat).
23
More details
• Corosync :
‒ provides cluster infrastructure functionality
‒ provides messaging and membership functionality
‒ maintains the quorum information.
‒ This feature has been utilized by pacemaker to
provide high availability solution.
24
In short ...
• Corosync : A quorum system that notifies
applications when quorum is achieved or lost
• Pacemaker :
‒ To start/stop resources on a node according to the
score.
‒ To monitor resources according to interval.
‒ To restart resources if monitor fails.
‒ To fence/STONITH a node if stop operation fails.
25
Pacemaker Corosync Conceptual
Overview
26
Pacemaker Components
• Non-cluster aware components (illustrated in
green). These pieces include the resources
themselves, scripts that start, stop and monitor
them
• Cluster Resource manager, provides the brain
that processes and reacts to events regarding
the cluster
• Low level infrastructure, Corosync provides
reliable messaging, membership and quorum
information about the cluster
27
Pacemaker Stack
• pacemaker corosync cluster called as pacemaker stack
• Linux kernel by default comes with DLM (distributed
lock manager). It provides locking feature which will be
used by cluster aware filesystem
• The GFS2 (Global File System2) and OCFS2 (Oracle
cluster File System 2) are called as cluster aware
filesystem
• To access single filesytem by multiple hosts you need to
have either GFS2 or OCFS2.
• Or you can create a file system on top of cLVM (cluster
logical volume manager)
28
29
Cluster Filesystem
• If you have shared disk and want several nodes access
it, you need cluster aware filesystem
• The open source solution are GFS2 (Global File System2)
and OCFS2 (Oracle cluster File System 2)
30
Cluster Block Device
• DRBD (distributed replicated block device) allows you
to create a mirror of two block devices that are
located at two different sites across an IP network.
When used with Corosync, DRBD supports
distributed high-availability Linux clusters. It is a
network based raid1, and high performance data
replication over network
• CLVM2, see https://www.sourceware.org/lvm2/
• Cluster md raid1, see
https://www.kernel.org/doc/Documentation/md-clust
er.txt
31
Cluster Block Device
32
STONITH
• STONITH is an acronym for “Shoot-The-Other-Node-In-
The-Head”.
• It protects your data from being corrupted by rogue
nodes or concurrent access.
• Just because a node is unresponsive, this doesn’t mean
it isn’t accessing your data. The only way to be 100%
sure that your data is safe, is to use STONITH so we can
be certain that the node is truly offline, before allowing
the data to be accessed from another node.
• STONITH also has a role to play in the event that a
clustered service cannot be stopped. In this case, the
cluster uses STONITH to force the whole node offline,
thereby making it safe to start the service elsewhere.
33
Split brain – the HA problem
• Two nodes run the same service, break the
data integrity
• Solution:
‒ Quorum
If cluster doesn’t have quorum no action will be
taken, means fencing and resource management are
disabled without quorum
‒ STONITH
Shoot the other node in the head
• More on stonith http://ourobengr.com/ha/
34
Reference
• SUSE HA Extension Doc (can be use for openSUSE
also)
https://www.suse.com/documentation/sle-ha-12/
• HA clusterlabs http://clusterlabs.org
• Corosync doc
http://landley.net/kdocs/ols/2008/ols2008v1-pages
-85-100.pdf
• DRBD http://drbd.linbit.org/en/
• OCFS2 https://ocfs2.wiki.kernel.org/
• CLVM2 https://sourceware.org/lvm2/
• Linux SCSI http://linux-iscsi.org/
Case Study / Hands-on
36
Setting up HA on Leap
• Scenario:
‒ setting up openSUSE Leap 42.1 as host
‒ create 2 VM with QEMU/KVM, install openSUSE Leap
42.1, configure the network, and all the required
packages
‒ Configure pacemaker corosync drbd
‒ Setup HA webserver
37
Preparation
• Install openSUSE Leap 42.1
• Configure all repository
• Install all the required software
• Create at least 2 virtul machine with QEMU/KVM
• Configure the Cluster
• Create DRBD
• Activate web server / nginx or apache
• Test the status
Thank you.
Join the conversation,
contribute & have a lot of fun!
www.opensuse.org
39
Have a Lot of Fun, and Join Us At:
www.opensuse.org

Más contenido relacionado

La actualidad más candente

MAA Best Practices for Oracle Database 19c
MAA Best Practices for Oracle Database 19cMAA Best Practices for Oracle Database 19c
MAA Best Practices for Oracle Database 19cMarkus Michalewicz
 
新しいTERASOLUNA Batch Frameworkとは
新しいTERASOLUNA Batch Frameworkとは新しいTERASOLUNA Batch Frameworkとは
新しいTERASOLUNA Batch Frameworkとはapkiban
 
FCスイッチISL 追加削除時の考慮点
FCスイッチISL 追加削除時の考慮点FCスイッチISL 追加削除時の考慮点
FCスイッチISL 追加削除時の考慮点Brocade
 
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)NTT DATA Technology & Innovation
 
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?Suguru Ito
 
Real World Azure RBAC
Real World Azure RBACReal World Azure RBAC
Real World Azure RBACToru Makabe
 
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)オラクルエンジニア通信
 
Next Generation Nexus 9000 Architecture
Next Generation Nexus 9000 ArchitectureNext Generation Nexus 9000 Architecture
Next Generation Nexus 9000 ArchitectureCisco Canada
 
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウ
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウIIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウ
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウIIJ
 
Kafka vs Pulsar @KafkaMeetup_20180316
Kafka vs Pulsar @KafkaMeetup_20180316Kafka vs Pulsar @KafkaMeetup_20180316
Kafka vs Pulsar @KafkaMeetup_20180316Nozomi Kurihara
 
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)NTT DATA Technology & Innovation
 
Azure Active Directory Domain Services (Azure ADDS) キホンのキ
Azure Active Directory Domain Services(Azure ADDS)キホンのキAzure Active Directory Domain Services(Azure ADDS)キホンのキ
Azure Active Directory Domain Services (Azure ADDS) キホンのキTetsuya Yokoyama
 
Securing Hadoop - MapR Technologies
Securing Hadoop - MapR TechnologiesSecuring Hadoop - MapR Technologies
Securing Hadoop - MapR TechnologiesMapR Technologies
 
再考、3つの仮想デスクトップイメージ管理と比較
再考、3つの仮想デスクトップイメージ管理と比較再考、3つの仮想デスクトップイメージ管理と比較
再考、3つの仮想デスクトップイメージ管理と比較Citrix Systems Japan
 
Fibre Channel 基礎講座
Fibre Channel 基礎講座Fibre Channel 基礎講座
Fibre Channel 基礎講座Brocade
 
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)OSC2011 Tokyo/Spring 自宅SAN友の会(前半)
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)Satoshi Shimazaki
 
Cephのベンチマークをしました
CephのベンチマークをしましたCephのベンチマークをしました
CephのベンチマークをしましたOSSラボ株式会社
 
楽天プロジェクトX:基幹DB移設 編
楽天プロジェクトX:基幹DB移設 編楽天プロジェクトX:基幹DB移設 編
楽天プロジェクトX:基幹DB移設 編Rakuten Group, Inc.
 
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理NTT DATA Technology & Innovation
 

La actualidad más candente (20)

MAA Best Practices for Oracle Database 19c
MAA Best Practices for Oracle Database 19cMAA Best Practices for Oracle Database 19c
MAA Best Practices for Oracle Database 19c
 
新しいTERASOLUNA Batch Frameworkとは
新しいTERASOLUNA Batch Frameworkとは新しいTERASOLUNA Batch Frameworkとは
新しいTERASOLUNA Batch Frameworkとは
 
FCスイッチISL 追加削除時の考慮点
FCスイッチISL 追加削除時の考慮点FCスイッチISL 追加削除時の考慮点
FCスイッチISL 追加削除時の考慮点
 
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)
Knative Eventing 入門(Kubernetes Novice Tokyo #11 発表資料)
 
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?
Azure Database for MySQL PostgreSQLを使って運用の手間を省きませんか?
 
Real World Azure RBAC
Real World Azure RBACReal World Azure RBAC
Real World Azure RBAC
 
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)
Oracle Databaseはクラウドに移行するべきか否か 全10ケースをご紹介 (Oracle Cloudウェビナーシリーズ: 2021年11月30日)
 
Next Generation Nexus 9000 Architecture
Next Generation Nexus 9000 ArchitectureNext Generation Nexus 9000 Architecture
Next Generation Nexus 9000 Architecture
 
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウ
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウIIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウ
IIJにおけるGlusterFS利用事例 GlusterFSの詳解と2年間の運用ノウハウ
 
Kafka vs Pulsar @KafkaMeetup_20180316
Kafka vs Pulsar @KafkaMeetup_20180316Kafka vs Pulsar @KafkaMeetup_20180316
Kafka vs Pulsar @KafkaMeetup_20180316
 
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)
Apache Spark on Kubernetes入門(Open Source Conference 2021 Online Hiroshima 発表資料)
 
Azure Active Directory Domain Services (Azure ADDS) キホンのキ
Azure Active Directory Domain Services(Azure ADDS)キホンのキAzure Active Directory Domain Services(Azure ADDS)キホンのキ
Azure Active Directory Domain Services (Azure ADDS) キホンのキ
 
Securing Hadoop - MapR Technologies
Securing Hadoop - MapR TechnologiesSecuring Hadoop - MapR Technologies
Securing Hadoop - MapR Technologies
 
再考、3つの仮想デスクトップイメージ管理と比較
再考、3つの仮想デスクトップイメージ管理と比較再考、3つの仮想デスクトップイメージ管理と比較
再考、3つの仮想デスクトップイメージ管理と比較
 
Fibre Channel 基礎講座
Fibre Channel 基礎講座Fibre Channel 基礎講座
Fibre Channel 基礎講座
 
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)OSC2011 Tokyo/Spring 自宅SAN友の会(前半)
OSC2011 Tokyo/Spring 自宅SAN友の会(前半)
 
Cephのベンチマークをしました
CephのベンチマークをしましたCephのベンチマークをしました
Cephのベンチマークをしました
 
楽天プロジェクトX:基幹DB移設 編
楽天プロジェクトX:基幹DB移設 編楽天プロジェクトX:基幹DB移設 編
楽天プロジェクトX:基幹DB移設 編
 
Oracle Database Applianceのご紹介(詳細)
Oracle Database Applianceのご紹介(詳細)Oracle Database Applianceのご紹介(詳細)
Oracle Database Applianceのご紹介(詳細)
 
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理
え、まって。その並列分散処理、Kafkaのしくみでもできるの? Apache Kafkaの機能を利用した大規模ストリームデータの並列分散処理
 

Destacado

Manage Virtual Machines with WebVirtMgr on openSUSE
Manage Virtual Machines with WebVirtMgr on openSUSEManage Virtual Machines with WebVirtMgr on openSUSE
Manage Virtual Machines with WebVirtMgr on openSUSEDendy P. Delly
 
Hardware and Software Considerations for Schools
Hardware and Software Considerations for SchoolsHardware and Software Considerations for Schools
Hardware and Software Considerations for SchoolsDana L. Miller
 
Cisco SUSE sapphire2016_booth-presentation
Cisco SUSE sapphire2016_booth-presentationCisco SUSE sapphire2016_booth-presentation
Cisco SUSE sapphire2016_booth-presentationMike Nelson
 
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...Novell
 
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision Makers
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision MakersThrive With Big Data Webinar Series - Part 5: Considerations for Decision Makers
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision MakersMongoDB
 
Building a Two Node SLES 11 SP2 Linux Cluster with VMware
Building a Two Node SLES 11 SP2 Linux Cluster with VMwareBuilding a Two Node SLES 11 SP2 Linux Cluster with VMware
Building a Two Node SLES 11 SP2 Linux Cluster with VMwaregeekswing
 
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...Building High Availability Clusters with SUSE Linux Enterprise High Availabil...
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...Novell
 
High Availability in 37 Easy Steps
High Availability in 37 Easy StepsHigh Availability in 37 Easy Steps
High Availability in 37 Easy StepsTim Serong
 
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster Dirk Oppenkowski
 

Destacado (10)

Manage Virtual Machines with WebVirtMgr on openSUSE
Manage Virtual Machines with WebVirtMgr on openSUSEManage Virtual Machines with WebVirtMgr on openSUSE
Manage Virtual Machines with WebVirtMgr on openSUSE
 
Hardware and Software Considerations for Schools
Hardware and Software Considerations for SchoolsHardware and Software Considerations for Schools
Hardware and Software Considerations for Schools
 
Cisco SUSE sapphire2016_booth-presentation
Cisco SUSE sapphire2016_booth-presentationCisco SUSE sapphire2016_booth-presentation
Cisco SUSE sapphire2016_booth-presentation
 
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...
SUSE Linux Enterprise High Availability Extension 11: Support and Troubleshoo...
 
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision Makers
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision MakersThrive With Big Data Webinar Series - Part 5: Considerations for Decision Makers
Thrive With Big Data Webinar Series - Part 5: Considerations for Decision Makers
 
Ha opensuse
Ha opensuseHa opensuse
Ha opensuse
 
Building a Two Node SLES 11 SP2 Linux Cluster with VMware
Building a Two Node SLES 11 SP2 Linux Cluster with VMwareBuilding a Two Node SLES 11 SP2 Linux Cluster with VMware
Building a Two Node SLES 11 SP2 Linux Cluster with VMware
 
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...Building High Availability Clusters with SUSE Linux Enterprise High Availabil...
Building High Availability Clusters with SUSE Linux Enterprise High Availabil...
 
High Availability in 37 Easy Steps
High Availability in 37 Easy StepsHigh Availability in 37 Easy Steps
High Availability in 37 Easy Steps
 
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
SUSE Linux Enterprise and SAP NetWeaver 7.30 HA Cluster
 

Similar a Set Up High Availability Cluster with openSUSE Leap

Linux container, namespaces & CGroup.
Linux container, namespaces & CGroup. Linux container, namespaces & CGroup.
Linux container, namespaces & CGroup. Neeraj Shrimali
 
2.1 Red_Hat_Cluster1.ppt
2.1 Red_Hat_Cluster1.ppt2.1 Red_Hat_Cluster1.ppt
2.1 Red_Hat_Cluster1.pptManoj603126
 
Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015 Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015 Roger Zhou 周志强
 
Chapter-1-IntroDistributeddffsfdfsdf-1.pptx
Chapter-1-IntroDistributeddffsfdfsdf-1.pptxChapter-1-IntroDistributeddffsfdfsdf-1.pptx
Chapter-1-IntroDistributeddffsfdfsdf-1.pptxmeharikiros2
 
The building blocks of docker.
The building blocks of docker.The building blocks of docker.
The building blocks of docker.Chafik Belhaoues
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyPeter Clapham
 
Ricon 2015 final
Ricon 2015 finalRicon 2015 final
Ricon 2015 finalKevin Jones
 
Linux Presentation_SSD.pdf
Linux Presentation_SSD.pdfLinux Presentation_SSD.pdf
Linux Presentation_SSD.pdfssuser37b0e0
 
Red Hat Linux.pptx
Red Hat Linux.pptxRed Hat Linux.pptx
Red Hat Linux.pptxssuser16fc07
 
Pacemaker+DRBD
Pacemaker+DRBDPacemaker+DRBD
Pacemaker+DRBDDan Frincu
 
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community)
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community) [발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community)
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community) 동현 김
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating systemudaya khanal
 
LogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC
 
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault ToleranceZookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault ToleranceAlluxio, Inc.
 
Linux操作系统01 简介
Linux操作系统01 简介Linux操作系统01 简介
Linux操作系统01 简介lclsg123
 
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Belmiro Moreira
 
Linux12 clustering onlinux
Linux12 clustering onlinuxLinux12 clustering onlinux
Linux12 clustering onlinuxJainul Musani
 

Similar a Set Up High Availability Cluster with openSUSE Leap (20)

Linux container, namespaces & CGroup.
Linux container, namespaces & CGroup. Linux container, namespaces & CGroup.
Linux container, namespaces & CGroup.
 
2.1 Red_Hat_Cluster1.ppt
2.1 Red_Hat_Cluster1.ppt2.1 Red_Hat_Cluster1.ppt
2.1 Red_Hat_Cluster1.ppt
 
Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015 Linux High Availability Overview - openSUSE.Asia Summit 2015
Linux High Availability Overview - openSUSE.Asia Summit 2015
 
Chapter-1-IntroDistributeddffsfdfsdf-1.pptx
Chapter-1-IntroDistributeddffsfdfsdf-1.pptxChapter-1-IntroDistributeddffsfdfsdf-1.pptx
Chapter-1-IntroDistributeddffsfdfsdf-1.pptx
 
The building blocks of docker.
The building blocks of docker.The building blocks of docker.
The building blocks of docker.
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
 
Ricon 2015 final
Ricon 2015 finalRicon 2015 final
Ricon 2015 final
 
Linux Presentation_SSD.pdf
Linux Presentation_SSD.pdfLinux Presentation_SSD.pdf
Linux Presentation_SSD.pdf
 
Red Hat Linux.pptx
Red Hat Linux.pptxRed Hat Linux.pptx
Red Hat Linux.pptx
 
Pacemaker+DRBD
Pacemaker+DRBDPacemaker+DRBD
Pacemaker+DRBD
 
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community)
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community) [발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community)
[발표자료] 오픈소스 Pacemaker 활용한 zabbix 이중화 방안(w/ Zabbix Korea Community)
 
Distributed operating system
Distributed operating systemDistributed operating system
Distributed operating system
 
LogicalDOC Clustering
LogicalDOC ClusteringLogicalDOC Clustering
LogicalDOC Clustering
 
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault ToleranceZookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
Zookeeper vs Raft: Stateful distributed coordination with HA and Fault Tolerance
 
Introduction
IntroductionIntroduction
Introduction
 
Linux操作系统01 简介
Linux操作系统01 简介Linux操作系统01 简介
Linux操作系统01 简介
 
First steps on CentOs7
First steps on CentOs7First steps on CentOs7
First steps on CentOs7
 
OS_MD_1.pdf
OS_MD_1.pdfOS_MD_1.pdf
OS_MD_1.pdf
 
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
 
Linux12 clustering onlinux
Linux12 clustering onlinuxLinux12 clustering onlinux
Linux12 clustering onlinux
 

Más de medwinz

managing-Indonesia-openSUSE-community-mirror-infrastructure.pdf
managing-Indonesia-openSUSE-community-mirror-infrastructure.pdfmanaging-Indonesia-openSUSE-community-mirror-infrastructure.pdf
managing-Indonesia-openSUSE-community-mirror-infrastructure.pdfmedwinz
 
Open suse indonesia-what-next
Open suse indonesia-what-nextOpen suse indonesia-what-next
Open suse indonesia-what-nextmedwinz
 
Open suse asia-summit-cfs
Open suse asia-summit-cfsOpen suse asia-summit-cfs
Open suse asia-summit-cfsmedwinz
 
Open suse asia-summit-cfp
Open suse asia-summit-cfpOpen suse asia-summit-cfp
Open suse asia-summit-cfpmedwinz
 
Maintaining good-spirit
Maintaining good-spiritMaintaining good-spirit
Maintaining good-spiritmedwinz
 
How to-encourage-community
How to-encourage-communityHow to-encourage-community
How to-encourage-communitymedwinz
 
Li f-e-is-it-feasible
Li f-e-is-it-feasibleLi f-e-is-it-feasible
Li f-e-is-it-feasiblemedwinz
 
Li f-e-is-it-feasible
Li f-e-is-it-feasibleLi f-e-is-it-feasible
Li f-e-is-it-feasiblemedwinz
 

Más de medwinz (8)

managing-Indonesia-openSUSE-community-mirror-infrastructure.pdf
managing-Indonesia-openSUSE-community-mirror-infrastructure.pdfmanaging-Indonesia-openSUSE-community-mirror-infrastructure.pdf
managing-Indonesia-openSUSE-community-mirror-infrastructure.pdf
 
Open suse indonesia-what-next
Open suse indonesia-what-nextOpen suse indonesia-what-next
Open suse indonesia-what-next
 
Open suse asia-summit-cfs
Open suse asia-summit-cfsOpen suse asia-summit-cfs
Open suse asia-summit-cfs
 
Open suse asia-summit-cfp
Open suse asia-summit-cfpOpen suse asia-summit-cfp
Open suse asia-summit-cfp
 
Maintaining good-spirit
Maintaining good-spiritMaintaining good-spirit
Maintaining good-spirit
 
How to-encourage-community
How to-encourage-communityHow to-encourage-community
How to-encourage-community
 
Li f-e-is-it-feasible
Li f-e-is-it-feasibleLi f-e-is-it-feasible
Li f-e-is-it-feasible
 
Li f-e-is-it-feasible
Li f-e-is-it-feasibleLi f-e-is-it-feasible
Li f-e-is-it-feasible
 

Último

Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxbodapatigopi8531
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfkalichargn70th171
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsArshad QA
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataBradBedford3
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfCionsystems
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsAndolasoft Inc
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providermohitmore19
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVshikhaohhpro
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfjoe51371421
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerThousandEyes
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantAxelRicardoTrocheRiq
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackVICTOR MAESTRE RAMIREZ
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 

Último (20)

Hand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptxHand gesture recognition PROJECT PPT.pptx
Hand gesture recognition PROJECT PPT.pptx
 
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdfThe Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
The Essentials of Digital Experience Monitoring_ A Comprehensive Guide.pdf
 
Software Quality Assurance Interview Questions
Software Quality Assurance Interview QuestionsSoftware Quality Assurance Interview Questions
Software Quality Assurance Interview Questions
 
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer DataAdobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
Adobe Marketo Engage Deep Dives: Using Webhooks to Transfer Data
 
Active Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdfActive Directory Penetration Testing, cionsystems.com.pdf
Active Directory Penetration Testing, cionsystems.com.pdf
 
Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)Der Spagat zwischen BIAS und FAIRNESS (2024)
Der Spagat zwischen BIAS und FAIRNESS (2024)
 
How To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.jsHow To Use Server-Side Rendering with Nuxt.js
How To Use Server-Side Rendering with Nuxt.js
 
TECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service providerTECUNIQUE: Success Stories: IT Service provider
TECUNIQUE: Success Stories: IT Service provider
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Optimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTVOptimizing AI for immediate response in Smart CCTV
Optimizing AI for immediate response in Smart CCTV
 
why an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdfwhy an Opensea Clone Script might be your perfect match.pdf
why an Opensea Clone Script might be your perfect match.pdf
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected WorkerHow To Troubleshoot Collaboration Apps for the Modern Connected Worker
How To Troubleshoot Collaboration Apps for the Modern Connected Worker
 
Salesforce Certified Field Service Consultant
Salesforce Certified Field Service ConsultantSalesforce Certified Field Service Consultant
Salesforce Certified Field Service Consultant
 
Cloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStackCloud Management Software Platforms: OpenStack
Cloud Management Software Platforms: OpenStack
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 

Set Up High Availability Cluster with openSUSE Leap

  • 1. High Availability Cluster with openSUSE Leap M. Edwin Zakaria medwin@opensuse.org
  • 2. 2 • Mohammad Edwin Zakaria • Linux user since 1998 • openSUSE since 6.2 around 1999 https://en.opensuse.org/User:Medwin • openSUSE member https://connect.opensuse.org/show/Medwin • openSUSE Indonesia
  • 3. 3
  • 4. 4
  • 5. 5
  • 6. 6
  • 7. 7
  • 8. 8 What is - Cluster ? - High Availability ?
  • 9. 9 Curious? • A computer cluster consists of a set of loosely or tightly connected computers that work together so that, in many respects, they can be viewed as a single system. Unlike grid computers, computer clusters have each node set to perform the same task, controlled and scheduled by software
  • 10. 10 Curious? • High availability (HA) is a system that is designed to avoid the loss of service by reducing or managing failures as well as minimizing planned downtime for the system. We expect a service to be highly available when life, health, and well-being, including the economic well-being of a company, depend on it.
  • 11. 11 Curious? • Harvard Research Group divide the HA into several Availability Environment Classification (AEC): AE4, AE3, AE2, AE1, AE0 • http://www.hrgresearch.com/pdf/AEC%20Defint ions.pdf • Other categories: continuous availability, fault tolerance, disaster tolerance
  • 12. 12 Once again what is cluster? • High performance computing • Load balancer (high capacity) • High availability ‒ 99.99% ‒ MTBF (mean time between failure = total operating time/total numbers of failure) ‒ Single point of failure
  • 13. 13 Once again what is cluster?
  • 14. 14 Challange in HA • Murphy’s Law “If anything can go wrong, it will” ‒ Loss of data ‒ Service outage • Flood, fire, earthquake, natural disaster, hardware damage ‒ Can you afford a downtime? ‒ Can you afford low availability system? ‒ Cost of downtime?
  • 15. 15 Different between HA term • HA term is widely use • VMware vSphere HA ‒ Closed source ‒ Hypervisor level and host hardware level • openSUSE/SUSE HA ‒ Open source ‒ OS level ‒ Protect critical resources running on VM ‒ HA within Linux OS
  • 16. 16 HA in Linux • Started with heartbeat project in around 1990 • Now manage by ClusterLabs http://clusterlabs.org/ • The ClusterLabs stack, incorporating Corosync and Pacemaker defines an Open Source, High Availability cluster offering suitable for both small and large deployments. • Pacemaker has been around since 2004 and is primarily a collaborative effort between Red Hat and SUSE, they also receive considerable help and support from the folks at LinBit and the community in general.
  • 17. 17 Hardware Consideration • External network, high traffic, use FO or eth bonding • Communication network between cluster node, use for messaging, membership, STONITH • Storage network, use FO or eth bonding • Manage switch • STONITH/fencing device • Shared storage: NAS (nfs/cifs), SAN (fc/iscsi)
  • 20. 20 Software Component • Corosync ‒ messaging and membership • Pacemaker ‒ Cluster resource management • Resource Agents ‒ Manage and monitor availability of service • Fencing device ‒ STONITH to ensure data integrity • User interface ‒ Crmsh and Hawk
  • 21. 21 Other component • LVS linux virtual server • HAproxy • Shared file system: OCFS2, GFS2 • Block device replication: DRBD • Shared storage: SAN • Geo cluster
  • 22. 22 More details • Pacemaker : Pacemaker is a cluster resource manager. It achieves maximum availability for your cluster resources by detecting and recovering from node and resource-level failures by making use of the messaging and membership capabilities provided by your preferred cluster infrastructure (either Corosync or Heartbeat).
  • 23. 23 More details • Corosync : ‒ provides cluster infrastructure functionality ‒ provides messaging and membership functionality ‒ maintains the quorum information. ‒ This feature has been utilized by pacemaker to provide high availability solution.
  • 24. 24 In short ... • Corosync : A quorum system that notifies applications when quorum is achieved or lost • Pacemaker : ‒ To start/stop resources on a node according to the score. ‒ To monitor resources according to interval. ‒ To restart resources if monitor fails. ‒ To fence/STONITH a node if stop operation fails.
  • 26. 26 Pacemaker Components • Non-cluster aware components (illustrated in green). These pieces include the resources themselves, scripts that start, stop and monitor them • Cluster Resource manager, provides the brain that processes and reacts to events regarding the cluster • Low level infrastructure, Corosync provides reliable messaging, membership and quorum information about the cluster
  • 27. 27 Pacemaker Stack • pacemaker corosync cluster called as pacemaker stack • Linux kernel by default comes with DLM (distributed lock manager). It provides locking feature which will be used by cluster aware filesystem • The GFS2 (Global File System2) and OCFS2 (Oracle cluster File System 2) are called as cluster aware filesystem • To access single filesytem by multiple hosts you need to have either GFS2 or OCFS2. • Or you can create a file system on top of cLVM (cluster logical volume manager)
  • 28. 28
  • 29. 29 Cluster Filesystem • If you have shared disk and want several nodes access it, you need cluster aware filesystem • The open source solution are GFS2 (Global File System2) and OCFS2 (Oracle cluster File System 2)
  • 30. 30 Cluster Block Device • DRBD (distributed replicated block device) allows you to create a mirror of two block devices that are located at two different sites across an IP network. When used with Corosync, DRBD supports distributed high-availability Linux clusters. It is a network based raid1, and high performance data replication over network • CLVM2, see https://www.sourceware.org/lvm2/ • Cluster md raid1, see https://www.kernel.org/doc/Documentation/md-clust er.txt
  • 32. 32 STONITH • STONITH is an acronym for “Shoot-The-Other-Node-In- The-Head”. • It protects your data from being corrupted by rogue nodes or concurrent access. • Just because a node is unresponsive, this doesn’t mean it isn’t accessing your data. The only way to be 100% sure that your data is safe, is to use STONITH so we can be certain that the node is truly offline, before allowing the data to be accessed from another node. • STONITH also has a role to play in the event that a clustered service cannot be stopped. In this case, the cluster uses STONITH to force the whole node offline, thereby making it safe to start the service elsewhere.
  • 33. 33 Split brain – the HA problem • Two nodes run the same service, break the data integrity • Solution: ‒ Quorum If cluster doesn’t have quorum no action will be taken, means fencing and resource management are disabled without quorum ‒ STONITH Shoot the other node in the head • More on stonith http://ourobengr.com/ha/
  • 34. 34 Reference • SUSE HA Extension Doc (can be use for openSUSE also) https://www.suse.com/documentation/sle-ha-12/ • HA clusterlabs http://clusterlabs.org • Corosync doc http://landley.net/kdocs/ols/2008/ols2008v1-pages -85-100.pdf • DRBD http://drbd.linbit.org/en/ • OCFS2 https://ocfs2.wiki.kernel.org/ • CLVM2 https://sourceware.org/lvm2/ • Linux SCSI http://linux-iscsi.org/
  • 35. Case Study / Hands-on
  • 36. 36 Setting up HA on Leap • Scenario: ‒ setting up openSUSE Leap 42.1 as host ‒ create 2 VM with QEMU/KVM, install openSUSE Leap 42.1, configure the network, and all the required packages ‒ Configure pacemaker corosync drbd ‒ Setup HA webserver
  • 37. 37 Preparation • Install openSUSE Leap 42.1 • Configure all repository • Install all the required software • Create at least 2 virtul machine with QEMU/KVM • Configure the Cluster • Create DRBD • Activate web server / nginx or apache • Test the status
  • 38. Thank you. Join the conversation, contribute & have a lot of fun! www.opensuse.org
  • 39. 39 Have a Lot of Fun, and Join Us At: www.opensuse.org