High availability and fault tolerance of openstack

12
High Availability and Fault Tolerance (OpenStack) Deepak Mane Cloud Architect

description

Detailed presentation will upload

Transcript of High availability and fault tolerance of openstack

Page 1: High availability and fault tolerance of openstack

High Availability and Fault Tolerance(OpenStack)

Deepak Mane

Cloud Architect

Page 2: High availability and fault tolerance of openstack

Objective & Motivation

• To Build a a Fault Tolerance and High Availability Architecture (OpenStack)

• Motivation– To build a fault tolerance architecture for OpenStack

– Build a Cluster Architecture for MySQL RabbitMQcomponents

– To build high availability architecture for network

– To build a predictive and reactive model for detecting failures of Nova , Swift and Compute.

Page 3: High availability and fault tolerance of openstack

Use cases

• Master-Master Cluster architecture for MySQL • Disk Level replication for mySQL using DBRD for

Glance , Swift and Cinder• Session level replication for RabbitMQ• High availability for networking• High availability for Horizon (Openstack

dashboard)• Predictive model for detecting failure for all

components• Reactive model for recovery for all components.

Page 4: High availability and fault tolerance of openstack

Non Use Cases

• Scenarios not suitable for cloud

– Redundancy of network components, such as switches and routers,

– Redundancy of applications and automatic service migration,

– Redundancy of storage components,

– Redundancy of facility services such as power, air conditioning, fire protection, and others

Page 5: High availability and fault tolerance of openstack

Pacemaker – High availability for OpenStack

• Cluster stack, the state of- the-art high availability and load balancing stack for the Linux platform

• Storage- and application-agnostic, and is in no way specific to OpenStack

• Pacemaker relies on the Corosync messaging layer for reliable cluster communications.

• Corosync implements the Totem single-ring ordering and membership protocol and provides UDP and InfiniBand based messaging, quorum, and cluster membership to Pacemaker.

Page 6: High availability and fault tolerance of openstack

Required packages

• pacemaker

• corosync

• cluster-glue

• resource-agents

Page 7: High availability and fault tolerance of openstack

Architecture and Technology Information

Page 8: High availability and fault tolerance of openstack

HA Architecture – Cloud Controller

High availability

Implemented using DBRD , Pacemaker , Corosync

Page 9: High availability and fault tolerance of openstack

DBRD Architecture – MySQL-Cloud Controller

Master Cloud controller Slave Cloud controller

Page 10: High availability and fault tolerance of openstack

DBRD Architecture – RabbitMQ- Cloud Controller

Master Cloud controller Slave Cloud controller

Page 11: High availability and fault tolerance of openstack

Nova - Recovery mode Approach

• Nova - Recovery mode Approach

Page 12: High availability and fault tolerance of openstack

Cloud Controller – Recovery approach