Webinar: Replication and Replica Sets

Summer 2013

Replication and Replica Sets

Member of Technical Staff, 10genWilliam Zola

Why Replication?

To keep your data safe

Why Replication?

To keep your data available

Why Replication?

Because bad things happen to good data centers

What is replication and why do we need it?

Replication

ImportantData

Copy of Important

• Using replica sets for high availability– PRIMARY, SECONDARY, and ARBITER nodes– PRIMARY elections

• Using replica sets for disaster recovery• Configure a replica set so there’s no single point of

failure• No-Downtime Maintenance• Durability in a networked environment

Agenda

• Not new to DBA or System Administration• New to MongoDB or MongoDB replication

Audience

Use Cases

Stakeholders

• High Availability (automatic failover)

Use Cases

• Disaster Recovery

Use Cases

• No downtime for maintenance– Backups– Maintenance (index rebuilds, compaction)

Use Cases

• Replica Set is "transparent" to the application

Use Cases

• Read Scaling (extra copies to read from)

Use Cases

MongoDB Replication Basics

Replica Set Features

• A cluster of N servers• Any (one) node can be

primary• All writes to primary• Reads go to primary (default)

optionally to a secondary

• Consensus election of primary• Automatic failover• Automatic recovery

Node 3

Node 1

Node 2

Primary WRITESREADS

Pick me!

• Replica set is two or more nodes

Node 1

Node 2P

Node 3

How MongoDB Replication works

• Election establishes the PRIMARY• Data replicates from PRIMARY to SECONDARIES

Node 1

Node 2 Primary

Node 3

How MongoDB Replication works

data data

Planned– Hardware upgrade– O/S or file-system tuning– Relocation of data to new file-system / storage– Software upgrade

Unplanned– Hardware failure– Data center failure– Region outage– Human error– Application corruption

Types of outage

AUTOMATIC FAILOVER

MAINTENANCEw/o DOWNTIME

Mechanics of Automatic Failover

• Data replicates from PRIMARY to SECONDARIES

Node 1

Node 2 Primary

Node 3

data data

• Election establishes the PRIMARY• Data replicates from PRIMARY to SECONDARIES• Primary might FAIL

Node 1

Node 2 Primary

Node 3

data data data

Node 1 Node 3

• Automatic election of new PRIMARY if majority exists

Node 2 DOWN

negotiate new primary

Node 1 Node 3

Node 2 DOWN

negotiate new master

New PRIMARY elected

Primary

Node 1 Node 3

Node 2RECOVERING

negotiate new master

Primary

Automatic Recovery of Failed Node

Can performfull resync from secondaryif necessary

• Once caught-up resumes syncing from primary• Original replica set configuration is re-established

Node 1

Node 2

Node 3

Primary

Cluster Size and Rules of Failover

Primary Election

Primary

Secondary

As long as a partition can see a majority (>50%) of the cluster, then it will elect a primary.

Must have a STRICT majority to be elected primary!!!

Simple Failure

Primary

Failed Node

Secondary

66% of cluster visible. Primary is elected

Secondary

Failed Node

33% of cluster visible. Read only mode.

Failed Node

Secondary

Simple Failure

Secondary

Primary

Network Partition

Primary

Secondary

Network Partition

Primary

Secondary

Primary

Failed Node

Secondary

66% of cluster visible.

Primary is elected

Secondary

Network Partition

33% visible. Read only mode.

Primary

Secondary

Failed Node

Secondary

No “Split Brain” Problem

Primary

Secondary

A node must be elected by a strict majority of the set in order to be a primary• Only the primary node

can accept writes• A replica set never has

two primary nodes

Even Cluster Size

Primary

Secondary

Primary

Secondary

Failed Node

Secondary

Failed Node

Secondary

Even Cluster Size

Primary

Secondary

Failed Node

Secondary

Failed Node

Secondary

Even Cluster Size✗ODD = good

Types of Nodes

Regular • Regular node holds a copy of your data

• Arbiter node has no data• but it can vote! use to break ties

Secondary

Arbiter

• Secondary / All data Nodes• different priorities• other configuration options

Primary• Primary

• A data node that won the election

Add an Arbiter!

Primary

Secondary

Arbiter

Add an arbiter node to break ties

• Odd number of votes in set• Arbiter is lightweight – does

not store data

High Availability

No Downtime Maintenance

1. Take secondary out of set

2. Perform maintenance

3. Replace secondary in set

4. Wait for it to catch up

Secondary

Primary1. Take secondary out of set

4. Wait for it to catch up✓

No Downtime Maintenance

4. Wait for it to catch up

Secondary

5. Step down the primary

(wait for new primary to be elected)

6. Repeat steps 1-4

Secondary

Primary

Primary✓

Primary

Arbiter

Secondary

Is this a good configuration?

2 Replicas + Arbiter??

Primary

Arbiter

Secondary

2 Replicas + Arbiter??

3. Primary node crashes– Uh-oh!– Replica set is down– Data from the primary hasn’t

been replicated

Use Three Data Nodes!

Primary

Secondary

Use a minimum of three data nodes to assure high availability

Avoid Single Points of Failure

Avoid Single points of failure

Primary

Secondary

Top of rack switch

Rack falls over

Better

Primary

Secondary

Loss of internet

DC burns down

Even Better

Secondary

Primary

San Francisco

Dallas

Priorities

Secondary

Primary

San Francisco

Dallas

Priority 1

Priority 0

Disaster recover data center. Will never become primary automatically.

Even Better

Primary

Secondary

San Francisco

Dallas

New York

Secondary

Node Priority

Primary

Secondary

Priority 10

Priority 5

Priority 0 Dallas

New York

SanFrancisco

Node Sizing

Primary

Secondary

Priority 10

Priority 5

Priority 0 Dallas

New York

SanFrancisco

Nodes that can become primary should be sized equally

• RAM • Disk• IOPS

Replica Set Review

Primary

Secondary

Replica set contains N nodes• At most one node is the

PRIMARY• All writes go to the PRIMARY• SECONDARY nodes contain

up-to-date copies of the data• SECONDARY nodes

continually copy data from the PRIMARY

WRITES

Failover Review

Primary

Secondary

If the PRIMARY fails, the Replica Set can elect a new PRIMARY

• A strict (>50%) majority is required for election

• The former PRIMARY will rejoin the set as a SECONDARY when it recovers

WRITES

Partition Review

A Network Partition prevents the nodes from communicating

• The Replica Set treats a partition as a “down node”

• A node must get a strict majority of the votes to be elected PRIMARY

• Even numbers of votes reduce availability

• Use Arbiters to break ties• Spread your nodes across multiple

data centers

Secondary

Primary

Secondary

Using Applications with Replica Sets

Application View

ApplicationCode Here

MongoDBDriver

Replica Set

Under the Covers

MongoDBDriver

Secondary

Primary

Replica Set Connection:

my-set/host1:27017,host2:27017,host3:27017

Replica Set

Secondary Reads

MongoDBDriver

Secondary

Primary

Potentially Stale!

Replica Set

Failover

MongoDBDriver

Secondary

Primary✗Connection Exception

Replica Set

New Election

MongoDBDriver

Secondary

Primary

Secondary✗

Durability and Replica Sets

• Wikipedia:– In database systems, durability is the ACID property which

guarantees that transactions that have committed will survive permanently.

Durability

The Lifetime of a Write Operation (single-node)

MongoDBDriver

Journal Data in RAM

Network Write

Validate Data

Update RAM Update Journal

Get Last Error

MongoDBDriver

Journal Data in RAM

Network Write

getLastError command

getLastError ResultValidate Data

Write Concern

MongoDBDriver

Network Write

getLastError Result

Network Acknowledgement {w:0}

Check for Error {w:1}

Journal Sync {j:1}

Replica Sets and Durability

Primary

Secondary

A write that has replicated to a majority of the nodes is durable

• The most up-to-date node will be elected primary

• The write will be present on that node

No guarantee of which nodes will have the write

• Use “tag sets” for finer-grained control

Durable!

Network Write Concern

MongoDBDriver

Network Write

getLastError Result

Specific Number of Nodes

Majority of Data Nodes {w: ’majority’}

Tag Set {w: “my tag set”}

Wait for timeout {w:2, wtimeout:2000}

Replica Set

Primary

Secondary

Wrapping it Up

Why Replication?

To keep your data safe and available

• High Availability (auto-failover)

• No downtime for maintenance

• Writes are durable with appropriate Write

Concern

Features

• Easy to setup – Try on a single machine– Multiple nodes with different ports on a single

• Check on-line documentation for RS tutorials– http://docs.mongodb.org/manual/replication/

#tutorials

Just Use It!

Questions?

Thank You!

Webinar: Replication and Replica Sets

Technology

Transcript of Webinar: Replication and Replica Sets

Managing Distributed Data with Geodatabase …New in 10: Geodatabase Replication • New Functionality - One way replication using archiving - One way child to parent replica - Schema

CA ARCserve® Replication and High Availability for Windows ARCserve Replication and High... · Spool Directory Settings ... Master or Replica server are not supported) ... CA ARCserve

Veeam Backup & Replication™go.veeam.com/rs/veeam/images/demo-veeam backup-replication-for … · Veeam server placement no longer affects the ability to seed Replica mapping Map

Consistency and Replication. Outline Introduction (what’s it all about) Data-centric consistency Client-centric consistency Replica management Consistency.

On the Application of Replica Molding Technology for the Indirect Measurement … · accuracy and precision of seven different silicone based replication media utilized for quantitative

Senior Solutions Architect, MongoDB Inc. Massimo Brignoli Introduction to Replication and Replica Sets.

Perforce Replication · 2017-07-18 · The Definitive Guide Perforce Replication Sven Erik Knop Senior Consultant . ... • Binary file located in the replica P4ROOT directory •

Performance Monitoring AlwaysOn Availability Groups...• Perfmon Counters • SQL Server:Availability Replica • Replication data - messages sent, bytes sent, ﬂow control • SQL

What s In The ox? - · PDF file1 Replica Oyster Spoon 1 Replica Oil Lamp 1 Replica andle Spike 1 Replica Painted Flagon 1 Replica Antefix Tile 5 Replica oins ... What’s In The ox?

Webinar: Replication and Replica Sets

Database Replication Policies for Dynamic Content …amza/papers/eurosys-final.pdfWhile dynamic database replication is appealing, it raises two key issues: 1) the replica allocation

Replication › CIgnat › files › replication › Replication_C4.pdf · State-based Replication • Replicated object: a tuple (S, s 0, q, u, m) § S: state domain § Replica at

Using Dell PS Series Asynchronous Replication...3. Configure each volume for replication (section 3.1). Specify the replication partner, local replication reserve, replica reserve

Back to Basics Webinar 3: Introduction to Replica Sets

LY2606368 Causes Replication Catastrophe and Antitumor ......catastrophe. Changes in the ratio of RPA2 to phosphorylated H2AX following LY2606368 treatment further support replica-tion

Original shell Replica Isostructure Replication Ha, et al.

Zerto Virtual Replication: A Quick Start in Azure Environmentss3.amazonaws.com/zertodownload_docs/5.5U3/Zerto Virtual Replica… · Table of Contents 1 ZERTO VIRTUAL REPLICATION A

Replication and Replica Sets

Arcserve® Replication and High Availability for Windows · PDF fileModify the Exchange Configuration on the Master and Replica ... Spool Directory Settings ... Replication High Availability

Distributed Workflows in ArcGIS - Esri€¦ · -Create Replica geoprocessing tool -Synchronize Changes geoprocessing tool -New Replica Manager ... Geodatabase Replication Tools and