Distributed Algorithms

1Distributed Algorithms – Mutual Exclusion

Distributed Algorithms

Distributed Mutual ExclusionLudovic HENRIO

[email protected] from plenty of sources, including Ghosh’s lecture


Distributed Mutual ExclusionMostly from Sukumar Ghosh's book and handsout:

1 – Introduction

2 – Solutions Using Message Passing

3 – Token Passing Algorithms

Also in Ghosh's book (not covered by this lecture):Group based mutual esclusionMutual exclusion using special instruction

Solution using Test-and-SetSolution using DEC LL and SC instruction


Distributed Mutual Exclusion

1 – Introduction2 – Solutions Using Message Passing



Why Do We Need Distributed Mutual Exclusion (DME) ?

Atomicity exists only up to a certain level:

Atomic instructions

Defines the granularity of the computation

Types of possible interleaving

Assembly Language Instruction?

Remote Procedure Call?

1- Introduction



1- Introduction

Some applications are:

• Resource sharing• Avoiding concurrent update on shared data• Controlling the grain of atomicity• Medium Access Control in Ethernet• Collision avoidance in wireless broadcasts



Example: Bank Account Operations

1- Introduction

shared n : integerProcess P

Account receives amount nP

Computation: n = n +nP:

P1. Load Reg_P, n

P2. Add Reg_P, nP

P3. Store Reg_P, n

Process Q

Account receives amount nQ

Computation: n = n +nQ:

Q1. Load Reg_Q, n

Q2. Add Reg_Q, nQ

Q3. Store Reg_Q, n


Why Do We Need DME? (example cont'd)

1- Introduction

Possible Interleaves of Executions of P and Q:2 give the expected result n= n + nP + nQ

P1, P2, P3, Q1, Q2, Q3Q1, Q2, Q3, P1, P2, P3

5 give erroneous result n = n+nQP1, Q1, P2, Q2, P3, Q3P1, P2, Q1, Q2, P3, Q3P1, Q1, Q2, P2, P3, Q3Q1, P1, Q2, P2, P3, Q3Q1, Q2, P1, P2, P3, Q3

5 give erroneous result n = n + nPQ1, P1, Q2, P2, Q3, P3Q1, Q2, P1, P2, Q3, P3Q1, P1, P2, Q2, Q3, P3P1, Q1, P2, Q2, Q3, P3P1, P2, Q1, Q2, Q3, P3


Solutions to the Mutual Exclusion Problem

1- Introduction

CS

CS

CS

CSp0

p1

p2

p3


Principle of the Mutual Exclusion Problem (2)

Each process, before entering the CS acquires the authorizatino to do so.

Critical section should eventually terminate

1- Introduction

Enter CS

<critical section>

Exit CS

Acquire authorisationEnter CS

<critical section>

Exit CS

Acquire authorisation

req=[1,1,1,0,0]


Mutual exclusion in the shared memory model – a solution for 2 processes


CorrectnessConditions...

ME1 : Mutual ExclusionAt most one process can remain in CS at any timeSafety property

ME2 : Freedom from deadlockAt least one process is eligible to enter CSLiveness property

ME3 : Fairness Every process trying to enter must eventually succeedAbsence of starvation

A measure of fairness: bounded waitingSpecifies an upper bound on the number of times a process waits for its turn to enter SC -> n-fairness (n is the MAXIMUM number of rounds)FIFO fairness when n=0

2- Solutions using Message Passing


How to Measure Performance Number of msg’s per CR invocation. Fairness measure (next slide) Synchronization Delay (SD). Response Time (RT). System Throughput (ST): ST = 1/(SD + E) where E is the average CR execution time.

last req exits CR next req enters CRSD

enters CR exits CRCR reqE

RT

Used here


Last words before getting to the algorithms …

Fault tolerance: in the case of failure, the algorithm can reorganize itself so that it continues to function without any disruption. -> not much true for the algorithms presented here

Two kinds of algorithms: logical clock based and token based.



1 – Introduction

2 – Solutions using Message Passing3 – Token Passing Algorithms


Problem formulationAssumptions

n processes (n>1), numbered 0 ... n-1, noted Pi communicating by sending / receiving messagestopology: completely connected grapheach Pi periodically wants:

1. enter the Critical Section (CS)2. execute the CS code3. eventually exit the CS code

Devise a protocol that satisfies:ME1 : Mutual ExclusionME2 : Freedom from deadlockME3 : Progress → Fairness



A trivial solution: centralized solutionTrivial solution consists in having one centralised server distributing authorisations• Queue requests and authorize one by oneDefaults:• Centralized server• Faults, contention

0

2

1

34req=[1,1,1,0,0]

Server

Acquire authorisation


Centralized solution

Use a coordinator processExternal processOne of the Pi-s

Problems:Single point of failureUnable to achieve FIFO fairness (except if CO)

Example:


i

j

coord

request CS

message

request CS

How to anticipatethis late arrival?

clients

busy: boolean

server

queue

req replyrelease

What if a timestamp is given when sending the CS request?


Lamport's SolutionAssumptions:

Each communication channel is FIFOEach process maintains a request queue Q

Algorithm described by 5 rulesLA1. To request entry, send a time-stamped message to every other process and

enqueue to local Q (of sender)LA2. Upon reception place request in Q and send time-stamped ACK but once out

of CS (possibly immediately if already out)LA3. Enter CS when:

1. request first in Q (chronological order)2. all ACK received from others

LA4. To exit CS, a process must:3. delete request from Q4. send time-stamped release message to others

LA5. When receiving a release msg, remove request from Q


Run an example with 3 processes and different interleavings


Analysis of Lamport's Solution2- Solutions using Message Passing

Can you show that it satisfies all the properties(i.e. ME1, ME2, ME3) of a correct solution?

Observation. Processes taking a decision to enter CS must have identical views of their local queues, when all ACKs have been received.

Proof of ME1. At most one process can be in its CS at any time.

Suppose not, and both j,k enter their CS. This implies

j in CS Qj.ts.j < Qk.ts.k k in CS Qk.ts.k < Qj.ts.j

Impossible.

WHY?

WHY?


Analysis of Lamport's Solution (2)2- Solutions Using Message Passing

Proof of ME2. (No deadlock)

The waiting chain is acyclic.

i waits for j i is behind j in all queues

(or j is in its CS) j does not wait for i

Proof of ME3. (progress)New requests join the end of the

queues,

so new requests do not pass

the old ones

WHY? ALWAYS?


Analysis of Lamport's Solution (3)

Proof of FIFO fairness. timestamp (j) < timestamp (k)

j enters its CS before k does so

Suppose not. So, k enters its CS before j. So k did not receive j’s request. But k received the ack from j for its own req.

This is impossible if the channels are FIFO.

Message complexity = 3(N-1) (per trip to CS)(N-1 requests + N-1 ack + N-1 release)

k j

Req(20)

ack

Req (30)

Exercise:IS THIS REALLY FIFO FAIRNESS? (remember there is no global time)If no: is the system n-fair? For which n? why? (note: a lot of messages are

exchanged, which gives the opportunity for different clocks to synchronise)


Ricart & Agrawala’s Solution2- Solutions Using Message Passing

What is new?1. Broadcast a timestamped request to all.2. Upon receiving a request, send ack if

-You do not want to enter your CS, or -You are trying to enter your CS, but your timestamp is higher than that of the sender.(If you are already in CS, then buffer the request)

3. Enter CS, when you receive ack from all.4. Upon exit from CS, send ack to each pending request before making a new request.(No release message is necessary)

Run an example with 3 processes and different interleavings


Analysis of Ricart & Agrawala’s Solution2- Solutions Using Message Passing

ExerciseME1. Prove that at most one process can be in CS.

ME2. Prove that deadlock is not possible.

ME3. Prove that FIFO fairness holds even if channels are not FIFO (note: this is the same

fairness as in Lamport’s solution)

Message complexity = 2(N-1)(N-1 requests + N-1 acks - no release message)

k j

Req(j)

Ack(j)

TS(j) < TS(k)

Req(k)


ExercisesA Generalized version of the mutual exclusion problem in which up to L processes (L ³1) are allowed to be in their critical sections simultaneously is known as the L-exclusion problem. Precisely, if fewer than L processes are in the CS at any time and one more process wants to enter it, it must be allowed to do so. Modify R.-A. algorithm to solve the L-exclusion problem.



1 – Introduction

2 – Solutions Using Message Passing



Token Ring Approach

Operations:Only the process holding the token can

enter the CS. To enter the critical section, wait passively

for the token. When in CS, hold on to the token.

To exit the CS, the process sends the token onto its neighbor.

If a process does not want to enter the CS when it receives the token, it forwards the token to the next neighbor.

P0

P1

P2

P3

PN-1

Previous holder of token

next holder of token

current holder of token

Processes are organized in a logical ring: pi has a communication channel to p(i+1) mod (n).


The basic ring approachFeatures:

Safety & liveness are guaranteed, but ordering is not.Bandwidth: 1 message per exit(N-1) -fairnessDelay between one process’s exit from the CS and the

next process’s entry is between 1 and N-1 message transmissions.


Suzuki-Kasami Solution3- Tokens passing algorithms

A mix of the lamport queue and the token approach

Completely connected network of processes

There is one token in the network. The holder of the token has the permission to enter CS.

Any other process trying to enter CS must acquire that token. Thus the token will move from one process to another based on demand. I want to enter CS

I want to enter CS


Suzuki-Kasami Algorithm3- Tokens passing algorithms

Process i broadcasts (i, num)

Each process maintains-an array req: req[j] denotes the sequence nb of the latest request from process j(Some requests will be stale soon)

Additionally, the holder of the token maintains-an array last: last[j] denotes the sequence number of the latest visit to CS for process j.- a queue Q of waiting processes req: array[0..n-1] of integer

last: array [0..n-1] of integer

Sequence number of the request

req

req

req

reqlast

queue Q


Suzuki-Kasami Algorithm (2)3- Tokens passing algorithms

When a process i receives a request (k, num) from process k, it sets req[k] to max(req[k], num). The holder of the token

--Completes its CS--Sets last[i]:= its own num--Updates Q by retaining each process k only if1+ last[k] = req[k] (This guarantees the freshness of the request)--Sends the token to the head of Q, along withthe array last and the tail of Q

In fact, token (Q, last)

Req: array[0..n-1] of integer

Last: Array [0..n-1] of integer


Suzuki-Kasami Algorithm (3)3- Tokens passing algorithms

{Program of process j}Initially, i: req[i] = last[i] = 0* Entry protocol *

req[j] := req[j] + 1Send (j, req[j]) to allWait until token (Q, last) arrivesCritical Section

* Exit protocol *last[j] := req[j]k ≠ j: k Q req[k] = last[k] + 1 append k to Q;if Q is not empty send (tail-of-Q, last) to head-of-Q fi

* Upon receiving a request (k, num) *req[k] := max(req[k], num)


Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[1,0,0,0,0]last=[0,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

initial state: process 0 has sent a request to all, and grabbed the token




0

2

1

34

req=[1,1,1,0,0]last=[0,0,0,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

1 & 2 send requests to enter CS




0

2

1

34

req=[1,1,1,0,0]last=[1,0,0,0,0]Q=(1,2)

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

0 prepares to exit CS




0

2

1

34

req=[1,1,1,0,0]req=[1,1,1,0,0]last=[1,0,0,0,0]Q=(2)

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

0 passes token (Q and last) to 1




0

2

1

34

req=[2,1,1,1,0]

req=[2,1,1,1,0]last=[1,0,0,0,0]Q=(2,0,3)

req=[2,1,1,1,0]

req=[2,1,1,1,0]

req=[2,1,1,1,0]

0 and 3 send requests




0

2

1

34

req=[2,1,1,1,0]req=[2,1,1,1,0]

req=[2,1,1,1,0]last=[1,1,0,0,0]Q=(0,3)

req=[2,1,1,1,0]

req=[2,1,1,1,0]

1 sends token to 2


Summary and advantagesToken-based + queue :• Satisfies ME1 to ME3• Less messages: N by CS• Question: is this algorithm fair? All messages

received during the CS are enqueued at the same position, cannot we do better?

• Note: index can be bound• Note 2: A similar algorithm was published by

Ricart and Agrawa at the same period

WHY?

WHY?


Back to message passing: Maekawa’s Solution

First solution with a sublinear O(sqrt N) message complexity.

“Close to” Ricart-Agrawala’s solution, but each process is required to obtain permission from only a subset of peers


Maekawa’s Algorithm2- Solutions Using Message Passing

• With each process i, associate a subset Si.Divide the set of processes into subsets that satisfy the following two conditions:

i Si

i,j : i,j n-1 :: Si Sj ≠

• Main idea. Each process i is required to receive permission from Si only. Correctness requires that multiple processes will never receive permission from all members of their respective subsets.

0,1,2 1,3,5

2,4,5

S0S1

S2


Maekawa’s Algorithm2- Solutions Using Message Passing

Example. Let there be seven processes 0, 1, 2, 3, 4, 5, 6

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}


Maekawa’s Algorithm (example cont'd)2- Solutions Using Message Passing

Version 1 {Life of process I}

1. Send timestamped request to each process in Si.2. Request received send ack to process with the

lowest timestamp. Thereafter, "lock" (i.e. commit) yourself to that process, and keep others waiting.

3. Enter CS if you receive an ack from each member in Si.

4. To exit CS, send release to every process in Si.5. Release received unlock yourself. Then send

ack to the next process with the lowest timestamp.

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}


Analysis of Maekawa’s Algorithm (version 1)

2- Solutions Using Message Passing

ME1. At most one process can enter its critical section at any time.

Let i and j attempt to enter their Critical SectionsSi Sj ≠ there is a process k Si Sj

Process k will never send ack to both. So it will act as the arbitrator and establishes ME1

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}


Analysis of Maekawa’s Algorithm (version 1)


ME2. No deadlock. Unfortunately deadlock is possible! Assume 0, 1, 2 want to enter their critical sections.

From S0= {0,1,2}, 0,2 send ack to 0, but 1 sends ack to 1;



Now, 0 waits for 1 (to send a release), 1 waits for 2 (to send a

release), , and 2 waits for 0 (to send a release), . So

deadlock is possible!

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}


Problem: Potential of Deadlock

Without the loss of generality, assume that three sites Si, Sj and Sk simultaneously invoke Mutual Exclusion, and suppose:

Ri Rj = {Su} Rj Rk = {Sv} Rk Ri = {Sw}

Solution: extra msg’s to detect deadlock, maximum number of msg’s = 5(sqtr(n) + 1).

SiSw

Sk

Sv

Sj

SuAckW

W Ack

WAck


Maekawa’s Algorithm, avoiding deadlocks


Avoiding deadlockIf processes always receive messages in

increasing order of timestamp, then deadlock “could be” avoided. But this is too strong an assumption.

Another more complex version exists that should ensure absence of deadlocks (not presented here)

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}


EXERCISES


Dekker’s algorithmDesigned for shared memory (oldest)Does it verify ME1 – ME2 – ME3?


Bounded timestamps

Can we apply this idea to algorithms using timestamps shown in this lecture?


Fault toleranceTake the basic token algorithm. (RING)Suppose processes are given a numberSuppose a process crashes but a new ring is automatically

created (keeping the old numbers)What happens if the disappeared process did not have the tokenWhat if it had the tokenWhat information can we add to the token so that it can be regenerated if necessary?

if the disappeared process was neither P0 nor Pn

if it was P0if it was Pn (how to detect it was Pn?)

See Le Lann 77

Distributed Algorithms

Documents

Transcript of Distributed Algorithms