Distributed Algorithms

50
1 Distributed Algorithms – Mutual Exclusion Distributed Algorithms Distributed Mutual Exclusion Ludovic HENRIO [email protected] Borrowed from plenty of sources, including Ghosh’s lecture

description

Distributed Mutual Exclusion Ludovic HENRIO Ludovic.henrio @inria . fr Borrowed from plenty of sources, including Ghosh’s lecture. Distributed Algorithms. Distributed Mutual Exclusion. Mostly from Sukumar Ghosh's book and handsout : 1 – Introduction - PowerPoint PPT Presentation

Transcript of Distributed Algorithms

Page 1: Distributed Algorithms

1Distributed Algorithms – Mutual Exclusion

Distributed Algorithms

Distributed Mutual ExclusionLudovic HENRIO

[email protected] from plenty of sources, including Ghosh’s lecture

Page 2: Distributed Algorithms

2Distributed Algorithms – Mutual Exclusion

Distributed Mutual ExclusionMostly from Sukumar Ghosh's book and handsout:

1 – Introduction

2 – Solutions Using Message Passing

3 – Token Passing Algorithms

Also in Ghosh's book (not covered by this lecture):Group based mutual esclusionMutual exclusion using special instruction

Solution using Test-and-SetSolution using DEC LL and SC instruction

Page 3: Distributed Algorithms

3Distributed Algorithms – Mutual Exclusion

Distributed Mutual Exclusion

1 – Introduction2 – Solutions Using Message Passing

3 – Token Passing Algorithms

Page 4: Distributed Algorithms

4Distributed Algorithms – Mutual Exclusion

Why Do We Need Distributed Mutual Exclusion (DME) ?

Atomicity exists only up to a certain level:

Atomic instructions

Defines the granularity of the computation

Types of possible interleaving

Assembly Language Instruction?

Remote Procedure Call?

1- Introduction

Page 5: Distributed Algorithms

5Distributed Algorithms – Mutual Exclusion

Why Do We Need Distributed Mutual Exclusion (DME) ?

1- Introduction

Some applications are:

• Resource sharing• Avoiding concurrent update on shared data• Controlling the grain of atomicity• Medium Access Control in Ethernet• Collision avoidance in wireless broadcasts

Page 6: Distributed Algorithms

6Distributed Algorithms – Mutual Exclusion

Why Do We Need Distributed Mutual Exclusion (DME) ?

Example: Bank Account Operations

1- Introduction

shared n : integerProcess P

Account receives amount nP

Computation: n = n +nP:

P1. Load Reg_P, n

P2. Add Reg_P, nP

P3. Store Reg_P, n

Process Q

Account receives amount nQ

Computation: n = n +nQ:

Q1. Load Reg_Q, n

Q2. Add Reg_Q, nQ

Q3. Store Reg_Q, n

Page 7: Distributed Algorithms

7Distributed Algorithms – Mutual Exclusion

Why Do We Need DME? (example cont'd)

1- Introduction

Possible Interleaves of Executions of P and Q:2 give the expected result n= n + nP + nQ

P1, P2, P3, Q1, Q2, Q3Q1, Q2, Q3, P1, P2, P3

5 give erroneous result n = n+nQP1, Q1, P2, Q2, P3, Q3P1, P2, Q1, Q2, P3, Q3P1, Q1, Q2, P2, P3, Q3Q1, P1, Q2, P2, P3, Q3Q1, Q2, P1, P2, P3, Q3

5 give erroneous result n = n + nPQ1, P1, Q2, P2, Q3, P3Q1, Q2, P1, P2, Q3, P3Q1, P1, P2, Q2, Q3, P3P1, Q1, P2, Q2, Q3, P3P1, P2, Q1, Q2, Q3, P3

Page 8: Distributed Algorithms

8Distributed Algorithms – Mutual Exclusion

Solutions to the Mutual Exclusion Problem

1- Introduction

CS

CS

CS

CSp0

p1

p2

p3

Page 9: Distributed Algorithms

9Distributed Algorithms – Mutual Exclusion

Principle of the Mutual Exclusion Problem (2)

Each process, before entering the CS acquires the authorizatino to do so.

Critical section should eventually terminate

1- Introduction

Enter CS

<critical section>

Exit CS

Acquire authorisationEnter CS

<critical section>

Exit CS

Acquire authorisation

req=[1,1,1,0,0]

Page 10: Distributed Algorithms

10Distributed Algorithms – Mutual Exclusion

Mutual exclusion in the shared memory model – a solution for 2 processes

Page 11: Distributed Algorithms

11Distributed Algorithms – Mutual Exclusion

CorrectnessConditions...

ME1 : Mutual ExclusionAt most one process can remain in CS at any timeSafety property

ME2 : Freedom from deadlockAt least one process is eligible to enter CSLiveness property

ME3 : Fairness Every process trying to enter must eventually succeedAbsence of starvation

A measure of fairness: bounded waitingSpecifies an upper bound on the number of times a process waits for its turn to enter SC -> n-fairness (n is the MAXIMUM number of rounds)FIFO fairness when n=0

2- Solutions using Message Passing

Page 12: Distributed Algorithms

12Distributed Algorithms – Mutual Exclusion

How to Measure Performance Number of msg’s per CR invocation. Fairness measure (next slide) Synchronization Delay (SD). Response Time (RT). System Throughput (ST): ST = 1/(SD + E) where E is the average CR execution time.

last req exits CR next req enters CRSD

enters CR exits CRCR reqE

RT

Used here

Page 13: Distributed Algorithms

13Distributed Algorithms – Mutual Exclusion

Last words before getting to the algorithms …

Fault tolerance: in the case of failure, the algorithm can reorganize itself so that it continues to function without any disruption. -> not much true for the algorithms presented here

Two kinds of algorithms: logical clock based and token based.

Page 14: Distributed Algorithms

14Distributed Algorithms – Mutual Exclusion

Distributed Mutual Exclusion

1 – Introduction

2 – Solutions using Message Passing3 – Token Passing Algorithms

Page 15: Distributed Algorithms

15Distributed Algorithms – Mutual Exclusion

Problem formulationAssumptions

n processes (n>1), numbered 0 ... n-1, noted Pi communicating by sending / receiving messagestopology: completely connected grapheach Pi periodically wants:

1. enter the Critical Section (CS)2. execute the CS code3. eventually exit the CS code

Devise a protocol that satisfies:ME1 : Mutual ExclusionME2 : Freedom from deadlockME3 : Progress → Fairness

2- Solutions using Message Passing

Page 16: Distributed Algorithms

16Distributed Algorithms – Mutual Exclusion

A trivial solution: centralized solutionTrivial solution consists in having one centralised server distributing authorisations• Queue requests and authorize one by oneDefaults:• Centralized server• Faults, contention

0

2

1

34req=[1,1,1,0,0]

Server

Acquire authorisation

Page 17: Distributed Algorithms

17Distributed Algorithms – Mutual Exclusion

Centralized solution

Use a coordinator processExternal processOne of the Pi-s

Problems:Single point of failureUnable to achieve FIFO fairness (except if CO)

Example:

2- Solutions using Message Passing

i

j

coord

request CS

message

request CS

How to anticipatethis late arrival?

clients

busy: boolean

server

queue

req replyrelease

What if a timestamp is given when sending the CS request?

Page 18: Distributed Algorithms

18Distributed Algorithms – Mutual Exclusion

Lamport's SolutionAssumptions:

Each communication channel is FIFOEach process maintains a request queue Q

Algorithm described by 5 rulesLA1. To request entry, send a time-stamped message to every other process and

enqueue to local Q (of sender)LA2. Upon reception place request in Q and send time-stamped ACK but once out

of CS (possibly immediately if already out)LA3. Enter CS when:

1. request first in Q (chronological order)2. all ACK received from others

LA4. To exit CS, a process must:3. delete request from Q4. send time-stamped release message to others

LA5. When receiving a release msg, remove request from Q

2- Solutions using Message Passing

Run an example with 3 processes and different interleavings

Page 19: Distributed Algorithms

19Distributed Algorithms – Mutual Exclusion

Analysis of Lamport's Solution2- Solutions using Message Passing

Can you show that it satisfies all the properties(i.e. ME1, ME2, ME3) of a correct solution?

Observation. Processes taking a decision to enter CS must have identical views of their local queues, when all ACKs have been received.

Proof of ME1. At most one process can be in its CS at any time.

Suppose not, and both j,k enter their CS. This implies

j in CS Qj.ts.j < Qk.ts.k k in CS Qk.ts.k < Qj.ts.j

Impossible.

WHY?

WHY?

Page 20: Distributed Algorithms

20Distributed Algorithms – Mutual Exclusion

Analysis of Lamport's Solution (2)2- Solutions Using Message Passing

Proof of ME2. (No deadlock)

The waiting chain is acyclic.

i waits for j i is behind j in all queues

(or j is in its CS) j does not wait for i

Proof of ME3. (progress)New requests join the end of the

queues,

so new requests do not pass

the old ones

WHY? ALWAYS?

Page 21: Distributed Algorithms

21Distributed Algorithms – Mutual Exclusion

Analysis of Lamport's Solution (3)

Proof of FIFO fairness. timestamp (j) < timestamp (k)

j enters its CS before k does so

Suppose not. So, k enters its CS before j. So k did not receive j’s request. But k received the ack from j for its own req.

This is impossible if the channels are FIFO.

Message complexity = 3(N-1) (per trip to CS)(N-1 requests + N-1 ack + N-1 release)

k j

Req(20)

ack

Req (30)

Exercise:IS THIS REALLY FIFO FAIRNESS? (remember there is no global time)If no: is the system n-fair? For which n? why? (note: a lot of messages are

exchanged, which gives the opportunity for different clocks to synchronise)

Page 22: Distributed Algorithms

22Distributed Algorithms – Mutual Exclusion

Ricart & Agrawala’s Solution2- Solutions Using Message Passing

What is new?1. Broadcast a timestamped request to all.2. Upon receiving a request, send ack if

-You do not want to enter your CS, or -You are trying to enter your CS, but your timestamp is higher than that of the sender.(If you are already in CS, then buffer the request)

3. Enter CS, when you receive ack from all.4. Upon exit from CS, send ack to each pending request before making a new request.(No release message is necessary)

Run an example with 3 processes and different interleavings

Page 23: Distributed Algorithms

23Distributed Algorithms – Mutual Exclusion

Analysis of Ricart & Agrawala’s Solution2- Solutions Using Message Passing

ExerciseME1. Prove that at most one process can be in CS.

ME2. Prove that deadlock is not possible.

ME3. Prove that FIFO fairness holds even if channels are not FIFO (note: this is the same

fairness as in Lamport’s solution)

Message complexity = 2(N-1)(N-1 requests + N-1 acks - no release message)

k j

Req(j)

Ack(j)

TS(j) < TS(k)

Req(k)

Page 24: Distributed Algorithms

24Distributed Algorithms – Mutual Exclusion

ExercisesA Generalized version of the mutual exclusion problem in which up to L processes (L ³1) are allowed to be in their critical sections simultaneously is known as the L-exclusion problem. Precisely, if fewer than L processes are in the CS at any time and one more process wants to enter it, it must be allowed to do so. Modify R.-A. algorithm to solve the L-exclusion problem.

Page 25: Distributed Algorithms

25Distributed Algorithms – Mutual Exclusion

Distributed Mutual Exclusion

1 – Introduction

2 – Solutions Using Message Passing

3 – Token Passing Algorithms

Page 26: Distributed Algorithms

26Distributed Algorithms – Mutual Exclusion

Token Ring Approach

Operations:Only the process holding the token can

enter the CS. To enter the critical section, wait passively

for the token. When in CS, hold on to the token.

To exit the CS, the process sends the token onto its neighbor.

If a process does not want to enter the CS when it receives the token, it forwards the token to the next neighbor.

P0

P1

P2

P3

PN-1

Previous holder of token

next holder of token

current holder of token

Processes are organized in a logical ring: pi has a communication channel to p(i+1) mod (n).

Page 27: Distributed Algorithms

27Distributed Algorithms – Mutual Exclusion

The basic ring approachFeatures:

Safety & liveness are guaranteed, but ordering is not.Bandwidth: 1 message per exit(N-1) -fairnessDelay between one process’s exit from the CS and the

next process’s entry is between 1 and N-1 message transmissions.

Page 28: Distributed Algorithms

28Distributed Algorithms – Mutual Exclusion

Suzuki-Kasami Solution3- Tokens passing algorithms

A mix of the lamport queue and the token approach

Completely connected network of processes

There is one token in the network. The holder of the token has the permission to enter CS.

Any other process trying to enter CS must acquire that token. Thus the token will move from one process to another based on demand. I want to enter CS

I want to enter CS

Page 29: Distributed Algorithms

29Distributed Algorithms – Mutual Exclusion

Suzuki-Kasami Algorithm3- Tokens passing algorithms

Process i broadcasts (i, num)

Each process maintains-an array req: req[j] denotes the sequence nb of the latest request from process j(Some requests will be stale soon)

Additionally, the holder of the token maintains-an array last: last[j] denotes the sequence number of the latest visit to CS for process j.- a queue Q of waiting processes req: array[0..n-1] of integer

last: array [0..n-1] of integer

Sequence number of the request

req

req

req

reqlast

queue Q

Page 30: Distributed Algorithms

30Distributed Algorithms – Mutual Exclusion

Suzuki-Kasami Algorithm (2)3- Tokens passing algorithms

When a process i receives a request (k, num) from process k, it sets req[k] to max(req[k], num). The holder of the token

--Completes its CS--Sets last[i]:= its own num--Updates Q by retaining each process k only if1+ last[k] = req[k] (This guarantees the freshness of the request)--Sends the token to the head of Q, along withthe array last and the tail of Q

In fact, token (Q, last)

Req: array[0..n-1] of integer

Last: Array [0..n-1] of integer

Page 31: Distributed Algorithms

31Distributed Algorithms – Mutual Exclusion

Suzuki-Kasami Algorithm (3)3- Tokens passing algorithms

{Program of process j}Initially, i: req[i] = last[i] = 0* Entry protocol *

req[j] := req[j] + 1Send (j, req[j]) to allWait until token (Q, last) arrivesCritical Section

* Exit protocol *last[j] := req[j]k ≠ j: k Q req[k] = last[k] + 1 append k to Q;if Q is not empty send (tail-of-Q, last) to head-of-Q fi

* Upon receiving a request (k, num) *req[k] := max(req[k], num)

Page 32: Distributed Algorithms

32Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[1,0,0,0,0]last=[0,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

req=[1,0,0,0,0]

initial state: process 0 has sent a request to all, and grabbed the token

Page 33: Distributed Algorithms

33Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[1,1,1,0,0]last=[0,0,0,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

1 & 2 send requests to enter CS

Page 34: Distributed Algorithms

34Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[1,1,1,0,0]last=[1,0,0,0,0]Q=(1,2)

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

0 prepares to exit CS

Page 35: Distributed Algorithms

35Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[1,1,1,0,0]req=[1,1,1,0,0]last=[1,0,0,0,0]Q=(2)

req=[1,1,1,0,0]

req=[1,1,1,0,0]

req=[1,1,1,0,0]

0 passes token (Q and last) to 1

Page 36: Distributed Algorithms

36Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[2,1,1,1,0]

req=[2,1,1,1,0]last=[1,0,0,0,0]Q=(2,0,3)

req=[2,1,1,1,0]

req=[2,1,1,1,0]

req=[2,1,1,1,0]

0 and 3 send requests

Page 37: Distributed Algorithms

37Distributed Algorithms – Mutual Exclusion

Example of Suzuki-Kasami Algorithm Execution

3- Tokens passing algorithms

0

2

1

34

req=[2,1,1,1,0]req=[2,1,1,1,0]

req=[2,1,1,1,0]last=[1,1,0,0,0]Q=(0,3)

req=[2,1,1,1,0]

req=[2,1,1,1,0]

1 sends token to 2

Page 38: Distributed Algorithms

38Distributed Algorithms – Mutual Exclusion

Summary and advantagesToken-based + queue :• Satisfies ME1 to ME3• Less messages: N by CS• Question: is this algorithm fair? All messages

received during the CS are enqueued at the same position, cannot we do better?

• Note: index can be bound• Note 2: A similar algorithm was published by

Ricart and Agrawa at the same period

WHY?

WHY?

Page 39: Distributed Algorithms

39Distributed Algorithms – Mutual Exclusion

Back to message passing: Maekawa’s Solution

First solution with a sublinear O(sqrt N) message complexity.

“Close to” Ricart-Agrawala’s solution, but each process is required to obtain permission from only a subset of peers

Page 40: Distributed Algorithms

40Distributed Algorithms – Mutual Exclusion

Maekawa’s Algorithm2- Solutions Using Message Passing

• With each process i, associate a subset Si.Divide the set of processes into subsets that satisfy the following two conditions:

i Si

i,j : i,j n-1 :: Si Sj ≠

• Main idea. Each process i is required to receive permission from Si only. Correctness requires that multiple processes will never receive permission from all members of their respective subsets.

0,1,2 1,3,5

2,4,5

S0S1

S2

Page 41: Distributed Algorithms

41Distributed Algorithms – Mutual Exclusion

Maekawa’s Algorithm2- Solutions Using Message Passing

Example. Let there be seven processes 0, 1, 2, 3, 4, 5, 6

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}

Page 42: Distributed Algorithms

42Distributed Algorithms – Mutual Exclusion

Maekawa’s Algorithm (example cont'd)2- Solutions Using Message Passing

Version 1 {Life of process I}

1. Send timestamped request to each process in Si.2. Request received send ack to process with the

lowest timestamp. Thereafter, "lock" (i.e. commit) yourself to that process, and keep others waiting.

3. Enter CS if you receive an ack from each member in Si.

4. To exit CS, send release to every process in Si.5. Release received unlock yourself. Then send

ack to the next process with the lowest timestamp.

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}

Page 43: Distributed Algorithms

43Distributed Algorithms – Mutual Exclusion

Analysis of Maekawa’s Algorithm (version 1)

2- Solutions Using Message Passing

ME1. At most one process can enter its critical section at any time.

Let i and j attempt to enter their Critical SectionsSi Sj ≠ there is a process k Si Sj

Process k will never send ack to both. So it will act as the arbitrator and establishes ME1

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}

Page 44: Distributed Algorithms

44Distributed Algorithms – Mutual Exclusion

Analysis of Maekawa’s Algorithm (version 1)

2- Solutions Using Message Passing

ME2. No deadlock. Unfortunately deadlock is possible! Assume 0, 1, 2 want to enter their critical sections.

From S0= {0,1,2}, 0,2 send ack to 0, but 1 sends ack to 1;

From S1= {1,3,5}, 1,3 send ack to 1, but 5 sends ack to 2;

From S2= {2,4,5}, 4,5 send ack to 2, but 2 sends ack to 0;

Now, 0 waits for 1 (to send a release), 1 waits for 2 (to send a

release), , and 2 waits for 0 (to send a release), . So

deadlock is possible!

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}

Page 45: Distributed Algorithms

45Distributed Algorithms – Mutual Exclusion

Problem: Potential of Deadlock

Without the loss of generality, assume that three sites Si, Sj and Sk simultaneously invoke Mutual Exclusion, and suppose:

Ri Rj = {Su} Rj Rk = {Sv} Rk Ri = {Sw}

Solution: extra msg’s to detect deadlock, maximum number of msg’s = 5(sqtr(n) + 1).

SiSw

Sk

Sv

Sj

SuAckW

W Ack

WAck

Page 46: Distributed Algorithms

46Distributed Algorithms – Mutual Exclusion

Maekawa’s Algorithm, avoiding deadlocks

2- Solutions Using Message Passing

Avoiding deadlockIf processes always receive messages in

increasing order of timestamp, then deadlock “could be” avoided. But this is too strong an assumption.

Another more complex version exists that should ensure absence of deadlocks (not presented here)

S0 = {0, 1, 2}S1 = {1, 3, 5}S2 = {2, 4, 5}S3 = {0, 3, 4}S4 = {1, 4, 6}S5 = {0, 5, 6}S6 = {2, 3, 6}

Page 47: Distributed Algorithms

47Distributed Algorithms – Mutual Exclusion

EXERCISES

Page 48: Distributed Algorithms

48Distributed Algorithms – Mutual Exclusion

Dekker’s algorithmDesigned for shared memory (oldest)Does it verify ME1 – ME2 – ME3?

Page 49: Distributed Algorithms

49Distributed Algorithms – Mutual Exclusion

Bounded timestamps

Can we apply this idea to algorithms using timestamps shown in this lecture?

Page 50: Distributed Algorithms

50Distributed Algorithms – Mutual Exclusion

Fault toleranceTake the basic token algorithm. (RING)Suppose processes are given a numberSuppose a process crashes but a new ring is automatically

created (keeping the old numbers)What happens if the disappeared process did not have the tokenWhat if it had the tokenWhat information can we add to the token so that it can be regenerated if necessary?

if the disappeared process was neither P0 nor Pn

if it was P0if it was Pn (how to detect it was Pn?)

See Le Lann 77