Distributed Computing: Principles, Algorithms, and Systems
Introduction
Mutual exclusion: Concurrent access of processes to a shared resource or
data is executed in mutually exclusive manner.
Only one process is allowed to execute the critical section (CS) at any given
time.
In a distributed system, shared variables (semaphores) or a local kernel
cannot be used to implement mutual exclusion.
Message passing is the sole means for implementing distributed mutual
exclusion.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 2 / 93
Distributed Computing: Principles, Algorithms, and Systems
Introduction
Distributed mutual exclusion algorithms must deal with unpredictable
message delays and incomplete knowledge of the system state.
Three basic approaches for distributed mutual exclusion:
1 Token based approach
2 Non-token based approach
3 Quorum based approach
Token-based approach:
◮ A unique token is shared among the sites.
◮ A site is allowed to enter its CS if it possesses the token.
◮ Mutual exclusion is ensured because the token is unique.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 3 / 93
Distributed Computing: Principles, Algorithms, and Systems
Introduction
Non-token based approach:
◮ Two or more successive rounds of messages are exchanged among the sites to
determine which site will enter the CS next.
Quorum based approach:
◮ Each site requests permission to execute the CS from a subset of sites (called
a quorum).
◮ Any two quorums contain a common site.
◮ This common site is responsible to make sure that only one request executes
the CS at any time.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 4 / 93
Distributed Computing: Principles, Algorithms, and Systems
Preliminaries
System Model
The system consists of N sites, S1 , S2 , ..., SN .
We assume that a single process is running on each site. The process at site
Si is denoted by pi .
A site can be in one of the following three states: requesting the CS,
executing the CS, or neither requesting nor executing the CS (i.e., idle).
In the ‘requesting the CS’ state, the site is blocked and can not make further
requests for the CS. In the ‘idle’ state, the site is executing outside the CS.
In token-based algorithms, a site can also be in a state where a site holding
the token is executing outside the CS (called the idle token state).
At any instant, a site may have several pending requests for CS. A site
queues up these requests and serves them one at a time.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 5 / 93
Distributed Computing: Principles, Algorithms, and Systems
Requirements
Requirements of Mutual Exclusion Algorithms
1 Safety Property: At any instant, only one process can execute the critical
section.
2 Liveness Property: This property states the absence of deadlock and
starvation. Two or more sites should not endlessly wait for messages which
will never arrive.
3 Fairness: Each process gets a fair chance to execute the CS. Fairness
property generally means the CS execution requests are executed in the order
of their arrival (time is determined by a logical clock) in the system.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 6 / 93
Distributed Computing: Principles, Algorithms, and Systems
Performance Metrics
The performance is generally measured by the following four metrics:
Message complexity: The number of messages required per CS execution
by a site.
Synchronization delay: After a site leaves the CS, it is the time required
and before the next site enters the CS (see Figure 1).
Last site exits the CS
Next site enters the CS
time
Synchronization delay
Figure 1: Synchronization Delay.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 7 / 93
Distributed Computing: Principles, Algorithms, and Systems
Performance Metrics
Response time: The time interval a request waits for its CS execution to be
over after its request messages have been sent out (see Figure 2).
CS Request arrives
The site enters
Its request the CS The site exits the CS
messages sent out
CS execution time time
Response Time
Figure 2: Response Time.
System throughput: The rate at which the system executes requests for the
CS.
system throughput=1/(SD+E )
where SD is the synchronization delay and E is the average critical section
execution time.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 8 / 93
Distributed Computing: Principles, Algorithms, and Systems
Performance Metrics
Low and High Load Performance:
We often study the performance of mutual exclusion algorithms under two
special loading conditions, viz., “low load” and “high load”.
The load is determined by the arrival rate of CS execution requests.
Under low load conditions, there is seldom more than one request for the
critical section present in the system simultaneously.
Under heavy load conditions, there is always a pending request for critical
section at a site.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 9 / 93
Distributed Computing: Principles, Algorithms, and Systems
Lamport’s Algorithm
Requests for CS are executed in the increasing order of timestamps and time
is determined by logical clocks.
Every site Si keeps a queue, request queuei , which contains mutual exclusion
requests ordered by their timestamps.
This algorithm requires communication channels to deliver messages the
FIFO order.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 10 / 93
Distributed Computing: Principles, Algorithms, and Systems
The Algorithm
Requesting the critical section:
When a site Si wants to enter the CS, it broadcasts a REQUEST(tsi , i)
message to all other sites and places the request on request queuei . ((tsi , i)
denotes the timestamp of the request.)
When a site Sj receives the REQUEST(tsi , i) message from site Si ,places site
Si ’s request on request queuej and it returns a timestamped REPLY message
to Si .
Executing the critical section: Site Si enters the CS when the following two
conditions hold:
L1: Si has received a message with timestamp larger than (tsi , i) from
all other sites.
L2: Si ’s request is at the top of request queuei .
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 11 / 93
Distributed Computing: Principles, Algorithms, and Systems
The Algorithm
Releasing the critical section:
Site Si , upon exiting the CS, removes its request from the top of its request
queue and broadcasts a timestamped RELEASE message to all other sites.
When a site Sj receives a RELEASE message from site Si , it removes Si ’s
request from its request queue.
When a site removes a request from its request queue, its own request may come
at the top of the queue, enabling it to enter the CS.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 12 / 93
Distributed Computing: Principles, Algorithms, and Systems
correctness
Theorem: Lamport’s algorithm achieves mutual exclusion.
Proof:
Proof is by contradiction. Suppose two sites Si and Sj are executing the CS
concurrently. For this to happen conditions L1 and L2 must hold at both the
sites concurrently.
This implies that at some instant in time, say t, both Si and Sj have their
own requests at the top of their request queues and condition L1 holds at
them. Without loss of generality, assume that Si ’s request has smaller
timestamp than the request of Sj .
From condition L1 and FIFO property of the communication channels, it is
clear that at instant t the request of Si must be present in request queuej
when Sj was executing its CS. This implies that Sj ’s own request is at the top
of its own request queue when a smaller timestamp request, Si ’s request, is
present in the request queuej – a contradiction!
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 13 / 93
Distributed Computing: Principles, Algorithms, and Systems
correctness
Theorem: Lamport’s algorithm is fair.
Proof:
The proof is by contradiction. Suppose a site Si ’s request has a smaller
timestamp than the request of another site Sj and Sj is able to execute the
CS before Si .
For Sj to execute the CS, it has to satisfy the conditions L1 and L2. This
implies that at some instant in time say t, Sj has its own request at the top
of its queue and it has also received a message with timestamp larger than
the timestamp of its request from all other sites.
But request queue at a site is ordered by timestamp, and according to our
assumption Si has lower timestamp. So Si ’s request must be placed ahead of
the Sj ’s request in the request queuej . This is a contradiction!
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 14 / 93
Distributed Computing: Principles, Algorithms, and Systems
Performance
For each CS execution, Lamport’s algorithm requires (N − 1) REQUEST
messages, (N − 1) REPLY messages, and (N − 1) RELEASE messages.
Thus, Lamport’s algorithm requires 3(N − 1) messages per CS invocation.
Synchronization delay in the algorithm is T .
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 15 / 93
Distributed Computing: Principles, Algorithms, and Systems
An optimization
In Lamport’s algorithm,REPLY messages can be omitted in certain situations.
For example, if site Sj receives a REQUEST message from site Si after it has
sent its own REQUEST message with timestamp higher than the timestamp
of site Si ’s request, then site Sj need not send a REPLY message to site Si .
This is because when site Si receives site Sj ’s request with timestamp higher
than its own, it can conclude that site Sj does not have any smaller
timestamp request which is still pending.
With this optimization, Lamport’s algorithm requires between 3(N − 1) and
2(N − 1) messages per CS execution.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 16 / 93
Distributed Computing: Principles, Algorithms, and Systems
Token-Based Algorithms
In token-based algorithms, a unique token is shared among the sites.
A site is allowed to enter its CS if it possesses the token.
Token-based algorithms use sequence numbers instead of timestamps. (Used
to distinguish between old and current requests.)
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 59 / 93
Distributed Computing: Principles, Algorithms, and Systems
Suzuki-Kasami’s Broadcast Algorithm
If a site wants to enter the CS and it does not have the token, it broadcasts a
REQUEST message for the token to all other sites.
A site which possesses the token sends it to the requesting site upon the
receipt of its REQUEST message.
If a site receives a REQUEST message when it is executing the CS, it sends
the token only after it has completed the execution of the CS.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 60 / 93
Distributed Computing: Principles, Algorithms, and Systems
continuation..
This algorithm must efficiently address the following two design issues:
(1) How to distinguish an outdated REQUEST message from a current
REQUEST message:
Due to variable message delays, a site may receive a token request message
after the corresponding request has been satisfied.
If a site can not determined if the request corresponding to a token request
has been satisfied, it may dispatch the token to a site that does not need it.
This will not violate the correctness, however, this may seriously degrade the
performance.
(2) How to determine which site has an outstanding request for the CS:
After a site has finished the execution of the CS, it must determine what
sites have an outstanding request for the CS so that the token can be
dispatched to one of them.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 61 / 93
Distributed Computing: Principles, Algorithms, and Systems
continuation..
The first issue is addressed in the following manner:
A REQUEST message of site Sj has the form REQUEST(j, n) where n (n=1,
2, ...) is a sequence number which indicates that site Sj is requesting its nth
CS execution.
A site Si keeps an array of integers RNi [1..N] where RNi [j] denotes the
largest sequence number received in a REQUEST message so far from site Sj .
When site Si receives a REQUEST(j, n) message, it sets RNi [j]:=
max(RNi [j], n).
When a site Si receives a REQUEST(j, n) message, the request is outdated if
RNi [j]>n.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 62 / 93
Distributed Computing: Principles, Algorithms, and Systems
continuation..
The second issue is addressed in the following manner:
The token consists of a queue of requesting sites, Q, and an array of integers
LN[1..N], where LN[j] is the sequence number of the request which site Sj
executed most recently.
After executing its CS, a site Si updates LN[i]:=RNi [i] to indicate that its
request corresponding to sequence number RNi [i] has been executed.
At site Si if RNi [j]=LN[j]+1, then site Sj is currently requesting token.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 63 / 93
Distributed Computing: Principles, Algorithms, and Systems
The Algorithm
Requesting the critical section
(a) If requesting site Si does not have the token, then it increments its
sequence number, RNi [i], and sends a REQUEST(i, sn) message to
all other sites. (‘sn’ is the updated value of RNi [i].)
(b) When a site Sj receives this message, it sets RNj [i] to max(RNj [i],
sn). If Sj has the idle token, then it sends the token to Si if
RNj [i]=LN[i]+1.
Executing the critical section
(c) Site Si executes the CS after it has received the token.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 64 / 93
Distributed Computing: Principles, Algorithms, and Systems
The Algorithm
Releasing the critical section Having finished the execution of the CS, site Si
takes the following actions:
(d) It sets LN[i] element of the token array equal to RNi [i].
(e) For every site Sj whose id is not in the token queue, it appends its
id to the token queue if RNi [j]=LN[j]+1.
(f) If the token queue is nonempty after the above update, Si deletes
the top site id from the token queue and sends the token to the
site indicated by the id.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 65 / 93
Distributed Computing: Principles, Algorithms, and Systems
Correctness
Mutual exclusion is guaranteed because there is only one token in the system and
a site holds the token during the CS execution.
Theorem: A requesting site enters the CS in finite time.
Proof:
Token request messages of a site Si reach other sites in finite time.
Since one of these sites will have token in finite time, site Si ’s request will be
placed in the token queue in finite time.
Since there can be at most N − 1 requests in front of this request in the
token queue, site Si will get the token and execute the CS in finite time.
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 66 / 93
Distributed Computing: Principles, Algorithms, and Systems
Performance
No message is needed and the synchronization delay is zero if a site holds the
idle token at the time of its request.
If a site does not hold the token when it makes a request, the algorithm
requires N messages to obtain the token. Synchronization delay in this
algorithm is 0 or T .
A. Kshemkalyani and M. Singhal (Distributed Computing) Distributed Mutual Exclusion Algorithms 67 / 93