Representing distributed algorithms
How to represent a distributed algorithm?
We will introduce the notions of atomicity, non-
determinism, fairness etc that are important
issues in specifying distributed algorithms.
These concepts are not built into languages
like JAVA, C++, python etc!
Syntax & semantics: guarded actions
<guard G> <action A>
is equivalent to
if G then A
(Borrowed from E.W. Dijkstra: A Discipline of Programming)
Syntax & semantics: guarded actions
• Sequential actions S0; S1; S2; . . . ; Sn
• Alternative constructs if . . . . . . . . . . fi
• Repetitive constructs do . . . . . . . . . od
The specification is useful for representing abstract algorithms, not executable codes.
Syntax & semantics
Alternative construct
if G1 S1
[] G2 S2
…
[] Gn Sn
fi
When no guard is true, skip (do nothing). When multiple guards are true, the choice of the action to be executed is completely arbitrary.
Syntax & semantics
Repetitive construct
do G1 S1
[] G2 S2
.
[] Gn Sn
od
Keep executing the actions until all guards are false
and the program terminates. When multiple guards
are true, the choice of the action is arbitrary.
Example: graph coloring
0 1
0
1
{program for process i}
c[i] = color of process I
do
∃j neighbor(i): c(j) = c(i) → c(i) := 1- c(i)∈
od
Will the above computation terminate?
There are four processes and two colors0, 1. The system has to reach a configuration in which no two neighboring processes have the same color.
Central vs. Distributed Schedulers
Central scheduler implies that at most one process can execute its action at a time. A bit unrealistic, but it simplifies reasoningabout program correctness
Distributed scheduler allows any number of eligible processesto execute their actions at the same time. It is more realistic, but reasoning about correctness is more difficult.
Consider another example
program uncertain;define x : integer;initially x = 0do x < 4 x := x + 1[] x = 3 x := 0od
Question. Will the program terminate?
(Our goal here is to understand fairness)
The adversary
A distributed computation can be viewed as a game between the system and an adversary. The adversary may come up with feasible schedules to challenge the system (and cause “bad things”). A correct algorithm must be able to prevent those bad things from happening.
Non-determinism(Program for a token server - it has a single token}repeat
if (req1 token) ∧ then give the token to client1 else if (req2 token) ∧ then give the token to client2
else if (req3 token) ∧ then give the token to client3 forever
Now, assume that all three requests are sent simultaneously.Client 2 or 3 may never get the token! The outcome could have been different if the server makes a non-deterministic choice.
1 23
Token server
Examples of non-determinism
Determinism caters to a specific order and is a special case of non-determinism.
If there are multiple processes ready to execute actions, then who will execute the action first is nondeterministic.
Message propagation delays are arbitrary and the order of message reception is non-deterministic.
Atomicity (or granularity)
Atomic = all or nothingAtomic actions = indivisible actions
do red message x:= 0 {red action} [] blue message x:=7 {blue action}od
Regardless of how nondeterminism ishandled, we would expect that the value of x will be an arbitrary sequence of 0's and 7's.Right or wrong?
x
Atomicity (continued)
do red message x:= 0 {red action} [] blue message x:=7 {blue action}od
Let x be a 3-bit integer x2 x1 x0, so x:=7 means (x2:=1, x1:= 1, x2:=1), and x:=0 means (x2:=0, x1:= 0, x2:=0)
If the assignment is not atomic, then manyinterleavings are possible, leading to any possible value of x between 0 and 7
x
So, the answer depends on the atomicity of the assignment
Atomicity (continued)
Does hardware guarantee any form of atomicity? Yes! (examples?)
Transactions are atomic by definition (in spite of process failures). Also, critical section codes are atomic.
We will assume that G → A is an “atomic operation.” Does it make a difference if it is not so?
y
x
if x ≠ y → y:= x fi
if x ≠ y → x:= y fi
Atomicity (continued){Program for P}define b: booleaninitially b = true
do b send msg m to Q[] ¬ empty(R,P) receive msg;
b := falseod
Suppose it takes 15 seconds tosend the message. After 5 seconds,P receives a message from R. Will it stop sending the remainder of themessage? NO.
P QR
b
Fairness
Defines the choices or restrictions on the scheduling of actions. No such restriction implies an unfair scheduler. For fair schedulers, the following types of fairness have received attention:
– Unconditional fairness
– Weak fairness
– Strong fairness
Scheduler / demon /adversary
Fairness
Programtest
define x : integer
{initial value unknown}
do true x : = 0
[] x = 0 x : = 1
[] x = 1 x : = 2
od
An unfair scheduler may never schedule the second (or the third actions). So, x may always be equal to zero.
An unconditionally fair scheduler will eventually give every statement a chance to execute without checking their eligibility. (Example: process scheduler in a multiprogrammed OS.)
Weak Fairness
Program test
define x : integer
{initial value unknown}
do true x : = 0
[] x = 0 x : = 1
[] x = 1 x : = 2
od
• A scheduler is weakly fair, when it eventually executes every guarded action whose guard becomes true, and remains true thereafter
• A weakly fair scheduler will eventually execute the second action, but may never execute the third action. Why?
Strong fairness
Programtest
define x : integer
{initial value unknown}
do true x : = 0
[] x = 0 x : = 1
[] x = 1 x : = 2
od
A scheduler is strongly fair, when it eventually executes every guarded action whose guard is true infinitely often.
The third statement will be executed under a strongly fair scheduler. Why?
Study more examples to reinforce these concepts
Program correctness
The State-transition model
Global state
{ = local state of process k}
Each state transition is caused by an action by an eligible process.
A B
C D E
F
G H I
J K
L
state
Initial state
transition
We reason using interleaving semantics, and assume that concurrent actions are serialized in an arbitrary order
A sample computation (or behavior) is ABGHIFL
S0 action⏐ →⏐ ⏐ S1 action⏐ →⏐ ⏐ S2 action⏐ →⏐ ⏐
S =s0 ×s1 ×s2 ×...×smsk
Correctness criteria
• Safety properties• Bad things never happen
• Liveness properties• Good things eventually happen
Testing vs. Proof
Testing: Apply inputs and observe if the outputs
satisfy the specifications. Fool proof testing can be
painfully slow, even for small systems. Most testing
are partial.
Proof: Has a mathematical foundation, and is a
complete guarantee. Sometimes not scalable.
Testing vs. Proof
To test this program, you have to test all possible interleavings. With n processes p0, p1, … pn-1, and m steps per process, the number of interleavings is
(n.m)!
(m!) n
The state explosion problem
p0 p1 p2 p3
step1 step1 step1 step1
step2 step2 step2 step2
step3 step3 step3 step3
Example: Mutual Exclusion
Process 0 Process 1do true do true
Entry protocol Entry protocolCritical section Critical sectionExit protocol Exit protocol
od od
Safety properties (1) There is no deadlock (2) At most one process is in its critical section.
Liveness property A process trying to enter the CS must eventually succeed. (This is also called the progress property)
CS CS
Exerciseprogram mutex 1{two process mutual exclusion algorithm: shared memory model}define busy :shared boolean (initially busy = false}
{process 0} {process 1}do true do true
do busy skip od; do busy skip od;busy:= true; busy:= true;critical section; critical sectionbusy := false; busy := false{remaining codes} {remaining codes}
od odDoes this mutual exclusion protocol satisfy liveness and safety properties?
Safety invariants
Invariant means: something meaningful should always holdExample: Total no. of processes in CS ≤ 1 (mutual exclusion problem)
Another safety property is Partial correctness. It implies that
“If the program terminates then the postcondition will hold.” Consider the following:
Safety invariant: ¬(G0 ∨ G1 ∨ G2 ∨…∨ Gk) ⇒ postcondition
It does not say if the program will terminate.
(termination is a liveness property)Total correctness = partial correctness + termination.
do G0 S1 [] G1 S1 [] … [] Gk Sk od
Exercise
Starting from the given initial state, devise an algorithm to color thenodes of the graph using the colors 0 and 1, so that notwo adjacent nodes have the same color.
program colorme {for process Pi }
define color c {0, 1}∈
Initially colors are arbitrary
do j neighbor(i) : (c[i] = c[j]) → ∃ ∈ c[i] := 1 - c[i] od
Is the program partially correct? YES (why?)Does it terminate? NO (why?)
p1
p3p0
p2
0 0
1 1
Liveness properties
Eventuality is tricky. There is no need to guarantee when the desired thing will happen, as long as it happens..
Some examples The message will eventually reach the receiver. The process will eventually enter its critical section. The faulty process will be eventually be diagnosed Fairness (if an action will eventually be scheduled) The program will eventually terminate. The criminal will eventually be caught.
Absence of liveness cannot be determined from finite prefix of the computation
Proving safetydefine c1, c2 : channel; {init c1 = c2 = null}
r, t : integer; {init r = 5, t = 5}
{program for T} • do t > 0 → send msg along c1; t := t -12 [] ¬empty (c2) → rcv msg from c2; t := t + 1
od {program for R} 3 do ¬empty (c1) → rcv msg from c1; r := r+14 [] r > 0 → send msg along c2; r := r-1
od
We want to prove the safety property P:P ≡ n1 + n2 ≤ 10
T R
c1
c2
transmitter receiver
n1= # of messages in c1n2= # of messages in c2
t r
Proving safetyn1, n2 = # of messages in c1and c2 respectively.We will establish the following invariant:
I ≡ (t ≥ 0) (r ≥ 0) (n1 + t + n2 + r = 10)∧ ∧(I ⇒ P). Check if I holds after every action.
{program for T} • do t > 0 → send msg along c1; t := t -12 [] ¬empty (c2) → rcv msg from c2; t := t+1
od
{program for R} 3 do ¬empty (c1) → rcv msg from c1; r := r+14 [] r > 0 → send msg along c2; r := r-
1od
T R
c1
c2
Use the method of inductionShow that I initially holds, andholds after each action.
r=1t=4
Proving liveness
S1→ S2 → S3 → S4 ↓ f ↓ f ↓ f ↓ f
w1 w2 w3 w4
o w1, w2, w3, w4 WF∈o WF is a well-founded set whose
elements can be ordered by » and
there is a smallest element
If there is no infinite chain like
w1 » w2 » w3 » w4 .., i.e.
f(si) » f(si+1) » f(si+2) ..
Global state Global state
then the computation willdefinitely terminate!
f is called a variant function Example?
Proof of liveness: an example
0
n-1
3
2
1
Clock phase synchronization
System of n clocks ticking at the same rate.
Each clock is 3-valued, i,e it ticks as 0, 1, 2, 0, 1, 2…
A failure may arbitrarily alter the clock phases.
The clocks need to return to the same phase. .
Proof of liveness: an example
Clock phase synchronization
{Program for each clock}(c[k] = phase of clock k, initially arbitrary)
do j: j N(i) :: c[j] = c[i] +1 ∃ ∈ mod 3 → c[i] := c[i] + 2 mod 3
[] j: j N(i) :: c[j] ≠ c[i] +1 ∀ ∈ mod 3 → c[i] := c[i] + 1 mod 3
od
Show that eventually all clocks will return to the same phase (convergence), andcontinue to be in the same phase (closure)
0
n-1
3
2
1
∀k: c[k] {0,1,2}∈
Proof of convergence
Let D = d[0] + d[1] + d[2] + … + d[n-1]
d[i] = 0 if no arrow points towards clock i;= i + 1 if a ← points towards clock i; = n - i if a → points towards clock i;= 1 if both → and ← point towards clock i.
By definition, D ≥ 0.
Also, D decreases after every step in the
system. So the number of arrows must
reduce to 0.
0 2 02 2
1 1 10 1
2 2 22 2
Understand the game of arrows
0 1 2 n-1
Top Related