Lecture #10: Agreement Protocols

These topics are from Chapter 8 (Agreement Protocols) in Advanced Concepts in OS.

Topics for Today

definition of the agreement problem
solution of Lamport et alia

Agreement Problem

all sites must agree on a value, say, 0 or 1
example: decision to commit a DB transaction
just voting is not enough
processors may send inconsistent votes to different sites

System Model

Assume:

m out of n processors may fail
system is fully connected, pairwise
receiver knows sender's identity
communications are reliable

Synchronous vs. Asynchronous

synchronous: all processors proceed in ``lock step''
asynchronous: each processor proceeds at its own pace

Agreement problem is not solvable in an asynchronous system, even for single-processor failures.

Failure Modes

crash fault
omission fault
malicious (Byzantine) fault

Synchronous model allows detection of first two kinds of failures.

Byzantine failures may be due to hardware or software failures, or due to malicious attacks.

Other Issues

authentication of messages
metrics: time, message traffic, storage overhead

Taxonomy of Problems

All non-faulty processors must agree on value(s) from a non-faulty processor

Byzantine agreement:
- The source processor broadcasts its initial value to all other processes.
- Agreement: All nonfaulty processors agree on the same value.
- Validity: If the source processor is nonfaulty, the the common agreed upon value by all nonfaulty processors should be the initial value of the source
consensus:
- Every processor broadcast the initial value to all other processors.
- Agreement: All nonfaulty processors agree on the same value.
- Validity: If the initial value of every nonfaulty processor is v, then the agreed upon common value by all nonfaulty processors must be v.
interactive consistency:
- every processor broadcasts its initial value to all other processors.
- Agreement: All nonfaulty processors agree on the same vector. (v_1, v_2, ..., v_n).
- Validity: If the ith processor is nonfaulty and its initial value is v_i, then the ith value to be agreed on by all nonfaulty processors must be v_i.

Byzantine agreement is the most basic one.

Algorithms to solve the other problems can be constructed from an algorithm to solve the Byzantine agreement problem, though more direct algorithms may also exist.

Impossibility Results

Byzantine agreement is impossible if m > ë(n-1)/3 ű
- e.g., ë(3-1)/3 ű = 0
Byzantine agreement is impossible with < (m+1) message exchanges

We will see some algorithms for solving the Byzantine agreement problem that fall within these bounds. However, we will also see that the algorithms are fairly complex. This should naturally lead one to think twice when designing a system, to see if there is a way to avoid creating situations that require agreement.

See the following simple example with 3 processors, from text. The arrows indicate state information made available to other nodes. In the first case, processor A initiates the agreement protocol and processor B is maliciously faulty.

C sees that B has decided for 0 and A has decided for 1. To satisfy the Byzantine agreement problem, C must decide for 1, since A is not faulty and A has decided for 1. This implies that the algorithm followed by C (and hence by any non-faulty non-initiating processor) must break ties in favor of the initiating processor.

The next case is where the processor A is a traitor, and reports different values to B and C.

B thinks A has decided for 0 and C thinks A has decided for 1. If the algorithm breaks ties in favor of the initiator, C must decide for 1. However, B must follow the same algorithm, and so it must decide for 0. This means we have no agreement among the two nonfaulty processors.

Proof of the full theorem generalizes this reasoning to a larger number of processors.

Lamport-Shostak-Pease Algorithm -- No failures

solves Byzantine agreement for n ł 3m+1 processors in the presence of m faulty processors
recursively defined, as OM(m), m ł 0

This is called the ``Oral Message'' algorithm, because the conditions correspond to what we would expect if messages are delivered orally, in person, by pairwise conversations between the parties involved in the consensus.

`Oral' Messages

every message that is sent is delivered exactly
the receiver of a message knows who sent it
the absence of a message can be detected

Lamport Terminology for Byzantine Agreement

every processor is a general
the general who initiates the agreement protocol is the commander
the value suggested by the commander is the order
the other generals, to whom the commander sends the order, are his lieutenants
the faulty processors are traitors
the nonfaulty processors are loyal

OM(0,S)

If there are no traitors, achieving agreement is easy:

The commander i sends the proposed value v to every lieutenant j in S - {i}
Each lieutenant j accepts the value v from i

OM(m,S) for m > 0

S is the set of generals for which we want agreement.

The commander i sends a value v directly to every lieutenant j Î S - {i}.
For each lieutenant j Î S - {i}, let v_j be the value lieutenant j receives from the commander i, or else be RETREAT of he receives no value. Lieutenant j initiates OM(m-1, S - {i}) (recursively) with value v_j, acting as commander.
The notation v_j here helps us to remember that j received the value v_j from i in the previous round, and j is asking the other generals to agree on this fact. At the end of each of these recursive executions, all every loyal lieutenants j Î S - {i} has agreed on a set of pairs (k,v_k), one for each k Î S-{i}.
When Step 2 has been completed by all lieutenants, each lieutenant j tabulates the pairs it received in Step 2 (its own pair containing the original value from its commander and the other pairs containing the values returned by its own lieutenants by the recursive invocation of OM(m,S-{i})) and agrees on the value v = majority ({(k,v_k) | k Î S -{i}}) that is in the majority of those pairs, to be the result of OM(m, S).

One feature of this algorithm that some people have found confusing is the way in which the results of the recursive algorithms are combined. That is, the values must be retained and then combined, by taking the majority, after the entire round has completed.

Another feature that some people have found confusing is that there must be an arbitrary rule, such as choosing the lower value, is to break ties. Since traitors may not send messages, there also must be a default value, such as 0, that is used for all generals from which no pair is received. Likewise, if there is no majority, a default value must be used for the result of OM(m,S). So long as all loyal generals agree on the tie-breaking rule and the default value, there will still be consensus among the loyal generals.

To understand this algorithm, it helps to start with the case that the commander i is loyal. In that case, each lieutenant j will receive the same value v from i. The loyal ones can simply accept the value v and it will not matter what the traitors do.

However, since there is no way for a lieutenant j to tell whether the commander i is traitor, one must assume that he may be a traitor. To protect against the commander sending different values to the different lieutenants, the lieutenants must hold a ballot to reach consensus on what message the commander sent to each one of them. The rest of the algorithm is the procedure for that ballot.

Since the messages are transmitted "orally" (not broadcast), the lieutenants must all exchange information about what they received in the previous round, before they can hold the ballot. The ballot would still be easy if we could trust every processor to report accurately what it received. However, we must allow fo the possibility that some lieutenants are traitors, and so will report different things to different other lieutenants. That is why we need to do a Byzantine agreement on each of the messages that was sent to a lieutenant in the previous round.

When we get to the recursive invocation of OM(m-1,S-{i}), it is not obvious that we have reduced the problem sufficiently to satisfy the preconditions for OM(m-1,S-{i}). There are two possibilities:

The commander i is a traitor. In this case, it is clear that the recursion should work. Only m- of the lieutenants are traitors. We are assuming | S | ł 3m+1, so | S- {i} | ł 3m > 3(m-1)+1. It follows that OM(m-1, S-{i}) can achieve Byzantine agreement on the message "i sent j the value v" among the loyal lieutenants in S-{i}.
Processor i is a traitor. In this case, it is not so clear that the recursion should work. We have reduced the number of processors in the consensus by one, but there may remain m processors in S-{i}. If m is the number of traitors, how can we get away with using OM(m-1,...)?

The second case is dealt with by the Validity Lemma, which is stated and proven below. This lemma guarantees that if the commander is loyaal, O(m,S) can tolerate up to k traitors if | S | ł 2k+m. We will explain this lemma in more detail below, using the original theorems and proofs of Lamport, Shostak, and Pease.

Byzantine Agreement Conditions

Agreement: All loyal generals agree on the same value.
Validity: If the commander is loyal, then the common agreed upon value for all loyal lieutenants is the initial one given by the commander.

Validity Lemma

Lemma: For any m and k, OM(m,S) satisfies the Validity Condition if there are more than 2k+m processors and at most k of them are traitors.

Proof:

The proof is by induction on m. As a basis for the induction, we consider the case of OM(0). The Validity Condition only specifies what must happen if the commander is loyal. It is easy to see that if the commander is loyal OM(0) satisfies the Validity Condition, since all the processes get the same value v and agree upon that. We therefore can assume the theorem is true for OM(m-1) and prove that is tis true for OM(m), m > 0.

For the induction step, we have m ł 1. In Step 1, the loyal commander i sends a value v to all the other processors. At Step 2, each loyal lieutenant j applies O(M-1,S-{i}). Since we are assuming that | S | > 2k + m, we have | S -{i} | > 2k + (m-1), so we can apply the induction hypothesis to conclude that every loyal lieutenant agrees on the value v_j=v for each invocation of OM(m-1,S-{i}) by a loyal commander j. Since there are atmost k traitors, and | S -{i} | > 2k + (m-1) > 2k, a majority of the lieutenants in S -{i} are loyal. Hence, when each lieutenant gets to Step 3 it will find a majority of the other lieutenants support the value v, and so it will agree to the value v. This confirms the Validity Condition.

Agreement Theorem

Theorem: For any m, OM(m,S) satisfies the Validity and Agreement Conditions if there are more than 3m generals and at most m of them are traitors.

Proof:

The proof is by induction on m, similar to that of the Validity Lemma. As a basis for the induction, we consider the case of OM(0). If there are no traitors, it is easy to see that OM(0) satsfies the Validity and Agreement Conditions. We therefore can assume the theorem is true for OM(m-1) and prove that it is true for OM(m), m > 0.

For the induction step, have m ł 1. We consider two cases, depending on whether the commander is a traitor.

Suppose the commander i is loyal. By taking k equal to m in the Validity Lemma, we see that OM(m) satisfies the Validity Condition. Moreover, since we are assuming the commander is loyal the Agreement condition is also satisfied.
Suppose the commander is a traitor. At most m-1 of the lieutenants can be traitors. Since there are more than 3m processes, there are more than 3m-1 > 3(m-1) processes in S-{i}. We may therefore apply the induction hypothesis to conclude that OM(m-1) satisfies the Agreement and Validity Conditions. Hence, any two loyal lieutenants get the same vector of values v₁,Ľ,v_n-1, and therefore obtain the same value majority(v₁,Ľ,v_n-1) in Step 3, proving the Agreement Condition.

Do an example, for 4 processors, interactively

Four Processor Example: Nonfaulty Commander

Round 1: processor A executes OM(1), where processor C (in red) is faulty.

Round 2: processors B, C, and D execute OM(0). Dashed lines indicate messages sent during the previous round.

Three Processor Example: Faulty Commander

Round 1: processor A executes OM(1), where processor A is faulty.

Round 2: processors B, C, and D execute OM(0).

Message Complexity

T(0,n) = n-1
T(m,n) = (n-1)T(m-1,n-1), for m > 0
T(m,n) = (n-1)(n-2)(n-3)Ľ(n-m-1) Î O(n^m)

Lecture #10: Agreement Protocols These topics are from Chapter 8 (Agreement Protocols) in Advanced Concepts in OS.

Lecture #10: Agreement Protocols