Probability in Light of Independent Testimony for Mutually Exclusive Possibilities
Abstract.
In this document, Bayes’ Rule is used to find the probability of a proposition in light of multiple testimonies that may conflict with each other or corroborate each other, with the constraint that the testimonies are independent of one another and each attest to one of a set of mutually exclusive alternatives. The probability of the proposition is shown to be a function of the prior probabilities of the alternatives and the probability that the witnesses would claim what they have claimed given each of the alternatives. For cases where the probability of the witness telling the truth is known and the witness’ truthfulness is independent of claims, theorems are stated which allow the probability of a witness telling the truth to be substituted for the probability that the witness would claim what they have claimed given an alternative. The theorems stated are used to solve some example problems, including a problem stated by Augustus de Morgan in Formal Logic.
1. Introduction
There are many factors that complicate the use of testimony to estimate the probability of a proposition. Testimonies may corroborate or contradict each other. One witness may be more or less likely to tell the truth than another. Witnesses may be biased for or against making certain claims. The claims may be more or less likely a priori. Estimating probability in light of ths kind of evidence may not always be feasible, but if the testimony is independent and it is limited to a set of mutually exclusive alternatives, there is a formula that can take all of these things into account.
The discussion proceeds as follows: The problem is stated. The basic solution and a corollary are given; both relate the probability of a proposition to the probabilities of witnesses making certain claims and the prior probabilities of the things that are claimed. To extend the solution’s applicability, some additional theorems are stated that relate the probabilities of a witness making claims to the probability that the witness would tell the truth. To illustrate how these formulas and theorems might be used, example problems are presented and solved. In case it is helpful, some notation is explained and axioms, a lemma and a definition are stated. At the end of the discussion, theorems stated and assertions made during the discussion, which require proof, are proven from standard axioms and definitions.
2. Stating the Problem
Let be a set of alternative possibilities. Let be a set of witnesses that have each attested to exactly one of the alternatives in . Let be one of the alternatives in . Let be a predicate on two variables such that “” asserts “witness claims that alternative is true”. Let be a function that maps each witness in to the alternative in to which the witness attested. Then the probability of given the testimony of is
which may be read as “the probability of given the fact that the witnesses in claim what they have claimed”. This is the quantity that is being sought. This quantity should be deduced from these quantities, which might be known or estimated:
- 
⋄ 
The prior probabilities of the alternatives that the witnesses might have attested to 
- 
⋄ 
The probabilities that witnesses would make their claims, given each of the alternatives 
- 
⋄ 
The probability that each witness would tell the truth 
The problem is limited to situations which meet all of these conditions:
- 
1. 
The claims are constrained to a set of alternatives that are mutually exclusive of one another. 
- 
2. 
The alternatives cover all possibilities. 
- 
3. 
None of the alternatives are known a priori to be false. 
- 
4. 
The witnesses’ claims are conditionally independent of one another given each of the alternatives they might have claimed. 
Note that the claims are to be conditionally independent. In general, it would not be expected or desirable for the claims to be unconditionally independent. If two witnesses are inclined to tell the truth, then if one makes a certain claim, it raises the probability that the claim is true, which in turn raises the probability that the other witness makes the same claim. Conditional independence means one witness’s claim may be influenced by many things (hopefully including reality and a will to tell the truth), but not by the other witnesses’ claims.
3. The Basic Solution and a Corollary
Under the conditions stated above, the basic solution is a formula that can be derived from Bayes’ Rule, the Law of Total Probability and the definition of conditional independence. It has already been stated and proven by Hu and Qu as Theorem 4 in “Bayes’ Theorem under Conditional Indepenence”[3].
Theorem 1 (Hu and Qu’s Theorem 4).
If the alternatives in are mutually exclusive and cover all possibilities, and none are known a priori to be false, and all the claims of are all conditionally independent of one another given any of these alternatives, then
A corollary that relates the probability of one alternative to another given the claims of witnesses follows straightforwardly from Theorem 1:
Corollary 1 .
If the alternatives in are mutually exclusive and cover all possibilities, and none are known a priori to be false, and all the claims of are all conditionally independent of one another given any of these alternatives, then the ratio of the probabilities for any two alternatives and in given the claims of is
4. Equivalences
Let be a predicate on one variable such that “” asserts “witness is telling the truth”. If so, then the predicates and are related logically. For instance, if , then (if alternative is true and claims , then is telling the truth). The theorems in this section state some of these relations. Both theorems follow straightforwardly from what has been postulated about and .
Theorem 2 .
These equivalences are true for all propositions and and any witness who has attested to or :
| (E1) | ||||||
| (E2) | ||||||
| (E3) | ||||||
| (E4) | ||||||
| (E5) | ||||||
| (E6) | ||||||
| (E7) | ||||||
| (E8) | ||||||
Additional equivalences are true in circumstances where there are exactly two alternatives that a witness may attest to, e.g., “The accused committed the crime” and “the accused did not commit the crime”.
Theorem 3 .
If is a set of alternatives, is a member of and is a witness that has attested to one of the alternatives in , then these equivalences are all true:
| (E9) | |||||
| (E10) | |||||
| (E11) | |||||
| (E12) | |||||
| (E13) | |||||
| (E14) | |||||
| (E15) | 
5. Some Substitutes for
The theorems in this section are true for all in and all in where is a set of alternatives and is a set of witnesses who have each attested to one of the alternatives in . These theorems identify substitutes for , which appears in Theorem 1 and Corollary 1.
Theorem 4 .
The probability of a witness claiming an alternative when that alternative is true is equal to the probability of the witness telling the truth;
Theorem 5 .
If a witness’s truthfulness is independent of an alternative, then the probability of the witness claiming that alternative when the alternative is true is equal to the prior probability of the witness telling the truth;
Theorem 6 .
Given an alternative, if a witness, when not telling the truth, claims another alternative in proportion to its prior probability, then if the witness’s truthfulness is independent of the given alternative being true, the probability that the witness claims the other alternative is equal to the product of the witness not telling the truth and the proportion of the prior probability of the other alternative to the prior probabilities of all of the alternatives that are not the given alternative;
Theorem 6 applies to situations where a witness does not exhibit bias for or against the alternative that they falsely claimed; the claim was made as if it was the result of a random selection among the false alternatives.
Theorem 7 .
If there are only two alternatives and a witness’s truthfulness is independent of them, then the probability of a witness claiming the false alternative is equal to the probability of the witness not telling the truth;
6. Example Problems
Example 1 (Dishonest accusations).
Alice, Bob and Dan attempt to rob a jewelry store. The only other person present is a clerk, whom one of them threatens with a pistol. Things do not go well and the clerk is shot dead. Alice, Bob and Dan are the only living witnesses to the event. There is no physical evidence to indicate which of them is the shooter. The police apprehend the three and interrogate each of them separately. During the interrogations, each makes an accusation. None of the suspects are willing to go to jail and none are loyal to any of the others. The suspects did not have time to confer beforehand and no plea bargains were offered in such a way as to motivate a false confession. Because of all this, they are expected to testify as follows: The guilty one will almost certainly not confess and will instead accuse one of the others, with equal probability of accusing either. The others, who have no known motive to protect the guilty one, will almost certainly accuse the guilty one. Let be a predicate of one variable that asserts that is the murderer. Let , , , and . Let in the following equations represent small positive numbers that are not necessarily equal to one another. Under these circumstances, the probabilities for and in are as follows:
If a witness is not the murderer, the probability that he or she accuses the murderer is high;
| (1) | |||||
| If a witness is not the murderer, the probability that he or she makes a false accusation is low and it is divided evenly amongst the suspects who are not the murderer; | |||||
| (2) | |||||
| If a witness is the murderer, the probability that he or she confesses (and therefore accuses the murderer) is low; | |||||
| (3) | |||||
| If a witness is the murderer, the probability that he or she makes a false accusation is high and it is divided evenly amongst the other suspects; | |||||
| (4) | |||||
If, prior to the accusations, the probabilities that one suspect or another committed the murder are all equal, then the approximate probability of any one of the three being the murderer can be calculated from Theorem 1. In most circumstances such as this, no suspect confesses and two of the suspects accuse the same person; there is corroboration between two of the witnesses and it is almost certain that the suspect whom they accuse is the murderer. In the other cases, the probability of a suspect being the murderer is not so obvious. In the case where one suspect confesses and the other two accuse each other, one might think that the suspect who confessed is the murderer, because people usually do not confess to crimes that they did not commit, and the other two witnesses’ accusations cancel each other out. But actually, the one who confessed is probably not the murderer. Suppose Alice confesses, Bob accuses Dan, and Dan accuses Bob. Given these claims and Corollary 1, the ratio of the probability of Alice being the murderer to the probability of Bob being the murderer is
| (5) | ||||
| Making the assumption that and are equal and using (1), (2), (3) and (4) to find appropriate substitions for the other terms in (5) yields | ||||
As can be seen, the probability that Alice is the murderer given these accusations is the product of three small numbers, but the probability that Bob is the murderer given these accusations is the product of one small number and two instances of one minus a small number. Unless some of these small numbers differ by many orders of magnitude, the probability that Alice is the murderer is much smaller than the probability that Bob is the murderer. By the same line of reasoning, the ratio of the probability of Alice being the murderer to Dan being the murderer is also small. Therefore, although Alice confessed, she is probably not the murderer. This can be explained by noting that if Alice is the murderer, three unlikely events have occurred, but if Bob or Dan is the murderer, then only one unlikely event has occurred.
Example 2 (Unreliable but unbiased witnesses to a roll of a fair die).
Suppose someone rolls a fair, six-sided die. There is a witness and this witness is very unreliable; he misreports half of the rolls that he witnesses. However, he is unbiased in the sense that he is not more likely to be wrong about one number than another and, when he is wrong, he does not choose one number more often than another. He says that a six was rolled. How does his testimony affect the probability that a six was rolled? Is the posterior probability the same because what he says is as often as not false? What if another, similarly unbiased but even less reliable witness, who mispreports two thirds of the rolls he witnesses, says it is a six? Does his testimony decrease the probability that a six was rolled because he is more often than not wrong?
To answer these questions, start with Theorem 1. In this scenario, is the set of possible rolls, one through six, is the event that a six was rolled and are the two unreliable witnesses. Since each roll is equally probable, the priors in Theorem 1 cancel out. Since the witnesses are no more or less likely to tell the truth when a six is rolled, Theorem 5 allows the terms that represent the probability that they would claim a six when a six was rolled to be replaced with . Since neither of the witnesses are biased, Theorem 6 can be applied to the terms that represent the probability that they would claim a six when a number other than a six was rolled. The result of these substitutions is
| (6) | ||||
| Since the prior probabilities of all rolls are equal, this can be simplified to be | ||||
| (7) | ||||
| Considering only the testimony of the first witness, the probability that a six was rolled is ; | ||||
| The first witness’s testimony increases the probability that a six was rolled because, though he does not tell the truth more often than not, he is not biased and his assertions are true more often than a random guess. If the testimony of both witnesses is taken into account, the probability is ; | ||||
The second witness’s testimony, though less reliable than that of the first witness, is still better than a random guess, so it raises the probability of a six even further. The corroboration of the two witnesses raises the probability into the range of more-likely-than-not.
Example 3 (Independent testimony, according to de Morgan).
In chapter X of Formal Logic, titled “On Probable Inference”[2], Augustus de Morgan discusses several problems having to do with “independent testimonies to the truth of an assertion”. For the first problem, he presents some formulae as solutions for certain circumstances. The first formula is a solution to circumstances where there are several independent witnesses for a single assertion and the probability of each telling the truth is known. He accounts for the prior probability of the assertion by including, in his words, “the initial testimony of the mind itself which is to form the judgement” as one of the testimonies. The formula, expressed in the notation of the present discussion, is
| (8) | 
He apparently intends the testimonies to be independent in the sense that the probabilities of each being true are independent of one another. Also, because the prior estimate of probability counts as testimony, the probability of each testimony being true is independent of the prior probability of the assertion. In symbolic form, this independence is
| (9) | 
from which the formula can be derived straightforwardly. Interestingly, the same formula can be derived from Theorem 1 if the testimonies are independent in a different sense, where the witnesses are conditionally independent of one another given each possible alternative, and the witnesses are unbiased in the sense that the probability of each witnesses telling the truth is independent of the probability of the assertion. In symbolic form, this is
| (10) | 
The formula follows from both pairs of assumptions even though they are not equivalent – (9) does not entail (10) and (10) does not entail (9).
The second formula is an equation of two ratios, expressed as an analogy, that applies to cases where threre is independent testimony for and against an assertion and the credibility of all witnesses are equal. The formula, expressed in the notation of the present discussion, with being the number of testimonies in favor of an assertion and being the number of testimonies against , is
| (11) | 
As with (8), de Morgan inferred it from (9). He did not, however, consider the prior probability of the assertion. By omitting it from the formula, he effectively assumed that . Equation (11) can also be derived from Corollary 1 if (10) is assumed and .
7. Notation, Axioms and a Definition
(There is nothing novel in this section; it is included to make the intended meanings of symbols and terms explicit and to show that this discussion is grounded in modern probability theory).
Throughout this discussion, an expression of the form “”, where and are Boolean expressions, signifies a logical conjunction of the two, i.e. “ AND ”. A line over a symbol denotes negation, e.g. “” asserts “witness did not claim ”. An expression of the form “” asserts that the expression is true for each member of . An expression of the form “” asserts that the expression is true for some member of . An expression of the form “” asserts that the probabilities of and are independent. An expression of the form “” denotes the result of applying function to each member of and multiplying the results together. An expression of the form “” denotes the result of applying function to each member of and adding the results together.
The Kolmogorov Axioms, stated in terms of a set of mutually exclusive propositions such that , are
| (K1) | ||||||
| (K2) | ||||||
| (K3) | ||||||
| This consequence of (K2) and (K3) is useful for combining probability with logic: | ||||||
| (K4) | ||||||
It will be invoked several times in the following proofs.
Definition 1 (Conditional independence).
Propositions and are conditionally independent given proposition if and only if
8. Proofs
Proof of (K4).
Proof of Theorem 1.
Consider the probability of an alternative that is in . According to Bayes’ Rule,
| (20) | ||||
| Applying the Law of Total Probability to the denominator on the right-hand side of (20) yields | ||||
| (21) | ||||
| Since the claims made by members of are independent given any of the alternatives in , Theorem 1 follows from (21) and the definition of conditional independence: | ||||
∎
Proof of Theorem 4.
Proof of Theorem 5.
Proof of Theorem 6.
Proof of Theorem 7.
Proof that (7) follows from (6) in Example 2.
Consider this subexpression of (6):
Since and in this subexpression are rolls of a die and the prior probabilities of all rolls are equal, this is true:
| (39) | ||||
| Since the content of the summation on the right-hand side of (39) is the same for any value of , | ||||
| (40) | ||||
| Since, in this scenario, prior probability is equally divided amongst alternatives in , | ||||
| which, in combination with the laws of exponents, entails | ||||
| (41) | ||||
| Multiplying the leftmost and rightmost parts of (41) by yields | ||||
| (42) | ||||
| and so by (39), (40), (42) and the transitive property of equality, | ||||
| (43) | ||||
In (6), substituting the right-hand side of (43) for the left-hand side of (43) yields
∎
Proof that (8) follows from (10) in Example 3.
The example supposes that each of attest to and . With assumption (10), the scenario meets the antecedent conditions of Theorem 1. Since all witnesses attest to , can be substituted for in Theorem 1, yielding
| (44) | ||||
| Since , the sum in (44) can be expanded and therefore | ||||
| (45) | ||||
| Since , the substitutions described in Theorem 5 and Theorem 7 are allowable for (45) and therefore | ||||
∎
Proof that (9) does not entail (10) and (10) does not entail (9).
| #1 | #2 | |
|---|---|---|
Consider scenarios where there is a condition and two witnesses and who must testify for or against . In these scenarios, and . If so, then there are 32 ways to combine the propositions , , , , and their negations. Of these combinations, 24 describe impossible scenarios such as , where is true and claims but does not tell the truth. Since they are impossible, their probabilities are always 0. For the remaining 8 combinations, there exist nonzero probabilities that add up to 1 while making condition (9) true and condition (10) false. One such set of probabilities is shown in column #1 of Table 1. There also exist nonzero probabilities that add up to 1 while making condition (10) true and condition (9) false. One such set of probabilities is shown in column #2 of Table 1.
The first eight rows of Table 1 are probabilities of combinations of conditions. The next seven rows of Table 1 are probabilities that can be calculated by summing quantities in the first eight rows. The last five rows are the probabilities involved in (9) and (10), which can be derived from the middle rows using (K4) and the definitions of conditional probability and independence, noting that, per (E1) and (E2), is equivalent to and is equivalent to .
Proof that (11) follows from (10) in Example 3 if .
With assumption (10), the scenario meets the antecedent conditions for Corollary 1. Applying Corollary 1 to alternatives and yields
| (46) | ||||
| Since by hypothesis , these two terms cancel each other out, and | ||||
| (47) | ||||
| Let function , in addition to mapping each witness in to the alternative in to which the witness attested, also map each alternative in to the set of witnesses in which attested to it. With , the products on the right-hand side of (47) can be split into two parts each, one for witnesses who claim and another for witnesses who claim : | ||||
| (48) | ||||
| Since , the substitutions described in Theorem 5 and Theorem 7 are allowable: | ||||
| (49) | ||||
| Since all witnesses are equally credible, | ||||
| (50) | ||||
| By the definitions of and given in the example, | ||||
| (51) | ||||
| By the definition of complement, | ||||
| (52) | ||||
| Therefore, by (46), (47), (48), (49), (50), (51), (52), the transitive property of equality and the laws of exponents, | ||||
∎
References
- [1] (1847) Formal logic. Taylor and Walton, London. Cited by: 2.
- [2] (2003) Formal logic. University Press of the Pacific, Honolulu, Hawaii. Note: Originally published as [1] Cited by: Example 3.
- [3] (2020) Bayes’ theorem under conditional independence. arXiv preprint arXiv:2003.03970. External Links: Link Cited by: §3.