Two-Envelope Paradox. Two Indistinguishable Envelopes, 1 and 2, Are
Two-Envelope Paradox. Two Indistinguishable Envelopes, 1 and 2, Are
Two-Envelope Paradox. Two Indistinguishable Envelopes, 1 and 2, Are
1. INTRODUCTION
The snake in the grass is that equal probabilities or chances are assigned
to winning and losing. Kraitchik writes: In reality, however, the probabil-
ity is not an objectively given fact, but depends on ones knowledge of
the circumstances. In the present case it is wise not to try to estimate the
probability.
A similar paradox is the two-envelope paradox. It is unclear who gave
the problem its modern form; Kraitchik already formulated a problem
where two people compare the number of pennies in their purses. Gardner
(1982) called this the wallet game. Zabell (1988a, b) heard it from Budrys
(see Nalebuff 1989). Our formulation is a bit technical, this is useful later
on.
Two-envelope paradox. Two indistinguishable envelopes, 1 and 2, are
submitted to you (the decision maker; later the designation we is used
as the combination of you and the statistician who tries to assist you).
Envelope 2 contains a check worth twice the unknown value, say y, in
Envelope 1. You choose one of the envelopes, say Envelope z, at random
and, after opening it, you observe the value x = zy contained by it. Finally
you are allowed to decide between
a1: keep the envelope you have.
a2: return this envelope and take the other one.
Two variants will be discussed. In Variant 1, the discrete case, you are told
that y N; Variant 2, the continuous case, tells you y R+ . A paradox
appears if you would argue that choosing at random implies that, as the
other envelope contains either 12 x (namely if z = 2) or 2x (when z = 1),
the expectation of the value contained by it is 12 12 x + 12 2x = 54 x, and that,
hence, swapping is advantageous on the average. The snake in the grass
is that you should not use the marginal or prior probability P(Z = 1),
but the conditional or posterior probability P(Z = 1|X = x), given the
knowledge you have, namely that X = Z Y has the outcome x observed.1
Unfortunately, the relevant posterior probabilities P(Z = z|X = x) (z =
1, 2) are unknown to you. Moreover you might question whether y can
really be regarded as the outcome of a random variable Y . The paradox
has been explained, but you are still in need of a solution to the problem.
Should you decide upon a1 or upon a2 , given the outcome X = x? That is
the question.
This problem has been discussed extensively in the literature, see, e.g.,
Zabell (1988a, b), Nalebuff (1989), and the (probabilistic) references cited
in Section 3. The literature shows that there are many ways to solve
this problem but, in the absence of additional information, these solutions
cannot be regarded as satisfactory.
Remark. One could suggest that the problem becomes more interesting
from the philosophical point of view if we discuss the contents of the
envelope chosen first without opening it. It is, of course, very natural for
the philosophical mind to keep thinking and to avoid the confrontation with
actual data. Note, however, that our formulation of the two-envelope prob-
lem is in some ways deeper and of more epistemological interest than that
where the envelopes remain unopened. From the perspective of our some-
what technical formulation it is clear that in the case of non-availability
of x, the paradox has its origin in the same fallacious assignment of
probabilities as before.
Note that choosing a1 leads to an expected amount E X = E (Z Y ) =
E Z E Y = 32 E Y which coincides with the amount E (3 Z )Y to be
expected from a2 . A fallacy appears if the latter amount is computed by
conditioning with respect to X and the relevant conditional probability
P(Z = 1|X ) is identified with the marginal probability P(Z = 1) = 12 .
This is erroneous because X = Y Z and Z are not statistically independent.
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 91
2. EXPLORATIONS
If you read the proofs you tend to interpret these terms in each proof in
such a way that the proof is correct. But if you take the interpretation that
makes one of the proofs correct and use that interpretation in the other
proof than the other proof is not correct.
The ambiguous terms are the amount you will gain by trading, if you
do gain and the amount you will lose by trading if you do lose. What
do these terms refer to? This all depends on what you mean by if you
do gain for example. Under what circumstances do you gain? In the first
proof this is the case if the amount in the envelope you did not pick is
double the amount you are now actually holding. In the second proof this
is the case if you picked the envelope which contains the highest amount
actually available.
The difference between these interpretation becomes clear if we look
at the case where you lose by trading. Now in the case of proof one and
the first interpretation, the amount you gain by trading, if you do gain
refers to twice the amount that is actually in your envelope, which is four
times what is in the other envelope. In case of proof two, on the other hand,
the amount you gain by trading, if you do gain refers to the amount you
would win by trading if you had picked the other envelope, which is one
times what is in the other envelope. A similar analysis is provided in Chase
(2002).
Regardless of which interpretation you choose, this does not resolve
the problem.
To make the choice between a1 and a2 additional knowledge will be
needed. In this respect it is natural to refer, as we shall do in Section 3,
to the probabilistic knowledge that P(Z = 1) = P(Z = 2) = 12 . It will
turn out that this factual knowledge does not yet settle the issue. That is
why we will look for more additional knowledge, perhaps of a less factual
kind, in the hope that, after all, some reasonable solution will appear. It
is in this respect that we will refer to the following logical paradox. The
point we shall make is that the context of this paradox enforces a solution
(at least temporarily).
Euathlus, a wealthy young man, was desirous of instruction in oratory and the pleading of
causes. He became a pupil of Protagoras, the keenest of all sophists, and promised to pay
him a large sum of money, as much as Protagoras had demanded. He paid him half of the
amount at once, before beginning his lessons, and agreed to pay the remaining half on the
day when he first pleaded before jurors and won his case. Afterwards, when he had been
for some little time a pupil and follower of Protagoras, and had in fact made considerable
progress in the study of oratory, he nevertheless did not undertake any cases. And when the
time was already getting long, and he seemed to be acting thus in order not to pay the rest
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 93
of the fee, Protagoras formed what seemed to him at the time a wily scheme; he determined
to demand his pay according to the contract, and brought suit against Euathlus.
And when they had appeared before the jurors to bring forward and to contest the case,
Protagoras began as follows: Let me tell you, most foolish of youths, that in either event
you will have to pay what I am demanding, whether judgment be pronounced for or against
you. For if the case goes against you, the money will be due me in accordance with the
verdict, because I have won; but if the decision be in your favour, the money will be due
me according to our contract, since you will have won a case.
To this Euathlus replied: I might have met this sophism of yours, tricky as it is, by not
pleading my own cause but employing another as my advocate. But I take greater satis-
faction in a victory in which I defeat you, not only in the suit, but also in this argument
of yours. So let me tell you in turn, wisest of masters, that in either event I shall not have
to pay what you demand, whether judgment be pronounced for or against me. For if the
jurors decide in my favour, according to their verdict nothing will be due you, because I
have won; but if they give judgment against me, by the terms of our contract I shall owe
you nothing, because I have not won a casex.
This is the paradox. But the context is such that Protagoras went to court.
This had the following consequences according to Gellius:
Then the jurors, thinking that the plea on both sides was uncertain and insoluble, for fear
that their decision, for whichever side it was rendered, might annul itself, left the matter
undecided and postponed the case to a distant day. Thus a celebrated master of oratory
was refuted by his youthful pupil with his own argument, and his cleverly devised sophism
failed.
We conclude that, due to the context within which it appeared, this paradox
is solved, from a practical point of view, in favor of Euathlus. Later authors,
e.g., Stewart (2000), missed this point. Dealing with the two-envelope
paradox we shall look for similar sources of additional information, us-
ing probability theory (Section 3), mathematical statistics (Section 4), and
game theory (Section 5).
Remark. One may not be satisfied by our presentation of the Protagoras
paradox which, indeed, is nothing but a translation of something archaic.
In consequence, one may see no reason to put faith in the moral we draw
from the paradox and which is that as the purely rational approaches of
Section 3 and part of Section 4 fail to be applicable, we may still try to
obtain something practical. With respect to the two-envelope problem
we completely agree that there is not much reason to put faith in this
moral because the factual information (the outcome x and the knowledge
that P(Z = 1) = 12 ) is very weak. In the beginning of Section 4 some
reference will be made to a situation where relevant additional information
is available. In Section 5 other additional information, only partly factual,
will be incorporated.
94 CASPER J. ALBERS ET AL.
We shall not analyze the Protagoras paradox here more carefully, log-
ically and mathematically: the mere reason of mentioning this paradox
was for making the statement that sometimes a temporary solution exists
suffices.
3. PROBABILITY THEORY
U (x, y, z; a1 ) = x = yz
2x if z = 1
U (x, y, z; a2 ) = 2 32z
x = y(3 z) = 1
2
x if z = 2
P(Z = 1, X = x)
P(Z = 1|X = x) =
P(X = x)
P(Z = 1, Y = x)
=
P(Z = 1, Y = x) + P(Z = 2, Y = 12 x)
f (x)
= .
f (x) + f ( 12 x)
(see Broome 1995). Now the optimal procedure prescribes to decide upon
a2 if f ( 12 x)/ f (x) is smaller than 4.
The Achilles heel of this (personalist) Bayesian solution is the assump-
tion that y is the outcome of a random variable Y with known density f .
Concerning f one can distinguish three cases: (i) that where f is fully
known (and above mathematics are directly applicable), (ii) that where
f is not completely known, but not complete unknown either (and some
other solution has to be found), and (iii) that where it is not reasonable to
96 CASPER J. ALBERS ET AL.
4. MATHEMATICAL STATISTICS
d( ) a2
E (2X 3Y )(X ) = 1
2
E (2Y 3Y )(Y ) + 12 E (4Y 3Y )(2Y )
= 1
2
E Y ((2Y ) (Y ))
5. GAME THEORY
If the reader has the idea that this game-theoretic perspective will not
be very helpful in the present context, then he is right, as we shall see in the
rest of this section. He is invited to continue with Section 6 where impor-
tant interpretations are made. (The decision-theoretic NeymanPearson
Wald approach is, of course, of considerable interest elsewhere; the point
is that a sufficient amount of additional information should be available to
have faith in any approach.)
A game being defined as a triple, the first game (A, Y, U ) to consider
is that where Player 1 chooses an action from A = {a1 , a2 }, Player 2 (the
rich eccentric or Nature) chooses y from Y (either N or R+ ) and
yz if a = a1
U (y, z; a) =
y(3 z) if a = a2
goes from Player 2 to Player 1 (you/we). This payoff depends on the
outcome z of Z . This formulation is inadequate in the sense that Player 2
cannot be regarded as the minimizing player and also in the sense that
the information x = yz available to Player 1 has not been used. To allow
for the last mentioned statistical input, the game (A, Y, U ) is replaced by
the game (D, Y, U ) where we now have that Player 1 chooses a decision
procedure d : X A from the class D of all (nonrandomized) rules of
this kind. The payoff is defined as
U (y; d) = E U (y, Z ; d(x, Z ))
y if d(y) = a1 ,d(2y) = a2
= 2y if d(y) = a2 ,d(2y) = a1
3
2
y if d(y) = d(2y)
This formulation still is irrelevant in the sense that Player 2 will choose
y as small as possible. If in the discrete case the specification Y =
{1, 2, . . . , n} is made (for some given n) then taking y = 1 is the min-
imax strategy of Player 2 and any rule with d(1) = a2 and d(2) = a1
is such that the minimum payoff is maximum. The two-person zero-sum
game (D, Y, U ) is determined in the sense that maxd min y U (y; d) =
min y maxd U (y; d) = 2 while the saddle points are characterized in the
above. Randomization can be dispensed with.
In his review of Wald (1964), Savage (1951) explained and criticized
the preoccupation by losses, risks, errors, etc., in the NeymanPearson
Wald approach to statistics. In this approach we are not discussing utilities
but regrets; shortcomings with respect to the best. Given y, the best we can
do is to swap if we observe y and to keep what we have if we observe 2y.
Hence
L(y, z; a) = max U (y, z; ah ) U (y, z; a)
h=1,2
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 101
= 2y U (y, z; a)
(2 z)y if a = a1
=
(z 1)y if a = a2
If the reader has the idea that this loss-function reformulation is too manip-
ulative to be satisfactory, then we agree. The reader, again, might continue
with Section 6. The analysis, however, is not uninteresting, as we shall see
below.
After this Umwertung aller Werte, the rich eccentric has become
Player 1 while we are now Player 2, the minimizing player in the two-
person zero-sum game where we are allowed to choose from the set D and
pay the amount
where X = y Z and this risk depends on the value y chosen by Player 1, the
equivalent of Nature in Walds approach to the problems of statistics, ini-
tiated nicely in his review of Neumann and Morgenstern (1944; see Wald
1947). The game (Y, D, R) is not determined because the lower value of
the game
and that, hence, sup y[0,n] R(y, d ) = 14 n. Next we concentrate the atten-
tion on the two possibilities y = 12 n and y = n having x 12 n, n, 2n
as consequence. Any nonrandomized rule d assigns values d( 12 n), d(n),
d(2n) to these outcomes. The most appropriate assignments (a2 , a1 , a1 )
and (a2 , a2 , a1 ) lead to maximum risks equal to 12 n and 14 n, respectively.
Hence sup y R(y, d) 14 n holds for all d. Equality holds for d which,
hence, is minimax in the sense that
= v.
Note that minimaxity holds with respect the the class D of nonrandomized
mixed rules. If randomization is allowed, the upper value v of the game
can be decreased to n6 by taking
1
3 , x n2
(x) = 23 , n2 < x n
1, x > n
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 103
R(y, ) = E (y Z )(3y 2y Z ) + yE (Z 1)
= 12 y(1 + (y) (2y)).
R( n2 , ) = 1n
22
(1 + ( n2 ) (n)) = n6
R(n, ) = 1
2
n(1 + (n) (2n)) = n6 ,
n
r (, ) = R(y, ) f (y)
y=1
= 1
2
f T (a + A)
104 CASPER J. ALBERS ET AL.
If n is odd, then ( n1
2
) = 0, (n 1) = 1
3
is necessary to have
max R(, ) n16
. If we choose such that
0 if 1 i n1
1 2
if n1 <i n1
(i) = 3 2
0 if i = n
1 if n < i 2n
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 105
n1
6
}.
R(, ) =1
(1 + () (2))
2
12 if n4
= 1
if n4 < n2
31
6
if n2 < n
1 if = 1
R(, ) =
R(, ) elsewhere
R(, ) = 1
(1 + () (2))
1
2
if n1
12 4
if n1 < n1
= 3 4 2
1
if n1 <n 1
6 2
0 if =n
Analogue to the case n even, the solution and its non-uniqueness are trivial.
or 2x. Issues like the one discussed in the two-envelope problem cannot be
settled unless something additional is incorporated. The Protagoras para-
dox is of particular interest in this respect because, there, the context was
such that a (temporary) solution was enforced. With respect to the two-
envelope problem one has to go beyond the logic essentials because the
knowledge that P(Z = 1) = P(Z = 2) = 12 has to be incorporated.
The Bayesian solutions obtained, however, are not applicable because they
depend on the unknown function f . The really interesting element of this
two-envelope problem is, however, that nothing is known about f (even not
if such an f actually exists). In Section 4 an attempt was made to settle the
issue mathematically. These attempts were not successful. In Section 5 we
tried to incorporate arguments from the theory of games and from Walds
theory of statistical decision functions. It was only after the specification of
the upper bound n for y, that we could arrive at something not completely
unreasonable.
We conclude that, if almost no information exists with respect to f , then
it is wise not to try to estimate the posterior probability P(Z = 1|X = x).
Note that already more than half a century ago, Kraitchik (1943) made
a similar statement, see Section 1. If an optimal decision or optimal pro-
cedure is advocated then a scrutiny will reveal that it largely is based on
something fictitious. It would be a fallacy of misplaced concreteness if one
accepts such result as sufficiently compelling or valid.
We are interested in the two-envelope problem, because there are some
consequences for Statistical Science at large. It often happens that the sta-
tistician is asked to use data in order to compute some posterior probability,
to make a distributional inference, or to suggest an optimal decision. Some,
perhaps many, of these situations are such that the lack of relevant infor-
mation is so large that it is wise not to try to settle the issue. This leaves
us with the problem to draw a distinction between those situations where
the information is too weak to say something and those where the informa-
tion is sufficiently overwhelming. The difficulty is, of course, in the area
between. (See also the beginning of Section 4 where reasonable Bayesian
or non-Bayesian solutions will emerge if the value of x is available for a
sufficiently large number of similar games.)
The criticism mentioned here does not only refer to the Bayesian ap-
proach where it has to be considered unwise to specify a prior distribution
unless relevant information is sufficiently abundant. Criticism refers also to
the Neyman-Pearson-Wald approach where one is trying to discuss many
possible worlds, one for every value of the parameter . Usually we as-
sume that exactly one of these worlds is factually true, but we dont know
which one. We, perhaps, have to admit that our approaches are consid-
TRYING TO RESOLVE THE TWO-ENVELOPE PROBLEM 107
erably manipulative and, as Karl Pearson noticed in his paper about the
Fundamental Problem of Practical Statistics (1920), in the nearness of
an abyss.
In our approach to the two-envelope problem, the economist, psy-
chologist and logician did not get as much attention as the probabilist,
mathematical statistician and game theorist, because we considered P(Z =
1) = P(Z = 2) = 12 to be knowledge which should not be ignored.
The two-envelope problem is an extremely simple example of the com-
plicated socio-economic and other issues to be settled in practice. Sciences
like economy, sociology, and psychology try to contribute to the settlement
of these issues by emphasizing various aspects of the managerial use of
information. The basis of everything is, of course, that data are collected
representing the state of reality. Unfortunately, such statistical data leave
room for uncertainties of various kinds. The sciences indicated have much
more to say than is suggested in the context of the two-envelope problem.
What they have to say, however, is largely of a qualitative existential
nature. The economist may infer from empirical data that assessments of
utilities and probabilities are considerably subjective, i.e., different from
person to person. The psychologist may use his data to characterize people
as systematically risk averse or risk prone. The sociologist will argue that
economic behaviour is much less rational than a decision-theorist might
hope. The two-envelope problem provides a nice illustration of the limits
of reason and the need of information to make inferences and decisions
reasonable.
ACKNOWLEDGEMENTS
The physicist L. K. ter Veld taught us about the Protagoras paradox. The
decision procedure E (2X 3Y )(X ) on page 99 arose during discus-
sions with W. Oudshoorn and R. Auer. The reviewers made very helpful
suggestions that led to improvements of the manuscript.
N OTES
1 We use standard probabilistic terminology; random variables are denoted by capital
letters and regarded as functions on some underlying probability space (, F , P ). Factual
outcomes are denoted by lowercase letters like x, y, z. If a priori possible outcomes are
discussed, then we use , , to denote the possibilities for x, y, z.
2 In his book Smullyan uses the letter n in his proof of proposition 1. We have replaced it
with x to keep a uniform notation.
3 See later in this section, is used as notation for a randomized rule.
108 CASPER J. ALBERS ET AL.
R EFERENCES