Homepage › Solution manuals › Joseph Blitzstein › Introduction to Probability › Exercise 1.62 (Birthday problem)
Exercise 1.62 (Birthday problem)
In the birthday problem, we assumed that all 365 days of the year are equally likely (and excluded February 29). In reality, some days are slightly more likely as birthdays than others. For example, scientists have long struggled to understand why more babies are born 9 months after a holiday. Let be the vector of birthday probabilities, with the probability of being born on the th day of the year (February 29 is still excluded, with no offense intended to Leap Dayers). The kth elementary symmetric polynomial in the variables is defined by
This just says to add up all of the
terms we can get by choosing and multiplying
of the variables.
For example,
, and
Now
let be
the number of people.
(a) Find a simple expression for the probability that there is at least one birthday match,
in terms of
and an elementary symmetric polynomial.
(b) Explain intuitively why it makes sense that
(at least one birthday
match) is minimized when
for all ,
by considering simple and extreme cases.
(c) The famous arithmetic mean-geometric mean inequality says that for
,
This inequality follows from adding to both sides of . Define by for Using the arithmetic mean-geometric mean bound and the fact, which you should verify, that
show that at least one birthday at least one birthday match , with strict inequality if , where the "given notation means that the birthday probabilities are given by . Using this, show that the value of that minimizes the probability of at least one birthday match is given by for all .
Answers
- (a)
- (b)
- Consider the extreme case where and for . Then, the probability that there is at least one birthday match is . In general, if for a particular , then a birthday match is more likely, since that particular day is more likely to be sampled multiple times. Thus, it makes intuitive sense that the probability of at least one birthday match is minimized when .
- (c)
- First, consider .
We can break up this sum into the sum of three disjoint cases.
- (a)
- Sum of terms that contain both and . This sum is given by
- (b)
- Sum of terms that contain either or but not both. This sum is given by
- (c)
- Sum of terms that don’t contain either or . This sum is give by
Thus,
Next, compare and . Expanding the elementary symmetric polynomials, it is easy to see that the only difference between the two are the terms that contain either the first, the second or both terms from and respectively.
Notice that because , the sum of the terms with only and only but not both is exactly equal to . Thus, the only difference between and are the terms and .
By the arithmetic geometric mean inequailty, . Hence, .
In other words, given birthday probabilities , we can potentially reduce the probability of having at least one birthday match by taking any two birthday probabilities and replacing them with their average. For a minimal probability of at least one birthday match then, all values in must be equal, so that averaging any and does not change anything.