If you have played Blackjack before, you may have heard about the story of the MIT Blackjack Team to “beat the dealer” by counting cards. The strategy used by MIT Blackjack Team originated from the book “Beat the dealer” by Dr. Edward Thorp. The strategy generally estimate the probability of having “large cards” and “small cards” next round from the remaining cards in deck by counting cards already shown. This set of rules helps the Blackjack Team won millions from casinos. Today, I would like to put our focus on part of the strategy to decide our betting size.
Recalling what we have learned in Markov Chain Lessons, the Gambler’s Ruin tells us that if a gambler enters a negative expected value game, eventually this gambler would go bankruptcy.
So we must choose a game with positive expected value if we don’t want to lose all our money. In Blackjack, card-counting skills enables gamblers to decide when to hit and when not to by estimating the probability of winning. But we have only solved half of the question. Now suppose we have entered a gamble with positive expected value, how much should we bet each time? It would be irrational to bet all our wealth one time. So, the next question is: how much we should bet each time so as to maximize our profit?
Let’s do some very simple Maths.
Denote f as the fraction of our current wealth we bet, as the random variable of outcome of the i-th bet,
and
as mean and variance of
respectively. Now we will maximize the expected logarithm long-term growth rate of N period of investment.
If we assume each bet is independent, then we would maximize
By Taylor Expansion, we have
Approximately, we have
By differentiation, we got the Kelly’s Criterion:
With this f*, the expected growth rate in each bet would be
As we can see from the diagram above, if we bet twice Kelly, we would have an expected growth rate of zero, and further increasing our bet would be a suicide in long run, even if we have the advantage in the gamble. The expected long-term growth rate will also decrease when the gambler bets aggressively, yet with higher risk than Kelly. This is why some gamblers would use a “Half Kelly”, half betting size of Kelly’s Criterion, to protect them from being over-aggressive. The size is halved while the expected growth rate is 75% of Kelly’s, which makes “Half Kelly” a welcoming choice.
Now, let’s look at the expected long-term growth rate = which we tried to maximize.
Have you seen something similar before?
In our Corporate Finance Course on Mean-Variance Optimization, we aim at minimizing
for vector w as portfolio we have, vector as mean return of each portfolio,
is the covariance matrix for the returns on the assets in the portfolio
It is equivalent to Kelly’s criterion with
We may in fact consider f* as the proportion of asset we invest in market portfolio, while the remaining (1-f*) to be invested in risk-free rate. We would dynamically rebalance f* by the change of expected return and variance of market portfolio.
Hence, Kelly’s criterion is one of the candidates for multi-period portfolio optimization.
For more details, you may refer to http://www.bjmath.com/bjmath/thorp/paper.htm
Reference: Paul Wilmott on Quantitative Finance