In this case the function P(m) will represent the number of games played and T(m) will represent the potential number of games. This ration will be expressed as X and for all value of X less than threshold z we will consider the game unique.
Chess is about 500 years old. Let's assume that 1% of all the people who have lived since then have played a game of chess every other day for 50 years. That means around 10 trillion games have been played (10^12). For starters that means an unimaginably small percentage of the total playable games of chess have been played (over 100 zeroes after the decimal point). But that's when m gets up to 80. how about for the first few moves.
If we model out how many potential combinations we get of the first, let's say 10 moves, we get the following table.
m | Theoretical Combos |
---|---|
1 | 30 |
2 | 900 |
3 | 27,000 |
4 | 810,000 |
5 | 24 million |
6 | 729 million |
7 | 22 billion |
Next we have to model how many games have been played for each value of m. Lets use a standard normal distribution for this function such that:
For N we can use our previously calculated estimate of played games at 10^12. We can use Professor Shannon's mean number of moves at 80, and we can use a standard deviation of 20 moves. From this we can calculate the probability that your game is unique:
m | Theoretical Combos | P(Unique) |
---|---|---|
1 | 30 | <1% |
2 | 900 | <1% |
3 | 27,000 | <1% |
4 | 810,000 | <1% |
5 | 24 million | <1% |
6 | 729 million | <1% |
7 | 22 billion | <1% |
As you can see nearly all of of the 10 trillion games have been played have made it to move 7 so the likelihood that your game has been played is high. But what's interesting is the ballooning denominator. It'll hit that 10 trillion number soon. So what if we look at the next 7 moves?
m | Theoretical Combos | P(Unique) |
---|---|---|
8 | 656 billion | <1% |
9 | 20 trillion | <1% |
10 | 590 trillion | 5% |
11 | 17 quadrillion | >99% |
12 | 531 quadrillion | >99% |
13 | 15 quintillion | >99% |
14 | 478 quintillion | >99% |
By the time you get to move 10 there are 590 trillion possible games to have played, against 10 trillion played all time, so there's now a high likelihood that your game is unique. Once you take the 11th and 12th move of the game it is a virtual certainty that your game of chess has never been played before!
Now obviously this isn't entirely true, if you're playing at high levels of competitive chess you may know theoretical openings and defenses that extend well into the 11th and 12th moves. But probabilistically, games that reach that point have an extremely high likelihood of being unique.
I decided to test my statistics by using chess.com's analysis engine. The engine has over a billion stored games from its players and you can see how many times your specific game has been played, and what the most common next move is. I decided to play the most theoretically common sequence of moves, the Sicilian Defense, and see how far I got before it was something new. I got 36 moves in before the most common line of moves became something the engine had never seen before, so this was an extreme upper bound. When I went back and decided to go with even the second most common next move, it was only 12 moves before it was a unique game. Confirming that even slightly varying from the common theoretical line almost always puts a chess player in the realm of unique games by move 12.
So for all you non chess grandmasters out there - next time you're playing chess, and you're 15 moves in, stop to think for a moment that you're probably facing a specific problem that has never been seen before, and may well never be seen again! It kind of makes the moment more interesting.