ansaurus

Question

What is the optimal winning strategy for this modified blackjack game?

Answer 1

+1 A:

Here are some thoughts about the data you've collected:

It's mildly useful for telling you what your "hit limit" should be, but only if you know that your opponent is following a similar "hit limit" strategy.
Even then, It's only really useful if you know what your opponent's "hit limit" is or is likely to be. You can just choose a limit that gives you more wins than them.
You can more or less ignore the actual values in the table. It's whether they're positive or negative that matters.

To show your data another way, the first number is your opponent's limit, and the second group of numbers are the limits you can choose and win with. The one with an asterisk is the "winningest" choice:

12:   13, 14, 15, 16*, 17, 18
13:   14, 15, 16*, 17, 18, 19
14:   15, 16, 17*, 18, 19
15:   16, 17*, 18, 19
16:   17, 18*, 19
17:   18*, 19
18:   19*, 20
19:   12, 20*
20:   12*, 13, 14, 15, 16, 17
21:   12*, 13, 14, 15, 16, 17, 18, 19, 20

From that, you can see that a hit limit of 17 or 18 is the safest option if the opponent is following a random "hit limit" selection strategy because 17 and 18 will beat 7/10 opponent "hit limits".

Of course if your opponent is human, you can't reply on them self-imposing a "hit limit" of under 18 or over 19, so that completely negates the previous calculations. I still think these numbers are useful however:

I agree that for any individual hand, you can be reasonably confident that your opponent will have a limit after which they will stop hitting, and they'll stay. If you can guess at that limit, you can choose your own limit based on that estimate.

If you think they're being optimistic or they're happy to risk it, choose a limit of 20 - you'll beat them in the long run provided their limit is above 17. If you're really confident, choose a limit of 12 - that will win if their limit is above 18 and there are much more frequent winnings to be had here.

If you think they're being conservative or risk averse, choose a limit of 18. That will win if they're staying anywhere below 18 themselves.

For neutral ground, maybe think about what your limit would be without any outside influence. Would you normally hit on a 16? A 17?

In short, you can only guess at what your opponent's limit is, but if you guess well, you can beat them over the long term with those statistics.

Damovisa 2010-02-20 08:52:42

@Damovisa: You seem to come to some of the same conclusions that I do. ( My net column more or less indicates the same as your table ). My natural instinct would be to hit on 16 always, sometimes hit on 17, and never hit on 18.

hobodave 2010-02-20 09:24:03

What I surmise is that a player _not_ following the same strategy of a hard hit limit would _always_ stay on 19, 20, and 21, and _always_ hit on 12 through 15. Thus for each hand they'd probably have a hit limit of 16, 17, 18, or 19 depending on whatever. This makes me thing that 19 might be the best hit limit to have, as it will tend to beat all of those hit limits in the long run, tying the 19. The _only_ apparent way to beat this would be for the opponent to adopt this strategy and always use a hit limit of 12 or 20.

hobodave 2010-02-20 09:29:02

So, basically over say, 400,000 hands, it doesn't matter if they randomly decide to stay on any of 16 through 19. If they did each equally then I'd win roughly 2867, 3314, 2080, and 0 more games than them respectively. Is this flawed?

hobodave 2010-02-20 09:33:44

No, I think that's pretty accurate. You're assuming they're not choosing "bad" options; 16-19. If you choose 19 as your limit, you'll beat them in the long run.

Damovisa 2010-02-20 09:44:19

Answer 2

+2 A:

Two comments:

It looks like there isn't a single dominating strategy based on a "hit limit":
- If you choose 16 your opponent can choose 17
- if you choose 17 your opponent can choose 18
- if you choose 18 your opponent can choose 19
- if you choose 19 your opponent can choose 20
- if you choose 20 your opponent can choose 12
- if you choose 12 your opponent can choose 16.

~~2. You do not mention whether players can see how many cards their opponent has drawn (I would guess so). I would expect this information to be incorporated into the "best" strategy.~~ (answered)

With no information about the other players decisions, the game gets simpler. But since there is clearly no dominant "pure" strategy, the optimal strategy will be a "mixed" strategy. That is: a set of probabilities for each score from 12 to 21 for whether you should stop or draw another card (EDIT: you will need different probabilities for a given score with no aces vs the score with aces.) Executing the strategy then requires you to randomly choose (according to the probabilities) whether to stop or continue after each new draw. You can then find the Nash equilibrium for the game.

Of course, if you are only asking the simpler question: what is the optimal winning strategy against suboptimal players (for example ones that always stop on 16, 17, 18 or 19) you are asking an entirely diiferent question, and you will have to specify exactly in which way the other player is limited compared to you.

Rasmus Faber 2010-02-20 09:10:58

That's a good point, although the number of cards drawn might not tell you very much, particularly as the deck is shuffled every hand. Two cards could be anywhere from a score of 3 to a score of 21. That said, I see your point - if the player has only two cards, it's less likely they'll have a higher hand, but you can't rule it out. It'd be interesting to add that into the mix!

Damovisa 2010-02-20 09:17:10

@Rasmus: 2: I updated the question. You do not know.

hobodave 2010-02-20 09:19:03

Answer 3

+3 A:

I'm suspicious of your results. For example, if the opponent aims for 19, your data says that the best way to beat him is to hit until you reach 20. This does not pass a basic smell test. Are you sure you don't have a bug? If my opponent is striving for 19 or better, my strategy would be to avoid busting at all costs: stay on anything 13 or higher (maybe even 12?). Going for 20 has to wrong -- and not just by a small margin, but by a lot.

How do I know that your data is bad? Because the blackjack game you are playing isn't unusual. It's the way a dealer plays in most casinos: the dealer hits up to a target and then stops, regardless of what the other players hold in their hands. What is that target? Stand on hard 17 and hit soft 17. When you get rid of the bugs in your script, it should confirm that the casinos know their business.

When I make the following replacements to your code:

# Replace scoring method.
def score
  s = inject(0) { |sum, c| sum + c }
  return s if s < 21
  n_aces = find_all { |c| c == 11 }.size
  while s > 21 and n_aces > 0
      s -= 10
      n_aces -= 1
  end
  return s
end

# Replace section of code determining hand outcome.
my_score  = my_hand.score
opp_score = opp_hand.score
my_score  = 0 if my_score  > 21
opp_score = 0 if opp_score > 21
if my_score == opp_score
  draw += 1
elsif my_score > opp_score
  win += 1
else
  lose += 1
end

The results agree with the behavior of casino dealers: 17 is the optimal target.

n=10000
                                                       opponent_limit
my_limit |        12        13        14        15        16        17        18        19        20        21       net
-------- |  --------  --------  --------  --------  --------  --------  --------  --------  --------  --------  --------
      12 |        --      -843     -1271     -1380     -1503     -1148      -137      1234      3113      6572
      13 |       843        --      -642     -1041     -1141      -770       -93      1137      2933      6324
      14 |      1271       642        --      -498      -784      -662        93      1097      2977      5945
      15 |      1380      1041       498        --      -454      -242      -100       898      2573      5424
      16 |      1503      1141       784       454        --      -174        69       928      2146      4895
      17 |      1148       770       662       242       174        --        38       631      1920      4404
      18 |       137        93       -93       100       -69       -38        --       489      1344      3650
      19 |     -1234     -1137     -1097      -898      -928      -631      -489        --       735      2560
      20 |     -3113     -2933     -2977     -2573     -2146     -1920     -1344      -735        --      1443
      21 |     -6572     -6324     -5945     -5424     -4895     -4404     -3650     -2560     -1443        --

Some miscellaneous comments:

The current design is inflexible. With a just little refactoring, you could achieve a clean separation between the operation of the game (dealing, shuffling, keeping running stats) and player decision making. This would allow you to test various strategies against each other. Currently, your strategies are embedded in loops that are all tangled up in the game operation code. Your experimentation would be better served by a design that allowed you to create new players and set their strategy at will.

FM 2010-02-20 15:37:07

hobodave 2010-02-20 18:44:48

@FM: "Note, in particular, that 21 + 21 = 42, not 44." Yes. If both hands total 44 then both have busted (a draw). If they both have 21 then it is a draw as well.

hobodave 2010-02-20 18:46:22

hobodave 2010-02-20 18:49:15

@FM: _(1) Have each player compute his own score after receiving a new card and then remember it, rather than recalculating it several times_: If I do this I lose information about whether any existing cards are an Ace.

hobodave 2010-02-20 18:58:11

@hobodave I agree my comment about 21+21 was a little off the mark. But there was a bug nonetheless. :) Regarding the scoring, if you play a hand, then set the scores to zero for players who busted, your logic to decide who won will be easier to grasp. As far as soft aces go, each player needs to keep track of 2 elements: a hard score and a tally for N of soft aces. If a player exceeds 21, convert a soft ace (11) to a hard ace (1) and reduce the score by 10.

FM 2010-02-20 19:17:13

(3) Use the Fisher-Yates shuffle rather than sort_by { rand }.: I've tested with this and it is slower than my current implementation.

hobodave 2010-02-20 19:34:53

@hobodave Had to run an errand.... Regarding this: "What you're not seeing in this data is the ridiculously high number of draws for both players if they hit until 19 and 20, due to both busting." That's further evidence that the results you posted are off by a lot. If 2 players using high targets (19 or 20) are busting at high rates, this suggests that one of them could reap **large** rewards by adopting a different strategy. But your data suggest that targets of 12 and 20 are roughly equally good against an opponent aiming for 19. It doesn't add up.

FM 2010-02-20 20:45:34

@FM: Why did you change the scoring? You broke it. [10,11,11].score shows a score of 22 with your method. The Array.score method in my example works for all cases and doesn't need to be changed.

hobodave 2010-02-21 10:41:38

@hobodave Doh! Fixed it now. The general outcome is the same: 17 is the way to go.

FM 2010-02-21 15:02:34

@hobodave At least on my Ruby (ruby 1.8.6, i386-mswin32) the `score` method that you posted does not run, and it seemed like a puzzle anyway. That's why I changed it.

FM 2010-02-21 15:10:23

@hobodave In your recent bug fix you have `if my_hand.score == opp_hand.score` **after** manually setting busts to zero. You need to use this instead: `if my_score == opp_score`. That's why your data are still off.

FM 2010-02-21 15:37:43

@FM: Ah, let me fix that :)

hobodave 2010-02-21 22:13:49

ansaurus

tags:

views:

answers:

What is the optimal winning strategy for this modified blackjack game?

Questions

Background

Modified Blackjack Rules

What is a deck of cards?

Algorithm

Output (N = 100 000)

Interpretation

related questions