ansaurus

Question

Answer 1

+2 A:

This stuff makes my brain hurt, so I'm not positive that this answer is correct, but here goes.

In negamax, the score is always evaluated relative to the player currently on move. If it's white's move, then a high score is good for white. If it's black's move, then a high score is good for black. So if you have a leaf node, whether the score is +inf or -inf is determined not by whether the node is a win for white or black, but whether it's a win for the player you're currently evaluating. Replace this:

return winner == Player.AI ? (10000 / depth) : (-10000 / depth);

with this:

return winner == player ? (10000 / depth) : (-10000 / depth);

There is a similar problem in your evaluation function. Replace this:

return player == Player.AI ? score : -score;

with this:

return score;

Again, I'm not sure this is right. But I hope you try those two changes and let me know if it works. I'm very curious!

Brennan Vincent 2010-07-03 02:03:59

Will give it a try tomorrow, but already thanks for the effort :)

JulianR 2010-07-03 02:31:39

I gave it a try and got it to work. :). Also, double-check that your lookup table is set up correctly. And you didn't post all of your code, so it's hard to know for sure what you're doing with bestColumn, but if that's what you're using to choose the next move for the AI, don't you only want to set it when depth = 0?

Brennan Vincent 2010-07-03 12:36:52

@Brennan Vincent - Thanks Brennan, your suggestions definitely improved the performance of the AI. But still, it has some curiosities, see the update of my question.

JulianR 2010-07-03 18:50:24

Answer 2

+1 A:

If it's not blocking certain combinations it sounds like you have a flaw in your table of possible wins.

I also see a problem in your evaluation function: It gives value to moves that have NO hope of winning. Suppose you have xoo.x, you're playing o. Your routine says it's worth 15 points to play here when in reality it's worth 0. Any win pattern that already contains tiles from both players is of no value to anyone.

I have found that when debugging this sort of thing the debugger is of little value as it doesn't let you see the big picture very well. Try writing to a log file each pattern it's checking--put an actual drawing in the log.

Loren Pechtel 2010-07-03 19:47:55

Well, it does block them, but only when I search 2, 4 or 6 deep, but not when I search any other depths. I've double checked the win combinations table. I've added some logging to the application now, and at first sight there are some odd things going on. For example, the log size of search depth 7 is 900 KB, while the one for depth 6 is over 12 MB. The same for depth 2 and 3. Your comment about the evaluation function is correct, I would have to see if fixing it is worth having a slightly slower eval function.

JulianR 2010-07-03 20:40:02

You certainly have some sort of bug--increasing the depth should never make a smaller log. Long ago I did an AI for Reversi and I found the evaluation function was critical. Putting more muscle into it even at the expense of one layer of depth made it much stronger.

Loren Pechtel 2010-07-04 00:44:00

The evaluation function is certainly important, but since he's got a bug that obviously goes beyond imperfections in the evaluation function, I don't think that's where he should focus his efforts at the moment.

Brennan Vincent 2010-07-04 03:07:23

ansaurus

tags:

views:

answers:

Adverserial search troubles

related questions