ansaurus

Question

Answer 1

A:

Create a list.
Create a deep copy of that list.
Shuffle the first list.
Assert list1 != list2

Add more descriptive tests as you require. Quality of the shuffle, randomness, etc...

As Alex Martelli pointed out, you can perform a statistical analysis of the sorting to confirm it is sorted to the degree you expect.

The best outcome is that every card position has changed. Meaning, every one of the 52 cards is now in a new position. You could take my above approach, record the number of elements that are different and set a threshold for the test.

Expect at least 20 cards to be in new positions. Create the list, copy it, sort it and then compare it. If the result is less than 20, fail, otherwise pass.

Ty 2009-09-18 04:42:24

There's some small chance that the order is the same after shuffling.

wrang-wrang 2009-09-18 04:47:31

Say the chance of the shuffle returning the same list it was given is p. Then 100% - p of the time, this test will work correctly. Now matter what you do, there will always be a p. You can make it smaller by re-running on failure, but that will cost you more testing time. Which is generally considered a bad thing.

Ty 2009-09-18 04:52:50

Answer 2

+1 A:

One approach is to make statistical tests; after each shuffle check for correctness (the set of cards must not have been changed, only the order), and collect statistics about some random variables ("position of the 7 of diamonds", "is the 5 of clubs before or after the 8 of hearts", and the like) to be tested after a suitable number of shuffles by Student's t-test and other statistical hypothesis-testing approaches.

Alex Martelli 2009-09-18 04:43:47

Good recommendation.

Ty 2009-09-18 04:44:56

Tx @Ty, so if you like it why no upvote?-)

Alex Martelli 2009-09-18 05:27:24

Answer 3

+1 A:

A good question. First, an absolute pass/fail test: after shuffling, the multiset (e.g. compare after sorting) must be unchanged.

To test the randomness, you'll need to do sufficient shuffles that the probability of a false "not sufficiently random" failure is vanishingly small. For instance:

There is a .0000001% chance that in 10000 shuffles, that a particular card is in one of the given 52 slots less than (1-e)/52 or greater than (1+e)/52. (for some small e, I don't know offhand how to compute it).

A correct program may "fail" such a test, but shouldn't fail very often.

With respect to shuffling; one common failure is to do this:

for i from 1..52: choose a random j from 1..52 and swap card i with card j (wrong)

That doesn't give you any permutation with equal probability; this, however, does:

for i from 1..52: choose a random j from i..52 and swap card i with card j (right)

wrang-wrang 2009-09-18 04:47:03

Answer 4

+1 A:

John Kugelman 2009-09-18 04:54:41

Answer 5

A:

Since I haven't tried this I can't say immediately how practical it is, but it should be possible to make unit tests with small decks and a special deterministic random-number-generator, run exhaustively such that the shuffler should produce each possible permutation once.

Darius Bacon 2009-09-18 05:00:57

Answer 6

+1 A:

There was the subject of an exhaustive (or exhausting) thread on the Yahoo Groups TDD list a few years ago. Ron Jeffries has some useful insights but but it is best to start at the top.

APC 2009-09-18 05:01:58

Answer 7

+2 A:

Your goal is to test shuffle(). Since you know how you constructed shuffle(), it would be a deterministic unit test of your initial deck vs. the shuffled deck if you were able to know the series of numbers generated.

This is a case where injecting a method into your Deck() class during testing can make your shuffle function deterministic.

Build your class to use the random() function by default but to use a predetermined, number generating function when injected. For example, in Python you can do:

class Deck():
   def __init__(self, rand_func = random.random):
      self._rand = rand_func
   def rand(self):
      return self._rand()

When simply using Deck with no arguments, you get the random number expected. But if you craft your own random number function, you can generate your predetermined sequence of numbers.

With this construction you can now build an initial deck (whatever size you want) and a list of random numbers (again, whatever size you need) and you will know what to expect as output. Because shuffle() doesn't change between the injected version and the truly random version you can unit test shuffle() deterministically and yet have random behavior at runtime. You can even generate multiple different number sequences if there are corner cases you want to test.

With respect to the other's answers involving statistical modeling: I think those are acceptance level tests to prove the correctness of the "shuffle" algorithm, but it doesn't deterministically unit test the function shuffle()'s implementation.

Michael Groner 2009-09-18 06:01:23

ansaurus

tags:

views:

answers:

Testing a card deck shuffler

related questions