ansaurus

Question

Answer 1

+1 A:

Your reject collisions are turning an O(n) algorithm into (I think) an O(n^2) operation.

There are two ways to select cards from a deck: shuffle and pop, or pick sets until the elements of the set are unique; you are doing the latter which requires a considerable amount of backtracking.

I didn't look at the details of the code, just a quick scan.

msw 2010-05-10 09:27:27

Indeed, `std::random_shuffle` should be both simpler and unbiased (as far as the underlying random number generator - `rand() % x` by default? - is unbiased).

visitor 2010-05-10 09:46:52

Thanks; this is a good observation of the first algo. I may well incorporate your insight. However it totally avoids the question; how do I weight the selection of cards?

supert 2010-05-10 09:47:08

Just to clarify, I'm using a Mersenne twister RNG, not rand(), but I could implement my own shuffle of course.

supert 2010-05-10 09:56:28

Using random_shuffle resulted in a ~30% improvement for the uniformly sampled code (the first routine).

supert 2010-05-10 11:16:45

Answer 2

A:

My guess would be the memcpy(1326*sizeof(double)) within the retry-loop. It doesn't seem to change, so should it be copied each time?

stefaanv 2010-05-10 09:38:29

Good spot, thanks, not sure how I missed that. I just tried moving it out of the loop, it didn't help much.

supert 2010-05-10 09:50:31

Did you move it out of both nested loops (for and while) or is that not possible?

stefaanv 2010-05-10 10:02:23

I eliminated the memcpy by doing simplyint h = weighted_randi(

supert 2010-05-10 11:15:16

Too bad that didn't make a difference (but it did simplify the code a bit). My best bet would be to also hava a look at weighted_randi() and i2h(). When nothing apparent can be changed, measure (profiling or as suggested by Mike)

stefaanv 2010-05-10 12:41:54

Answer 3

+1 A:

you could gain some speed by replacing the all the loops that check if a card is taken with a bit mask, eg for a pool of 52 cards, we prevent collisions like so:

DWORD dwMask[2] = {0}; //64 bits
//...
int nCard;
while(true)
{
    nCard = rand_52();
    if(!(dwMask[nCard >> 5] & 1 << (nCard & 31)))
    {
        dwMask[nCard >> 5] |= 1 << (nCard & 31);
        break;
    }
}
//...

Necrolis 2010-05-10 11:33:58

Answer 4

A:

Rather than tell you what the problem is, let me suggest how you can find it. Either 1) single-step it in the IDE, or 2) randomly halt it to see what it's doing.

That said, sampling by rejection, as you are doing, can take an unreasonably long time if you are rejecting most samples.

Mike Dunlavey 2010-05-10 12:18:42

Answer 5

A:

Your inner "try_again" for loop should stop as soon as it sets try_again to true - there's no point in doing more work after you know you need to try again.

for (n = 0; n < i && !try_again; n++) {
    try_again = (temp[n] == temp[i]) || (temp[n] == temp[i+1]);
}

Mark B 2010-05-10 15:14:46

Answer 6

A:

Answering the second question about picking from a weighted set also has an algorithmic replacement that should be less time complex. This is based on the principle of that which is pre-computed does not need to be re-computed.

In an ordinary selection, you have an integral number of bins which makes picking a bin an O(1) operation. Your weighted_randi function has bins of real length, thus selection in your current version operates in O(n) time. Since you don't say (but do imply) that the vector of weights w is constant, I'll assume that it is.

You aren't interested in the width of the bins, per se, you are interested in the locations of their edges that you re-compute on every call to weighted_randi using the variable threshold. If the constancy of w is true, pre-computing a list of edges (that is, the value of threshold for all *w) is your O(n) step which need only be done once. If you put the results in a (naturally) ordered list, a binary search on all future calls yields an O(log n) time complexity with an increase in space needed of only sizeof w / sizeof w[0].

msw 2010-05-10 15:19:37

ansaurus

tags:

views:

answers:

weighted RNG speed problem in C++

related questions