ansaurus

Question

How to get checksums for strided patterns

Answer 1

+2 A:

dionadar 2009-02-28 19:38:36

nice. clean. OTOH I need really blazing fast as this is part of the inner most loop of a tree search algorithm.

BCS 2009-02-28 19:58:15

This doesn't work. He only wants four bits in each sum. Also, the % operator is best avoided, as it is usually terribly slow.

UncleO 2009-02-28 20:06:04

I do not see anything that indicates that he would only want 4 bits per sum? "[...] computer the sum of the bits at [...]"yeah % may be rather slow, and "c[i] ...; if(++i == m) i = m;" will probably be faster (although you need to 0 i out between the loops then)

dionadar 2009-02-28 21:43:36

% is now removed - the whole thing is now pointer instead of array based

dionadar 2009-02-28 21:55:04

A little oops... after your rework of the question I understood what you wanted the first time >.>

dionadar 2009-02-28 23:23:40

In looking at that it still has the problem that c[0] is the sum of 6 bits and I want exactly 4.

BCS 2009-03-01 01:45:50

I probably have an obnoxious amount of fun with this^^

dionadar 2009-03-01 04:55:25

I have the same vice. :b

BCS 2009-03-01 07:48:59

could you post all 3 solutions? Just the inner code would do

BCS 2009-03-01 07:51:11

I posted the whole "application" over there: http://pastebin.com/f5ce069b2

dionadar 2009-03-01 17:58:53

The full code of the updated test application can be found over there: http://pastebin.com/f573c0c2

dionadar 2009-03-03 02:12:40

If you are willing to spend ~0.5kb of memory, you can speed your perfect hash solution up to be the fastest of all (~1500 ms): you just need to precompile all 586 possible solutions and leave out your modulo 30.

dionadar 2009-03-03 23:51:18

how do you get 586 solutions?

BCS 2009-03-05 06:54:14

take your perfect hash solution, remove the "% 30" and make arr large enough to allow for arr[0x1001001001]

dionadar 2009-03-05 18:13:01

Answer 2

+1 A:

A suggestion that I don't want to code right now is to use a loop, an array to hold partial results, and constants to pick up the bits m at a time.

loop 
   s[3*i] += x & (1 << 0);
   s[3*i+1] += x & (1 << 1);
   s[3*i+2] += x & (1 << 2);
   x >> 3;

This will pick too many bits in each sum. But you can also keep track of the intermediate results and subtract from the sums as you go, to account for the bit that may not be there anymore.

loop 
   s[3*i] += p[3*i]   = x & (1 << 0);
   s[3*i+1] += p[3*i+1] = x & (1 << 1);
   s[3*i+2] += p[3*i+2] = x & (1 << 2);

   s[3*i] -= p[3*i-10];
   s[3*i+1] -= p[3*i-9];
   s[3*i+2] -= p[3*i-8];
   x >> 3;

with the appropriate bounds checking, of course.

The fastest approach is to just hardcode the sums themselves.

s[0] = (x & (1<<0)) + (x & (1<<3)) + (x & (1<<6)) + (x & (1<<9));

etc. (The shifts occur at compile time.)

UncleO 2009-02-28 20:03:30

similar to the inital idea, this is not a very generic answer, but instead if implemented needs to use two loops. One is the one you wrote, the other one goes from 0 to m to do the adding and subtracting

dionadar 2009-02-28 20:05:56

It doesn't have to be generic. m is fixed at 3.What do you mean by the second loop?

UncleO 2009-02-28 20:11:31

ok, dump that, i just read the comment on the question

dionadar 2009-02-28 22:18:44

The hard code is fast for each by it's self. What about all together?

BCS 2009-03-01 07:53:08

ansaurus

tags:

views:

answers:

How to get checksums for strided patterns

related questions