ansaurus

Question

How to define and work with an array of bits in C?

Answer 1

+2 A:

You can use & (bitwise and) and << (left shift).

For example, (1 << 3) results in "00001000" in binary. So your code could look like:

char eightBits = 0;

//Set the 5th and 6th bits from the right to 1
eightBits &= (1 << 4);
eightBits &= (1 << 5);
//eightBits now looks like "00110000".

Then just scale it up with an array of chars and figure out the appropriate byte to modify first.

For more efficiency, you could define a list of bitfields in advance and put them in an array:

#define BIT8 0x01
#define BIT7 0x02
#define BIT6 0x04
#define BIT5 0x08
#define BIT4 0x10
#define BIT3 0x20
#define BIT2 0x40
#define BIT1 0x80

char bits[8] = {BIT1, BIT2, BIT3, BIT4, BIT5, BIT6, BIT7, BIT8};

Then you avoid the overhead of the bit shifting and you can index your bits, turning the previous code into:

eightBits &= (bits[3] & bits[4]);

Alternatively, if you can use C++, you could just use an std::vector<bool> which is internally defined as a vector of bits, complete with direct indexing.

David 2010-03-26 17:34:10

Using `std::vector<bool>` won't get him optimal performance, since he'll end up having two lookups to get one pair of bits. Whether this penalty is sufficient to justify creating his own variation of `std::vector<bool>` is dependent upon whether the lookups (and assignments) themselves are a bottleneck.

Brian 2010-03-26 18:13:18

Assuming C++ were an option (the OP only mentioned C) I'd not hesitate to start off with an `std::vector<bool>`, simply for conciseness and readability. If I then needed better performance, I'd profile to find out where the bottleneck was. (It could very well be in rand() and not the vector lookup).

David 2010-03-26 18:30:52

Instead of `char bits[8] = { ... };` you could do `#define bits(x) BIT##x`.

Chris Lutz 2010-03-26 19:55:19

Eddy 2010-03-27 17:40:06

I need to create a very large array, with more than 'max_size of int' boolean values/bits. Is this possible with vector<bool> or bitset?

Eddy 2010-03-27 17:41:47

Answer 2

A:

It's a trade-off:

(1) use 1 byte for each 2 bit value - simple, fast, but uses 4x memory

(2) pack bits into bytes - more complex, some performance overhead, uses minimum memory

If you have enough memory available then go for (1), otherwise consider (2).

Paul R 2010-03-26 17:35:31

@Paul: No, it uses 4x as much memory, since he would be storing 2bit numbers in 1 byte. However, I think from the OP's question that he has already made a decision to go with (2).

Brian 2010-03-26 17:44:33

@Brian: Thanks - I missed that part - I'll update my answer accordingly.

Paul R 2010-03-26 19:12:00

Answer 3

A:

Useful links to have around when dealing with bits:

Bit Twiddling hacks
Using unions with bitfields

Martin Beckett 2010-03-26 18:02:08

Answer 4

+1 A:

typedef unsigned long bfield_t[ size_needed/sizeof(long) ];
// long because that's probably what your cpu is best at
// The size_needed should be evenly divisable by sizeof(long) or
// you could (sizeof(long)-1+size_needed)/sizeof(long) to force it to round up

Now, each long in a bfield_t can hold sizeof(long)*8 bits.

You can calculate the index of a needed big by:

bindex = index / (8 * sizeof(long) );

and your bit number by

b = index % (8 * sizeof(long) );

You can then look up the long you need and then mask out the bit you need from it.

result = my_field[bindex] & (1<<b);

or

result = 1 & (my_field[bindex]>>b); // if you prefer them to be in bit0

The first one may be faster on some cpus or may save you shifting back up of you need to perform operations between the same bit in multiple bit arrays. It also mirrors the setting and clearing of a bit in the field more closely than the second implemention. set:

my_field[bindex] |= 1<<b;

clear:

my_field[bindex] &= 1<<b;

You should remember that you can use bitwise operations on the longs that hold the fields and that's the same as the operations on the individual bits.

You'll probably also want to look into the ffs, fls, ffc, and flc functions if available. ffs should always be avaiable in . It's there just for this purpose -- a string of bits. Anyway, it is find first set and essentially:

int ffs(int x) {
    int c = 0;
    while (!(x&1) ) {
        c++;
        x>>=1;
    }
    return c; // except that it handles x = 0 differently
}

This is a common operation for processors to have an instruction for and your compiler will probably generate that instruction rather than calling a function like the one I wrote. x86 has an instruction for this, by the way. Oh, and ffsl and ffsll are the same function except take long and long long, respectively.

nategoose 2010-03-26 19:52:32

ansaurus

tags:

views:

answers:

How to define and work with an array of bits in C?

related questions