ansaurus

Question

Endianness in casting an array of two bytes into a single short

Answer 1

+1 A:

Because a byte is 8 bits, not 4. The 9th least significant bit in an unsigned int will have value 2^(9-1)=256. (the least significant has value 2^(1-1)=1).

From the IBM article:

unsigned char endian[2] = {1, 0};
short x;

x = *(short *) endian;

They're correct; the value is (short)256 on big-endian, or (short)1 on little-endian.

Writing out the bits, it's an array of {00000001_{base2}, 00000000_{base2}}. Big endian would interpret that bit array read left to right; little endian would swap the two bytes.

wrang-wrang 2009-07-21 01:19:16

Thank you! My initial assumption was clearly poor about C-arrays (dunno whether it varies?) Is the default size of an element in an array always a byte in the most popular programmnig languages?

Masi 2009-07-21 01:26:56

A "char" is often 1,2,or 4 bytes, depending on the language. An "integer" is most often 4 but could be 8. Languages that use a "char" of more than 1 byte will usually have a "byte" type you can refer to instead.

wrang-wrang 2009-07-21 01:39:45

In statically typed languages, the size of an array's elements depends on their type. In dynamic (e.g. Python, Lisp) languages, the actual memory used to store even the smallest individual value is a minimum of 8-20 bytes depending on the platform. And in C, a char is almost always (but not guaranteed) to be 8bits=1byte.

wrang-wrang 2009-07-21 01:42:26

Actual computer hardware these days almost always addresses the physical memory in units of bytes (chunks of 8 bits). Maybe that's what you were thinking of.

wrang-wrang 2009-07-21 01:44:47

Answer 2

+1 A:

Answering your followup question: briefly, there is no "default size of an element in an array" in most programming languages.

In C (perhaps the most popular programming language), the size of an array element -- or anything, really -- depends on its type. For an array of char, the elements are usually 1 byte. But for other types, the size of each element is whatever the sizeof() operator gives. For example, many C implementations give sizeof(short) == 2, so if you make an array of short, it will then occupy 2*N bytes of memory, where N is the number of elements.

Many high-level languages discourage you from even attempting to discover how many bytes an element of an array requires. Giving a fixed number of bytes ties the designers' hands to always using that many bytes, which is good for transparency and code that relies on its binary representation, but bad for backward compatibility whenever some reason comes along to change the representation.

Hope that helps. (I didn't see the other comments until after I wrote the first version of this.)

Brian C. Wells 2009-07-21 02:37:42

+1 for various perspectives.

Masi 2009-07-25 23:23:33

Answer 3

+1 A:

256_dec is not 1000_0000_bin, it's 0000_0001_0000_0000_bin.

With swapped bytes (1 byte = 8 bits) this looks like 0000_0000_0000_0001_bin, which is 1_dec.

sth 2009-07-21 02:39:08

true, I should have noticed it. Thank you for your concise reply. +1

Masi 2009-07-25 23:24:57

ansaurus

tags:

views:

answers:

Endianness in casting an array of two bytes into a single short

related questions