ansaurus

Question

what does compiler do with a[i] which a is array? And what if a is a pointer?

Answer 1

+7 A:

a is a pointer to an array of chars. p is a pointer to a char which happens to, in this case, being pointed at a string-literal.

movl    $1819043176, -14(%ebp)
movw    $111, -10(%ebp)

Initializes the local "hello" on the stack (that's why it is referenced through ebp). Since there are more than 4bytes in "hello", it takes two instructions.

movzbl  -11(%ebp), %eax
movsbl  %al,%eax

References a[3]: the two step process is because of a limitation in terms of access to the memory referenced though ebp (my x86-fu is a bit rusty).

movl -8(%ebp), %eax does indeed reference the p pointer.

LC0 references a "relative memory" location: a fixed memory location will be allocated once the program is loaded in memory.

movsbl %al,%eax means: "move single byte, lower" (give or take... I'd have to look it up... I am a bit rusty on this front). al represent a byte from the register eax.

jldupont 2010-01-15 16:40:08

So you mean a is also a pointer? But I was told that the type of a is array.

ibread 2010-01-15 17:28:41

@ibread: when it gets down to the assembly level, there is no concept of array really, just accessible memory through pointers etc.

jldupont 2010-01-15 17:33:08

... and why the "drive-by down-vote without comment", please??

jldupont 2010-01-15 18:00:21

@jldupont but I think maybe it's better to call "a" as the name of block of memory which contains "hello"? after all, there's no dereference to "a" in assembly level.

ibread 2010-01-15 18:15:14

I'm confused.... what does "drive-by down-vote without comment" mean? I did not do anything but up-vote your answer...

ibread 2010-01-15 18:16:33

@jldupont I think I'm clearer after reading your explanation. And I've added another 2 questions in the original post, would you plz show me the answer plz? thank you in advance~

ibread 2010-01-15 18:21:39

@ibread: I don't think the "drive-by down-vote" comment was directed at you. It was directed at whoever downvoted this answer.

Fred Larson 2010-01-15 18:22:46

@ibread: somebody down-voted my contribution **without** providing an explanation as to why. This is unfortunately quite common on SO... and childish. We are here as a community, trying to better ourselves. If folks do not explain **why** my contribution is faulty, then we/I cannot learn.

jldupont 2010-01-15 18:24:04

@ibread: you cannot keep adding sub-questions to to your post. This isn't how SO works. Please post another question to let others have the chance to contribute further. Also, it is good practice to accept an answer. Cheers.

jldupont 2010-01-15 18:25:25

@jldupont Thank you very much for your remind. But... my further question is related to this one, and ... should I copy all the content in this post to another one? And, I'm still confused about the functionality of " movsbl %al,%eax". Is it necessary since p[3] has already been retrieved via "movzbl (%eax), %eax"?

ibread 2010-01-15 18:34:49

@ibread: I'll make an exception ;-) (just kidding of course). "movsbl %al,%eax" transfers a single 8bit byte to the "eax" register and zeroes-out the rest of the register (if memory serves me right)... in other words, a char. It facilitates working on this particular "char" from an assembly/machine code point of view. Was that the question?

jldupont 2010-01-15 18:38:01

Answer 2

+2 A:

While it is true that arrays are not pointers, they behave very similarly. In both cases the compiler internally stores an address to a typed element, and in both cases there can be one, or more than one element.

In both arrays and pointers, when dereferenced by the [] operator, the compiler evaluates the address of the element you are indexing to by multiplying the index by the size of the data type and adding it to the address of the pointer or array.

The fundamental difference between pointer and arrays is that an array is essentially a reference. Where it is legal to initialize a pointer to null, or change the value that a pointer stores, arrays cannot be null, and they cannot be set to other arrays; they are in essence constant pointers that cannot be set to null.

Additionally it is possible for arrays to be allocated on the stack, and that is not possible for pointers (although pointers can be set to addresses on the stack, but that can get ugly).

Chris 2010-01-15 16:45:32

"I wanted the structure not merely to characterize an abstract object but also to describe a collection of bits that might be read from a directory. Where could the compiler hide the pointer to name that the semantics demanded? Even if structures were thought of more abstractly, and the space for pointers could be hidden somehow, how could I handle the technical problem of properly initializing these pointers when allocating a complicated object, perhaps one that specified structures containing arrays containing structures to arbitrary depth? ...

Johannes Schaub - litb 2010-01-15 16:50:27

... The solution constituted the crucial jump in the evolutionary chain between typeless BCPL and typed C. It eliminated the materialization of the pointer in storage, and instead caused the creation of the pointer when the array name is mentioned in an expression. The rule, which survives in today's C, is that values of array type are converted, when they appear in expressions, into pointers to the first of the objects making up the array." (Dennis M. Ritchie on History of C: http://cm.bell-labs.com/cm/cs/who/dmr/chist.html)

Johannes Schaub - litb 2010-01-15 16:52:37

(Your answer sounds like a `char p[1];` needs more than 1 byte of storage, because you say the compiler would store the address of `p` into a "constant pointer", which is wrong indeed).

Johannes Schaub - litb 2010-01-15 17:07:37

You "array is essentially a reference""constant pointers that cannot be set to null"Wikipedia "Once a reference is created, it cannot be later made to reference another object; it cannot be reseated. This is often done with pointers.References cannot be null, whereas pointers can"... humour abounds.

Alex Brown 2010-01-15 17:27:00

Answer 3

+2 A:

Getting on the language side of this, since the assembler side has already been handled:

Note this sentence: " an expression of the form a[i] causes the array to decay into a pointer, following the rule above, and then to be subscripted just as would be a pointer variable in the expression p[i] (although the eventual memory accesses will be different, " I'm pretty confused of this: since a has decayed to pointer, then why does he mean about "memory accesses will be different?

This is because after decaying, access is equal for the (now a pointer value) and the pointer. But the difference is how that pointer value is got in the first place. Let's look at an example:

char c[1];

char cc;
char *pc = &cc;

Now, you have an array. This array does not take any storage other than one char! There is no pointer stored for it. And you have a pointer that points to a char. The pointer takes the size of one address, and you have one char that the pointer points to. Now let's look what happens for the array case to get the the pointer value:

c[0] = 'A';
// #1: equivalent: *(c + 0) = 'A';
// #2: => 'c' appears not in address-of or sizeof 
// #3: => get address of "c": This is the pointer value P1

The pointer case is different:

pc[0] = 'A';
// #1: equivalent: *(pc + 0) = 'A';
// #2: => pointer value is stored in 'pc'
// #3: => thus: read address stored in 'pc': This is the pointer value P1

As you see, for the array case for getting the pointer value needed where we add the index value to (in this case a boring 0), we don't need to read from memory, because the address of the array is already the pointer value needed. But for the pointer case, the pointer value we need is stored in the pointer: We need one read from memory to get that address.

After this, the path is equal for both:

// #4: add "0 * sizeof(char)" to P1. This is the address P2
// #5: store 'A' to address P2

Here is the assembler code generated for the array and the pointer case:

        add     $2, $0, 65  ; write 65 into r2
        stb     $2, $0, c   ; store r2 into address of c
# pointer case follows
        ldw     $3, $0, pc  ; load value stored in pc into r3
        add     $2, $0, 65  ; write 65 into r2
        stb     $2, $3, 0   ; store r2 into address loaded to r3

We can just store 65 (ASCII for 'A') at the address of c (which will be known already at compile or link time when it is global). For the pointer case, we will first have to load the address stored by it into register 3, and then write the 65 to that address.

Johannes Schaub - litb 2010-01-15 18:26:20

ansaurus

tags:

views:

answers:

what does compiler do with a[i] which a is array? And what if a is a pointer?

related questions