views:

1007

answers:

6

I'm a beginner C programmer, and I assumed that this would be the case, but would like some affirmation if possible.

If they are the same, why not just take one argument instead?

+1  A: 

It is the same. The reason is because most of the time you want to use a sizeof operator as one of the arguments. If passing two parameters bother you, call malloc() which has a single argument.

Otávio Décio
You should note that malloc() won't zero the memory as calloc() does.
Eduard - Gabriel Munteanu
Well, that is true. Thanks for the heads up.
Otávio Décio
Is it true they're the same when you take packing into consideration? Is calloc(16,3) the same as calloc(3,16) on a machine which needs any data element to start on a 4-byte boundary?
ChrisW
@ChrisW: Yes, see http://www.opengroup.org/onlinepubs/000095399/functions/calloc.html: "The pointer returned if the allocation succeeds shall be suitably aligned so that it may be assigned to a pointer to any type of object and then used to access such an object or an array of such objects...".
paxdiablo
Doesn't that implies that the answer is "no: they're not the same"? That on a machine with 4-byte packing, for example, calloc(4,3) would actually allocate 16 bytes, whereas calloc(3,4) would allocate only 12 bytes?
ChrisW
ChrisW. calloc has to return a pointer that's suitable aligned for any type of object. that means it will always return a pointer to an address maximally aligned. the order, therefor, won't necessarily be changed because of aligning, at least i don't see any reason why so.
Johannes Schaub - litb
@ChrisW: calloc gives you the address of an array. The entire array may be aligned but not the individual elements.
PolyThinker
and given that an array can't have "holes" between its elements, you will end up having a maximally aligned array starting somewhere and having 4 * 3 == 3 * 4 bytes. so both cases are identically in that regard, i think.
Johannes Schaub - litb
I can't give this an upvote until it distinguishes between malloc() not initializing the memory and calloc() initializing the memory. That is crucial - if it wasn't, you'd use malloc() because it doesn't do the initialization.
Jonathan Leffler
@Jonathan - if you read the question you'll see that it refers solely to calloc. I just said if he wanted to use a single argument, to use malloc.
Otávio Décio
@ocdecio - Actually I think it could be misleading. Perhaps would be slightly better to say "Use a single argument wrapper on calloc" or "Use malloc followed by memset". Although clearly its not that big a problem.
Ali
+12  A: 

People mostly use allocation routines to allocate space for a set number of items, so calloc() allows that to be specified nicely. So, for example, if you want space for 100 integers or 20 of your own structure:

int *pInt = calloc (100, sizeof(int));
tMyStruct *pMyStruct = calloc (20, sizeof(tMyStruct));

This code actually looks slightly "nicer" than the equivalent malloc() calls:

int *pInt = malloc (100 * sizeof(int));
tMyStruct *pMyStruct = malloc (20 * sizeof(tMyStruct));

although, to seasoned C coders, there's no real distinction (other than the zero initialization of course).

I have to say I have never used calloc in the wild, since I'm almost always creating a struct where zero's don't make sense. I prefer to initialize all the fields manually to ensure I get the values I want.

paxdiablo
Thanks. Btw you need to change the second calloc in the bottom code to a malloc.
Ali
Ta, Damnable cut'n'paste operations ! :-)
paxdiablo
I agree - I've almost never used calloc(). I do sometimes create a static structure that is correctly default initialized, and then assign that to newly allocated structures - and do any residual fixups.
Jonathan Leffler
i've also never used it. i find it better to explicitly initialize elements too :)
Johannes Schaub - litb
+1  A: 

There is a slight distinction: Calloc can decide to zero-out the memory only as it is needed and then there's the advantage of knowing the size of the elements.

I can't name any implementations doing this, but it was thought for this.

As an example:

One callocates 4GB of memory, but the system has only 2GB: it wouldn't make sense to write 2GBs of zero into the virtual memory, therefore the system could set a dirty-flag on this memory to zero it out as it gets loaded into memory.

Georg
+4  A: 
mat_geek
Most programmers I have worked with assume they are the same. One project wrote a custom allocator with that assumption. When it was proven wrong, the team lead decided to fix it by adding a comment stating the second argument must be a sizeof or it'll cause a crash.
mat_geek
i believe you are wrong. both malloc and calloc must return a pointer suitable aligned for any object type. as it happens, that's also exactly what you have quoted there from the standard. assuming my believing is wrong, can you provide a link for further reading about this please?
Johannes Schaub - litb
anyway, if you change the order of the arguments, you never are guaranteed for any function whatsoever that exactly the same happens. implementations could do optimizations on their behalf always - as soon as the constraints put by the standard are not violated so that using code fails to work.
Johannes Schaub - litb
I agree with litb; this is an interesting theory, but not correct. The sizeof() operator deals with alignment issues in arrays. If sizeof() returns 6, then the type can be allocated at successive 6 byte slots; if it couldn't (because of an alignment restriction on a float), then sizeof() says 8...
Jonathan Leffler
i so hate it that we can't edit comments in place :) anyway - s,as soon,as long, :) and thanks Jonathan that's exactly what i mean. padding at the end of the struct deals with aligning issues for elements so they can be put next after each other.
Johannes Schaub - litb
`calloc()` need not factor in alignment of objects within the array. As Jonathan mentions - that's taken into account by the sizeof() the object. If an implementation did do something like that, it would just end up reserving more memory than needed. If you depend on that behavior, you have a bug.
Michael Burr
See also the comp.lang.c FAQ, http://c-faq.com/malloc/calloc.html
Lars Wirzenius
+3  A: 

Despite the accepted answer (which I believe to be correct), there seems to be confusions about how many bytes are allocated due to alignment. So here's a little test on my 32-bit Linux with gcc-4.3:

#include <stdio.h>
#include <stdlib.h>

int main()
{
  char* p1 = calloc(6, 4);
  char* p2 = calloc(4, 6);
  char* p3 = calloc(1,1);
  printf("%p, %p, %p\n", p1, p2, p3);
  return 0;
}

The result is:

0x826b008, 0x826b028, 0x826b048

which shows that both calloc(6,4) and calloc(4,6) allocate the same amount of memory, which is rounded to 32 bytes on my system. Changing the numbers to calloc(3,4) and calloc(4,3) will give the following result:

0x95d2008, 0x95d2018, 0x95d2028

which shows that 16 bytes are reserved when 12 are requested and allocated to the program. In either case, both calloc(a,b) and calloc(b,a) calls have the same effect on the memory usage.


Added by Jonathan Leffler because 300 characters is never going to be enough.

Consider this program, which leaks memory like a veritable sieve, but demonstrates a point:

#include <stdlib.h>
#include <stdio.h>

int main()
{
    int i, j, k;

    for (i = 1; i < 17; i++)
        for (j = 1; j < 9; j++)
            for (k = 0; k < 4; k++)
                printf("(%2d,%d)%d: 0x%08X\n", i, j, k, calloc(i, j));
    return(0);
}

On Windows, under Cygwin, this starts by allocating blocks that are 16 bytes apart (actually, the second block is 24 bytes after the first, but thereafter, they are 16 bytes apart). When allocating (2,7), the block addresses start incrementing by 24 bytes; likewise, (3,4) allocates blocks 16 bytes apart, but (3,5) allocates blocks 24 bytes apart. And, for the record, both (4,6) and (6,4) return pointers 32 bytes apart.

This simply demonstrates that there is some overhead associated with an allocation call. If you look at the archetypal implementation of malloc() et al in K&R, you will see that the size of the block is stored ahead of the memory that you're entitled to use. Different implementations do these things differently; those worried about memory trampling will avoid storing control data near where the user can wreak havoc.

When you calloc(4,6), you only have reliable access to 24 bytes of data. Even if your implementation gives you return values that are 32 bytes apart, you may not safely use any more than the 24 bytes you requested. And debugging versions of malloc() will observe if you write out of the bounds you requested.

PolyThinker
That just shows that there is at least 4 bytes of overhead and that allocations are quantized. Each memory allocation normally includes some overhead (for example, the size of the block that was allocated is written just before the returned pointer - details vary by implementation).
Jonathan Leffler
Points taken. I guess I didn't state it clearly but I wanted to show that the `effect' of calloc(a,b) and calloc(b,a) is exactly the same.
PolyThinker
+2  A: 

To the excellent responses posted, I want to add one more point of difference between using calloc(nelem, elsize) versus malloc(nelem * elsize): quality implementations of calloc will ensure that if your nelem and elsize were big enough to cause an integer overflow when multiplied together, it will fail rather than cause an undersized allocation, as a naive malloc invocation would.

Just this feature alone would be enough for me to prefer calloc to malloc. Background reading.

Chris Jester-Young