views:

180

answers:

8

I was advised a while ago that is was common place to use std::vector as exception safe dynamic array in c++ rather than allocating raw arrays... for example

{
    std::vector<char> scoped_array (size);
    char* pointer = &scoped_array[0];

    //do work

} // exception safe deallocation

I have used this convention multiple times with no problems, however I have recently ported some code to Win32 VisualStudio2010 (previously it was only on MacOS/Linux) and my unit tests are breaking (stdlib throws an assert) when the vector size happens to be zero.

I understand that writing to an such an array would be a problem, but this assumption breaks this solution as a replacement to raw pointers. Consider the following functions with n = 0

void foo (int n) {
   char* raw_array = new char[n];
   char* pointer = raw_array;
   file.read ( pointer , n );
   for (int i = 0; i < n; ++i) {
      //do something
   }
   delete[] raw_array;
}

While arguably redundant, the above code is perfectly legal (I believe), while the below code will throw an assertion on VisualStudio2010

void foo (int n) {
   std::vector<char> scoped_array (n);
   char* pointer = &scoped_array[0];
   file.read ( pointer , n );
   for (int i = 0; i < n; ++i) {
  //do something
   }
}

Was I using undefined behavior all along? I was under the impression operator[] did no error checking, and this was a valid use of std::vector<>. Has anyone else encountered this problem?

--edit: Thanks for all the useful responses, in reply to people saying it is undefined behavior. Is there a way to replace the raw array allocation above which will work with n = 0?

While saying that checking for n=0 as an exceptional case will solve the problem (it will). There are many patters where no special case is needed (such as the raw pointer example above) so maybe using something other than std::vector<> would be needed?

A: 

MVS does range checking in operator[] even in release builds. I don't know if it's standard compliant. (I actually found debug code in their implementation which made their implementation break correct code). There is a switch to disable it though.

ybungalobill
It's not per the standard but it's very helpful in debugging unfinished code quickly. Unfortunately this resulted in the need to tune a tangle of macros to get desired behaviour - I don't know if this is any cleaner in later versions (VC9/VC10).
Steve Townsend
ybungalobill
+4  A: 

With n==0 it is undefined for both array of char and vector.

For new char[0], according to 3.7.3.1/2

The effect of dereferencing a pointer returned as a request for zero size is undefined.

EDIT: It is of course, defined until dereferencing actually takes place, if file here is std::istream, then file.read() won't attempt to dereference it since n is zero.

For the vector, a[n] is equivalent to *(a.begin() + n), according to 23.1.1/12, If the container is empty, then begin() == end(), according to 23.1/7, which means scoped_array[0] attempted to dereference a past-the-end iterator.

Cubbi
Akusete
Akusete is correct, there is not lvalue-to-rvalue conversion in `raw[0]`.
avakar
@Akusete @avakar: you're both right, edited to clarify.
Cubbi
A: 

If you want to get cleaner behaviour in this scenario you could replace use of a[0] with use a.at(0), which will throw if the index is invalid.

A pragmatic solution would be to init vector with n+1 entries and constrain access to 0..n-1 (as this code already does).

void foo (int n) {
   std::vector<char> scoped_array (n+1);
   char* pointer = &scoped_array[0];
   file.read ( pointer , n );
   for (int i = 0; i < n; ++i) {
  //do something
   }
}
Steve Townsend
That's the opposite of what he actually wants, though. He wants it not to throw, just like the array/pointer code doesn't throw.
Steve Jessop
@Steve Jessop - I read the goal as 'exception-safe dynamic array in C++'. Code as it stands will assert (at best) and fault (more typically).
Steve Townsend
As far as the example code goes, I think the key word is *array*. An array provides a one-past-the-end pointer, which you can legally pass to `read` as long as the number of bytes read is 0. A vector doesn't give you that pointer, just a one-past-the-end *iterator*. So in that respect is not an exception-safe dynamic array. I think that Akusete's vector code will either assert or typically work. I don't know quite what the implementation looks like on which it will actually fault, although we're pretty sure it's allowed to.
Steve Jessop
`at` has an `[]` operator?
Default
@Steve Jessop - I see. Then perhaps a pragmatic solution would be to init vector with n+1 entries and constrain access to 0..n-1 (as this code already does).
Steve Townsend
@Steve: good call, that's more concise code than my solution.
Steve Jessop
A: 

This brought an interesting question to my mind, which I promptly asked here. In your case, you can avoid using pointers the following way:

template<class InputIterator, class OutputIterator>
OutputIterator copy_n( InputIterator first, InputIterator last, OutputIterator result, std::size_t n)
{
    for ( std::size_t i = 0; i < n; i++ ) {
        if (first == last)
            break;
        else
            *result++ = *first++;
    }
    return result;
}

std::ifstream file("path_to_file");
std::vector<char> buffer(n);
copy_n(std::istream_iterator<char>(file), 
       std::istream_iterator<char>(),
       std::back_insert_iterator<vector<char> >(buffer),
       n);

This will copy the contents of the file to a buffer n chars at a time. When you iterate over the buffer, use:

for (std::vector<char>::iterator it = buffer.begin(); it != buffer.end(); it++)

instead of a counter.

Space_C0wb0y
+1  A: 

As far as the C++ standard is concerned, operator[] isn't guaranteed not to check, it's just that (unlike at()) it's not guaranteed to check.

You'd expect that in a non-checking implementation, &scoped_array[scoped_array.size()] would result in a legal pointer either within or one-off-the-end of an array allocated by the vector. This isn't explicitly guaranteed, but for a given implementation you could verify by looking at its source. For an empty vector, there might not be an allocation at all (as an optimisation), and I don't see anything in the vector part of the standard which defines the result of scoped_array[0] other than table 68.

Going from table 68, you might say that the result of your expression is &*(a.begin() + 0), which illegally dereferences an off-the-end iterator. If your implementation's vector iterator is just a pointer then you probably get away with this - if not you might not, and obviously yours isn't.

I forget the results of the argument as to whether &*, on a pointer that must not be dereferenced, is a no-op, or not. IIRC it's not clear from the standard (some ambiguity somewhere), which provoked requests to fix the standard to make it explicitly legal. This suggests that it does in fact work on all or most known implementations.

Personally I wouldn't rely on this, and I wouldn't disable the checking. I'd rewrite your code:

char* pointer = (scoped_array.size() > 0) ? &scoped_array[0] : 0;

Or in this case just:

char* pointer = (n > 0) ? &scoped_array[0] : 0;

It just looks wrong to me to use index n of a vector without knowing that the size is at least n+1, regardless of whether it actually works in your implementation once you've disabled the checking.

Steve Jessop
I agree, better rewrite this code than try to work around the check, it's there for a reason.
Matthieu M.
A: 

operator [] returns a reference, and therefor invoking it on an empty vector must be undefined.

After all, which item should the reference refer to, when there are no items? operator [] would have to return a null-reference or a totally invalid reference. Both of which would result in undefined behavior.

So yes, you were using undefined behavior all along. Visual Studio's not-mandatory-but-still-comformant checks in operator [] just revealed that fact.

pgroke
+4  A: 

See LWG issue 464. This is a known issue. C++0x (which is partially implemented by MSVC 2010) solves it by adding a .data() member.

MSalters
.data() would be perfect, too bad its only in C++0x
Akusete
I'm accepting this answer simply because it references the discussion as to why this type of operation is wanted, and what will (eventually) be a standard way to do it.
Akusete
A: 

Could you use iterators instead of pointers?

{
    std::vector<char> scoped_array (size);
    std::vector<char>::iterator pointer = scoped_array.begin();

    //do work

} // exception safe deallocation
Arne
In general no, I was looking for a way to allocate normal pointer arrays in an exception safe way (sort of like auto_ptr for an array). I was under the impression it was good practice to use a std::vector for such a purpose.
Akusete