ansaurus

Question

C++ STL: How to iterate vector while requiring access to element and its index?

Answer 1

+8 A:

for (size_t i = 0; i < vec.size(); i++)
    elem = vec[i];

Vectors are a thin wrapper over C arrays; whether you use iterators or indexes, it's just as fast. Other data structures are not so forgiving though, for example std::list.

iconiK 2010-05-25 03:32:44

Actually this is not necessarily the best. Random access is less prone to cache optimization then iteration -- iteration carries the additional information that data will be accessed sequentially what allows modern optimizers to prepare the cache and save us from the oh-so-costly cache misses. Run a test comparing non-trivial vector iteration on different architectures -- you might be surprised.

Kornel Kisielewicz 2010-05-25 03:54:27

That is interesting, but can you provide a source (link or keywords)? For which compiler is that the case? Generally recognizing sequential access in loop shouldn't be harder for compiler than recognizing std iterators. I can think of couple other reasons to explain why iteration can be faster, depending on vector implementation.

ima 2010-05-25 06:06:45

In fact, I just checked Intel compiler documentation and, while it lists plenty of cache access optimizations for loops, there is not a single mention (under any name) of stl flavor of iteration. In the best case, stl iteration will be optimized into loop, then the same techniques will apply.

ima 2010-05-25 06:17:57

Actually assuming sequential access with iterators is not possible as I can easily advance by any number of element I want with a random access iterator. It is no more optimizable than an index; a vector iterator is practically a plain pointer. Whether you increment the pointer each iteration or you add i to it, it's still a simple add instruction.

iconiK 2010-05-25 10:54:59

Answer 2

+6 A:

You can always compute the index in the loop:

std::size_t index = std::distance(fooVec.begin(), i);

For a vector, this is quite likely to be implemented as a single pointer subtraction operation, so it's not particularly expensive.

James McNellis 2010-05-25 03:33:12

However, for other data structures it can get quite expensive.

iconiK 2010-05-25 03:34:49

@iconiK: I don't disagree with that.

James McNellis 2010-05-25 03:35:37

@James: I think it's better to increment the index in the loop if you iterate over all the elements of a generic container. That way you don't loop around a list 3 million times calculating an index.

iconiK 2010-05-25 03:37:17

@iconiK: Accessing an element via subscript should not be faster than computing the distance from the beginning of the container to a given iterator. There might be corner cases, but I can't think of any off the top of my head.

James McNellis 2010-05-25 04:11:10

`std::distance` for *random access iterators* like `std::vector` is O(1), being just a pointer subtraction, but for other less flexible iterators (e.g. those of `std::list`) it's an O(N) operation, and it is better to not compute `index` again and again in the loop.

KennyTM 2010-05-25 06:07:13

Answer 3

+4 A:

Elegance is in the eye of the beholder, however do remember pointer/iterator arithmetics :)

for (FooVecIter i = fooVec.begin(); i != fooVec.end(); ++i)
{
    Foo& foo = *i;
    if (foo.somethingIsTrue()) // True for most elements
        std::cout << i - fooVec.begin() << ": " << foo << std::endl;
}

The up-side compared to the distance method is that you won't mistakenly do this for a non-random_access_iterator, so you'll always be in O(1).

Kornel Kisielewicz 2010-05-25 03:42:10

You mean i = fooVec.begin(); right?

iconiK 2010-05-25 03:44:56

@iconiK, sure I do, thanks.

Kornel Kisielewicz 2010-05-25 03:45:36

Why you you subtracting a FooVecIter from a Foo in i - fooVec.begin() // makes no sense.

iconiK 2010-05-25 03:45:54

Okay, it works, though I don't understand how exactly, as it's weird (but then, I'm no iterator wizard): http://codepad.org/ycqlJxU5

iconiK 2010-05-25 03:52:20

@iconiK, both are FooVectIter

Kornel Kisielewicz 2010-05-25 03:52:33

@Kornel: I agree; enforcing that the iterator is random-accessible is often a better idea than using distance.

James McNellis 2010-05-25 04:02:15

@Kornel, ahh, my bad, I thought I saw foo - fooVec.begin(0). Being tired is not good for programming, I tell you that!

iconiK 2010-05-25 10:50:18

Answer 4

+1 A:

For the specific question:

Is there a better or more elegant way to iterate over STL vectors
when both reference to the vector element and its index is required?

IMHO,

for( size_t i = 0; i < fooVec.size(); ++i ) {
    Foo & foo = fooVec[ i ];
}

is the most simple and elegant solution. Per the requirements of the question, using an iterator is not required.

I know iterators are there, they are useful, and using them is the preferred approach. But, we also have to put the solution after the problem (as in putting the carriage after the horse, nor the other way). I don't see how something becomes elegant just by using iterator, at least for this problem. But, I am not an expert, so I look forward what others have to say.

ArunSaha 2010-05-25 05:31:54

You can't rebind a reference.

Matt Curtis 2010-05-25 05:42:11

Yes, corrected.

ArunSaha 2010-05-25 05:52:25

ansaurus

tags:

views:

answers:

C++ STL: How to iterate vector while requiring access to element and its index?

related questions