ansaurus

Question

Why is it so slow iterating over a big std::list ?

Answer 1

+3 A:

It is due to the large amounts of cache misses you get when using a list. With a vector the surrounding elements are stored in the processors cache.

Yacoby 2009-09-09 22:36:53

Answer 2

+1 A:

Have a look at the following stackoverflow thread.

ChristopheD 2009-09-09 22:37:21

Answer 3

A:

[Edit: I stand corrected. std::list doesn't have operator[]. Sorry.]

It's hard to tell from your description, but I suspect you were trying to access the items randomly (i.e., by index):

for(int i = 0; i < mylist.size(); ++i) { ... mylist[i] ... }

Instead of using the iterators:

for(list::iterator i = mylist.begin(); i != mylist.end(); ++i) { ... (*i) ... }

Both "vector" & "deque" are good at random access, so either will perform adequately for those types---O(1) in both cases. But "list" is not good at random access. Accessing the list by index would take O(n^2) time, versus O(1) when using iterators.

ijprest 2009-09-09 22:37:53

Last time I checked, the stl didn't define a list as having random access functions?

Yacoby 2009-09-09 22:44:01

Why is this voted up? You *can't* access a list by index.

jalf 2009-09-09 22:45:14

Answer 4

+1 A:

There is a cache issue: all data in vector are stored in a contiguous chunk, and each list element is allocated separately and may happen to be stored in quite a random place of memory, which leads to more cache misses. However, I bet that you encounter one of the issues described in the other answers.

Pavel Shved 2009-09-09 22:42:16

Answer 5

+13 A:

Lists have terrible (nonexistent) cache locality. Every node is a new memory allocation, and may be anywhere. So every time you follow a pointer from one node to the next, you jump to a new, unrelated, place in memory. And yes, that hurts performance quite a bit. A cache miss may be two orders of magnitudes slower than a cache hit. In a vector or deque, pretty much every access will be a cache hit. A vector is one single contiguous block of memory, so iterating over that is as fast as you're going to get. A deque is several smaller blocks of memory, so it introduces the occasional cache miss, but they'll still be rare, and iteration will still be very fast as you're getting mostly cache hits.

A list will be almost all cache misses. And performance will suck.

In practice, a linked list is hardly ever the right choice from a performance point of view.

Edit: As a comment pointed out, another problem with lists is data dependencies. A modern CPU likes to overlap operations. But it can't do that if the next instruction depends on the result of this one.

If you're iterating over a vector, that's no problem. You can compute the next address to read on the fly, without ever having to check in memory. If you're reading at address x now, then the next element will be located at address x + sizeof(T) where T is the element type. So there are no dependencies there, and the CPU can start loading the next element, or the one after it, immediately, while still processing an earlier element. That way, the data will be ready for us when we need it, and this further helps mask the cost of accessing data in RAM.

In a list, we need to follow a pointer from node i to node i+1, and until i+1 has been loaded, we don't even know where to look for i+2. We have a data dependency, so the CPU is forced to read nodes one at a time, and it can't start reading future nodes ahead of time, because it doesn't yet know where they are.

If a list hadn't been all cache misses, this wouldn't have been a big problem, but since we're getting a lot of cache misses, these delays are costly.

jalf 2009-09-09 22:44:16

Even STL vectors?

Paul Nathan 2009-09-09 22:46:04

Okey, that does make sence when you explain it.. Cache locality, didnt think of that. Ty a bunch

Martin Andersson 2009-09-09 22:46:40

Even STL vectors what?

jalf 2009-09-09 22:47:02

Even STL vectors have a continuous memory access characteristic?

Paul Nathan 2009-09-09 22:49:52

*only* STL vectors have contiguous memory. None of the other STL data structures do. That's what makes vector so useful.

jalf 2009-09-09 22:50:29

That is good to know. Thanks. :-)

Paul Nathan 2009-09-09 22:53:22

not just cache penalties, don't forget about data dependencies

fortran 2009-09-09 23:04:50

true, although I suspect the cache hits are quite a bit more expensive. But you're right, I'll update my answer

jalf 2009-09-09 23:07:43

@jalf, doesn't std::string also use contiguous memory?

StackedCrooked 2009-09-10 12:53:01

I don't think it's technically required to. All implementations I know of do, yes, but I think it's legal to use another memory layout. I could be wrong though, I've never looked it up.

jalf 2009-09-10 15:36:44

jalf is correct; there's no requirement that std::string be contiguous -- that's why the data() and c_str() member functions return char_type const *, not char_type *.

me22 2009-09-23 15:29:02

ansaurus

tags:

views:

answers:

Why is it so slow iterating over a big std::list ?

related questions