ansaurus

Question

Mysterious pointer-related multithreading slowdown

Answer 1

+1 A:

My first guess is that you are running into false-sharing. If you have multiple threads both modifying memory in the same cache line, the hardware is going to spend a lot of time passing ownership of the cache line between processors.

R Samuel Klatchko 2009-11-06 18:49:07

but my Nodes and Models are both block-allocated... hmm.. how would false sharing happen in this case?

int3 2009-11-06 18:54:48

As I understand it though, neither thread modifies the data in question, which would rule out false sharing. It was my first thought too though.

jalf 2009-11-06 20:31:52

What about read contention for reading the same cache-line during the same bus cycle? I think the L1 cache is single-ported.

Heath Hunnicutt 2009-11-09 19:50:45

Answer 2

A:

The biggest thing I'd look for is some improper initializtion that either makes duplicate data or has improper shared data. It's not evident in the code but it's the obvious mistake to make when going from ** to *.

Charles Eli Cheese 2009-11-09 07:16:55

Answer 3

+1 A:

Another has questioned whether the slow-down comes from the added indirection, or the change in how you allocate the struct Model. Because you are now allocating the Model structs as a contiguous region of memory, it is possible that adjacent structs could share the same cache-line. If your threads are simultaneously accessing adjacent structs, the will contend for access. One read access will stall for a bus cycle whilst awaiting the other.

What is the sizeof(class Model)? You might try expanding it with dummy variables until the class is the sizeof your cache line.

Another possibility is that you have changed the alignment of member variables you are accessing. If your sizeof(class Model) is not a multiple of your machine's word size (e.g., 8-bytes) then an array of such objects will have some members aligned to the word size and some not. Misalignment causes double-fetching on the memory bus, as the fetch unit reads the machine words from aligned memory locations and composites the addressed value out of those two fetches.

Heath Hunnicutt 2009-11-09 19:58:42

ansaurus

tags:

views:

answers:

Mysterious pointer-related multithreading slowdown

related questions