ansaurus

Question

C++: How to parallelize reading lines from an input file when lines get independently processed?

Answer 1

+1 A:

Not a direct OpenMP answer - but what you are probably looking for is Map/Reduce approach. Take a look at Hadoop - it's done in Java, but there's some C++ API at least.

In general, you want to process this amount of data on different machines, not in multiple threads in the same process (virtual address space limitations, lack of physical memory, swapping, etc.) Also the kernel will have to bring the disk file in sequentially anyway (which you want - otherwise the hard-drive will just have to do extra seeks for each of your threads).

Nikolai N Fetissov 2010-10-05 01:39:55

@Nikolai: Thanks for the explanation. What you said makes perfect sense now.

Legend 2010-10-05 01:52:12

ansaurus

tags:

views:

answers:

C++: How to parallelize reading lines from an input file when lines get independently processed?

related questions