ansaurus

Question

Reading binary files, Linux Buffer Cache

Answer 1

+1 A:

in_avail returns the lower bound of how much is available to read in the streams read buffer, not the size of the file. To read the whole file via the stream, just keep calling the stream's readsome() method and checking how much was read with the gcount() method - when that returns zero, you have read everthing.

anon 2009-08-10 13:33:10

That algorithm will stop if showmanyc() (which is the virtual member called by in_avail if the buffer is empty) returns 0. But showmanyc() can return 0 without reaching the end of file. (Linux's version for filebuf returns the file size, Solaris and AIX return 0)

AProgrammer 2009-08-10 13:59:34

Answer 2

+3 A:

in_avail doesn't give the length of the file, but a lower bound of what is available (especially if the buffer has already been used, it return the size available in the buffer). Its goal is to know what can be read without blocking.
unsigned int is most probably unable to hold a length of more than 4GB, so what is read can very well be in the cache.
C++0x Stream Positioning may be interesting to you if you are using large files

AProgrammer 2009-08-10 13:36:21

Hi, good point about the unsigned int. I've edited my code to reflect what you've said. I am still getting the same problem. Infact i've also written a perl script which just reads in the whole file and get similar (slower on both parts) results. Still just over 2 minutes for the first run and 3 seconds for the second.

Salgar 2009-08-10 13:57:03

Answer 3

+4 A:

On Linux, you can do this for a quick troughput test:

$ dd if=/dev/md0 of=/dev/null bs=1M count=200
200+0 records in
200+0 records out
209715200 bytes (210 MB) copied, 0.863904 s, 243 MB/s

$ dd if=/dev/md0 of=/dev/null bs=1M count=200
200+0 records in
200+0 records out
209715200 bytes (210 MB) copied, 0.0748273 s, 2.8 GB/s

$ sync && echo 3 > /proc/sys/vm/drop_caches

$ dd if=/dev/md0 of=/dev/null bs=1M count=200
200+0 records in
200+0 records out
209715200 bytes (210 MB) copied, 0.919688 s, 228 MB/s

echo 3 > /proc/sys/vm/drop_caches will flush the cache properly

Nicolas Viennot 2009-08-10 14:11:29

Note that syncing before dropping caches isn't strictly necessary if the data isn't being modified - drop_caches won't remove dirty (or mmapped) data.

bdonlan 2009-08-10 15:22:09

Note that this requires superuser privileges.

Peter Mortensen 2009-08-29 12:18:59

Answer 4

+1 A:

It appears to take a correct amount of time the first time the file is read.

On that first read, you're reading 150GB in about 2 minutes. That works out to about 10 gigabits per second. Is that what you're expecting (based on the network to your NFS mount)?

Keith Smith 2009-08-10 14:56:06

It turns out this is the correct answer, The reason there is such a difference is because everything I am reading is making it into the cache.The problem is i'm not reading 'everything' into the cache.For instance when reading in a 10gb file. I read size and add up everything .readsome() has read I get a big difference.Size = 10819423232Read = 2229490000Anyone know why readsome() would be stopping early?

Salgar 2009-08-10 15:03:50

It's always good to sanity check your numbers. Do you need to write your own program to do this? If I wanted to measure the time to read a large file in 1MB chunks, I'd just use dd...

Keith Smith 2009-08-10 15:12:15

Answer 5

+1 A:

One possibility is that the file could be at least in part sparse. A sparse file has regions that are truly empty - they don't even have disk space allocated to them. These sparse regions also don't consume much cache space, and so reading the sparse regions will essentially only require time to zero out the userspace pages they're being read into.

You can check with ls -lsh. The first column will be the on-disk size - if it's less than the file size, the file is indeed sparse. To de-sparse the file, just write to every page of it.

If you would like to test for true disk speeds, one option would be to use the O_DIRECT flag to open(2) to bypass the cache. Note that all IO using O_DIRECT must be page-aligned, and some filesystems do not support it (in particular, it won't work over NFS). Also, it's a bad idea for anything other than benchmarking. See some of Linus's rants in this thread.

Finally, to drop all caches on a linux system for testing, you can do:

echo 3 > /proc/sys/vm/drop_caches

If you do this on both client and server, you will force the file out of memory. Of course, this will have a negative performance impact on anything else running at the time.

bdonlan 2009-08-10 14:58:25

ansaurus

tags:

views:

answers:

Reading binary files, Linux Buffer Cache

related questions