ansaurus

Question

Answer 1

A:

I've done some tests on this, and after a certain point, the effect of increased buffer size goes down the bigger the buffer. There is usually an optimum buffer size you can find with a bit of trial and error.

Note also that fread() (or more specifically the C or C++ I/O library) will probably be doing its own buffering. If your system suports it a plain read() may (or may not) be a bit faster.

anon 2009-12-13 12:51:43

In a sense then I'd have to create a buffer, with smaller buffers, such as buffer[5][4096]?

kelton52 2009-12-13 12:54:28

I think you are right about the buffering, but is there a way to read a file without the read function buffering?

kelton52 2009-12-13 13:00:32

As I said, try using te read() function, rather than fread(), or try using other OS features, such as the Win32 ReadFile() API..

anon 2009-12-13 13:05:30

I did see a slight improvement of about 1-5 million characters per second.

kelton52 2009-12-13 13:11:19

Answer 2

+1 A:

Transfers from or to main memory run at speeds of gigabytes per second. Inside the CPU data flows even faster. It is not surprising that, whatever you do at the software side, the hard drive itself remains the bottleneck.

Here are some numbers from my system, using PerformanceTest 7.0:

hard disk: Samsung HD103SI 5400 rpm: sequential read/write at 80 MB/s
memory: 3 * 2 GB at 400 MHz DDR3: read/write around 2.2 GB/s

So if your system is a bit older than mine, a hard drive speed of 50 MB/s is not surprising. The connection to the drive (IDE/SATA) is not all that relevant; it's mainly about the number of bits passing the drive heads per second, purely a hardware thing.

Another thing to keep in mind is your OS's filesystem cache. It could be that the second time round, the hard drive isn't accessed at all.

The 180 MB/s memory read speed that you mention in your comment does seem a bit on the low side, but that may well depend on the exact code. Your CPU's caches come into play here. Maybe you could post the code you used to measure this?

Thomas 2009-12-13 12:51:49

Even though I have one SATA drive and one IDE drive? I should notice SOME difference if the hard drive was the bottle neck, correct?

kelton52 2009-12-13 12:55:58

Also, the fastest I've been able to read bytes from an array was about 180 million bytes per second, far from gigabytes. I feel like I'm missing a lot of speed I could be using, and I need all that I can find. Any suggestions for that?

kelton52 2009-12-13 12:59:31

Too much to say for a comment. I'll edit my answer, hang on...

Thomas 2009-12-13 13:19:38

I'm pretty sure you're right about the hard drives being the bottleneck now. Thank you. And i rewrote a 'read from buffer' and it doesn't even seem to register on my timer. The biggest file I opened was 700 megs. Didn't even see a millisecond tick. Doesn't seem right. I'll post my code for that also.

kelton52 2009-12-13 14:14:35

The bottleneck is not the hard drive, but the communication channel between the program and the hard drive. There are USB, SATA and other protocols to setup as well as data and address bus sharing on the PC. Also, if the data file is fragmented, the drive will have to make more than one access.

Thomas Matthews 2009-12-14 18:29:44

Answer 3

+1 A:

The FILE* API uses buffered streams, so even if you read byte by byte, the API internally reads buffer by buffer. So your comparison will not make a big difference.

The low level IO API (open, read, write, close) is unbuffered, so using this one will make a difference.

It may also be faster for you, if you do not need the automatic buffering of the FILE* API!

frunsi 2009-12-13 13:38:56

Yeah I tested that, and it gave me 1-5 extra million bytes per second...relatively not much of a gain. The biggest gain I've had so far is about 10-15 MBps by slicing my buffer into smaller pieces, around 4096 Byte a piece. I also squared the size and noticed no definate change.

kelton52 2009-12-13 13:49:50

4k is good buffer size for various reasons, also you should use gettimeofday() for your timer on unix/linux and QueryPerformanceCounter on windows. An upvote would be nice :)

frunsi 2009-12-13 14:03:40

I explored both QueryPerfomanceCounter and gettimeofday, and neither would fit my specific needs. As far as I know the solution I came up with for the timing is actually pretty sound and reliable.

kelton52 2009-12-13 14:11:54

Well, it may be sufficient here, but the resolution of ftime() is about 10ms on windows (though it looks like it would be 1ms).

frunsi 2009-12-13 14:44:45

well does dettimeofday() catch milliseconds? I've only ran across examples for getting seconds.

kelton52 2009-12-13 22:06:50

gettimeofday() should have _microsecond_ accuracy, even in practice. AFAIK it somehow depends on CPU speed, but with a few hundred MHz CPU you already get microsecond accuracy. At least this all is much more accurate than ftime()! ;)

frunsi 2009-12-13 22:45:34

Answer 4

A:

The issue in block reading is the overhead between your program and the hard drive. Most of this is to maintain portability and ease of development. Although some of the overhead is to manage multiple tasks or programs running concurrently.

Managed C, or C#, has another layer between your program and the hard disk. This, I belive, is called CLI. CLI is written to be language generic so that programs written in other langauges (such as Visual Basic) can easily share data and resources. Supposedly, this should also shrink the size of the executable since the OS now contains more of the code.

The read operation can be further optimized by diving deeper into either the C style code or into the C++ streams. If you really need performance, you will have to sacrifice portability and use platform specific technologies. If you can tell the I/O card to dump a quantity of bytes directly into your array, you're doing great. Some computers have the capability of delegating I/O operations away from the main processor; however, there may be some data bus or address bus sharing issues that will slow it down.

I optimized a utility that processes 1GB data files from 1 hour to 2 minutes, primarily by reading data into huge buffers (5MB to 10MB). The performance bottleneck is in the I/O system. For example, if the data file was on the network, performance often doubled.

Thomas Matthews 2009-12-14 18:26:12

Yeah but my current read can read a gigabyte in 16.6 seconds. It also sounds a lot simplier than what you're doing.

kelton52 2009-12-15 04:00:55

I can also read through > 45MB worth of source code, and remove the comments (line and block) in ~1-10 ms(I don't know because it doesn't register with my timer, and my timer, as stated earlier, has a 10ms margin of error.)...When I actually get a reading though I'll translate into how many gigabytes I can parse.

kelton52 2009-12-15 04:07:03

done. I can remove all the comments, and copy the buffer to a new buffer for a 1GB file in 8.31 seconds

kelton52 2009-12-15 05:10:49

got it down to 5.75

kelton52 2009-12-15 05:23:17

Answer 5

A:

Hello kelton,

Could you publish the piece of code you used to fill your buffer for a 1GB file in 5.75s ?

Thanks in advance.

Sasfepu 2010-07-29 14:37:17

You can check it out here...http://blog.skylabsonline.com/?p=53

kelton52 2010-08-09 12:31:59

ansaurus

tags:

views:

answers:

fread speeds managed unmanaged

main

read_whole_buffer

Question

SO

My 'script timer'

Read buffer thing

related questions