Cross-platform and cross-process atomic int writes on file

Hello!

I'm writing an application that will have to be able to handle many concurrent accesses to it, either by threads as by processes. So no mutex'es or locks should be applied to this.

To make the use of locks go down to a minimum, I'm designing for the file to be "append-only", so all data is first appended to disk, and then the address pointing to the info it has updated, is changed to refer to the new one. So I will need to implement a small lock system only to change this one int so it refers to the new address. How is the best way to do it?

I was thinking about maybe putting a flag before the address, that when it's set, the readers will use a spin lock until it's released. But I'm afraid that it isn't at all atomic, is it? e.g.

a reader reads the flag, and it is unset
on the same time, a writer writes the flag and changes the value of the int
the reader may read an inconsistent value!

I'm looking for locking techniques but all I find is either for thread locking techniques, or to lock an entire file, not fields. Is it not possible to do this? How do append-only databases handle this?

edit: I was looking at how append-only db's (couchDB) do it, and it seems they use a thread only to serialize the writes to file. Does that mean it isn't possible to make them embeddable, like sqlite, without locking the entire file with file system locks?

Thanks! Cauê

Thanks for your answer!But still, if a reader happens to be reading the data the writer process is writing to - even if it's only an int pointer, it can still catch it in an inconsistent state, can't it?Also, I'd like to avoid using this kind of process communication, since I'd like to make this database prototype embeddable, like sqlite.

Waneck 2010-05-17 03:29:20

Thanks for your answer! I didn't know mmap could also write to file, I thought it would only read from it. It could very much be an option, if the windows equivalent (MapViewOfFile) also works as expected. But I don't know if there is a cross-processor way to use a compare-and-swap function.About the append semantics, I think a normal filesystem file lock would work fine on this case, wouldn't it?A rename would be out of question. It's a database prototype I'm working on, and it should have to handle with very big files, and constant writes.

Waneck 2010-05-17 03:20:02

Is it a good workflow to mmap pages to arrays and then use compare-and-swap on them if write is needed. Would mmap pages be slow? Is mmap guaranteed to be atomic? I haven't found any reference that says it is, but none that says it isn't either!

Waneck 2010-05-17 18:51:20

Sounds like a fine workflow to me. mmap'd pages aren't slow (at least on linux), in fact they should be faster than using read/write as they avoid any copying (the virtual memory is mapped directly to the page in the filesystem cache). It has got to be the same on Windows for the scheme to work at all - the two mmap'd regions in the two processes need to map to the same physical memory for compare-and-swap to work at all.The mmap call being atomic or not is irrelevant to your situation. The only atomicity you need is the operation of compare-and-swap on already mmap'd memory.

Keith Randall 2010-05-17 19:34:27

But can't it happen that I use a compare-and-swap on the mmap, but the time it will take to actually update it from memory to the actual hard disk could cause some sync issues?

Waneck 2010-05-17 19:53:20

Oh, I see, they will all be in memory!It could actually be a pretty neat way to solve this! THe problem is that it isn't really cross-process... But I think I can live with that limitation, no two processes will be able to modify the same file.

Waneck 2010-05-17 20:00:50

It should work cross-process as well, as long as you mmap it shared, the same physical memory page will back all of the mmap'd regions.

Keith Randall 2010-05-17 21:15:37

Really?? Very Cool!!! Thank you very much!

Waneck 2010-05-18 11:19:14

ansaurus

tags:

views:

answers:

Cross-platform and cross-process atomic int writes on file

related questions