ansaurus

Question

Are there some cases where Python threads can safely manipulate shared state?

Answer 1

+3 A:

Appending to a list is thread-safe, yes. You can only append to a list while holding the GIL, and the list takes care not to release the GIL during the append operation (which is, after all, a fairly simple operation.) The order in which different thread's append operations go through is of course up for grabs, but they will all be strictly serialized operations because the GIL is never released during an append.

The same is not necessarily true for other operations. Lots of operations in Python can cause arbitrary Python code to be executed, which in turn can cause the GIL to be released. For example, i += 1 is three distinct operations, "get i', "add 1 to it" and "store it in i". "add 1 to it" would translate (in this case) into it.__iadd__(1), which can go off and do whatever it likes.

Python objects themselves guard their own internal state -- dicts won't get corrupted by two different threads trying to set items in them. But if the data in the dict is supposed to be internally consistent, neither the dict nor the GIL does anything to protect that, except (in usual thread fashion) by making it less likely but still possible things end up different than you thought.

Thomas Wouters 2010-04-29 20:17:03

Answer 2

+1 A:

In CPython, thread switching is done when sys.getcheckinteval() bycodes have been executed. So a context switch can never occur during the execution of a single bytecode, and operations that are encoded as a single bytecode are inherently atomic and threadsafe, unless that bytecode executes other Python code or calls C code that releases the GIL. Most operations on the built-in collection types (dict, list etc) fall into the 'inherently threadsafe' category.

However this is an implementation detail that is specific to the C implementation of Python, and should not be relied upon. Other versions of Python (Jython, IronPython, PyPy etc) may not behave in the same way. There is also no guarantee that future versions of CPython will keep this behaviour.

Dave Kirby 2010-04-29 20:34:41

This followed my intuition about what was happening, but I would never have thought to examine the other implementations. I am only using CPython. I did consider that, to make code future-proof against alternate implementations, this detail should not be relied upon.

Erik Garrison 2010-04-29 20:50:12

The bytecode distinction is mostly not very interesting, because almost all bytecodes have the potential for executing more Python code, and some that don't have the potential for releasing the GIL themselves. `list.append()`, for example, is not one bytecode, and the actual `append` work is executed by the `CALL_FUNCTION` opcode, which is *extremely likely* to execute more code :-)

Thomas Wouters 2010-04-29 22:56:25

ansaurus

tags:

views:

answers:

Are there some cases where Python threads can safely manipulate shared state?

related questions