ansaurus

Question

Are C++ exceptions sufficient to implement thread-local storage?

Answer 1

A:

void *kid( void *local_v ) try {
    thlocal &local = * static_cast< thlocal * >( local_v );
    throw local;
} catch( thlocal & ) {
    print_thread();

    return NULL;
}

==

void *kid (void *local_v ) { print_thread(local_v); }

I might be missing something here, but it's not a thread local storage, just unnecessarily complicated argument passing. Argument is different for each thread only because it is passed to pthread_create, not because of any exception juggling.

It turned out that I indeed was missing that GCC is producing actual thread local storage calls in this example. It actually makes the issue interesting. I'm still not quite sure whether it is a case for other compilers, and how is it different from calling thread storage directly.

I still stand by my general argument that the same data can be accessed in a more simple and straight-forward way, be it arguments, stack walking or thread local storage.

ima 2010-03-21 15:52:04

Thread local storage *is* just complicated argument passing. `get_thread` retrieves the argument from any unknown location up the call stack. `pthread_create` isn't special, cf `main`. The significance here is that it works for functions without arguments, such as destructors or ill-designed callbacks.

Potatoswatter 2010-03-21 15:56:17

Exactly - but it doesn't work for function without arguments! You have to explicitly pass it to function by throw-ing before call. It's just that function(arguments) becomes throw arguments ... function.

ima 2010-03-21 16:00:48

I also don't think it's entirely correct to call pthread_getspecific "argument passing", no more than stack pointer itself.

ima 2010-03-21 16:12:42

@ima: `print_thread` is as without-arguments as functions get in C++. The intent is only one `throw local;` at the topmost scope. In terms of the functional language formalism, *all* data movement is argument passing. Can you give an example of something that should be possible but isn't?

Potatoswatter 2010-03-21 16:31:38

One? Your example has 3 'throw local's, one for each call. Consider: you can declare function without arguments, and then push and pop arguments in stack manually - would it make a parameter-less function? Obviously no, just one with a custom calling convention. You are using exception objects as a custom stack implementation, and store parameters there. And why are we talking about functional language formalism here?

ima 2010-03-21 16:48:32

Providing example is tricky, because thread local storage itself is never required, it's just a convenience.

ima 2010-03-21 16:59:05

But I think if you rewrite your example to use a more complex function call graph, with trees- and loops-, you'll see that it's equivalent to passing (void*) to every function.

ima 2010-03-21 17:02:08

There are clearly two `throw local`, one for each *thread*. I can retrieve the exception object any number of times with `get_thread`, so it's not like a stack. The functional formalism is relevant because it relates data flow to argument passing. `get_thread` pulls an argument "out of thin air". Copy my code and try it yourself.

Potatoswatter 2010-03-21 17:20:08

My mistake, two throw local and one throw - it doesn't matter much here though. "So it's not like a stack" - it's not "like", it "is" a stack. Where do you think thread_exception_base objects are stored? Check disassembly if you doubt. You are using exception handlers to access stack instead of more conventional means, it doesn't give any new capabilities, just a stack.

ima 2010-03-21 19:38:53

@ima; the point is, 'throw' on its own, picks up the exception currently being handled, regardless how far up the stack it was "caught", so you don't need to pass it, there may very well be frames on the stack without this 'void*' argument.

roe 2010-03-21 19:44:43

In a sense, what you have here is platform-independent way of stack walking. I can even imagine where it might be useful (rarely, since practical threads api's themselves are not cross-platform), but it's quite a different thing from thread local storage.

ima 2010-03-21 19:47:27

@ima: I did check the disassembly and found it was calling `pthread_getspecific`. See the edit to the question. There is an exception stack but I'm only using one level of it. The call stack is not traversed by `get_thread`, at least not under GCC.

Potatoswatter 2010-03-22 05:46:41

Looks like I were wrong here, sorry for being stubborn. Completely unexpected though, why would GCC use thread local storage for exception? I'll have to look into it.It begs another question though - if it is the case, why not just call getspecific directly?

ima 2010-03-22 09:16:23

It's probably combination, it might be calling getspecific to limit stack walking to the function call that started process. Because exception object itself is definitely stored on stack, and the same exception can be thrown deep inside thread calls.

ima 2010-03-22 09:26:12

@ima: I agree, it's possible to implement exceptions without `getspecific` by walking the stack. `getspecific` is simply faster. By the same token, `getspecific` could also be implemented by stack-walk… the functionality is equivalent. However, I don't know another way to do it portably. (If you do, I'll probably select your answer.) One direct call to `getspecific` would of course be a lot faster, but this way I can write code that still works on single-threaded or non-pthread platforms.

Potatoswatter 2010-03-23 02:59:35

No worries about stubbornness. I'm glad you articulated your thoughts and arguments and hope you don't delete this.

Potatoswatter 2010-03-23 03:00:52

Answer 2

A:

Accessing data on the current function call stack is always thread safe. That's why your code is thread safe, not because of the clever use of exceptions. Thread local storage allows us to store per-thread data and reference it outside of the immediate call stack.

Paul Keister 2010-03-21 17:37:40

Data on the call stack is quite unsafe if used in an unsafe manner. Thread local storage is not inherently safer than anything else, it's simply keyed by thread.

Potatoswatter 2010-03-21 17:47:18

This has nothing to do with thread safety, only thread local storage.

roe 2010-03-21 19:47:24

*If* you declared a bit of data on the stack and then passed it around to every function as an extra argument, you'd effectively have thread-local storage. The advantage of TLS is that you don't have to add the extra argument. The clever thing about this idea is noting that you can say `void foo() { throw; }`, where `throw` has no local context to determine what it should be throwing, so the runtime has to keep one per call stack, which is to say, one per thread. Hence this is a way of sharing a value only within a thread without using parameters to pass it around. Just like TLS.

Daniel Earwicker 2010-03-26 11:20:06

My original comment did contain a serious error which I consider a typo, but you be the judge. In any case I apologize for the confusion; I've corrected the error. I still don't understand how exceptions can take the place of thread local storage. There's no scenario I can thing of where passing stack variables to functions on or off the current thread will create per-thread variable instance, which is what thread local storage is.

Paul Keister 2010-03-28 20:27:03

This isn't stack variables. It's a storage slot provided by the C++ runtime library, which is used to store the most recently caught exception so that `throw;` will be able to retrieve it and rethrow it. The gag here is to store arbitrary data in an exception object, use `throw`/`catch` to get that exception stored in the slot, and then use `throw;` to retrieve it elsewhere. Hence it is a "global" storage slot...

Daniel Earwicker 2010-07-02 12:04:48

... But in a C++ implementation that is extended to support threads, it must be changed to a thread-local storage slot, otherwise thread 1 might inadvertantly rethrow something that has just been caught in thread 2, instead of rethrowing what was just caught in thread 1.

Daniel Earwicker 2010-07-02 12:05:22

Answer 3

+2 A:

I think you're onto something here. This might even be a portable way to get data into callbacks that don't accept a user "state" variable, as you've mentioned, even apart from any explicit use of threads.

So it sounds like you've answered the question in your subject: YES.

Ben Voigt 2010-03-21 18:01:33

Answer 4

+1 A:

In the playful spirit of the question, I offer this horrifying nightmare creation:

class tls
{
    void push(void *ptr)
    {
        // allocate a string to store the hex ptr 
        // and the hex of its own address
        char *str = new char[100];
        sprintf(str, " |%x|%x", ptr, str);
        strtok(str, "|");
    }

    template <class Ptr>
    Ptr *next()
    {
        // retrieve the next pointer token
        return reinterpret_cast<Ptr *>(strtoul(strtok(0, "|"), 0, 16));
    }

    void *pop()
    {
        // retrieve (and forget) a previously stored pointer
        void *ptr = next<void>();
        delete[] next<char>();
        return ptr;
    }

    // private constructor/destructor
    tls() { push(0); }
    ~tls() { pop(); }

public:
    static tls &singleton()
    {
        static tls i;
        return i;
    }

    void *set(void *ptr)
    {
        void *old = pop();
        push(ptr);
        return old;
    }

    void *get()
    {
        // forget and restore on each access
        void *ptr = pop();
        push(ptr);
        return ptr;
    }
};

Taking advantage of the fact that according to the C++ standard, strtok stashes its first argument so that subsequent calls can pass 0 to retrieve further tokens from the same string, so therefore in a thread-aware implementation it must be using TLS.

example *e = new example;

tls::singleton().set(e);

example *e2 = reinterpret_cast<example *>(tls::singleton().get());

So as long as strtok is not used in the intended way anywhere else in the program, we have another spare TLS slot.

Daniel Earwicker 2010-07-02 15:09:37

`strtok` is defined in terms of a sequence of calls, so it's ambiguous whether a thread-aware implementation keeps a TLS pointer or a global with a mutex. I am horrified, though.

Potatoswatter 2010-07-05 19:17:04

I'm glad it had the desired effect! :) If `strtok` was only protected by a mutex that would be locked for each call, that wouldn't really help, so TLS is the only likely solution. Though I suppose it could have some perverted use as a semi-reliable communication channel between threads! And on top of that I could build a stream interface, like TCP over IP... Now *there's* a challenge!

Daniel Earwicker 2010-07-06 07:46:46

ansaurus

tags:

views:

answers:

Are C++ exceptions sufficient to implement thread-local storage?

related questions