I'd like to be able to see how "expensive" each thread in my application is using callgrind. I profiled with the --separate-thread=yes
option which gives you a callgrind file for the whole app and then one per-thread.
This is useful for viewing the profile of any given thread, but what I really want is just a sorted list of CPU time from each thread so I can see which threads are the the biggest hogs.