How does Intel TBB's scalable_allocator work ? | ansaurus

tags:

views:

1077

answers:

1

+3 Q:

How does Intel TBB's scalable_allocator work ?

What does the tbb::scalable_allocator in Intel Threading Building Blocks actually do under the hood ?

It can certainly be effective. I've just used it to take 25% off an apps' execution time (and see an increase in CPU utilization from ~200% to 350% on a 4-core system) by changing a single std::vector<T> to std::vector<T,tbb::scalable_allocator<T> >. On the other hand in another app I've seen it double an already large memory consumption and send things to swap city.

Intel's own documentation doesn't give a lot away (e.g a short section at the end of this FAQ). Can anyone tell me what tricks it uses before I go and dig into its code myself ?

UPDATE: Just using TBB 3.0 for the first time, and seen my best speedup from scalable_allocator yet. Changing a single vector<int> to a vector<int,scalable_allocator<int> > reduced the runtime of something from 85s to 35s (Debian Lenny, Core2, with TBB 3.0 from testing).

+4 A:

There is a good paper on the allocator: download.intel.com/technology/itj/2007/v11i4/5-foundations/5-Foundations_for_Scalable_Multi-core_Software.pdf

My limited experience: I overloaded the global new/delete with the tbb::scalable_allocator for my AI application. But there was little change in the time profile. I didn't compare the memory usage though.

Amit Kumar 2009-03-19 06:22:33

Thanks! Article contains exactly the sort of information I was looking for.

timday 2009-03-19 09:17:06

related questions

Of Memory Management, Heap Corruption, and C++

How do I make a GUI?

Alpha blending sprites in Nintendo DS Homebrew

Thread safe lazy contruction of a singleton in C++

Interview Programming Questions - In house Exam

Link issues (VC6)

What are the barriers to understanding pointers and what can be done to overcome them?

Why are professors or schools picking Java over C++ to teach to students?

What is the best way to create a sparse array in C++

C/C++ library for reading MIDI signals from a USB MIDI device

How do you pack a visual studio c++ project for release?

How to set up unit testing for Visual Studio C++

How do I configure and communicate with a serial port?

Lightweight IDE for Linux

Mapping Stream data to data structures in C#

CPU throttling in C++

Asynchronous multi-direction server-client communication over the same open socket?

Exceptions in C++

Heap corruption under Win32; how to locate?

Build for Windows NT 4.0 using Visual Studio 2005?

C++: Should I use nested classes in this case?

BerkeleyDB Concurrency

GTK implementation of MessageBox

Is gettimeofday() guaranteed to be of microsecond resolution?

How to use the C socket API in C++ on z/OS