Question about Compute Visual Profiler and number of blocks for profiling | ansaurus

tags:

views:

24

answers:

1

Q:

Question about Compute Visual Profiler and number of blocks for profiling

On page 51 of the Compute Visual Profiler User Guide it states that:

" Note that in case the number blocks in a kernel is less than or not a multiple of the number of multiprocessors the counters values across multiple runs will not be consistent. "

Is that an inclusive or exclusive "or" statement? Does it always have to be a multiple?

+1 A:

The inconsistency mentioned in the docs is causes by load imbalance between multiprocessors.

For instance, if you are running a kernel with 15 blocks on a Tesla C2050 which has 14 multiprocessors, one of the multiprocessors will end up running threads from the one "extra" block. If the profiler happens to be collecting data from this multiprocessor running threads of two blocks in one profiling run, but from one running only threads from a single block in another one, the results will obviously deviate.

To answer the very question you asked, the "or" is inclusive, as is usually in natural languages.

Although I do not remember being mentioned in the documentation, I can image that even if these conditions are both false, profiling inconsistency can also occur when the data itself causes imbalance (amount of arithmetics/data or memory addressing patters conditional on some data).

pszilard 2010-10-27 02:19:46

related questions

SQL Server Profiler - How to filter trace to only display events from one database?

Understanding Firebug Profiler Output

C++ Code Profiler

Profiler for Sql CE

Sharepoint custom user and document library specific properties

Is there any similar tool for Linux that works like Shark on Mac OS X?

Is there any way I can get .net stack traces in Sql Profiler, or a similar tool?

Profiler/Analyzer for Erlang?

Need a good tool to explore a process and threads

Which Java Profiling tool do you use and which tool you think is the best?

What utilities can provide database hits/duration per page?

Tool or code for Cache and Memory Bus performances

Does a Silverlight memory profiler exist?

Oracle: is there a tool to trace queries, like Profiler for sql server?

Great resources for using the CLR Profiler APIs

Flex profiler gives "Socket timeout " error. Why for?

Profiling C# / .NET applications

Java profiler for IBM JVM 1.4.2 (WebSphere 6.0.2)

Find out how much memory is being used by an object in C#?

Best .NET memory and performance profiler?

CLR Profiler - Attaching to existing process

How do I profile a Maven Application in Netbeans?

Please recommend a Java profiler

Any decent C# profilers out there?

What Are Some Good .NET Profilers?