ansaurus

Question

How to make different threads execute different parts in CUDA ?

Answer 1

A:

You need to use the thread ID to control what is executed, e.g.

if (thread_ID == 0)
{
  // do single thread stuff
}

// do common stuff on all threads

if (thread_ID == 0)
{
  // do single thread stuff
}

Paul R 2010-05-06 09:44:48

Answer 2

+2 A:

You can only synchronize threads within a single blocks. It is possible to synchronize between multiple blocks, but only under very specific circumstances. If you need global synchronization between all threads, the way to do that is to launch a new kernel.

Within a block, you can synchronize threads using __syncthreads(). For example:

__global__ void F(float *A, int N)
{
    int idx = threadIdx.x + blockIdx.x * blockDim.x;

    if (threadIdx.x == 0) // thread 0 of each block does this:
    {
         // Whatever
    }
    __syncthreads();

    if (idx < N) // prevent buffer overruns
    {
        A[idx] = A[idx] * A[idx];  // "real work"
    }

    __syncthreads();

    if (threadIdx.x == 0) // thread 0 of each block does this:
    {
         // Whatever
    }
}

mch 2010-05-06 15:00:53

This is a simple solution but be aware of branching (which results in the current warp being serialised). When possible try to make all threads in the half warp follow the same execution path.

Laurence Dawson 2010-05-14 23:16:00

ansaurus

tags:

views:

answers:

How to make different threads execute different parts in CUDA ?

related questions