cuda kernel parameter | ansaurus

tags:

cuda

views:

40

answers:

1

+1 Q:

cuda kernel parameter

say I have a cuda kernel

__global__ foo (int a, int b)
{
    ... ...
}

where a and b are stored. Does this takes register space for each thread?

+2 A:

No this doesn't take register space for each thread, instead a and b are allocated once in a constant space - a read only space - from which all thread can read.

Note that this space is cached by constant registers and shared by all threads:

A read-only constant cache that is shared by all scalar processor cores and speeds up reads from the constant memory space, which is a read-only region of device memory [PTX ISA Version 2.1 Chapter 3].

Stringer Bell 2010-10-08 23:42:00

related questions

CUDA vs Direct X 10 for parallel mathematics. any thoughs you have about it ?

How to design an approximate solution algorithm

CUDA compiler (nvcc) macro

CUDA + Visual Studio = suppressed output window

How do you get around the maximum CUDA run-time?

How ugly is the API for GP-GPU?

Compression library using Nvidia's CUDA

CUDA vs FPGA?

CUDA: Wrapping device memory allocation in C++

CUDA memory troubles

Dynamic Allocation of Constant memory in CUDA

Getting array subsets efficiently

How to block until an asynchronous job finishes

CUDA Driver API vs. CUDA runtime

CUDA for .net?

Should I create CUDA apps now, or wait for DirectX 11?

Operations on arbitrary value types

How do I make an already written concurrent program run on a GPU array?

GPGPU VM's: Any open source projects to port virtual machines onto graphics processing units?

Turning C# methods into C++ methods

CUDA global (as in C) dynamic arrays allocated to device memory

Have you successfully used a GPGPU?

How well do common programming tasks translate to GPUs?

raytracing with CUDA

Feasability of GPU as a CPU?