opencl

Why does the OpenCL vector addition Nvidia SDK example use async writes?

The vector addition example has this code: // Asynchronous write of data to GPU device ciErr1 = clEnqueueWriteBuffer(cqCommandQueue, cmDevSrcA, CL_FALSE, 0, sizeof(cl_float) * szGlobalWorkSize, srcA, 0, NULL, NULL); ciErr1 |= clEnqueueWriteBuffer(cqCommandQueue, cmDevSrcB, CL_FALSE, 0, sizeof(cl_float) * szGlobalWorkSize, srcB, 0, NULL,...

Multiple OpenCl Kernels

Hey, I just wanted to ask, if somebody can give me a heads up on what to pay attention to when using several simple kernels after each other. Can I use the same CommandQueue? Can I just run several times clCreateProgramWithSource + cl_program with a different cl_program? What did I forget? Thanks! ...

What is with the Disable Profiling option button on the Compute Profiler?

Why is there a button for enabling and disabling profiling on the Compute Profiler? If I disable profiling, then I can't launch my application for profiling. So why does profiling need to be disabled at all? ...

Is it possible to compare more than two kernels executions at a time in Compute Prof (OpenCL/CUDA)

Is it possible to compare more than two kernels executions at a time in Compute Prof? ...

Question about Compute Visual Profiler and number of blocks for profiling

On page 51 of the Compute Visual Profiler User Guide it states that: " Note that in case the number blocks in a kernel is less than or not a multiple of the number of multiprocessors the counters values across multiple runs will not be consistent. " Is that an inclusive or exc...

CL_OUT_OF_RESOURCES for 2 millions floats with 1GB VRAM?

It seems like 2 million floats should be no big deal, only 8MBs of 1GB of GPU RAM. I am able to allocate that much at times and sometimes more than that with no trouble. I get CL_OUT_OF_RESOURCES when I do a clEnqueueReadBuffer, which seems odd. Am I able to sniff out where the trouble really started? OpenCL shouldn't be failing like thi...

Mix OpenCL with OpenGL

Hello! Is it ok to use both OpenGL an OpenCL in one program? Both operate on GPU and I'm afraid how switching between OpenCL and OpenGL is handled in "background" (e.g. registers are overwritten). I'm using Linux working on computer with some nVidia graphic 8k, so i can use OpenCL implemented on top of the CUDA. ...

cl_out_of_resources error in atom_inc/atomic_inc

Hi everyone, i am using atom_inc in one of my kernels. More or less like this: kernel exampleKernel( global volatile int * x, global const int maxX, global const int * buf1, global const int * buf2 ) { if(x < maxX) { int y = atom_inc(x); buf2[y] = buf1[get_global_id(0)]; } } But it gives me an CL_OUT_OF_RE...

Compiling OpenCL on Ubuntu

My programming experience is about 1 year of C/C++ experience from high school, but I did my research and wrote a simple program with OpenCL a few months ago. I was able to compile and run this on an Apple computer relatively easily with g++ and the --framework option. Now I'm on my Ubuntu machine and I have no idea how to compile it. Th...

OpenCL for Python

I'm looking for good OpenCL wrapper\library for Python with good documantation. I tried to search some... but didn't find good enouth. Can anybody help? ...