ansaurus

Question

Problem with OpenCL

Answer 1

+1 A:

Your OpenCL code is very simple and the results very weird. I think that the problem can come from the setup part. Buffer creation, call to EnqueueNDRange, etc. Could you post the setup part? I guess the problem can be there.

EDIT: After seeing your code and testing it I realized that at first I didn't fully understand your problem. As you comented the mask updating part my mind just got rid of that line. I should have been able to answer correctly the first time.

The problem is that you CAN'T synchronize different work groups. The CLK_GLOBAL_MEM_FENCE affects the memory ordering access of the work group (making sure that writes to global memory are done before reading back). The real solution to your problem is execute your code in two calls, first you update the mask in parallel and then do the rest of the stuff in another kernel that will be executed when the first one finishes. You need the whole operation to finish before continuing, so you have to use barriers at command-queue level. There is no other way.

Verbatim from the specification:

There are two domains of synchronization in OpenCL:

Work-items in a single work-group

Commands enqueued to command-queue(s) in a single context

Josep 2010-06-23 12:35:12

Okay, as I wrote, I managed to get it working that way (split the kernel in two different ones and execute them one after the other). I just though it's also possible to do it in one kernel, since I wasn't aware of the mentioned 'limitation'. Thank you very much for clearing this up!

VHristov 2010-07-06 10:21:00

Answer 2

A:

http://github.com/ <--- Throw it in a repo so we can see the whole program :)

Chad Brewbaker 2010-07-01 07:05:06

ansaurus

tags:

views:

answers:

Problem with OpenCL

related questions