ansaurus

Question

Why does the OpenCL vector addition Nvidia SDK example use async writes?

Answer 1

+1 A:

I while the writes are asynchroneous from a hosts point of view, they aren't necessarily asynchroneous from thee devices point of view. I'd assume that the commandqueue is created without CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE, so it's an in-order commandqueue.

The opemcl specification says the following about in-order execution:

In-order Execution: Commands are launched in the order they appear in the command- queue and complete in order. In other words, a prior command on the queue completes before the following command begins. This serializes the execution order of commands in a queue.

Therefore the writes should complete before the kernel is executed on the device.

Grizzly 2010-10-21 11:47:39

ansaurus

tags:

views:

answers:

Why does the OpenCL vector addition Nvidia SDK example use async writes?

related questions