08-31-2017, 10:50 AM
(08-31-2017, 10:09 AM)atom Wrote: That sounds like you did not make sure to fill the shared memory before early exit the kernel (return too soon).
Thanks for the reply.
How would I go about doing that?
The code for populating s_teN and s_tdN variables already has a "barrier" after the for-loop:
Code:
barrier (CLK_LOCAL_MEM_FENCE);
Is that sufficient enough? Most of the other kernels I've seen does pretty much the same thing, fill the variables and perform AES immediately after.
Sorry I'm a n00b in GPU/OpenCL programming.