Yeah it's the barrier I mean. It's just important that there's no return statement executed before the kernel reaches this point in code. If it does, it's not guaranteed all data from constant buffer is copied to shared memory.
For example see here: https://github.com/hashcat/hashcat/blob/...#L502-L510
This is the typical use of an early return, when no __local memory is used.
On the contrary, see here: https://github.com/hashcat/hashcat/blob/...#L490-L524
Make sure your's is like this, where the first return comes after the barrier.
For example see here: https://github.com/hashcat/hashcat/blob/...#L502-L510
This is the typical use of an early return, when no __local memory is used.
On the contrary, see here: https://github.com/hashcat/hashcat/blob/...#L490-L524
Make sure your's is like this, where the first return comes after the barrier.