10-30-2015, 03:24 AM
The claimed 967 MH/s (on a 980Ti +250MHz) was using CUDA. Did you try that code with your tool chain? BTW the version that doesn't need a kernel per salt is not that much slower: 826 MH/s under same conditions.
With that speed you'd get 24 GH/s for LM, that would be some record...
With that speed you'd get 24 GH/s for LM, that would be some record...