在CUDA,当我们谈论 平行线 执行相同的代码 是否有命令 执行它们?
For-example:
If, I have 4 threads,for a 1D array of 4 elements.All four threads perfom some operation on some index of the array. Will thread 4 always execute after thread 3 or there is no specific order in the execution?
谢谢!