I have posted my problem in the CUDA forums, but not sure if it s appropriate to post a link here for more ideas in case there are significant number of different audiences between the two forums. The link is here. I apologize for any inconvenience and appreciate any comments on this question, as I haven t heard back yet on some specifics of a particular CUDA memory access and management problems. Thanks in advance.
There has been a significant shift towards data-parallel programming via systems like OpenCL and CUDA over the last few years, and yet books published even within the last six months never even ...