by Joe_H » Wed Jan 30, 2019 6:34 am
Probably not. The reason is that a GPU is not a general purpose processor like a CPU, for the calculations being done it is essentially being used as an attached vector processor for the data contained in the WU. Essentially a block of data is passed to the GPU with a specific set of operations to be performed on it. Each block then has to be coordinated with the adjacent blocks. After a pass through all of the data the results become the inputs for the next pass. I don;t really see an opportunity to buffer any of that data.
At no time is all of the WU data sent to the GPU, for larger WU's it might not fit on the largest cards anyways. On top of that, VRAM is holding the output from the shaders in a video format, I don't think the shaders can read from it.

iMac 2.8 i7 12 GB smp8, Mac Pro 2.8 quad 12 GB smp6
MacBook Pro 2.9 i7 8 GB smp3