NVIDIA explains OpenCL issue with GPUCapsViewer

Started by Stefan, November 02, 2010, 05:03:51 PM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.


Quote...The problem with the OpenCL test in GPU caps viewer is that the code generation for the vector store is trying to optimize the case where the RHS completely overwrites the LHS. When checking for this, it compares the sizes of the destination and source vectors and goes down the optimized path if the sizes match. However, it uses the actual size of the vector3 (=4), rather than the accessible size (=3) and compares it to the size of the destination vector (=4).

Source: NZONE