Author Topic: NVIDIA explains OpenCL issue with GPUCapsViewer  (Read 4423 times)

0 Members and 1 Guest are viewing this topic.


  • Global Moderator
  • Hero Member
  • *****
  • Posts: 4222
    • View Profile
NVIDIA explains OpenCL issue with GPUCapsViewer
« on: November 02, 2010, 05:03:51 PM »
...The problem with the OpenCL test in GPU caps viewer is that the code generation for the vector store is trying to optimize the case where the RHS completely overwrites the LHS. When checking for this, it compares the sizes of the destination and source vectors and goes down the optimized path if the sizes match. However, it uses the actual size of the vector3 (=4), rather than the accessible size (=3) and compares it to the size of the destination vector (=4).

Source: NZONE