NVIDIA CUDA SDK 10 released

Started by Stefan, September 20, 2018, 04:02:15 AM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.


  • CUDA 10.0 adds support for the Turing architecture (compute_75 and sm_75).
  • CUDA 10.0 adds support for new programming constructs called CUDA Graphs, a new asynchronous task-graph programming model that enables more efficient launch and execution. See the API documentation for more information.
  • Warp matrix functions now support additional matrix shapes 32x8x16 and 8x32x16. Warp matrix functions also include the ability (experimental in CUDA 10.0) to perform sub-byte operations (4-bit unsigned, 4-bit signed and 1-bit) using the Tensor Cores.
  • Added support for CUDA-Vulkan and CUDA-DX12 interoperability APIs.
  • Added support for a new instruction nanosleep that suspends a thread for a specified duration.
  • Added 6.3 version of the Parallel Thread Execution instruction set architecture (ISA). For more details on new (sm_75 target, wmma, nanosleep, FP16 atomics) and deprecated instructions, see this section in the PTX documentation.
  • Starting with CUDA 10.0, the CUDA runtime is compatible with specific older NVIDIA drivers. A new package called "cuda-compat-<version>" is included in the toolkit installer packages. For more information on compatibility, see the section in the Best Practices Guide.
Download now