Geeks3D Forums

Tech Forums => 3D-Tech News Around The Web => Topic started by: Stefan on March 04, 2011, 05:10:50 PM

Title: NVIDIA CUDA Toolkit 4.0 RC available to Registered Developers
Post by: Stefan on March 04, 2011, 05:10:50 PM
Quote
CUDA Toolkit 4.0 RC (March 2011) (http://developer.nvidia.com/object/cuda_4_0_RC_downloads.html)

Release Highlights

Easier Application Porting

    * Share GPUs across multiple threads
    * Use all GPUs in the system concurrently from a single host thread
    * No-copy pinning of system memory, a faster alternative to cudaMallocHost()
    * C++ new/delete and support for virtual functions
    * Support for inline PTX assembly
    * Thrust library of templated performance primitives such as sort, reduce, etc.
    * NVIDIA Performance Primitives (NPP) library for image/video processing
    * Layered Textures for working with same size/format textures at larger sizes and higher performance

Faster Multi-GPU Programming

    * Unified Virtual Addressing
    * GPUDirect v2.0 support for Peer-to-Peer Communication

New & Improved Developer Tools

    * Automated Performance Analysis in Visual Profiler
    * C++ debugging in cuda-gdb
    * GPU binary disassembler for Fermi architecture (cuobjdump)

Public download: CUDA Toolkit 4.0 Overview (http://developer.download.nvidia.com/compute/cuda/4_0/CUDA_Toolkit_4.0_Overview.pdf)
Title: Re: NVIDIA CUDA Toolkit 4.0 RC available to Registered Developers
Post by: Stefan on March 05, 2011, 03:23:44 PM
CUDA Open64 (http://en.wikipedia.org/wiki/Open64) 4.0 is available to the public:

http://download.nvidia.com/CUDAOpen64/nvopencc_4.0_src_8883668.tar.gz
http://download.nvidia.com/CUDAOpen64/cuda-gdb_4.0_src_8883668_linux.tar.bz2
http://download.nvidia.com/CUDAOpen64/cuda-gdb_4.0_src_8883668_darwin.tar.bz2