Shadertoy Multipass Demopack for GeeXLab
0 Members and 1 Guest are viewing this topic.
New FeaturesCUDA Tools‣CUDA Compilers. The CUDA compiler now supports Xcode 8.2.1.‣NVRTC. NVRTC is no longer considered a preview feature.CUDA Libraries‣cuBLAS. The cuBLAS library added a new function cublasGemmEx(), which isan extension of cublas<t/>gemm(). It allows the user to specify the algorithm,as well as the precision of the computation and of the input and output matrices.The function can be used to perform matrix-matrix multiplication at lowerprecision.Resolved IssuesGeneral CUDA‣CUDA Installer. On some SLES or openSUSE system configurations, theNVIDIA GL library package may need to be locked before the steps for a GL-lessinstallation are performed. The NVIDIA GL library package can be locked withthis command:sudo zypper addlock nvidia-glG04‣Unified memory. On GP10x systems, applications that usecudaMallocManaged() and attempt to use cuda-gdb will incur randomspurious MMU faults that will take down the application.‣Unified memory. Functions cudaMallocHost() and cudaHostRegister()don't work correctly on multi-GPU systems with the IOMMU enabled onLinux. The only workaround is to disable unified memory support with theCUDA_DISABLE_UNIFIED_MEMORY=1 environment variable.‣Unified memory. Fixed an issue where cuda-gdb or cuda-memcheck wouldcrash when used on an application that calls cudaMemPrefetchAsync().‣Unified memory. Fixed a potential issue that can cause an application to hangwhen using cudaMemPrefetchAsync().