GL-Z: OpenGL Information Utility for Windows, Linux and OS X
GL-Z 0.1.0
 
GLSL 4×4 Matrix Fields
GLSL 4×4 Matrix Fields
 
FurMark, GPU stress test
FurMark 1.15.0
 
GPU Caps Viewer
GPU Caps Viewer 1.22.0
 
GLSL Hacker - Pixel hacking with GLSL, Lua and Python on Windows, Linux and Mac OS X
GLSL Hacker 0.8.0
Win/Linux/OSX
OpenGL Geometry Instancing Test: Radeon vs GeForce
BenQ BL3200PT 32-inch
QHD Monitor

Posts Tagged ‘cudaMalloc’


Programming a Matrix Multiplication for GPUs with CUDA

Be the first to comment - What do you think?  Posted by JeGX - 2008/10/14 at 10:45

Categories: NVIDIA CUDA, Programming   Tags: , , , , , ,

CUDA makes it possible to program the GPU with the language C. This article will show you the steps to code a matrix multiplication routine in CUDA:

  • allocate memory on the GPU with cudaMalloc or cudaMallocPitch (for aligned memory allocation)
  • move data to the GPU with cudaMemcpy2D
  • select the kernel domain, write the kernel and run it
  • move results back from the GPU to the host with cudaMemcpy2D
  • free resources with cudaFree