NVIDIA has published (GTC 2012) a whitepaper that details some computing aspects of the upcoming high-end Kepler GPU, the GK110. This GPU is clearly focused on computing with its 7.1 billion transistors, 15 SMX, 2880 CUDA cores (192 CUDA cores per SMX) and 240 texture units (16 TU per SMX).
The GK110 GPU supports the new CUDA Compute Capability 3.5:
Among the new features available with the GK110, GPUDirect sounds really interesting:
When working with a large amount of data, increasing the data throughput and reducing latency is vital to increasing compute performance. Kepler GK110 supports the RDMA feature in NVIDIA GPUDirect, which is designed to improve performance by allowing direct access to GPU memory by third‐party devices such as IB adapters, NICs, and SSDs. When using CUDA 5.0, GPUDirect provides the following important features:
- Direct memory access (DMA) between NIC and GPU without the need for CPU side data buffering.
- Significantly improved MPISend/MPIRecv efficiency between GPU and other nodes in a network.
- Eliminates CPU bandwidth and latency bottlenecks
- Works with variety of 3rd party network, capture, and storage devices
You can download the GK110 whitepaper HERE.
15 thoughts on “NVIDIA Kepler GK110 Architecture Whitepaper: 2880 CUDA Cores and Compute Capability 3.5”
Nice. Yet another possible HPC bottleneck eliminated. More and more gpu clusters in top500, more and more of them tesla based 😉
I think the ability to launch a GPU kernel from a GPU kernel is the most interesting feature.
You should read this first guys:
So, Kepler’s power is rubbish compared with HD 7970 !
Ra, you obviously can’t read, they’re talking about GK104 in that article.
GK110 is the real compute GPU.
Kepler: Yes, but can it run Crysis ??? 😉
No, actually it is based on very same idea of SMX (so 3x number of ALUs but half speed and more software schedule & dispatch than hw). It should beat Tahiti by raw power but it’s not as efficient as Fermi or Tahiti (so needs a lot more transistors (SP) to achieve same performance). Still – AMD hasn’t show us SINGLE compute specific card while NV has already paved the way with their Teslas
This is the craziest thing I’ve ever see in my life. 7.1 billion transistors!
samsi you’re so stupid… but hey this is fanboism.
[quote]You should read this first guys:
So, Kepler’s power is rubbish compared with HD 7970 ![/quote]I ignore it!!! >:( Because OpenCL 1.1 driver (since 280.xx) for GeForce has performance PROBLEM about 6 months without recover!! CALM until OpenCL 1.1 driver fixed!!
@nuninho1980: yeah, maybe in 2018…
@Ra: too bad HD7970 sucks on real GPGPU
programs like Amber 11.
P.S. sorry for double post. :/
LuxRender is GPGPU too, tahiti doesn’t sucks there. Doesn’t sucks in AES Encrypt/Decrypt tools either. Doesn’t in winzip16.5, doesn’t in SL3 brute force. Doesn’t in milkyway, poem etc. etc.
Unmanufactuable halo product (nVidia already has problems with GK104 which is smaller than AMD 7xxx series) which will only ship to a few halo customers and then will be forgotten don’t expect to be able to buy it anywhere
New WHQL drivers out from Nvidia.
Comments are closed.