This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.
SiSoftware Sandra 2014 Released:
Updated Device Performance Certification, New Benchmarks, Windows 8.1 support
Updated February 17th 2014 : SP1a released nV CUDA 5.x devices - aka "Maxwell".
New CPU Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 3 instruction sets
New GP / HC (GPU/APU/CPU) Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 2 interfaces
Updated GP / HC (GPU/APU/CPU) Financial Analysis benchmark
3 models, 2 precision (FP32/FP64), 2 interfaces
CUDA 6, Available as Free Download, Makes Parallel Programming Easier, Faster
We’re always striving to make parallel programming better, faster and easier for developers creating next-gen scientific, engineering, enterprise and other applications.
With the latest release of the CUDA parallel programming model, we’ve made improvements in all these areas.
Available now to all developers on the CUDA website, the CUDA 6 Release Candidate is packed with several new features that are sure to please developers.
A few highlights:
Unified Memory – This major new feature lets CUDA applications access CPU and GPU memory without the need to manually copy data from one to the other. This is a major time saver that simplifies the programming process, and makes it easier for programmers to add GPU acceleration in a wider range of applications.
Drop-in Libraries – Want to instantly accelerate your application by up to 8X? The new drop-in libraries can automatically accelerate your BLAS and FFTW calculations by simply replacing the existing CPU-only BLAS or FFTW library with the new, GPU-accelerated equivalent.
Multi-GPU Scaling – Re-designed BLAS and FFT GPU libraries automatically scale performance across up to eight GPUs in a single node. This provides over nine teraflops of double-precision performance per node, supporting larger workloads than ever before (up to 512GB).
And there’s more.
The following are known issues with the CUDA 6.0 Release Candidate that will be resolved in the production release:
‣ The minBlocksPerMultiprocessor parameter for the launch_bounds() qualifier only accepts values up to 16 when used in compiling for sm_50, even
though values up to 32 are possible on that architecture.
‣ There is a performance issue with the new SIMD video intrinsics __v*2() and __v*4() when used in compiling for the sm_50 architecture.
‣ The sm_50 architecture supports 48 KB of shared memory per block; however, the check for this limit is not functioning properly in the compiler. This can allow
programs that use more than 48 KB of shared memory per block to compile successfully, although they will fail to run because the driver component does check
the limit properly.
‣ The MT19937 random number generator in the cuRAND library generates non-deterministic results for curandGenerateUniformDouble().
‣ The NPP library function nppiAlphaComp_8u_AC4R() generates incorrect results when used with the NPPI_OP_ALPHA_ATOP_PREMUL option.
‣ The NPP library functions FilterSobelHorizSecondBorder() and FilterSobelVertSecondBorder() may generate incorrect results.
NVIDIA GameWorks™ pushes the limits of gaming by providing a more interactive and cinematic game experience and thus enabling next gen gaming for current games. We provide technologies e.g. PhysX and VisualFX, which are easy to integrate into games as well as tutorials and tools to quickly generate game content. In addition we also provide tools to debug, profile and optimize your code.
Beta Intel® Iris™ and HD Graphics Beta Driver for Windows* 7/8/8.1 for TITANFALL* and Thief*
In an effort to keep Intel HD graphics compatible with
the latest games and applications, Intel will occasionally post a “Beta” driver
for user feedback on compatibility and performance. This beta driver provides
benefit for users playing Titan Fall and Thief Games. We strive for
the best possible experience for users of Intel HD graphics and will greatly
appreciate you feedback for these beta drivers. Download the 32-bit or 64-bit
beta drivers to play Titan Fall and Thief 2 Games.
Link: Iris™ and HD Graphics Driver for Windows* 7/8/8.1
Added support for the following GPUs:
GeForce GTX 750 Ti
GeForce GTX 750
GeForce GTX 745
GeForce GTX TITAN Black
Fixed a regression in the NVIDIA kernel module which caused it to improperly dereference a userspace pointer. This potential security issue was initially reported to the public at:
The regression did not affect NVIDIA GPU drivers before release 334.
Fixed a bug that could cause OpenGL programs to hang after calling fork(2).
Added support for GPUs with VDPAU Feature Set E. See the README for details.
On GPUs with VDPAU Feature Set E, VDPAU now supports more robust decode error handling at the cost of a minor performance impact.
This can be disabled by setting the VDPAU_NVIDIA_DISABLE_ERROR_CONCEALMENT environment variable to 1.
Added support for application profile rule patterns which are logical operations of subpatterns. See the README for details.
Added support for a "findfile" application profile feature which allows the driver to apply profiles based on matching files in the same directory as the process executable. See the README for details.
Improved performance of OpenGL applications when used in conjunction with the X driver's composition pipeline. The composition pipeline may be explicitly enabled by using the ForceCompositionPipeline or ForceFullCompositionPipeline MetaMode options, or implicitly enabled when certain features such as some XRandR transformations, rotation,Warp & Blend, PRIME, and NvFBC are used.
Fixed a bug that could cause nvidia-settings to compute incorrect gamma ramps when adjusting the color correction sliders.
Updated the nvidia-settings control panel to allow the selection of display devices using RandR and target ID names when makingqueries targeted towards specific display devices.
Fixed a bug that prevented some dropdown menus in the nvidia-settings control panel from working correctly on olderversions of GTK+ (e.g. 2.10.x).
Updated the nvidia-settings control panel to provide help text for application profile keys and suggestions for valid key nameswhen configuring application profiles.
Updated the nvidia-settings control panel to populate the dropdown menu of stereo modes with only those modes which are available.
Fixed a bug that could cause applications using the OpenGL extension ARB_query_buffer_object to crash under Xinerama.
Fixed a bug that caused high pixelclock HDMI modes (e.g. as used with 4K resolutions) to be erroneously reported as dual-link inthe nvidia-settings control panel.
Fixed a bug that prevented some DisplayPort 1.2 displays from being properly restored after a VT switch.
Renamed per GPU proc directories in /proc/driver/nvidia/gpus/ with GPU's bus location represented in "domain:bus:device.function" format.
Added 64-bit EGL and OpenGL ES libraries to 64-bit driver packages.
Changed format of "Bus Location" field reported in the
/proc/driver/nvidia/gpus/0..N/information files from "domain:bus.device.function" to "domain:bus:device.function" to matchthe lspci format.
Fixed a bug in the GLX_EXT_buffer_age extension where incorrect ages would be returned unless triple buffering was enabled.
Changed the driver's default behavior to stop deleting RandR 1.2 outputs corresponding to unused DisplayPort 1.2 devices. Deleting these outputs can confuse some applications. Added a new option,DeleteUnusedDP12Displays, which can be used to turn this behavior back on.This option can be enabled by running sudo nvidia-xconfig --delete-unused-dp12-displays
Improved support for the __GL_SYNC_DISPLAY_DEVICE and VDPAU_NVIDIA_SYNC_DISPLAY_DEVICE environment variables in certain configurations. Both environment variables will now recognize all supported display device names. See "Appendix C. Display Device Names" and "Appendix G. VDPAU Support" in the README for more details.
Improved performance of the X driver when handling large numbers of surface allocations.
Blender 2.69.11 r61310 Hash eb4f2b4
Added support for the new Maxwell architecture.
Compile with VS2008, Scons
Has Cuda Kernels 2.0, 2.1, 3.0, 3.5, 5.0
i'm unable to build kernels for 1.0, 1.1, 1.2, 1.3, sry
Broadcom is releasing the full source of the OpenGL ES 1.1 and 2.0 driver stack for the Broadcom VideoCore® IV 3D graphics subsystem used in the BCM21553 3G integrated baseband SoC. VideoCore IV is used in many Broadcom products, including the BCM2835 application processor, which runs the popular Raspberry Pi microcomputer.
Announcing Intel® Graphics Performance Analyzers 2014 R1
Intel® Graphics Performance Analyzers (Intel® GPA) is a powerful, agile developer tool suite for analyzing and optimizing games, media, and other graphics-intensive applications. The product supports applications intended for the Windows* OS platforms or Intel® Atom™ based phones running the Android* OS. The toolset is a free download from the Intel GPA Home Page.
Feature Highlights of The AMD Catalyst™ 14.2 Beta V1.3 Driver for Windows®
Thief: Crossfire Profile update and performance improvements for single GPU configurations
Mantle: Multi-GPU configurations (up to 4 GPUs) running Battlefield 4 are now supported
Frame Pacing for Dual Graphics and non-XDMA configurations above 2560x1600 are now supported with Battlefield 3 and Battlefield 4
Dual graphics DirectX 9 application issues have been resolved
Minecraft: Missing textures have been resolved
3D applications no longer see intermittent hangs or application crashes
Resolves corruption issues seen in X-plane
New in GeForce 335.04 Certified drivers
Performance Boost – Increases performance by up to 19% for GeForce 400/500/600/700 series GPUs in several PC games vs. the previous NVIDIA Certified drivers (332.21). Results will vary depending on your GPU and system configuration. Here are some examples of measured gains:
GeForce GTX 770/780/TITAN/780Ti:
Up to 19% in F1 2013
Up to 18% in Sleeping Dogs
Up to 16% in Hitman Absolution
Up to 15% in Company of Heroes 2
Up to 10% in Assassin’s Creed 3
Up to 7% in BioShock Infinite
Up to 6% in Sniper Elite V2
Up to 5% in Total War: Rome 2
Assassin’s Creed Liberation HD – created profile
Assassin’s Creed: Freedom Cry – created profile
Deus Ex: Human Revolution Director’s Cut – created profile
The Crew – created profile
Supports GeForce ShadowPlay™ technology
Supports GeForce ShadowPlay™ Twitch Streaming
Supports NVIDIA GameStream™ technology
Shadow Warrior – rating now “Excellent”
The Stanley Parable – rated “Excellent”
Walking Dead 2 – rated “Good”
World Rally Championship 4 – rated “Good”
LEGO Marvel Super Heroes – rated “Good”
Far Cry 3 Blood Dragon – rated “Fair”
G-SYNC Pendulum Demo
With the Pendulum demo, see how NVIDIA G-SYNC changes gaming by eliminating tearing and minimizing stutter and lag, giving you the smoothest, fastest gaming experience.
Simple to install and run, the interactive Pendulum demo lets you see the stutter caused by V-SYNC, the tearing caused with V-SYNC OFF, and then the tear-free, super smooth presentation with G-SYNC. Experiment with different GPU frame rates, camera angles and scene rotation settings to get a real feel for G-SYNC and start to see gaming differently.
With VSYNC selected, change the GPU frame rate. As the frames per second drop below the maximum refresh rate of the monitor, notice the stutter in the pendulum’s swing. With “noVSYNC” selected, try different GPU frame rates and immediately notice tearing on the screen. And with G-SYNC selected, you’ll see a smooth, tear-free image.
New in Release 334.95:
This driver is from the most recent driver release branch, and contains the latest new features and functionality. It has been tested for workstation environments, and is recommended only for those users who need specific features mentioned in the release highlights or release notes. Also known as the Quadro New Feature driver.
For the most stable and fully supported enterprise driver please see the main Quadro download section.
HD Audio Power Saving
Added the ability for the audio codec to go to a low-power state when not in use.
Quadro Sync Swapgroup Support
Added support for swapgroups in full-screen OpenGL and Direct3D applications using Quadro Sync-capable Quadro cards. Windows 8
Limited support to two simultaneous NVENC streamed sessions per GPU on low-end Quadro cards for unlicensed applications.
NVIDIA’s first-generation “Maxwell” architecture implements a number of architectural enhancements
designed to extract even more performance per watt consumed. The first Maxwell-based GPU is
codenamed “GM107” and designed for use in power-limited environments like notebooks and small
form factor (SFF) PCs. These SFF systems are often used for gaming and home entertainment, with the
most recent example being Valve Software’s recently announced Steam Machines initiative. The first
graphics card that is based on the GM107 GPU is the GeForce GTX 750 Ti. Because of GM107’s
remarkable architectural efficiency, at 1080p resolution a GeForce GTX 750 Ti will frequently match the
performance of our flagship GPU from four years ago, the GeForce GTX 480, but with only a 60W TDP,
consumes a fourth of the power.
Dramatically simplify parallel programming with CUDA 6.0 .
Simplifies programming by enabling applications to access CPU and GPU memory without the need to manually copy data. Read more about unified memory.
Automatically accelerate applications’ BLAS and FFTW calculations by up to 8X by simply replacing the existing CPU libraries with the GPU-accelerated equivalents.
cublasXT - a new BLAS GPU library that automatically scales performance across up to eight GPUs in a single node, delivering over nine teraflops of double precision performance per node, and supporting larger workloads than ever before (up to 512GB). The re-designed FFT GPU library scales up to 2 GPUs in a single node, allowing larger transform sizes and higher throughput.