Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Topics - Stefan

Pages: 1 ... 3 4 [5] 6 7 ... 132
Version 1.1, 2014-03-11
1. Modified the Makefile to use the new -arch flags. 2. Added an option to run the test on a 64bit GPU. 3. Added a check for whether the current architecture is compatible with the specified test option (cpu32|cpu64|gpu32|gpu64). If there is a mismatch, the test now prints a warning. 4. Added a '-h/--help' argument.

GPU PerfStudio 2 gives developers control with seamless workflow integration. Identify performance and algorithm issues early in the development cycle, and meet your quality and performance goals. GPU PerfStudio 2 is the second generation GPU Performance Analysis and Debugging Tool from AMD. It offers clear advantages to developers by cutting software development time and improving graphics quality.

GPU PerfStudio 2.14 build 1668
Build 1668 fixes a profiler issue where the selection of more than just the GPUTime would cause a crash when profiling.

GPU PerfStudio 2.14 Improvements
Profiler support for Hawaii GPUs
Pipeline stage specific counters for OpenGL are now supported
Improved stability for multi-threaded apps

EOL driver support for legacy products

After Release 340, any subsequent driver release starting with Release 343 will cease to support the products listed in this section.

The Release 340 drivers will continue to support these products until April 1, 2016, and the NVIDIA support team will continue to address driver issues for these products in driver branches up to and including Release 340. However, future driver enhancements and optimizations in driver releases after Release 340 will not support these products.

3D-Tech News Around The Web / (WebGL) Ô Green by SPECIAL.T
« on: March 08, 2014, 03:15:29 PM »
Now you can dive right into the magical 3D world of Ô Green, the new Limited Edition  green tea by SPECIAL.T.

Usually i ignore adverts, but cucumber flavored tea is bizarre enough to post it here  :P

Depending on your device or browser you see either a school of fishes or mashed geometry.

SiSoftware Sandra 2014 Released:
Updated Device Performance Certification, New Benchmarks, Windows 8.1 support
Updated February 17th 2014 : SP1a released nV CUDA 5.x devices - aka "Maxwell".

New CPU Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 3 instruction sets

New GP / HC (GPU/APU/CPU) Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 2 interfaces

Updated GP / HC (GPU/APU/CPU) Financial Analysis benchmark
3 models, 2 precision (FP32/FP64), 2 interfaces

CUDA 6, Available as Free Download, Makes Parallel Programming Easier, Faster

We’re always striving to make parallel programming better, faster and easier for developers creating next-gen scientific, engineering, enterprise and other applications.

With the latest release of the CUDA parallel programming model, we’ve made improvements in all these areas.

Available now to all developers on the CUDA website, the CUDA 6 Release Candidate is packed with several new features that are sure to please developers.

A few highlights:

    Unified Memory – This major new feature lets CUDA applications access CPU and GPU memory without the need to manually copy data from one to the other. This is a major time saver that simplifies the programming process, and makes it easier for programmers to add GPU acceleration in a wider range of applications.
    Drop-in Libraries – Want to instantly accelerate your application by up to 8X? The new drop-in libraries can automatically accelerate your BLAS and FFTW calculations by simply replacing the existing CPU-only BLAS or FFTW library with the new, GPU-accelerated equivalent.
    Multi-GPU Scaling – Re-designed BLAS and FFT GPU libraries automatically scale performance across up to eight GPUs in a single node. This provides over nine teraflops of double-precision performance per node, supporting larger workloads than ever before (up to 512GB).

And there’s more.

The following are known issues with the CUDA 6.0 Release Candidate that will be resolved in the production release:
‣ The minBlocksPerMultiprocessor parameter for the launch_bounds() qualifier only accepts values up to 16 when used in compiling for sm_50, even
though values up to 32 are possible on that architecture.
‣ There is a performance issue with the new SIMD video intrinsics __v*2() and __v*4() when used in compiling for the sm_50 architecture.
‣ The sm_50 architecture supports 48 KB of shared memory per block; however, the check for this limit is not functioning properly in the compiler. This can allow
programs that use more than 48 KB of shared memory per block to compile successfully, although they will fail to run because the driver component does check
the limit properly.
‣ The MT19937 random number generator in the cuRAND library generates non-deterministic results for curandGenerateUniformDouble().
‣ The NPP library function nppiAlphaComp_8u_AC4R() generates incorrect results when used with the NPPI_OP_ALPHA_ATOP_PREMUL option.
‣ The NPP library functions FilterSobelHorizSecondBorder() and FilterSobelVertSecondBorder() may generate incorrect results.

3D-Tech News Around The Web / Introducing NVIDIA GameWorks
« on: March 04, 2014, 07:23:42 PM »
NVIDIA GameWorks™ pushes the limits of gaming by providing a more interactive and cinematic game experience and thus enabling next gen gaming for current games. We provide technologies e.g. PhysX and VisualFX, which are easy to integrate into games as well as tutorials and tools to quickly generate game content. In addition we also provide tools to debug, profile and optimize your code.

Read more

Beta Intel® Iris™ and HD Graphics Beta Driver for Windows* 7/8/8.1 for TITANFALL* and Thief*

In an effort to keep Intel HD graphics compatible with
the latest games and applications, Intel will occasionally post a “Beta” driver
for user feedback on compatibility and performance. This beta driver provides
benefit for users playing Titan Fall and Thief Games.  We strive for
the best possible experience for users of Intel HD graphics and will greatly
appreciate you feedback for these beta drivers. Download the 32-bit or 64-bit
beta drivers to play Titan Fall and Thief 2 Games.

Link: Iris™ and HD Graphics Driver for Windows* 7/8/8.1

    Added support for the following GPUs:
        GeForce GTX 750 Ti
        GeForce GTX 750
        GeForce GTX 745
        GeForce GTX TITAN Black
    Fixed a regression in the NVIDIA kernel module which caused it to improperly dereference a userspace pointer. This potential security issue was initially reported to the public at:
    The regression did not affect NVIDIA GPU drivers before release 334.
    Fixed a bug that could cause OpenGL programs to hang after calling fork(2).
    Added support for GPUs with VDPAU Feature Set E. See the README for details.
    On GPUs with VDPAU Feature Set E, VDPAU now supports more robust decode error handling at the cost of a minor performance impact.
    This can be disabled by setting the VDPAU_NVIDIA_DISABLE_ERROR_CONCEALMENT environment variable to 1.
    Added support for application profile rule patterns which are logical operations of subpatterns. See the README for details.
    Added support for a "findfile" application profile feature which allows the driver to apply profiles based on matching files in the same directory as the process executable. See the README for details.
    Improved performance of OpenGL applications when used in conjunction with the X driver's composition pipeline. The composition pipeline may be explicitly enabled by using the ForceCompositionPipeline or ForceFullCompositionPipeline MetaMode options, or implicitly enabled when certain features such as some XRandR transformations, rotation,Warp & Blend, PRIME, and NvFBC are used.
    Fixed a bug that could cause nvidia-settings to compute incorrect gamma ramps when adjusting the color correction sliders.
    Updated the nvidia-settings control panel to allow the selection of display devices using RandR and target ID names when makingqueries targeted towards specific display devices.
    Fixed a bug that prevented some dropdown menus in the nvidia-settings control panel from working correctly on olderversions of GTK+ (e.g. 2.10.x).
    Updated the nvidia-settings control panel to provide help text for application profile keys and suggestions for valid key nameswhen configuring application profiles.
    Updated the nvidia-settings control panel to populate the dropdown menu of stereo modes with only those modes which are available.
    Fixed a bug that could cause applications using the OpenGL extension ARB_query_buffer_object to crash under Xinerama.
    Fixed a bug that caused high pixelclock HDMI modes (e.g. as used with 4K resolutions) to be erroneously reported as dual-link inthe nvidia-settings control panel.
    Fixed a bug that prevented some DisplayPort 1.2 displays from being properly restored after a VT switch.
    Renamed per GPU proc directories in /proc/driver/nvidia/gpus/ with GPU's bus location represented in "domain:bus:device.function" format.
    Added 64-bit EGL and OpenGL ES libraries to 64-bit driver packages.
    Changed format of "Bus Location" field reported in the
    /proc/driver/nvidia/gpus/0..N/information files from "domain:bus.device.function" to "domain:bus:device.function" to matchthe lspci format.
    Fixed a bug in the GLX_EXT_buffer_age extension where incorrect ages would be returned unless triple buffering was enabled.
    Changed the driver's default behavior to stop deleting RandR 1.2 outputs corresponding to unused DisplayPort 1.2 devices. Deleting these outputs can confuse some applications. Added a new option,DeleteUnusedDP12Displays, which can be used to turn this behavior back on.This option can be enabled by running sudo nvidia-xconfig --delete-unused-dp12-displays
    Improved support for the __GL_SYNC_DISPLAY_DEVICE and VDPAU_NVIDIA_SYNC_DISPLAY_DEVICE environment variables in certain configurations. Both environment variables will now recognize all supported display device names. See "Appendix C. Display Device Names" and "Appendix G. VDPAU Support" in the README for more details.
    Improved performance of the X driver when handling large numbers of surface allocations.

NVIDIA Driver Downloads

BlenderArtist Rolf compiled Blender with CUDA 6.0 RC SDK

Blender 2.69.11 r61310 Hash eb4f2b4
Added support for the new Maxwell architecture.
Compile with VS2008, Scons
Has Cuda Kernels 2.0, 2.1, 3.0, 3.5, 5.0
i'm unable to build kernels for 1.0, 1.1, 1.2, 1.3, sry

Reminder: you must enable  GPU Rendering  manually in Blender as seen in screenshot below.

Barcelona Pavillon benchmark takes 2:27 minutes, slightly faster than Rolf thx to factory overclocking

Broadcom is releasing the full source of the OpenGL ES 1.1 and 2.0 driver stack for the Broadcom VideoCore® IV 3D graphics subsystem used in the BCM21553 3G integrated baseband SoC. VideoCore IV is used in many Broadcom products, including the BCM2835 application processor, which runs the popular Raspberry Pi microcomputer.


3D-Tech News Around The Web / OpenGL Extensions Viewer 4.15 released
« on: February 28, 2014, 05:54:14 PM »
You can download the latest version of OpenGL Extensions Viewer. :

Release 4.15 2014-02-28
No changelog yet.
Reminder: unselect forward context to display OpenGL 4.4 infos

3D-Tech News Around The Web / NVIDIA GeForce Linecards 2014
« on: February 28, 2014, 05:15:08 PM »

New in Release 331.01

Graphics driver updated for Mac OS X Mavericks 10.9.2 (13C64)
Graphics driver updated for Mac OS X Mountain Lion 10.8.5 (12F45). 
Contains performance improvements and bug fixes for a wide range of applications.
Includes NVIDIA Driver Manager preference pane.

libclh.dylib supports:
gt206 gt200 g98 g96 g94 g92 g86 g84 g80 cudaMoneyInTheBananaStand gf119 gf117 gf108 gf106 gf116 gf104 gf114 gf110 gf100 gk208 gk110 gk107 gk106 gk104


Announcing Intel® Graphics Performance Analyzers 2014 R1

Intel® Graphics Performance Analyzers (Intel® GPA) is a powerful, agile developer tool suite for analyzing and optimizing games, media, and other graphics-intensive applications. The product supports applications intended for the Windows* OS platforms or Intel® Atom™ based phones running the Android* OS. The toolset is a free download from the Intel GPA Home Page.

General Discussion / EVGA GeForce GTX 750 Ti FTW hands-on review
« on: February 26, 2014, 06:55:18 PM »
Just upgraded my old rig with EVGA GeForce GTX 750 Ti FTW w/ EVGA ACX Cooling

I don't waste your time by repeating benchmarks already shown on major news sites.

Fan-speed detection in Geeks3D programs needs to be updated.
Maxwell stands FurMark easily, only 55°C with 8xMSAA

AMD LEO, FluidMark, TessMark, NVIDIA Alien vs. Triangles, NVIDIA Endless City produce only max. 46°C

GPU-Z always reads DirectX 11.1, even under Windows XP.

NVFLASH support for GM107 begins with v5.142

If you like to have tested some CUDA accelerated apps, let me know.
I suggest to compile them against CUDA 6.0 RC SDK to exploit Maxwell's capabilities.

Feature Highlights of The AMD Catalyst™ 14.2 Beta V1.3 Driver for Windows®

        Thief:  Crossfire Profile update and performance improvements for single GPU configurations
        Mantle:  Multi-GPU configurations (up to 4 GPUs) running Battlefield 4 are now supported
        Frame Pacing for Dual Graphics and non-XDMA configurations above 2560x1600 are now supported with Battlefield 3 and Battlefield 4
        Dual graphics DirectX 9 application issues have been resolved
        Minecraft:  Missing textures have been resolved
        3D applications no longer see intermittent hangs or application crashes
        Resolves corruption issues seen in X-plane

3D-Tech News Around The Web / NVIDIA GeForce driver 335.04 certified
« on: February 25, 2014, 01:46:47 AM »
Download XP64 , XP32 ; WDDM not yet available

release notes

    New in GeForce 335.04 Certified drivers

        Performance Boost – Increases performance by up to 19% for GeForce 400/500/600/700 series GPUs in several PC games vs. the previous NVIDIA Certified drivers (332.21). Results will vary depending on your GPU and system configuration. Here are some examples of measured gains:

        GeForce GTX 770/780/TITAN/780Ti:
            Up to 19% in F1 2013
            Up to 18% in Sleeping Dogs
            Up to 16% in Hitman Absolution
            Up to 15% in Company of Heroes 2
            Up to 10% in Assassin’s Creed 3
            Up to 7% in BioShock Infinite
            Up to 6% in Sniper Elite V2
            Up to 5% in Total War: Rome 2
        SLI Technology
            Assassin’s Creed Liberation HD – created profile
            Assassin’s Creed: Freedom Cry – created profile
            Deus Ex: Human Revolution Director’s Cut – created profile
            The Crew – created profile
        Gaming Technology
            Supports GeForce ShadowPlay™ technology
            Supports GeForce ShadowPlay™ Twitch Streaming
            Supports NVIDIA GameStream™ technology
        3D Vision
            Shadow Warrior – rating now “Excellent”
            The Stanley Parable – rated “Excellent”
            Walking Dead 2 – rated “Good”
            World Rally Championship 4 – rated “Good”
            LEGO Marvel Super Heroes – rated “Good”
            Far Cry 3 Blood Dragon – rated “Fair”

3D-Tech News Around The Web / (WebGL) Eye texture raytracing demo
« on: February 21, 2014, 07:22:49 PM »
Eye shader main features:

    Texture raytracing to simulate cornea refractions
    Loads of parameters to customize the eye
    Cornea vertex displacement
    Normal maps
    Simple image based 'lighting'


Pages: 1 ... 3 4 [5] 6 7 ... 132