Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Messages - Stefan

Pages: 1 2 3 [4] 5 6 ... 139
3D-Tech News Around The Web / AMD releases Chrome browser with WebCL
« on: March 15, 2014, 08:38:49 AM »
Chrome web Browser with WebCL™ now Available for Download
WebCL™ 1.0, soon to be published by the Khronos group, is a JavaScript API providing access to OpenCL™ acceleration, a sibling to the now quite popular WebGL™ API. AMD has been working on implementing WebCL™ inside a Chrome browser, enabling web programmer access to compute acceleration, including interop between WebCL™ and WebGL™.

It seems to be incompatible with NOKIA's WebCL?

3D-Tech News Around The Web / ASUS GPUTweak 2.5.7
« on: March 15, 2014, 08:24:49 AM »
ASUS GPUTweak 2.5.7 for WinXP32/64, Win7 32/64, Win8 32/64, Win8.1 32/64

No changelog available, but it contains ROMs for unreleased GPUs.
"LN2" = liquid nitrogen or marketing goof?


edit: here is a PR that explains it
Meet Your Matrix! (Pick: R9 290X or GTX 780 Ti)
"LN2 Mode / STD Mode Jumper Switch: Unleash the maximum GPU power."

Quote from: Manuel Guzman
I verified with our software team and indeed CSAA has been removed from Maxwell GPUs. Any mixed GPU systems that have both a Maxwell and an older NVIDIA GPU (i.e. Kepler) will also not display CSAA modes.

Source: NVIDIA forum

3D-Tech News Around The Web / Blender - Fibonacci fractal
« on: March 14, 2014, 07:48:50 PM »
It’s fractal time again! Reynante M. Martinez shared this Fibonacci fractal and also offers the .blend file for download.

Reynante writes:

    I was just able to finish this piece that’s been lurking in my space for quite awhile now. During the past weekend (can’t remember which one), I had the pleasure of creating a procedurally- inspired (sort of) Blender piece, with great influence from nature and mathematics.

    Attached here is the result of that experiment.

    The .blend file is also available, giving the community a chance to tinker with it as well and would love to see what they can come up with using the technique illustrated in the file.

    Fibonacci .blend and Hi-res image download

FYI: This formula is also "used" by the Romanesco broccoli (sic!)

OTOY’s acclaimed OctaneRender™ software now free for students

Technology used by the world’s best professional artists, animators,
and designers now available at no cost to emerging talents

LOS ANGELES, Calif. – March 5, 2014 – Cloud graphics pioneer, OTOY Inc., today announced that its renowned 3D rendering software, OctaneRender™, along with OctaneRender™ plugins for Blender™ and Autodesk’s 3ds MAX® and shortly, Maya®, are now free for students. The move puts the same cutting-edge software that’s used by hundreds of thousands of professional artists, animators, and designers, in the hands of today’s up-and-coming talents at no cost if purchased using a .edu email address.

Don't know if .edu email addresses are used outside the USA

3D-Tech News Around The Web / Intel HD Graphics Driver
« on: March 14, 2014, 06:03:05 PM »
Date: 03/12/2014

This beta graphics driver provides support for two games: "TITANFALL" and "Thief". Do not update to this driver if you do not plan to play either of of these games.

This Beta graphics driver is specifically intended to test compatibility with "TITANFALL" and "Thief" games.

Engineering a Static Verification Tool for GPU Kernels

GPUVerify is a tool for verifying race- and divergence-freedom of OpenCL and CUDA kernels.
... we evaluate our optimisations over a set of 564 GPU kernels. For reproducibility, we give binaries and code for our toolchain and experimental evaluation.

3D-Tech News Around The Web / (Android) YEBIS OpenGL ES 3.0 Tech Demo
« on: March 13, 2014, 06:26:32 PM »
This is a technical demo previewing the capabilities of YEBIS 2 in OpenGL ES 3.0 (Android 4.3 and later).
YEBIS 2 is a post process library that enhances the realism and the beauty of rendered images in real-time.
Our previously released MOBILE GPUMARK application used only the OpenGL ES 2.0 feature set. With the new features provided in OpenGL ES 3.0, we can now deliver a higher level of detail and realism that was not possible with OpenGL ES 2.0.
The level of quality we once could only achieve on desktop machines can now be realized on Mobile devices. Try it and get a feel of the most cutting edge graphics on smartphones.
For more information about YEBIS 2, please check out our website:

Note: Please check our end-user license agreement before using this application

Note: This application has not yet been tested on all available GLES 3.0 devices. If you encounter any problems with it, we would appreciate your feedback via email.

What I am about to present here, may be one of the most important features in unbiased rendering ever. :-) Not kidding, probably.

This feature enters in the field of cheating, but the negative impact in the quality and purity of the render is often 0, while the benefit of removing fireflies entirely is probably the long lasting dream of every unbiased render engine user.

In most practical uses, the following is true:

With this feature, you can get rid of most (often ALL) of the fireflies in your render without (generally) affecting the render or losing visual components.

read on...

Version 1.1, 2014-03-11
1. Modified the Makefile to use the new -arch flags. 2. Added an option to run the test on a 64bit GPU. 3. Added a check for whether the current architecture is compatible with the specified test option (cpu32|cpu64|gpu32|gpu64). If there is a mismatch, the test now prints a warning. 4. Added a '-h/--help' argument.

GPU PerfStudio 2 gives developers control with seamless workflow integration. Identify performance and algorithm issues early in the development cycle, and meet your quality and performance goals. GPU PerfStudio 2 is the second generation GPU Performance Analysis and Debugging Tool from AMD. It offers clear advantages to developers by cutting software development time and improving graphics quality.

GPU PerfStudio 2.14 build 1668
Build 1668 fixes a profiler issue where the selection of more than just the GPUTime would cause a crash when profiling.

GPU PerfStudio 2.14 Improvements
Profiler support for Hawaii GPUs
Pipeline stage specific counters for OpenGL are now supported
Improved stability for multi-threaded apps

EOL driver support for legacy products

After Release 340, any subsequent driver release starting with Release 343 will cease to support the products listed in this section.

The Release 340 drivers will continue to support these products until April 1, 2016, and the NVIDIA support team will continue to address driver issues for these products in driver branches up to and including Release 340. However, future driver enhancements and optimizations in driver releases after Release 340 will not support these products.

General Discussion / Re: EVGA GeForce GTX 750 Ti FTW hands-on review
« on: March 08, 2014, 10:02:59 PM »
EVGA GTX 750 Ti FTW BIOS Update (Reduced Fan Noise)

We now have an update available that will reduce the fan noise level while the card is idle. This update is ONLY for the EVGA GTX 750 Ti FTW model (with ACX Cooler), part number: 02G-P4-3757-KR. This update is already being applied to new cards that are shipping now.
The update to fan acoustics is the only change in this BIOS. Also please keep in mind it will not lower the minimum allowed fanspeed (42%), but the acoustic level will be lower. To update please follow the below instructions:

    Download the following .zip file:
    Extract the .zip file to a temporary location.
    It is recommended to UNINSTALL your NVIDIA display drivers.
    After display drivers are removed, run update.exe located in the folder that was extracted earlier.
    Reboot and reinstall the NVIDIA display drivers.

A successful flash procedure looks like this

If you don't uninstall GeForce driver a backup ends with an error and the system becomes unresponsive (i guess a flash will fail the same way, but i wasn't dumb enough to try)

Heads-up from GPUGrid

Just a line to say we've got our first GTX750Ti card up and running. The performance out-of-the-box matches a GTX480. We'll have a public app for GPUGRID just as soon as CUDA 6.0 is publicly released, which will probably happen at GTC at the end of the month.

3D-Tech News Around The Web / (WebGL) Ô Green by SPECIAL.T
« on: March 08, 2014, 03:15:29 PM »
Now you can dive right into the magical 3D world of Ô Green, the new Limited Edition  green tea by SPECIAL.T.

Usually i ignore adverts, but cucumber flavored tea is bizarre enough to post it here  :P

Depending on your device or browser you see either a school of fishes or mashed geometry.

SiSoftware Sandra 2014 Released:
Updated Device Performance Certification, New Benchmarks, Windows 8.1 support
Updated February 17th 2014 : SP1a released nV CUDA 5.x devices - aka "Maxwell".

New CPU Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 3 instruction sets

New GP / HC (GPU/APU/CPU) Scientific Analysis benchmark
3 algorithms, 2 precision (FP32/FP64), 2 interfaces

Updated GP / HC (GPU/APU/CPU) Financial Analysis benchmark
3 models, 2 precision (FP32/FP64), 2 interfaces

Quote from: Device query.exe
devicequery Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: "GeForce GTX 750 Ti"
  CUDA Driver Version / Runtime Version          6.0 / 6.0
  CUDA Capability Major/Minor version number:    5.0
  Total amount of global memory:                 2048 MBytes (2147483648 bytes)
  ( 5) Multiprocessors, (128) CUDA Cores/MP:     640 CUDA Cores
  GPU Clock rate:                                1268 MHz (1.27 GHz)
  Memory Clock rate:                             2700 Mhz
  Memory Bus Width:                              128-bit
  L2 Cache Size:                                 2097152 bytes
  Maximum Texture Dimension Size (x,y,z)         1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
  Maximum Layered 1D Texture Size, (num) layers  1D=(16384), 2048 layers
  Maximum Layered 2D Texture Size, (num) layers  2D=(16384, 16384), 2048 layers
  Total amount of constant memory:               65536 bytes
  Total amount of shared memory per block:       49152 bytes
  Total number of registers available per block: 65536
  Warp size:                                     32
  Maximum number of threads per multiprocessor:  2048
  Maximum number of threads per block:           1024
  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)
  Maximum memory pitch:                          2147483647 bytes
  Texture alignment:                             512 bytes
  Concurrent copy and kernel execution:          Yes with 1 copy engine(s)
  Run time limit on kernels:                     No
  Integrated GPU sharing Host Memory:            No
  Support host page-locked memory mapping:       Yes
  Alignment requirement for Surfaces:            Yes
  Device has ECC support:                        Disabled
  CUDA Device Driver Mode (TCC or WDDM):         WDDM (Windows Display Driver Model)
  Device supports Unified Addressing (UVA):      Yes
  Device PCI Bus ID / PCI location ID:           1 / 0
  Compute Mode:
     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 6.0, CUDA Runtime Version = 6.0, NumDevs = 1, Device0 = GeForce GTX 750 Ti
Result = PASS

Quote from: NBody.exe
Run "nbody -benchmark [-numbodies=<numBodies>]" to measure perfomance.

> Single precision floating point simulation
> 1 Devices used for simulation
GPU Device 0: "GeForce GTX 750 Ti" with compute capability 5.0

> Compute 5.0 CUDA device: [GeForce GTX 750 Ti]
5120 bodies, total time for 10 iterations: 8.208 ms
= 31.936 billion interactions per second
= 638.723 single-precision GFLOP/s at 20 flops per interaction

Run "nbody -benchmark [-numbodies=<numBodies>]" to measure perfomance.
   -fp64             (use double precision floating point values for simulation)

> Double precision floating point simulation
> 1 Devices used for simulation
GPU Device 0: "GeForce GTX 750 Ti" with compute capability 5.0

> Compute 5.0 CUDA device: [GeForce GTX 750 Ti]
5120 bodies, total time for 10 iterations: 220.679 ms
= 1.188 billion interactions per second
= 35.637 double-precision GFLOP/s at 30 flops per interaction

CUDA 6, Available as Free Download, Makes Parallel Programming Easier, Faster

We’re always striving to make parallel programming better, faster and easier for developers creating next-gen scientific, engineering, enterprise and other applications.

With the latest release of the CUDA parallel programming model, we’ve made improvements in all these areas.

Available now to all developers on the CUDA website, the CUDA 6 Release Candidate is packed with several new features that are sure to please developers.

A few highlights:

    Unified Memory – This major new feature lets CUDA applications access CPU and GPU memory without the need to manually copy data from one to the other. This is a major time saver that simplifies the programming process, and makes it easier for programmers to add GPU acceleration in a wider range of applications.
    Drop-in Libraries – Want to instantly accelerate your application by up to 8X? The new drop-in libraries can automatically accelerate your BLAS and FFTW calculations by simply replacing the existing CPU-only BLAS or FFTW library with the new, GPU-accelerated equivalent.
    Multi-GPU Scaling – Re-designed BLAS and FFT GPU libraries automatically scale performance across up to eight GPUs in a single node. This provides over nine teraflops of double-precision performance per node, supporting larger workloads than ever before (up to 512GB).

And there’s more.

The following are known issues with the CUDA 6.0 Release Candidate that will be resolved in the production release:
‣ The minBlocksPerMultiprocessor parameter for the launch_bounds() qualifier only accepts values up to 16 when used in compiling for sm_50, even
though values up to 32 are possible on that architecture.
‣ There is a performance issue with the new SIMD video intrinsics __v*2() and __v*4() when used in compiling for the sm_50 architecture.
‣ The sm_50 architecture supports 48 KB of shared memory per block; however, the check for this limit is not functioning properly in the compiler. This can allow
programs that use more than 48 KB of shared memory per block to compile successfully, although they will fail to run because the driver component does check
the limit properly.
‣ The MT19937 random number generator in the cuRAND library generates non-deterministic results for curandGenerateUniformDouble().
‣ The NPP library function nppiAlphaComp_8u_AC4R() generates incorrect results when used with the NPPI_OP_ALPHA_ATOP_PREMUL option.
‣ The NPP library functions FilterSobelHorizSecondBorder() and FilterSobelVertSecondBorder() may generate incorrect results.

Thx to GTX 750 TI i don't need 10 GB RAM. However NVIDIA Optix is not yet Maxwell compatible  :P

Pages: 1 2 3 [4] 5 6 ... 139