Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - Stefan

Pages: 1 ... 126 127 [128] 129 130 ... 187
2541
3D-Tech News Around The Web / GPU Observer 3.7 – Sidebar Gadget
« on: November 19, 2010, 04:54:01 PM »

2542
3D-Tech News Around The Web / Fluid Simulation for Video Games (Part 8)
« on: November 18, 2010, 05:19:28 PM »
Quote
This article, the eighth in a series, explains how a vortex-based fluid simulation handles variable density in a fluid.
...
This article introduces features to the simulations presented in previous articles: The fluid flow includes motion because of buoyancy-heavier fluid sinks, and lighter fluid rises. These new features facilitate visual effects with effects of variable density or (with additional rendering work) multiple fluids, such liquid-gas mixtures like water and air. It also lays the groundwork for upcoming features like thermal convection (hot air rises) and combustion (generating heat from chemical processes).

2543
3D-Tech News Around The Web / AMD GPUPerfAPI 2.5.990
« on: November 17, 2010, 07:28:46 PM »
Quote
GPUPerfAPI is AMD's library for accessing GPU performance counters on ATI Radeon graphics cards. It is used by GPU PerfStudio 2 and the ATI Stream Profiler and is now available to third party developers who wish to incorporate it within their own applications. GPUPerfAPI supports DirectX10, DirectX11, OpenGL, and OpenCL applications.

Quote
Release Notes

Version 2.5 (11/17/10)

    * Adds Linux support for OpenGL and OpenCL.
    * More consistent naming between OpenCL and DX Compute Shader counters.
    * Improved methods for identifying existing hardware.
    * DirectX 10/11:
          o Improved accuracy of DepthAndStencil counters.
          o Improved accuracy of ColorBuffer counters.
    * OpenCL:
          o New counters: FetchSize, CacheHit, LDSFetchInsts, LDSWriteInsts, FastPath, CompletePath, PathUtilization.
    * OpenGL:
          o Fixed an issue that caused counters to not be available if the application is using queries.
          o Adds tessellation related counters.

GPUPerfAPI-2.5.990.1.zip (2.01MB)   11/17/2010   Windows®   32-bit/64-bit    AMD's library for accessing GPU performance counters.
GPUPerfAPI-2.5.990.1-lnx.tgz (942KB)   11/17/2010   Linux®   32-bit/64-bit    Linux libraries for accessing OpenGL and OpenCL GPU performance counters.

2544
3D-Tech News Around The Web / ATI Catalyst 10.11 (8.791) WHQL
« on: November 17, 2010, 07:16:38 PM »
This driver is the same as seen @ MS Update earlier.
It does not expose OpenGL 4.1 and the profiles in UMD are older than in HotfixE.

Hotlinking to AMD sometimes work, sometimes not. If not, wait until AMD'S driver website has been updated  ::)

Desktop Vista/7 64 bit with APP
Mobility Vista/7 64 bit


To get OpenGL 4.1 support, replace atioglxx.dll with that from Hotfix E


2545
Quote
Woody3D™ is a Tree Engine™ that generates 3D trees for real-time rendering. Dynamic libraries (x86, x64) for Windows and OSX operating systems are included. Trees are animated using the CPU or with the GPU using the included vertex shaders (GLSL / HLSL).


2546
Quote
CUDA Toolkit 3.2 (November 2010) Release Highlights

New and Improved CUDA Libraries

    * CUBLAS performance improved 50% to 300% on Fermi architecture GPUs, for matrix multiplication of all datatypes and transpose variations
    * CUFFT performance tuned for radix-3, -5, and -7 transform sizes on Fermi architecture GPUs, now 2x to 10x faster than MKL
    * New CUSPARSE library of GPU-accelerated sparse matrix routines for sparse/sparse and dense/sparse operations delivers 5x to 30x faster performance than MKL
    * New CURAND library of GPU-accelerated random number generation (RNG) routines, supporting Sobol quasi-random and XORWOW pseudo-random routines at 10x to 20x faster than similar routines in MKL
    * H.264 encode/decode libraries now included in the CUDA Toolkit



NVIDIA's oclDeviceQuery yet ignores multiple OpenCL vendors

Code: [Select]
oclDeviceQuery.exe Starting...

OpenCL SW Info:

 CL_PLATFORM_NAME: NVIDIA CUDA
 CL_PLATFORM_VERSION: OpenCL 1.0 CUDA 3.2.1
 OpenCL SDK Revision: 7027912


OpenCL Device Info:

 1 devices found supporting OpenCL:

 ---------------------------------
 Device GeForce GTX 465
 ---------------------------------
  CL_DEVICE_NAME: GeForce GTX 465
  CL_DEVICE_VENDOR: NVIDIA Corporation
  CL_DRIVER_VERSION: 263.06
  CL_DEVICE_VERSION: OpenCL 1.0 CUDA
  CL_DEVICE_TYPE: CL_DEVICE_TYPE_GPU
  CL_DEVICE_MAX_COMPUTE_UNITS: 11
  CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
  CL_DEVICE_MAX_WORK_ITEM_SIZES: 1024 / 1024 / 64
  CL_DEVICE_MAX_WORK_GROUP_SIZE: 1024
  CL_DEVICE_MAX_CLOCK_FREQUENCY: 810 MHz
  CL_DEVICE_ADDRESS_BITS: 32
  CL_DEVICE_MAX_MEM_ALLOC_SIZE: 248 MByte
  CL_DEVICE_GLOBAL_MEM_SIZE: 993 MByte
  CL_DEVICE_ERROR_CORRECTION_SUPPORT: no
  CL_DEVICE_LOCAL_MEM_TYPE: local
  CL_DEVICE_LOCAL_MEM_SIZE: 48 KByte
  CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64 KByte
  CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ENABLE
  CL_DEVICE_QUEUE_PROPERTIES: CL_QUEUE_PROFILING_ENABLE
  CL_DEVICE_IMAGE_SUPPORT: 1
  CL_DEVICE_MAX_READ_IMAGE_ARGS: 128
  CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8
  CL_DEVICE_SINGLE_FP_CONFIG: denorms INF-quietNaNs round-to-nearest round-to-zero round-to-inf fma

  CL_DEVICE_IMAGE <dim> 2D_MAX_WIDTH 4096
2D_MAX_HEIGHT 32768
3D_MAX_WIDTH 2048
3D_MAX_HEIGHT 2048
3D_MAX_DEPTH 2048

  CL_DEVICE_EXTENSIONS: cl_khr_byte_addressable_store
cl_khr_icd
cl_khr_gl_sharing
cl_nv_d3d9_sharing
cl_nv_d3d10_sharing
cl_khr_d3d10_sharing
cl_nv_d3d11_sharing
cl_nv_compiler_options
cl_nv_device_attribute_query
cl_nv_pragma_unroll
cl_khr_global_int32_base_atomics
cl_khr_global_int32_extended_atomics
cl_khr_local_int32_base_atomics
cl_khr_local_int32_extended_atomics
cl_khr_fp64


  CL_DEVICE_COMPUTE_CAPABILITY_NV: 2.0
  NUMBER OF MULTIPROCESSORS: 11
  NUMBER OF CUDA CORES: 352
  CL_DEVICE_REGISTERS_PER_BLOCK_NV: 32768
  CL_DEVICE_WARP_SIZE_NV: 32
  CL_DEVICE_GPU_OVERLAP_NV: CL_TRUE
  CL_DEVICE_KERNEL_EXEC_TIMEOUT_NV: CL_TRUE
  CL_DEVICE_INTEGRATED_MEMORY_NV: CL_FALSE
  CL_DEVICE_PREFERRED_VECTOR_WIDTH_<t> CHAR 1, SHORT 1, INT 1, LONG 1, FLOAT 1, DOUBLE 1


  ---------------------------------
  2D Image Formats Supported (71)
  ---------------------------------
  #     Channel Order   Channel Type         

  1     CL_R            CL_FLOAT             
  2     CL_R            CL_HALF_FLOAT         
  3     CL_R            CL_UNORM_INT8         
  4     CL_R            CL_UNORM_INT16       
  5     CL_R            CL_SNORM_INT16       
  6     CL_R            CL_SIGNED_INT8       
  7     CL_R            CL_SIGNED_INT16       
  8     CL_R            CL_SIGNED_INT32       
  9     CL_R            CL_UNSIGNED_INT8     
  10    CL_R            CL_UNSIGNED_INT16     
  11    CL_R            CL_UNSIGNED_INT32     
  12    CL_A            CL_FLOAT             
  13    CL_A            CL_HALF_FLOAT         
  14    CL_A            CL_UNORM_INT8         
  15    CL_A            CL_UNORM_INT16       
  16    CL_A            CL_SNORM_INT16       
  17    CL_A            CL_SIGNED_INT8       
  18    CL_A            CL_SIGNED_INT16       
  19    CL_A            CL_SIGNED_INT32       
  20    CL_A            CL_UNSIGNED_INT8     
  21    CL_A            CL_UNSIGNED_INT16     
  22    CL_A            CL_UNSIGNED_INT32     
  23    CL_RG           CL_FLOAT             
  24    CL_RG           CL_HALF_FLOAT         
  25    CL_RG           CL_UNORM_INT8         
  26    CL_RG           CL_UNORM_INT16       
  27    CL_RG           CL_SNORM_INT16       
  28    CL_RG           CL_SIGNED_INT8       
  29    CL_RG           CL_SIGNED_INT16       
  30    CL_RG           CL_SIGNED_INT32       
  31    CL_RG           CL_UNSIGNED_INT8     
  32    CL_RG           CL_UNSIGNED_INT16     
  33    CL_RG           CL_UNSIGNED_INT32     
  34    CL_RA           CL_FLOAT             
  35    CL_RA           CL_HALF_FLOAT         
  36    CL_RA           CL_UNORM_INT8         
  37    CL_RA           CL_UNORM_INT16       
  38    CL_RA           CL_SNORM_INT16       
  39    CL_RA           CL_SIGNED_INT8       
  40    CL_RA           CL_SIGNED_INT16       
  41    CL_RA           CL_SIGNED_INT32       
  42    CL_RA           CL_UNSIGNED_INT8     
  43    CL_RA           CL_UNSIGNED_INT16     
  44    CL_RA           CL_UNSIGNED_INT32     
  45    CL_RGBA         CL_FLOAT             
  46    CL_RGBA         CL_HALF_FLOAT         
  47    CL_RGBA         CL_UNORM_INT8         
  48    CL_RGBA         CL_UNORM_INT16       
  49    CL_RGBA         CL_SNORM_INT16       
  50    CL_RGBA         CL_SIGNED_INT8       
  51    CL_RGBA         CL_SIGNED_INT16       
  52    CL_RGBA         CL_SIGNED_INT32       
  53    CL_RGBA         CL_UNSIGNED_INT8     
  54    CL_RGBA         CL_UNSIGNED_INT16     
  55    CL_RGBA         CL_UNSIGNED_INT32     
  56    CL_BGRA         CL_UNORM_INT8         
  57    CL_BGRA         CL_SIGNED_INT8       
  58    CL_BGRA         CL_UNSIGNED_INT8     
  59    CL_ARGB         CL_UNORM_INT8         
  60    CL_ARGB         CL_SIGNED_INT8       
  61    CL_ARGB         CL_UNSIGNED_INT8     
  62    CL_INTENSITY    CL_FLOAT             
  63    CL_INTENSITY    CL_HALF_FLOAT         
  64    CL_INTENSITY    CL_UNORM_INT8         
  65    CL_INTENSITY    CL_UNORM_INT16       
  66    CL_INTENSITY    CL_SNORM_INT16       
  67    CL_LUMINANCE    CL_FLOAT             
  68    CL_LUMINANCE    CL_HALF_FLOAT         
  69    CL_LUMINANCE    CL_UNORM_INT8         
  70    CL_LUMINANCE    CL_UNORM_INT16       
  71    CL_LUMINANCE    CL_SNORM_INT16       

  ---------------------------------
  3D Image Formats Supported (71)
  ---------------------------------
  #     Channel Order   Channel Type         

  1     CL_R            CL_FLOAT             
  2     CL_R            CL_HALF_FLOAT         
  3     CL_R            CL_UNORM_INT8         
  4     CL_R            CL_UNORM_INT16       
  5     CL_R            CL_SNORM_INT16       
  6     CL_R            CL_SIGNED_INT8       
  7     CL_R            CL_SIGNED_INT16       
  8     CL_R            CL_SIGNED_INT32       
  9     CL_R            CL_UNSIGNED_INT8     
  10    CL_R            CL_UNSIGNED_INT16     
  11    CL_R            CL_UNSIGNED_INT32     
  12    CL_A            CL_FLOAT             
  13    CL_A            CL_HALF_FLOAT         
  14    CL_A            CL_UNORM_INT8         
  15    CL_A            CL_UNORM_INT16       
  16    CL_A            CL_SNORM_INT16       
  17    CL_A            CL_SIGNED_INT8       
  18    CL_A            CL_SIGNED_INT16       
  19    CL_A            CL_SIGNED_INT32       
  20    CL_A            CL_UNSIGNED_INT8     
  21    CL_A            CL_UNSIGNED_INT16     
  22    CL_A            CL_UNSIGNED_INT32     
  23    CL_RG           CL_FLOAT             
  24    CL_RG           CL_HALF_FLOAT         
  25    CL_RG           CL_UNORM_INT8         
  26    CL_RG           CL_UNORM_INT16       
  27    CL_RG           CL_SNORM_INT16       
  28    CL_RG           CL_SIGNED_INT8       
  29    CL_RG           CL_SIGNED_INT16       
  30    CL_RG           CL_SIGNED_INT32       
  31    CL_RG           CL_UNSIGNED_INT8     
  32    CL_RG           CL_UNSIGNED_INT16     
  33    CL_RG           CL_UNSIGNED_INT32     
  34    CL_RA           CL_FLOAT             
  35    CL_RA           CL_HALF_FLOAT         
  36    CL_RA           CL_UNORM_INT8         
  37    CL_RA           CL_UNORM_INT16       
  38    CL_RA           CL_SNORM_INT16       
  39    CL_RA           CL_SIGNED_INT8       
  40    CL_RA           CL_SIGNED_INT16       
  41    CL_RA           CL_SIGNED_INT32       
  42    CL_RA           CL_UNSIGNED_INT8     
  43    CL_RA           CL_UNSIGNED_INT16     
  44    CL_RA           CL_UNSIGNED_INT32     
  45    CL_RGBA         CL_FLOAT             
  46    CL_RGBA         CL_HALF_FLOAT         
  47    CL_RGBA         CL_UNORM_INT8         
  48    CL_RGBA         CL_UNORM_INT16       
  49    CL_RGBA         CL_SNORM_INT16       
  50    CL_RGBA         CL_SIGNED_INT8       
  51    CL_RGBA         CL_SIGNED_INT16       
  52    CL_RGBA         CL_SIGNED_INT32       
  53    CL_RGBA         CL_UNSIGNED_INT8     
  54    CL_RGBA         CL_UNSIGNED_INT16     
  55    CL_RGBA         CL_UNSIGNED_INT32     
  56    CL_BGRA         CL_UNORM_INT8         
  57    CL_BGRA         CL_SIGNED_INT8       
  58    CL_BGRA         CL_UNSIGNED_INT8     
  59    CL_ARGB         CL_UNORM_INT8         
  60    CL_ARGB         CL_SIGNED_INT8       
  61    CL_ARGB         CL_UNSIGNED_INT8     
  62    CL_INTENSITY    CL_FLOAT             
  63    CL_INTENSITY    CL_HALF_FLOAT         
  64    CL_INTENSITY    CL_UNORM_INT8         
  65    CL_INTENSITY    CL_UNORM_INT16       
  66    CL_INTENSITY    CL_SNORM_INT16       
  67    CL_LUMINANCE    CL_FLOAT             
  68    CL_LUMINANCE    CL_HALF_FLOAT         
  69    CL_LUMINANCE    CL_UNORM_INT8         
  70    CL_LUMINANCE    CL_UNORM_INT16       
  71    CL_LUMINANCE    CL_SNORM_INT16       

oclDeviceQuery, Platform Name = NVIDIA CUDA, Platform Version = OpenCL 1.0 CUDA 3.2.1, SDK Revision = 7027912, NumDevs = 1, Device = GeForce GTX 465

System Info:

 Local Time/Date = 18:18:22, 11/17/2010
 CPU Arch: 0
 CPU Level: 6
 # of CPU processors: 4
 Windows Build: 6002
 Windows Ver: 6.0 (Windows Vista / Windows 7)


PASSED


Press <Enter> to Quit...
-----------------------------------------------------------

2547
3D-Tech News Around The Web / PC Wizard 2010 v1.96
« on: November 17, 2010, 05:43:06 PM »
Download here until website is updated:
ftp://ftp.cpuid.com/pc-wizard/pc-wizard_2010.1.96-setup.exe (64 & 32 bit)
ftp://ftp.cpuid.com/pc-wizard/pc-wizard_2010.1.96.zip (32 bit)

Quote
PC Wizard 2010 Release Notes
Copyright © 1996-2010 Laurent KUTIL & Franck DELATTRE
 
2010/08/24
- Intel Atom N455, N475 processors support
- Intel Atom D425, D525 processors support
- Intel Pentium E6700, E5700 processors support
- Intel Core i3 330E, Core i7 660UE processors support
- Intel Celeron U3400, U3405 processors support

2010/08/26
- Sony ACPI Thermal Zones support
- nVidia SLI Ready Motherboard detection support

2010/08/30
- ATI HD6xxx Cayman, Antilles, BlackComb, Barts, Whistler, Onega, Turks, Seymour, Caicos video cards preliminary support
- Intel SandyBridge improved support
- Intel Core i7 2600K, 2600, 2600S, 2920XM, 2820QM, 2720QM, 2620M processors support
- Intel Core i5 2500K, 2500, 2500S, 2500T, 2400, 2400S, 2390T, 2540M, 2520M processors support
- Intel Core i3 2120, 2100T processors support

2010/09/04
- Nuvoton NCT6776 hardware sensor support
- AMD Lightwieght Profiling processor support
- AMD XOP, FMA4, TBM, F16C, BMI features processor support
- nVIDIA GTS450, GTS455 video card support
- Microsoft Internet Explorer 9 Beta support

2010/10/03
- Intel Core i7 680UM, 660LM, 640M processors support
- Intel Core i5 580M, 560UM, 560M, 470UM, 460M processors support
- Intel Core i3 380M, 380UM processor support
- Intel Pentium P6100, P6200 processors support
- Intel Celeron T3500, P4600, P4500 processors support
- Intel Xeon X5690 processor support

2010/10/30
- nVidia GTX580 video card prelmiminary support
- HP Hardware Sensor CMI support
- Intel Core i7 995/995X processor support
- Intel Core i3 560 processor support

2010/11/09
- nVidia Video Cards external connectors type detection support
- nVidia Video Cards Activity Load support
- ATI Eyefinity information support

2010/11/16
- Devices plugged or removed auto-detection support

Multiple OpenCL vendors are recognised correctly.


2548
Benchmark: ATI Stream CPU mode vs. INTEL



Configuration: Intel Core i3 330m / CEDAR 5470, window size 600 x 1 pixel to get higher framerates

           ATI Intel
Julia       4   5
Particles  21  36
Deformer   29  27
PostFX     79  60


As you can see the results are inconsistent.

2549
OpenCL demos actually running on INTEL platform



I managed to run the demos on Intel platform by disabling GPU vendor from registry (no reboot required).
To disable a platform, set its value dword:00000001



Here are the locations on 64 bit Windows:

Quote
Windows Registry Editor Version 5.00

[HKEY_LOCAL_MACHINE\SOFTWARE\Wow6432Node\Khronos\OpenCL\Vendors]
"nvcuda.dll"=dword:00000000
"intelocl.dll"=dword:00000000

Quote
Windows Registry Editor Version 5.00

[HKEY_LOCAL_MACHINE\SOFTWARE\Wow6432Node\Khronos\OpenCL\Vendors]
"atiocl.dll"=dword:00000000
"atiocl64.dll"=dword:00000000
"intelocl.dll"=dword:00000000

2550
Intel OpenCL SDK tested on notebook with ATI GPU and Core i3 330m



Same as above: NVIDIA's OpenCL Device Query fails to recognise Intel platform.



GPUCapsviewer recognises ATI and Intel platform, but is confused and continues using ATI platform, albeit Intel is selected.
You see that because you can run either GPU or CPU demos.





AMD's Stream 2.2 CLINFO recognises ATI (2 devices) and Intel platform (1 device)

Code: [Select]
Number of platforms: 2
  Platform Profile: FULL_PROFILE
  Platform Version: OpenCL 1.1 ATI-Stream-v2.2 (302)
  Platform Name: ATI Stream
  Platform Vendor: Advanced Micro Devices, Inc.
  Platform Extensions: cl_khr_icd cl_amd_event_callback cl_khr_d3d10_sharing
  Platform Profile: FULL_PROFILE
  Platform Version: OpenCL 1.1 WINDOWS
  Platform Name: Intel OpenCL
  Platform Vendor: Intel Corporation
  Platform Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_gl_sharing cl_khr_byte_addressable_store cl_khr_icd


  Platform Name: ATI Stream
Number of devices: 2
  Device Type: CL_DEVICE_TYPE_CPU
  Device ID: 4098
  Max compute units: 4
  Max work items dimensions: 3
    Max work items[0]: 1024
    Max work items[1]: 1024
    Max work items[2]: 1024
  Max work group size: 1024
  Preferred vector width char: 16
  Preferred vector width short: 8
  Preferred vector width int: 4
  Preferred vector width long: 2
  Preferred vector width float: 4
  Preferred vector width double: 0
  Max clock frequency: 2128Mhz
  Address bits: 32
  Max memory allocation: 536870912
  Image support: No
  Max size of kernel argument: 4096
  Alignment (bits) of base address: 1024
  Minimum alignment (bytes) for any datatype: 128
  Single precision floating point capability
    Denorms: Yes
    Quiet NaNs: Yes
    Round to nearest even: Yes
    Round to zero: Yes
    Round to +ve and infinity: Yes
    IEEE754-2008 fused multiply-add: No
  Cache type: Read/Write
  Cache line size: 64
  Cache size: 32768
  Global memory size: 1073741824
  Constant buffer size: 65536
  Max number of constant args: 8
  Local memory type: Global
  Local memory size: 32768
  Profiling timer resolution: 481
  Device endianess: Little
  Available: Yes
  Compiler available: Yes
  Execution capabilities:
    Execute OpenCL kernels: Yes
    Execute native function: Yes
  Queue properties:
    Out-of-Order: No
    Profiling : Yes
  Platform ID: 00EDD40C
  Name: Intel(R) Core(TM) i3 CPU       M 330  @ 2.13GHz
  Vendor: GenuineIntel
  Driver version: 2.0
  Profile: FULL_PROFILE
  Version: OpenCL 1.1 ATI-Stream-v2.2 (302)
  Extensions: cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_printf cl_khr_d3d10_sharing
  Device Type: CL_DEVICE_TYPE_GPU
  Device ID: 4098
  Max compute units: 2
  Max work items dimensions: 3
    Max work items[0]: 128
    Max work items[1]: 128
    Max work items[2]: 128
  Max work group size: 128
  Preferred vector width char: 16
  Preferred vector width short: 8
  Preferred vector width int: 4
  Preferred vector width long: 2
  Preferred vector width float: 4
  Preferred vector width double: 0
  Max clock frequency: 750Mhz
  Address bits: 32
  Max memory allocation: 134217728
  Image support: Yes
  Max number of images read arguments: 128
  Max number of images write arguments: 8
  Max image 2D width: 8192
  Max image 2D height: 8192
  Max image 3D width: 2048
  Max image 3D height: 2048
  Max image 3D depth: 2048
  Max samplers within kernel: 16
  Max size of kernel argument: 1024
  Alignment (bits) of base address: 32768
  Minimum alignment (bytes) for any datatype: 128
  Single precision floating point capability
    Denorms: No
    Quiet NaNs: Yes
    Round to nearest even: Yes
    Round to zero: Yes
    Round to +ve and infinity: Yes
    IEEE754-2008 fused multiply-add: Yes
  Cache type: None
  Cache line size: 0
  Cache size: 0
  Global memory size: 536870912
  Constant buffer size: 65536
  Max number of constant args: 8
  Local memory type: Scratchpad
  Local memory size: 32768
  Profiling timer resolution: 1
  Device endianess: Little
  Available: Yes
  Compiler available: Yes
  Execution capabilities:
    Execute OpenCL kernels: Yes
    Execute native function: No
  Queue properties:
    Out-of-Order: No
    Profiling : Yes
  Platform ID: 00EDD40C
  Name: Cedar
  Vendor: Advanced Micro Devices, Inc.
  Driver version: CAL 1.4.879
  Profile: FULL_PROFILE
  Version: OpenCL 1.1 ATI-Stream-v2.2 (302)
  Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_printf cl_amd_media_ops cl_khr_d3d10_sharing


Passed!
  Platform Name: Intel OpenCL
Number of devices: 1
  Device Type: CL_DEVICE_TYPE_CPU
  Device ID: 32902
  Max compute units: 4
  Max work items dimensions: 3
    Max work items[0]: 1024
    Max work items[1]: 1024
    Max work items[2]: 1024
  Max work group size: 1024
  Preferred vector width char: 16
  Preferred vector width short: 8
  Preferred vector width int: 4
  Preferred vector width long: 2
  Preferred vector width float: 4
  Preferred vector width double: 2
  Max clock frequency: 2133Mhz
  Address bits: 32
  Max memory allocation: 536838144
  Image support: Yes
  Max number of images read arguments: 128
  Max number of images write arguments: 128
  Max image 2D width: 8192
  Max image 2D height: 8192
  Max image 3D width: 2048
  Max image 3D height: 2048
  Max image 3D depth: 2048
  Max samplers within kernel: 128
  Max size of kernel argument: 1024
  Alignment (bits) of base address: 1024
  Minimum alignment (bytes) for any datatype: 128
  Single precision floating point capability
    Denorms: Yes
    Quiet NaNs: Yes
    Round to nearest even: Yes
    Round to zero: No
    Round to +ve and infinity: No
    IEEE754-2008 fused multiply-add: No
  Cache type: Read/Write
  Cache line size: 64
  Cache size: 262144
  Global memory size: 2147352576
  Constant buffer size: 131072
  Max number of constant args: 128
  Local memory type: Global
  Local memory size: 32768
  Profiling timer resolution: 481
  Device endianess: Little
  Available: Yes
  Compiler available: Yes
  Execution capabilities:
    Execute OpenCL kernels: Yes
    Execute native function: Yes
  Queue properties:
    Out-of-Order: Yes
    Profiling : Yes
  Platform ID: 03B96D88
  Name: GenuineIntel
  Vendor: Intel Corporation
  Driver version: 1.1
  Profile: FULL_PROFILE
  Version: OpenCL 1.1
  Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_gl_sharing cl_khr_byte_addressable_store


Passed!

2551
Intel OpenCL SDK tested on desktop with NVIDIA GPU and Core2Quad 9450



NVIDIA's OpenCL Device Query fails to recognise Intel platform.



GPUCapsviewer recognises NVIDIA and Intel platform, but is confused and continues using NVIDIA platform, albeit Intel is selected.





AMD's Stream 2.2 CLINFO recognises NVIDIA and Intel platform, but prints some errors:

Code: [Select]
Number of platforms: 2
  Platform Profile: FULL_PROFILE
  Platform Version: OpenCL 1.0 CUDA 3.2.1
  Platform Name: NVIDIA CUDA
  Platform Vendor: NVIDIA Corporation
  Platform Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll
  Platform Profile: FULL_PROFILE
  Platform Version: OpenCL 1.1 WINDOWS
  Platform Name: Intel OpenCL
  Platform Vendor: Intel Corporation
  Platform Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_gl_sharing cl_khr_byte_addressable_store cl_khr_icd


  Platform Name: NVIDIA CUDA
Number of devices: 1
  Device Type: CL_DEVICE_TYPE_GPU
  Device ID: 4318
  Max compute units: 11
  Max work items dimensions: 3
    Max work items[0]: 1024
    Max work items[1]: 1024
    Max work items[2]: 64
  Max work group size: 1024
  Preferred vector width char: 1
  Preferred vector width short: 1
  Preferred vector width int: 1
  Preferred vector width long: 1
  Preferred vector width float: 1
  Preferred vector width double: 1
  Max clock frequency: 810Mhz
  Address bits: 17240136165097504
  Max memory allocation: 260423680
  Image support: Yes
  Max number of images read arguments: 128
  Max number of images write arguments: 8
  Max image 2D width: 4096
  Max image 2D height: 32768
  Max image 3D width: 2048
  Max image 3D height: 2048
  Max image 3D depth: 2048
  Max samplers within kernel: 16
  Max size of kernel argument: 4352
  Alignment (bits) of base address: 4096
  Minimum alignment (bytes) for any datatype: 128
  Single precision floating point capability
    Denorms: Yes
    Quiet NaNs: Yes
    Round to nearest even: Yes
    Round to zero: Yes
    Round to +ve and infinity: Yes
    IEEE754-2008 fused multiply-add: Yes
  Cache type: Read/Write
  Cache line size: 128
  Cache size: 180224
  Global memory size: 1041694720
  Constant buffer size: 65536
  Max number of constant args: 9
  Local memory type: Scratchpad
  Local memory size: 49152
  Profiling timer resolution: 1000
  Device endianess: Little
  Available: Yes
  Compiler available: Yes
  Execution capabilities:
    Execute OpenCL kernels: Yes
    Execute native function: No
  Queue properties:
    Out-of-Order: Yes
    Profiling : Yes
  Platform ID: 02551550
  Name: GeForce GTX 465
  Vendor: NVIDIA Corporation
  Driver version: 261.00
  Profile: FULL_PROFILE
  Version: OpenCL 1.0 CUDA
  Extensions: cl_khr_byte_addressable_store cl_khr_icd cl_khr_gl_sharing cl_nv_d3d9_sharing cl_nv_d3d10_sharing cl_khr_d3d10_sharing cl_nv_d3d11_sharing cl_nv_compiler_options cl_nv_device_attribute_query cl_nv_pragma_unroll  cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_fp64


Error : atomics mismatch!
Error : Bytes mismatch!
Error : d3d10Sharing mismatch!
Error : glSharing mismatch!
Error : images mismatch!
Error : printf mismatch!
Error : deviceAttributeQuery mismatch!
Failed!
  Platform Name: Intel OpenCL
Number of devices: 1
  Device Type: CL_DEVICE_TYPE_CPU
  Device ID: 32902
  Max compute units: 4
  Max work items dimensions: 3
    Max work items[0]: 1024
    Max work items[1]: 1024
    Max work items[2]: 1024
  Max work group size: 1024
  Preferred vector width char: 16
  Preferred vector width short: 8
  Preferred vector width int: 4
  Preferred vector width long: 2
  Preferred vector width float: 4
  Preferred vector width double: 2
  Max clock frequency: 3200Mhz
  Address bits: 17240136165097504
  Max memory allocation: 536838144
  Image support: Yes
  Max number of images read arguments: 128
  Max number of images write arguments: 128
  Max image 2D width: 8192
  Max image 2D height: 8192
  Max image 3D width: 2048
  Max image 3D height: 2048
  Max image 3D depth: 2048
  Max samplers within kernel: 128
  Max size of kernel argument: 1024
  Alignment (bits) of base address: 1024
  Minimum alignment (bytes) for any datatype: 128
  Single precision floating point capability
    Denorms: Yes
    Quiet NaNs: Yes
    Round to nearest even: Yes
    Round to zero: No
    Round to +ve and infinity: No
    IEEE754-2008 fused multiply-add: No
  Cache type: Read/Write
  Cache line size: 64
  Cache size: 6291456
  Global memory size: 2147352576
  Constant buffer size: 131072
  Max number of constant args: 128
  Local memory type: Global
  Local memory size: 32768
  Profiling timer resolution: 279
  Device endianess: Little
  Available: Yes
  Compiler available: Yes
  Execution capabilities:
    Execute OpenCL kernels: Yes
    Execute native function: Yes
  Queue properties:
    Out-of-Order: Yes
    Profiling : Yes
  Platform ID: 02E11220
  Name: GenuineIntel
  Vendor: Intel Corporation
  Driver version: 1.1
  Profile: FULL_PROFILE
  Version: OpenCL 1.1
  Extensions: cl_khr_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_gl_sharing cl_khr_byte_addressable_store


Error : atomics mismatch!
Error : Bytes mismatch!
Error : d3d10Sharing mismatch!
Error : glSharing mismatch!
Error : images mismatch!
Error : printf mismatch!
Error : deviceAttributeQuery mismatch!
Failed!

2552
3D-Tech News Around The Web / Intel releases OpenCL SDK
« on: November 16, 2010, 06:37:51 PM »
As i predicted earlier, Intel released their own OpenCL SDK to be demonstrated at SC10.

I will do some tests asap...


2553
Quote
For professional users operating on Mac OS X Snow Leopard, this means the wait is over. The NVIDIA Quadro 4000 graphics processing unit (GPU) for Mac is optimized to accelerate workflows and drive a range of top professional applications.

Full story @ TPU

Download corresponding Quadro Mac OS X Driver Release 256.01.00f03

2554
3D-Tech News Around The Web / AMD lists rebranded HD6000 GPUS
« on: November 15, 2010, 06:51:55 PM »
Rebranded GPUs have the codename "Evergreen", really new GPUs have the codename "NI" (Northern Islands)

Full list here: ATI Catalyst™ PC VENDOR ID (1002) LIST

Quote
{ “ATI Radeon HD 6230   , Evergreen,   68F9”}
{ “ATI Radeon HD 6250   , Evergreen,   68F9”}
{ “ATI Radeon HD 6290   , Evergreen   ,68F9”}
{ “ATI Radeon HD 6390   , Evergreen,   68DA”}
{ “ATI Radeon HD 6510   , Evergreen,   68D9”}
{ “ATI Radeon HD 6750   , Evergreen   ,68BE”}
{ “ATI Radeon HD 6230     , Evergreen   ,68F9”}
{ “ATI Radeon HD 6250     , Evergreen   ,68F9”}
{ “ATI Radeon HD 6290     , Evergreen,   68F9”}
{ “ATI Radeon HD 6390     , Evergreen   ,68DA”}
{ “ATI Radeon HD 6510     , Evergreen   ,68D9”}
{ “ATI Radeon HD 6750     , Evergreen,   68BE”}

[via]

2555
3D-Tech News Around The Web / Alenka - SQL engine for CUDA
« on: November 15, 2010, 06:14:08 PM »
Quote
Alenka is a high level, high performance SQL-like language for data processing on CUDA hardware. All SQL operations like joins, groupings and sorts are implemented in a highly parallel fashion. Alenka uses regular files as sources of data. Read data are stored internally as columns, all operations are vector operations and thus are highly scalable on NVIDIA GPUs.

To use Alenka you need a GPU that supports CUDA 1.3 and higher (cards of models GTX 200 or GTX 400).

2556
3D-Tech News Around The Web / Notepad++ 5.8.4 released
« on: November 15, 2010, 05:18:00 PM »
Quote
Notepad++ v5.8.4 new features and fixed bugs (from v5.8.3):

   1. Fix memory leak problem while switching tab.
   2. Fix User Defined Language dialog docking problem under Windows 7.
   3. Fix backwards search with Asian codepage problem.
   4. Add a new capacity in context menu: the menu item and folder item can be renamed (and in whichever language).
   5. All the supported encoding charsets can be set as default in "New Document Settings".
   6. Remove Calltip restriction from plugins side.
   7. Add "-alwaysOnTop" command line argument.
   8. Fix icon display glitch in shell extension context menu.

2557
3D-Tech News Around The Web / VLC media player 1.1.5 final
« on: November 14, 2010, 10:57:02 AM »
Quote
What's new in 1.1.5

Two and a half months after VLC 1.1.4, and after almost 50 million downloads of VLC 1.1.4, here is a minor release of VLC 1.1.5.

Introducing small features and fixing important bug and a security issue, here is VLC 1.1.5.

    * 1000 Web-shows listing integrated inside VLC's playlist, provided by Channels.com
    * New Game Music Emu (GME) support
    * Windows SMB security issue - VideoLAN-SA-1006
    * Major updates in translations, and new translations in Asturian, Armenian and Modern Greek
    * Mac/Power-PC port fixed in 1.1.4.1
    * live .webm streams support
    * GPU HD decoding using Intel IGP on Windows, using DxVA2
    * Miscellaneous fixes on all Interfaces, Demuxers and Decoders modules
    * Codecs updates

The chipmunk shows you how to enable GPU acceleration  ;D


2558
Quote
NVIDIA shook the high-end PC hardware industry earlier this month with the surprise launch of its GeForce GTX 580 graphics card, which extended the lead for single-GPU performance NVIDIA has been holding. It also managed to come up with some great performance per Watt improvements over the previous generation. The reference design board, however, made use of a clock speed throttling logic which reduced clock speeds when an extremely demanding 3D application such as Furmark or OCCT is run. While this is a novel way to protect components saving consumers from potentially permanent damage to the hardware, it does come as a gripe to expert users, enthusiasts and overclockers, who know what they're doing.

GPU-Z developer and our boss W1zzard has devised a way to make disabling this protection accessible to everyone (who knows what he's dealing with), and came up with a nifty new feature for GPU-Z, our popular GPU diagnostics and monitoring utility, that can disable the speed throttling mechanism. It is a new command-line argument for GPU-Z, that's "/GTX580OCP". Start the GPU-Z executable (within Windows, using Command Prompt or shortcut), using that argument, and it will disable the clock speed throttling mechanism. For example, "X:gpuz.exe /GTX580OCP" It will stay disabled for the remainder of the session, you can close GPU-Z. It will be enabled again on the next boot.



As an obligatory caution, be sure you know what you're doing. TechPowerUp is not responsible for any damage caused to your hardware by disabling that mechanism. Running the graphics card outside of its power specifications may result in damage to the card or motherboard. We have a test build of GPU-Z (which otherwise carries the same-exact feature-set of GPU-Z 0.4.8). We also ran a power consumption test on our GeForce GTX 580 card demonstrating how disabling that logic affects power consumption.

DOWNLOAD: TechPowerUp GPU-Z GTX 580 OCP Test Build

P.S.
this build is newer than the builds in the previous post today

2559
Quote
Have you wondered how DX11 tessellation help a game look better? Check out this video to see a DX11 Tessellation On/Off comparison featuring Tom Clancy's H.A.W.X. 2 on an NVIDIA GeForce graphics card. You'll notice the incredibly detailed terrain and environments when tessellation is turned on.


2560
3D-Tech News Around The Web / ATI Catalyst 8.791 @ MS Update
« on: November 13, 2010, 05:03:43 PM »
Attention: this driver "1011041549-8.791-101026a-107889C-ATI" does not expose OpenGL 4.1 and the profiles in UMD are older than in "Hotfix E 8.782.1"
Don't get fooled by version numbers!

Direct downloads:
Vista/7 64
Vista/7 32
XP 32
XP 64



Pages: 1 ... 126 127 [128] 129 130 ... 187