Show Posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.


Messages - ljbade

Pages: 1 ... 3 4 [5]
82
3D-Tech News Around The Web / Re: MAXON CINEBENCH 11.5 available
« on: February 11, 2010, 07:35:30 AM »
My 5850 always fails the OpenGL 3.2 reference render test! (95.4%)

83
3D-Tech News Around The Web / Re: Futuremark patch round-up
« on: February 10, 2010, 10:19:46 PM »
Fixed hang in CPU Test 2 - Physics on systems with more than 12 logical processors
Added possibility to disable System Info scan in command line front-end
Fixed the score formula in texture fill rate test
Added buffered file input stream for faster test loading times
Updated System Info component version to 3.21.2.1
Installation package: 3DMark_Vantage_v102_1901.exe


Hopefully they fixed the system scan freeze when you enable AHCI SATA on a AMD chipset.

84
Don't forget that the current dev channel Google Chrome also supports WebGL!

Safari on Mac is supposed to support it too, but not Safari Win.

Only leaves IE in the dark... but who cares about IE.

85
These samples have also been in the Microsoft DX11 SDK.

87
3D-Tech News Around The Web / Re: New ATI drivers - Catalyst 10.2 RC2
« on: February 10, 2010, 08:26:27 AM »
Talking about DX11 stuff, this blog post seems to reveal some hidden 5xxx series extensions:
http://oscarbg.blogspot.com/2009/12/catalyst-912-adds-opengl-32-support-and.html

GL_AMD_gpu_shader5
GL_AMD_patch_tessellator
GL_AMDX_random_access_target
gl_amd_texture_compresion_dxt6,7
GL_EXT_tessellation_shader
GL_AMD_video_instruction?

88
3D-Tech News Around The Web / Re: New ATI drivers
« on: February 10, 2010, 07:03:51 AM »
Looks like new OpenCL driver?

Also 181 GL extensions!

3 new, 1 EXT, 2 ARB:
GL_ARB_blend_func_extended - more enhancements to blending? whats left in DX10/11 that OGL doesn't have?

GL_ARB_fragment_coord_conventions - DX9 compatibility (wasn't this in OpenGl 3.2?!? still missing transform_feedback2)

GL_EXT_texture_buffer_object_rgb32 - this one is interesting as GL_ARB_texture_buffer_object already lists all the RGBA32 F, I, and UI.

Also I note that 2 amd extensions have been documented:
http://www.opengl.org/registry/specs/AMD/seamless_cubemap_per_texture.txt - when did this get added?
http://www.opengl.org/registry/specs/AMD/shader_stencil_export.txt - from 10.1

Wonder how far away we are from GL 3.3. Still haven't seen DX11 stuff yet, but they must be working on it!

Can't see any sign of the rumored (or under NDA) per-game application profile support yet in CCC. Supposed to be in 10.2...

89
3D-Tech News Around The Web / New ATI drivers - Catalyst 10.2 RC2
« on: February 10, 2010, 06:15:31 AM »
New ATI driver version: 10.2 RC2 from http://www.ati-forum.de/allgemein/downloads/treiber/p19566-catalyst-beta-8-70-rc1-rc2/#post19566

Code: [Select]
===================================================
GPU Caps Viewer v1.8.2
http://www.ozone3d.net/gpu_caps_viewer/
===================================================


===================================[ System / CPU ]
- CPU Name: AMD Phenom(tm) II X4 20 Processor
- CPU Core Speed: 3214 MHz
- CPU Num Cores: 4
- Family: 15 - Model: 4 - Stepping: 2
- Physical Memory Size: 4093 MB
- Operating System: Windows Server 2007 ver.6.1 build 7600 [No Service Pack]
- DirectX Version: 10.0
- PhysX Version: 9091112


===================================[ Graphics Adapter / GPU ]
- OpenGL Renderer: ATI Radeon HD 5800 Series
- Drivers Renderer: ATI Radeon HD 5800 Series
- DB Renderer: ATI Radeon HD 5850
- Device Description: ATI Radeon HD 5800 Series
- Adapter String: ATI Radeon HD 5800 Series
- Vendor: ATI Technologies Inc.
- Vendor ID: 0x1002
- Device ID: 0x6899
- Drivers Version: 8.700.0.0 (1-13-2010) - atig6pxx.dll
- ATI Catalyst Version String:
- ATI Catalyst Release Version String: 8.70-100113a-094252E
- GPU Codename: Cypress
- GPU Unified Shader Processors: 1440
- GPU Vertex Shader Processors: 0
- GPU Pixel Shader Processors: 0
- Video Memory Size: 1024 MB
- BIOS String: 113-585AZNB-10
- Current Display Mode: 1280x1024 @ 60 Hz - 32 bpp


===================================[ OpenGL GPU Capabilities ]
- OpenGL Version: 3.2.9405 Compatibility Profile Context
- GLSL (OpenGL Shading Language) Version: 1.50
- ARB Texture Units: 8
- Vertex Shader Texture Units: 16
- Pixel Shader Texture Units: 16
- Geometry Shader Texture Units: 32
- Max Texture Size: 16384x16384
- Max Anisotropic Filtering Value: X16.0
- Max Point Sprite Size: 8192.0
- Max Dynamic Lights: 8
- Max Viewport Size: 16384x16384
- Max Vertex Uniform Components: 1024
- Max Fragment Uniform Components: 1024
- Max Geometry Uniform Components: 4096
- Max Varying Float: 64
- Max Vertex Bindable Uniforms: 15
- Max Fragment Bindable Uniforms: 15
- Max Geometry Bindable Uniforms: 15
- Frame Buffer Objects (FBO) Support:[yes]
- Multiple Render Targets / Max draw buffers: 8
- Pixel Buffer Objects (PBO) Support:[yes]
- S3TC Texture Compression Support:[yes]
- ATI 3Dc Texture Compression Support:[yes]
- Texture Rectangle Support:[yes]
- Floating Point Textures Support:[no]
- MSAA: 1X
- MSAA: 2X
- MSAA: 4X
- MSAA: 8X
- OpenGL Extensions: 181 extensions
    <li>GL_AMDX_name_gen_delete</li>
    <li>GL_AMDX_random_access_target</li>
    <li>GL_AMDX_vertex_shader_tessellator</li>
    <li>GL_AMD_draw_buffers_blend</li>
    <li>GL_AMD_performance_monitor</li>
    <li>GL_AMD_seamless_cubemap_per_texture</li>
    <li>GL_AMD_shader_stencil_export</li>
    <li>GL_AMD_texture_compression_dxt6</li>
    <li>GL_AMD_texture_compression_dxt7</li>
    <li>GL_AMD_texture_cube_map_array</li>
    <li>GL_AMD_texture_texture4</li>
    <li>GL_AMD_vertex_shader_tessellator</li>
    <li>GL_ARB_blend_func_extended</li>
    <li>GL_ARB_color_buffer_float</li>
    <li>GL_ARB_copy_buffer</li>
    <li>GL_ARB_depth_buffer_float</li>
    <li>GL_ARB_depth_clamp</li>
    <li>GL_ARB_depth_texture</li>
    <li>GL_ARB_draw_buffers</li>
    <li>GL_ARB_draw_buffers_blend</li>
    <li>GL_ARB_draw_elements_base_vertex</li>
    <li>GL_ARB_draw_instanced</li>
    <li>GL_ARB_fragment_coord_conventions</li>
    <li>GL_ARB_fragment_program</li>
    <li>GL_ARB_fragment_program_shadow</li>
    <li>GL_ARB_fragment_shader</li>
    <li>GL_ARB_framebuffer_object</li>
    <li>GL_ARB_framebuffer_sRGB</li>
    <li>GL_ARB_geometry_shader4</li>
    <li>GL_ARB_half_float_pixel</li>
    <li>GL_ARB_half_float_vertex</li>
    <li>GL_ARB_instanced_arrays</li>
    <li>GL_ARB_map_buffer_range</li>
    <li>GL_ARB_multisample</li>
    <li>GL_ARB_multitexture</li>
    <li>GL_ARB_occlusion_query</li>
    <li>GL_ARB_pixel_buffer_object</li>
    <li>GL_ARB_point_parameters</li>
    <li>GL_ARB_point_sprite</li>
    <li>GL_ARB_provoking_vertex</li>
    <li>GL_ARB_sample_shading</li>
    <li>GL_ARB_seamless_cube_map</li>
    <li>GL_ARB_shader_objects</li>
    <li>GL_ARB_shader_texture_lod</li>
    <li>GL_ARB_shading_language_100</li>
    <li>GL_ARB_shadow</li>
    <li>GL_ARB_shadow_ambient</li>
    <li>GL_ARB_sync</li>
    <li>GL_ARB_texture_border_clamp</li>
    <li>GL_ARB_texture_buffer_object</li>
    <li>GL_ARB_texture_compression</li>
    <li>GL_ARB_texture_compression_rgtc</li>
    <li>GL_ARB_texture_cube_map</li>
    <li>GL_ARB_texture_cube_map_array</li>
    <li>GL_ARB_texture_env_add</li>
    <li>GL_ARB_texture_env_combine</li>
    <li>GL_ARB_texture_env_crossbar</li>
    <li>GL_ARB_texture_env_dot3</li>
    <li>GL_ARB_texture_float</li>
    <li>GL_ARB_texture_gather</li>
    <li>GL_ARB_texture_mirrored_repeat</li>
    <li>GL_ARB_texture_multisample</li>
    <li>GL_ARB_texture_non_power_of_two</li>
    <li>GL_ARB_texture_query_lod</li>
    <li>GL_ARB_texture_rectangle</li>
    <li>GL_ARB_texture_rg</li>
    <li>GL_ARB_texture_snorm</li>
    <li>GL_ARB_transpose_matrix</li>
    <li>GL_ARB_uniform_buffer_object</li>
    <li>GL_ARB_vertex_array_bgra</li>
    <li>GL_ARB_vertex_array_object</li>
    <li>GL_ARB_vertex_buffer_object</li>
    <li>GL_ARB_vertex_program</li>
    <li>GL_ARB_vertex_shader</li>
    <li>GL_ARB_window_pos</li>
    <li>GL_ATI_draw_buffers</li>
    <li>GL_ATI_envmap_bumpmap</li>
    <li>GL_ATI_fragment_shader</li>
    <li>GL_ATI_meminfo</li>
    <li>GL_ATI_separate_stencil</li>
    <li>GL_ATI_texture_compression_3dc</li>
    <li>GL_ATI_texture_env_combine3</li>
    <li>GL_ATI_texture_float</li>
    <li>GL_ATI_texture_mirror_once</li>
    <li>GL_EXT_abgr</li>
    <li>GL_EXT_bgra</li>
    <li>GL_EXT_bindable_uniform</li>
    <li>GL_EXT_blend_color</li>
    <li>GL_EXT_blend_equation_separate</li>
    <li>GL_EXT_blend_func_separate</li>
    <li>GL_EXT_blend_minmax</li>
    <li>GL_EXT_blend_subtract</li>
    <li>GL_EXT_compiled_vertex_array</li>
    <li>GL_EXT_copy_buffer</li>
    <li>GL_EXT_copy_texture</li>
    <li>GL_EXT_draw_buffers2</li>
    <li>GL_EXT_draw_instanced</li>
    <li>GL_EXT_draw_range_elements</li>
    <li>GL_EXT_fog_coord</li>
    <li>GL_EXT_framebuffer_blit</li>
    <li>GL_EXT_framebuffer_multisample</li>
    <li>GL_EXT_framebuffer_object</li>
    <li>GL_EXT_framebuffer_sRGB</li>
    <li>GL_EXT_geometry_shader4</li>
    <li>GL_EXT_gpu_program_parameters</li>
    <li>GL_EXT_gpu_shader4</li>
    <li>GL_EXT_histogram</li>
    <li>GL_EXT_multi_draw_arrays</li>
    <li>GL_EXT_packed_depth_stencil</li>
    <li>GL_EXT_packed_float</li>
    <li>GL_EXT_packed_pixels</li>
    <li>GL_EXT_pixel_buffer_object</li>
    <li>GL_EXT_point_parameters</li>
    <li>GL_EXT_provoking_vertex</li>
    <li>GL_EXT_rescale_normal</li>
    <li>GL_EXT_secondary_color</li>
    <li>GL_EXT_separate_specular_color</li>
    <li>GL_EXT_shadow_funcs</li>
    <li>GL_EXT_stencil_wrap</li>
    <li>GL_EXT_subtexture</li>
    <li>GL_EXT_texgen_reflection</li>
    <li>GL_EXT_texture3D</li>
    <li>GL_EXT_texture_array</li>
    <li>GL_EXT_texture_buffer_object</li>
    <li>GL_EXT_texture_buffer_object_rgb32</li>
    <li>GL_EXT_texture_compression_latc</li>
    <li>GL_EXT_texture_compression_rgtc</li>
    <li>GL_EXT_texture_compression_s3tc</li>
    <li>GL_EXT_texture_cube_map</li>
    <li>GL_EXT_texture_edge_clamp</li>
    <li>GL_EXT_texture_env_add</li>
    <li>GL_EXT_texture_env_combine</li>
    <li>GL_EXT_texture_env_dot3</li>
    <li>GL_EXT_texture_filter_anisotropic</li>
    <li>GL_EXT_texture_integer</li>
    <li>GL_EXT_texture_lod</li>
    <li>GL_EXT_texture_lod_bias</li>
    <li>GL_EXT_texture_mirror_clamp</li>
    <li>GL_EXT_texture_object</li>
    <li>GL_EXT_texture_rectangle</li>
    <li>GL_EXT_texture_sRGB</li>
    <li>GL_EXT_texture_shared_exponent</li>
    <li>GL_EXT_texture_snorm</li>
    <li>GL_EXT_texture_swizzle</li>
    <li>GL_EXT_timer_query</li>
    <li>GL_EXT_transform_feedback</li>
    <li>GL_EXT_vertex_array</li>
    <li>GL_EXT_vertex_array_bgra</li>
    <li>GL_IBM_texture_mirrored_repeat</li>
    <li>GL_KTX_buffer_region</li>
    <li>GL_NV_blend_square</li>
    <li>GL_NV_conditional_render</li>
    <li>GL_NV_copy_depth_to_color</li>
    <li>GL_NV_explicit_multisample</li>
    <li>GL_NV_primitive_restart</li>
    <li>GL_NV_texgen_reflection</li>
    <li>GL_SGIS_generate_mipmap</li>
    <li>GL_SGIS_texture_edge_clamp</li>
    <li>GL_SGIS_texture_lod</li>
    <li>GL_SUN_multi_draw_arrays</li>
    <li>GL_WIN_swap_hint</li>
    <li>WGL_ARB_extensions_string</li>
    <li>WGL_ARB_pixel_format</li>
    <li>WGL_ATI_pixel_format_float</li>
    <li>WGL_ARB_pixel_format_float</li>
    <li>WGL_ARB_multisample</li>
    <li>WGL_EXT_swap_control</li>
    <li>WGL_ARB_pbuffer</li>
    <li>WGL_ARB_render_texture</li>
    <li>WGL_ARB_make_current_read</li>
    <li>WGL_EXT_extensions_string</li>
    <li>WGL_ARB_buffer_region</li>
    <li>WGL_EXT_framebuffer_sRGB</li>
    <li>WGL_ATI_render_texture_rectangle</li>
    <li>WGL_EXT_pixel_format_packed_float</li>
    <li>WGL_I3D_genlock</li>
    <li>WGL_NV_swap_group</li>
    <li>WGL_ARB_create_context</li>
    <li>WGL_AMD_gpu_association</li>
    <li>WGL_AMDX_gpu_association</li>
    <li>WGL_ARB_create_context_profile</li>


===================================[ OpenCL Capabilities ]
- Num OpenCL platforms: 1
- Name: ATI Stream
- Version: OpenCL 1.0 ATI-Stream-v2.0.0
- Profile: FULL_PROFILE
- Vendor: Advanced Micro Devices, Inc.
- Num devices: 2

- CL_DEVICE_NAME: AMD Phenom(tm) II X4 20 Processor
- CL_DEVICE_VENDOR: AuthenticAMD
- CL_DRIVER_VERSION: 1.0
- CL_DEVICE_PROFILE: FULL_PROFILE
- CL_DEVICE_VERSION: OpenCL 1.0 ATI-Stream-v2.0.0
- CL_DEVICE_TYPE: CPU
- CL_DEVICE_VENDOR_ID: 0x1002
- CL_DEVICE_MAX_COMPUTE_UNITS: 4
- CL_DEVICE_MAX_CLOCK_FREQUENCY: 3214MHz
- CL_DEVICE_ADDRESS_BITS: 32
- CL_DEVICE_MAX_MEM_ALLOC_SIZE: 524288KB
- CL_DEVICE_GLOBAL_MEM_SIZE: 1024MB
- CL_DEVICE_MAX_PARAMETER_SIZE: 4096
- CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 64 Bytes
- CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 64KB
- CL_DEVICE_ERROR_CORRECTION_SUPPORT: NO
- CL_DEVICE_LOCAL_MEM_TYPE: Global
- CL_DEVICE_LOCAL_MEM_SIZE: 32KB
- CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64KB
- CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
- CL_DEVICE_MAX_WORK_ITEM_SIZES: [1024 ; 1024 ; 1024]
- CL_DEVICE_MAX_WORK_GROUP_SIZE: 1024
- CL_EXEC_NATIVE_KERNEL: 4628960
- CL_DEVICE_IMAGE_SUPPORT: NO
- CL_DEVICE_MAX_READ_IMAGE_ARGS: 0
- CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 0
- CL_DEVICE_IMAGE2D_MAX_WIDTH: 0
- CL_DEVICE_IMAGE2D_MAX_HEIGHT: 0
- CL_DEVICE_IMAGE3D_MAX_WIDTH: 0
- CL_DEVICE_IMAGE3D_MAX_HEIGHT: 0
- CL_DEVICE_IMAGE3D_MAX_DEPTH: 0
- CL_DEVICE_MAX_SAMPLERS: 0
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR: 16
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT: 8
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 4
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 2
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 4
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 0
- CL_DEVICE_EXTENSIONS: 5
- Extensions:
- cl_khr_global_int32_base_atomics
- cl_khr_global_int32_extended_atomics
- cl_khr_local_int32_base_atomics
- cl_khr_local_int32_extended_atomics
- cl_khr_byte_addressable_store

- CL_DEVICE_NAME: Cypress
- CL_DEVICE_VENDOR: Advanced Micro Devices, Inc.
- CL_DRIVER_VERSION: CAL 1.4.553
- CL_DEVICE_PROFILE: FULL_PROFILE
- CL_DEVICE_VERSION: OpenCL 1.0 ATI-Stream-v2.0.0
- CL_DEVICE_TYPE: GPU
- CL_DEVICE_VENDOR_ID: 0x1002
- CL_DEVICE_MAX_COMPUTE_UNITS: 18
- CL_DEVICE_MAX_CLOCK_FREQUENCY: 765MHz
- CL_DEVICE_ADDRESS_BITS: 32
- CL_DEVICE_MAX_MEM_ALLOC_SIZE: 262144KB
- CL_DEVICE_GLOBAL_MEM_SIZE: 256MB
- CL_DEVICE_MAX_PARAMETER_SIZE: 1024
- CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 0 Bytes
- CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 0KB
- CL_DEVICE_ERROR_CORRECTION_SUPPORT: NO
- CL_DEVICE_LOCAL_MEM_TYPE: Local (scratchpad)
- CL_DEVICE_LOCAL_MEM_SIZE: 32KB
- CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64KB
- CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
- CL_DEVICE_MAX_WORK_ITEM_SIZES: [256 ; 256 ; 256]
- CL_DEVICE_MAX_WORK_GROUP_SIZE: 256
- CL_EXEC_NATIVE_KERNEL: 4628960
- CL_DEVICE_IMAGE_SUPPORT: NO
- CL_DEVICE_MAX_READ_IMAGE_ARGS: 0
- CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 0
- CL_DEVICE_IMAGE2D_MAX_WIDTH: 0
- CL_DEVICE_IMAGE2D_MAX_HEIGHT: 0
- CL_DEVICE_IMAGE3D_MAX_WIDTH: 0
- CL_DEVICE_IMAGE3D_MAX_HEIGHT: 0
- CL_DEVICE_IMAGE3D_MAX_DEPTH: 0
- CL_DEVICE_MAX_SAMPLERS: 0
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR: 16
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT: 8
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 4
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 2
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 4
- CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 0
- CL_DEVICE_EXTENSIONS: 4
- Extensions:
- cl_khr_global_int32_base_atomics
- cl_khr_global_int32_extended_atomics
- cl_khr_local_int32_base_atomics
- cl_khr_local_int32_extended_atomics


===================================[ Misc. ]


===================================[ Related Graphics Drivers ]
- http://www.geeks3d.com/?page_id=752
- http://downloads.guru3d.com/download.php?id=18
- http://www.tweakguides.com/ATICAT_1.html


===================================[ Related Graphics Cards Reviews ]

90
3D-Tech News Around The Web / Re: ATI Catalyst 10.-1 WHQL available
« on: January 28, 2010, 02:33:12 AM »
From Twitter:

RT @PackFanJason: @CatalystMaker Can you give us any hints at the new features? Please say Profiles! ;) ME - profiles

10.1 is just a standard release to fix up some bugs, but 10.2 and 10.3 will indeed have some major new features

The correct 10.1 is up, the issue is that the version number just shows up as 9.12 - just use them as normal

Thanks for letting me know guys. Someone forgot to update the version number in CCC. It says 9.12 rather than 10.1

91
General Discussion / Re: Sailfish OpenCL fluid simulation
« on: January 22, 2010, 11:39:39 AM »
Since I can't attach files, here is a zip file:
http://www.leithalweapon.geek.nz/sailfish-mod.zip

92
General Discussion / Sailfish OpenCL fluid simulation
« on: January 22, 2010, 11:23:05 AM »
Hi JeGX,

Have you looked at the OpenCL fluid simulation Sailfish before?
http://sailfish.us.edu.pl/index.html

It uses pyopencl!

I have been hacking away to get this running under Windows. I have installed Python all the required modules, including compiling pyopencl against the ATI Stream 2.0 SDK.

I had to make a few modifications to Sailfish as it was designed for Linux. This included removing a hardcoded path, and adding support for the OpenCL ICD (selecting an OpenCL platform...).

I have had mixed results.
The example scripts compile fine, and run with no exceptions, yet I get black pixels, and the frames simulated per second is impossibly fast. I have the feeling the kernel is not running, but no error seems to be generated...
Also the ATI compiler always returns warnings that  the -cl-single-precision-constant and -cl-fast-relaxed-math compiler options are not supported despite being in the OpenCL specification...

But I can get the simulation to work perfectly and produce the red pixels if I run it via clprofile.exe! It probably runs slower though thanks to all the logging to stdout and the CSV file...
Why does profiling make it work?!?

Also the 3D simulations always fail with the following error:
Code: [Select]
Traceback (most recent call last):
  File "c:\sailfish\examples\lbm_ldc_3d.py", line 54, in <module>
    sim.run()
  File "C:\python26\lib\site-packages\sailfish\lbm.py", line 715, in run
    self.vis.main()
  File "C:\python26\lib\site-packages\sailfish\vis2d.py", line 312, in main
    self.sim.sim_step(self._tracers)
  File "C:\python26\lib\site-packages\sailfish\lbm.py", line 532, in sim_step
    self.backend.run_kernel(kerns[1], self.kern_grid_size)
  File "C:\python26\lib\site-packages\sailfish\backend_opencl.py", line 54, in r
un_kernel
    cl.enqueue_nd_range_kernel(self.queue, kernel, global_size, kernel.block)
pyopencl.LogicError: enqueue_nd_range_kernel failed: invalid value - global/work
 work sizes have differing dimensions

I have traced this back possibly being to line 517 in lbm.py:
Code: [Select]
self.kern_grid_size = (self.options.lat_nx/self.block_size * self.options.lat_ny, self.options.lat_nz)but if I change this to:
Code: [Select]
self.kern_grid_size = (self.options.lat_nx/self.block_size, self.options.lat_ny, self.options.lat_nz)the script runs but I get black pixels (even when inside the profiler).

Could you try this out on your setup and see what you get?
I have attached the modified python files.

In case it is any help I have attached a GPU Caps Viewer XML file.

Also:
Latest 32bit 2.6 python, and latest version of all required modules.
Compiled pyopencl with Visual Studio 2008 Professional SP1.
I am only running OpenCL on CPU as I have not got a graphics card for my new system yet.

Leith

Pages: 1 ... 3 4 [5]