AMD Catalyst 13.4 WHQL Released with New OpenGL 4.3 Extensions

AMD Catalyst logo, ascii


AMD has released a new set of WHQL drivers for Radeon HD 5000/6000/7000 videocards. Catalyst 13.4 adds the support added for AMD Radeon HD 7790 and AMD Radeon HD 7990, brings some optimizations for Far Cry 3, Crysis 3, and 3DMark, significantly improves latency performance in Skyrim, Boderlands 2, Guild Wars 2, Tomb Raider and Hitman Absolution.

Catalyst 13.4 downloads

AMD Catalyst 13.4 WHQL, CCC information

AMD Catalyst 13.4 WHQL, + Radeon HD 7970 + GPU Caps Viewer

Catalyst 13.4 is still an OpenGL 4.2 driver (4.2.12217 Compatibility Profile/Debug Context 12.104.0.0) and exposes 259 OpenGL extensions (GL=236 and WGL=23) for a Radeon HD 7970 under Windows 7 64-bit. Compared to Catalyst 13.3 beta 2, Catalyst 13.4 adds four new extensions, three being related to OpenGL 4.3:

  • GL_AMD_interleaved_elements
  • GL_ARB_compute_shader
  • GL_ARB_program_interface_query
  • GL_ARB_shader_storage_buffer_object

Here is the complete list of all 259 OpenGL extensions for the HD 7970 under Win7 64-bit:

  • GL_AMDX_debug_output
  • GL_AMD_blend_minmax_factor
  • GL_AMD_conservative_depth
  • GL_AMD_debug_output
  • GL_AMD_depth_clamp_separate
  • GL_AMD_draw_buffers_blend
  • GL_AMD_interleaved_elements
  • GL_AMD_multi_draw_indirect
  • GL_AMD_name_gen_delete
  • GL_AMD_performance_monitor
  • GL_AMD_pinned_memory
  • GL_AMD_query_buffer_object
  • GL_AMD_sample_positions
  • GL_AMD_seamless_cubemap_per_texture
  • GL_AMD_shader_stencil_export
  • GL_AMD_shader_stencil_value_export
  • GL_AMD_shader_trace
  • GL_AMD_shader_trinary_minmax
  • GL_AMD_sparse_texture
  • GL_AMD_stencil_operation_extended
  • GL_AMD_texture_cube_map_array
  • GL_AMD_texture_texture4
  • GL_AMD_transform_feedback3_lines_triangles
  • GL_AMD_transform_feedback4
  • GL_AMD_vertex_shader_layer
  • GL_AMD_vertex_shader_viewport_index
  • GL_ARB_ES2_compatibility
  • GL_ARB_ES3_compatibility
  • GL_ARB_arrays_of_arrays
  • GL_ARB_base_instance
  • GL_ARB_blend_func_extended
  • GL_ARB_clear_buffer_object
  • GL_ARB_color_buffer_float
  • GL_ARB_compatibility
  • GL_ARB_compressed_texture_pixel_storage
  • GL_ARB_compute_shader
  • GL_ARB_conservative_depth
  • GL_ARB_copy_buffer
  • GL_ARB_debug_output
  • GL_ARB_depth_buffer_float
  • GL_ARB_depth_clamp
  • GL_ARB_depth_texture
  • GL_ARB_draw_buffers
  • GL_ARB_draw_buffers_blend
  • GL_ARB_draw_elements_base_vertex
  • GL_ARB_draw_indirect
  • GL_ARB_draw_instanced
  • GL_ARB_explicit_attrib_location
  • GL_ARB_explicit_uniform_location
  • GL_ARB_fragment_coord_conventions
  • GL_ARB_fragment_layer_viewport
  • GL_ARB_fragment_program
  • GL_ARB_fragment_program_shadow
  • GL_ARB_fragment_shader
  • GL_ARB_framebuffer_object
  • GL_ARB_framebuffer_sRGB
  • GL_ARB_geometry_shader4
  • GL_ARB_get_program_binary
  • GL_ARB_gpu_shader5
  • GL_ARB_gpu_shader_fp64
  • GL_ARB_half_float_pixel
  • GL_ARB_half_float_vertex
  • GL_ARB_imaging
  • GL_ARB_instanced_arrays
  • GL_ARB_internalformat_query
  • GL_ARB_invalidate_subdata
  • GL_ARB_map_buffer_alignment
  • GL_ARB_map_buffer_range
  • GL_ARB_multi_draw_indirect
  • GL_ARB_multisample
  • GL_ARB_multitexture
  • GL_ARB_occlusion_query
  • GL_ARB_occlusion_query2
  • GL_ARB_pixel_buffer_object
  • GL_ARB_point_parameters
  • GL_ARB_point_sprite
  • GL_ARB_program_interface_query
  • GL_ARB_provoking_vertex
  • GL_ARB_sample_shading
  • GL_ARB_sampler_objects
  • GL_ARB_seamless_cube_map
  • GL_ARB_separate_shader_objects
  • GL_ARB_shader_atomic_counters
  • GL_ARB_shader_bit_encoding
  • GL_ARB_shader_image_load_store
  • GL_ARB_shader_image_size
  • GL_ARB_shader_objects
  • GL_ARB_shader_precision
  • GL_ARB_shader_stencil_export
  • GL_ARB_shader_storage_buffer_object
  • GL_ARB_shader_subroutine
  • GL_ARB_shader_texture_lod
  • GL_ARB_shading_language_100
  • GL_ARB_shading_language_420pack
  • GL_ARB_shading_language_packing
  • GL_ARB_shadow
  • GL_ARB_shadow_ambient
  • GL_ARB_stencil_texturing
  • GL_ARB_sync
  • GL_ARB_tessellation_shader
  • GL_ARB_texture_border_clamp
  • GL_ARB_texture_buffer_object
  • GL_ARB_texture_buffer_object_rgb32
  • GL_ARB_texture_buffer_range
  • GL_ARB_texture_compression
  • GL_ARB_texture_compression_bptc
  • GL_ARB_texture_compression_rgtc
  • GL_ARB_texture_cube_map
  • GL_ARB_texture_cube_map_array
  • GL_ARB_texture_env_add
  • GL_ARB_texture_env_combine
  • GL_ARB_texture_env_crossbar
  • GL_ARB_texture_env_dot3
  • GL_ARB_texture_float
  • GL_ARB_texture_gather
  • GL_ARB_texture_mirrored_repeat
  • GL_ARB_texture_multisample
  • GL_ARB_texture_non_power_of_two
  • GL_ARB_texture_query_levels
  • GL_ARB_texture_query_lod
  • GL_ARB_texture_rectangle
  • GL_ARB_texture_rg
  • GL_ARB_texture_rgb10_a2ui
  • GL_ARB_texture_snorm
  • GL_ARB_texture_storage
  • GL_ARB_texture_storage_multisample
  • GL_ARB_timer_query
  • GL_ARB_transform_feedback2
  • GL_ARB_transform_feedback3
  • GL_ARB_transform_feedback_instanced
  • GL_ARB_transpose_matrix
  • GL_ARB_uniform_buffer_object
  • GL_ARB_vertex_array_bgra
  • GL_ARB_vertex_array_object
  • GL_ARB_vertex_attrib_64bit
  • GL_ARB_vertex_buffer_object
  • GL_ARB_vertex_program
  • GL_ARB_vertex_shader
  • GL_ARB_vertex_type_2_10_10_10_rev
  • GL_ARB_viewport_array
  • GL_ARB_window_pos
  • GL_ATI_draw_buffers
  • GL_ATI_envmap_bumpmap
  • GL_ATI_fragment_shader
  • GL_ATI_separate_stencil
  • GL_ATI_texture_compression_3dc
  • GL_ATI_texture_env_combine3
  • GL_ATI_texture_float
  • GL_ATI_texture_mirror_once
  • GL_EXT_abgr
  • GL_EXT_bgra
  • GL_EXT_bindable_uniform
  • GL_EXT_blend_color
  • GL_EXT_blend_equation_separate
  • GL_EXT_blend_func_separate
  • GL_EXT_blend_minmax
  • GL_EXT_blend_subtract
  • GL_EXT_compiled_vertex_array
  • GL_EXT_copy_buffer
  • GL_EXT_copy_texture
  • GL_EXT_depth_bounds_test
  • GL_EXT_direct_state_access
  • GL_EXT_draw_buffers2
  • GL_EXT_draw_instanced
  • GL_EXT_draw_range_elements
  • GL_EXT_fog_coord
  • GL_EXT_framebuffer_blit
  • GL_EXT_framebuffer_multisample
  • GL_EXT_framebuffer_object
  • GL_EXT_framebuffer_sRGB
  • GL_EXT_geometry_shader4
  • GL_EXT_gpu_program_parameters
  • GL_EXT_gpu_shader4
  • GL_EXT_histogram
  • GL_EXT_multi_draw_arrays
  • GL_EXT_packed_depth_stencil
  • GL_EXT_packed_float
  • GL_EXT_packed_pixels
  • GL_EXT_pixel_buffer_object
  • GL_EXT_point_parameters
  • GL_EXT_provoking_vertex
  • GL_EXT_rescale_normal
  • GL_EXT_secondary_color
  • GL_EXT_separate_specular_color
  • GL_EXT_shader_image_load_store
  • GL_EXT_shadow_funcs
  • GL_EXT_stencil_wrap
  • GL_EXT_subtexture
  • GL_EXT_texgen_reflection
  • GL_EXT_texture3D
  • GL_EXT_texture_array
  • GL_EXT_texture_buffer_object
  • GL_EXT_texture_compression_bptc
  • GL_EXT_texture_compression_latc
  • GL_EXT_texture_compression_rgtc
  • GL_EXT_texture_compression_s3tc
  • GL_EXT_texture_cube_map
  • GL_EXT_texture_edge_clamp
  • GL_EXT_texture_env_add
  • GL_EXT_texture_env_combine
  • GL_EXT_texture_env_dot3
  • GL_EXT_texture_filter_anisotropic
  • GL_EXT_texture_integer
  • GL_EXT_texture_lod
  • GL_EXT_texture_lod_bias
  • GL_EXT_texture_mirror_clamp
  • GL_EXT_texture_object
  • GL_EXT_texture_rectangle
  • GL_EXT_texture_sRGB
  • GL_EXT_texture_sRGB_decode
  • GL_EXT_texture_shared_exponent
  • GL_EXT_texture_snorm
  • GL_EXT_texture_storage
  • GL_EXT_texture_swizzle
  • GL_EXT_timer_query
  • GL_EXT_transform_feedback
  • GL_EXT_vertex_array
  • GL_EXT_vertex_array_bgra
  • GL_EXT_vertex_attrib_64bit
  • GL_IBM_texture_mirrored_repeat
  • GL_KTX_buffer_region
  • GL_NV_blend_square
  • GL_NV_conditional_render
  • GL_NV_copy_depth_to_color
  • GL_NV_copy_image
  • GL_NV_explicit_multisample
  • GL_NV_float_buffer
  • GL_NV_half_float
  • GL_NV_primitive_restart
  • GL_NV_texgen_reflection
  • GL_NV_texture_barrier
  • GL_SGIS_generate_mipmap
  • GL_SGIS_texture_edge_clamp
  • GL_SGIS_texture_lod
  • GL_SUN_multi_draw_arrays
  • GL_WIN_swap_hint
  • WGL_EXT_swap_control
  • WGL_ARB_extensions_string
  • WGL_ARB_pixel_format
  • WGL_ATI_pixel_format_float
  • WGL_ARB_pixel_format_float
  • WGL_ARB_multisample
  • WGL_EXT_swap_control_tear
  • WGL_ARB_pbuffer
  • WGL_ARB_render_texture
  • WGL_ARB_make_current_read
  • WGL_EXT_extensions_string
  • WGL_ARB_buffer_region
  • WGL_EXT_framebuffer_sRGB
  • WGL_ATI_render_texture_rectangle
  • WGL_EXT_pixel_format_packed_float
  • WGL_I3D_genlock
  • WGL_NV_swap_group
  • WGL_ARB_create_context
  • WGL_AMD_gpu_association
  • WGL_AMDX_gpu_association
  • WGL_ARB_create_context_profile
  • WGL_NV_DX_interop
  • WGL_NV_DX_interop2

Catalyst 13.4 supports OpenCL 1.2 and exposes 20 extensions. Here is the report from GPU Caps Viewer (the test machine has also Intel OpenCL drivers, that’s why there are two OpenCL platforms):

- Num OpenCL platforms: 2
- CL_PLATFORM_NAME: Intel(R) OpenCL
- CL_PLATFORM_VENDOR: Intel(R) Corporation
- CL_PLATFORM_VERSION: OpenCL 1.2 
- CL_PLATFORM_PROFILE: FULL_PROFILE
- Num devices: 1

	- CL_DEVICE_NAME:        Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz
	- CL_DEVICE_VENDOR: Intel(R) Corporation
	- CL_DRIVER_VERSION: 1.2
	- CL_DEVICE_PROFILE: FULL_PROFILE
	- CL_DEVICE_VERSION: OpenCL 1.2 (Build 63463)
	- CL_DEVICE_TYPE: CPU
	- CL_DEVICE_VENDOR_ID: 0x8086
	- CL_DEVICE_MAX_COMPUTE_UNITS: 8
	- CL_DEVICE_MAX_CLOCK_FREQUENCY: 3500MHz
	- CL_DEVICE_ADDRESS_BITS: 32
	- CL_DEVICE_MAX_MEM_ALLOC_SIZE: 524256KB
	- CL_DEVICE_GLOBAL_MEM_SIZE: 2047MB
	- CL_DEVICE_MAX_PARAMETER_SIZE: 3840
	- CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 64 Bytes
	- CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 256KB
	- CL_DEVICE_ERROR_CORRECTION_SUPPORT: NO
	- CL_DEVICE_LOCAL_MEM_TYPE: Global
	- CL_DEVICE_LOCAL_MEM_SIZE: 32KB
	- CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 128KB
	- CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
	- CL_DEVICE_MAX_WORK_ITEM_SIZES: [1024 ; 1024 ; 1024]
	- CL_DEVICE_MAX_WORK_GROUP_SIZE: 1024
	- CL_EXEC_NATIVE_KERNEL: 2682968
	- CL_DEVICE_IMAGE_SUPPORT: YES
	- CL_DEVICE_MAX_READ_IMAGE_ARGS: 480
	- CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 480
	- CL_DEVICE_IMAGE2D_MAX_WIDTH: 16384
	- CL_DEVICE_IMAGE2D_MAX_HEIGHT: 16384
	- CL_DEVICE_IMAGE3D_MAX_WIDTH: 2048
	- CL_DEVICE_IMAGE3D_MAX_HEIGHT: 2048
	- CL_DEVICE_IMAGE3D_MAX_DEPTH: 2048
	- CL_DEVICE_MAX_SAMPLERS: 480
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 1
	- CL_DEVICE_EXTENSIONS: 14
	- Extensions:
		- cl_khr_fp64
		- cl_khr_icd
		- cl_khr_global_int32_base_atomics
		- cl_khr_global_int32_extended_atomics
		- cl_khr_local_int32_base_atomics
		- cl_khr_local_int32_extended_atomics
		- cl_khr_byte_addressable_store
		- cl_intel_printf
		- cl_ext_device_fission
		- cl_intel_exec_by_local_thread
		- cl_khr_gl_sharing
		- cl_intel_dx9_media_sharing
		- cl_khr_dx9_media_sharing
		- cl_khr_d3d11_sharing
- CL_PLATFORM_NAME: AMD Accelerated Parallel Processing
- CL_PLATFORM_VENDOR: Advanced Micro Devices, Inc.
- CL_PLATFORM_VERSION: OpenCL 1.2 AMD-APP (1124.2)
- CL_PLATFORM_PROFILE: FULL_PROFILE
- Num devices: 2

	- CL_DEVICE_NAME: Tahiti
	- CL_DEVICE_VENDOR: Advanced Micro Devices, Inc.
	- CL_DRIVER_VERSION: 1124.2 (VM)
	- CL_DEVICE_PROFILE: FULL_PROFILE
	- CL_DEVICE_VERSION: OpenCL 1.2 AMD-APP (1124.2)
	- CL_DEVICE_TYPE: GPU
	- CL_DEVICE_VENDOR_ID: 0x1002
	- CL_DEVICE_MAX_COMPUTE_UNITS: 32
	- CL_DEVICE_MAX_CLOCK_FREQUENCY: 925MHz
	- CL_DEVICE_ADDRESS_BITS: 32
	- CL_DEVICE_MAX_MEM_ALLOC_SIZE: 524288KB
	- CL_DEVICE_GLOBAL_MEM_SIZE: 2048MB
	- CL_DEVICE_MAX_PARAMETER_SIZE: 1024
	- CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 64 Bytes
	- CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 16KB
	- CL_DEVICE_ERROR_CORRECTION_SUPPORT: NO
	- CL_DEVICE_LOCAL_MEM_TYPE: Local (scratchpad)
	- CL_DEVICE_LOCAL_MEM_SIZE: 32KB
	- CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64KB
	- CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
	- CL_DEVICE_MAX_WORK_ITEM_SIZES: [256 ; 256 ; 256]
	- CL_DEVICE_MAX_WORK_GROUP_SIZE: 256
	- CL_EXEC_NATIVE_KERNEL: 2682964
	- CL_DEVICE_IMAGE_SUPPORT: YES
	- CL_DEVICE_MAX_READ_IMAGE_ARGS: 128
	- CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8
	- CL_DEVICE_IMAGE2D_MAX_WIDTH: 16384
	- CL_DEVICE_IMAGE2D_MAX_HEIGHT: 16384
	- CL_DEVICE_IMAGE3D_MAX_WIDTH: 2048
	- CL_DEVICE_IMAGE3D_MAX_HEIGHT: 2048
	- CL_DEVICE_IMAGE3D_MAX_DEPTH: 2048
	- CL_DEVICE_MAX_SAMPLERS: 16
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR: 4
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT: 2
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 1
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 1
	- CL_DEVICE_EXTENSIONS: 20
	- Extensions:
		- cl_khr_fp64
		- cl_amd_fp64
		- cl_khr_global_int32_base_atomics
		- cl_khr_global_int32_extended_atomics
		- cl_khr_local_int32_base_atomics
		- cl_khr_local_int32_extended_atomics
		- cl_khr_int64_base_atomics
		- cl_khr_int64_extended_atomics
		- cl_khr_3d_image_writes
		- cl_khr_byte_addressable_store
		- cl_khr_gl_sharing
		- cl_ext_atomic_counters_32
		- cl_amd_device_attribute_query
		- cl_amd_vec3
		- cl_amd_printf
		- cl_amd_media_ops
		- cl_amd_media_ops2
		- cl_amd_popcnt
		- cl_khr_d3d10_sharing
		- cl_amd_c1x_atomics

	- CL_DEVICE_NAME:        Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz
	- CL_DEVICE_VENDOR: GenuineIntel
	- CL_DRIVER_VERSION: 1124.2 (sse2)
	- CL_DEVICE_PROFILE: FULL_PROFILE
	- CL_DEVICE_VERSION: OpenCL 1.2 AMD-APP (1124.2)
	- CL_DEVICE_TYPE: CPU
	- CL_DEVICE_VENDOR_ID: 0x1002
	- CL_DEVICE_MAX_COMPUTE_UNITS: 8
	- CL_DEVICE_MAX_CLOCK_FREQUENCY: 3510MHz
	- CL_DEVICE_ADDRESS_BITS: 32
	- CL_DEVICE_MAX_MEM_ALLOC_SIZE: 1048576KB
	- CL_DEVICE_GLOBAL_MEM_SIZE: 2048MB
	- CL_DEVICE_MAX_PARAMETER_SIZE: 4096
	- CL_DEVICE_GLOBAL_MEM_CACHELINE_SIZE: 64 Bytes
	- CL_DEVICE_GLOBAL_MEM_CACHE_SIZE: 32KB
	- CL_DEVICE_ERROR_CORRECTION_SUPPORT: NO
	- CL_DEVICE_LOCAL_MEM_TYPE: Global
	- CL_DEVICE_LOCAL_MEM_SIZE: 32KB
	- CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE: 64KB
	- CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS: 3
	- CL_DEVICE_MAX_WORK_ITEM_SIZES: [1024 ; 1024 ; 1024]
	- CL_DEVICE_MAX_WORK_GROUP_SIZE: 1024
	- CL_EXEC_NATIVE_KERNEL: 2682968
	- CL_DEVICE_IMAGE_SUPPORT: YES
	- CL_DEVICE_MAX_READ_IMAGE_ARGS: 128
	- CL_DEVICE_MAX_WRITE_IMAGE_ARGS: 8
	- CL_DEVICE_IMAGE2D_MAX_WIDTH: 8192
	- CL_DEVICE_IMAGE2D_MAX_HEIGHT: 8192
	- CL_DEVICE_IMAGE3D_MAX_WIDTH: 2048
	- CL_DEVICE_IMAGE3D_MAX_HEIGHT: 2048
	- CL_DEVICE_IMAGE3D_MAX_DEPTH: 2048
	- CL_DEVICE_MAX_SAMPLERS: 16
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR: 16
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT: 8
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT: 4
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_LONG: 2
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT: 4
	- CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE: 2
	- CL_DEVICE_EXTENSIONS: 17
	- Extensions:
		- cl_khr_fp64
		- cl_amd_fp64
		- cl_khr_global_int32_base_atomics
		- cl_khr_global_int32_extended_atomics
		- cl_khr_local_int32_base_atomics
		- cl_khr_local_int32_extended_atomics
		- cl_khr_3d_image_writes
		- cl_khr_byte_addressable_store
		- cl_khr_gl_sharing
		- cl_ext_device_fission
		- cl_amd_device_attribute_query
		- cl_amd_vec3
		- cl_amd_printf
		- cl_amd_media_ops
		- cl_amd_media_ops2
		- cl_amd_popcnt
		- cl_khr_d3d10_sharing

5 thoughts on “AMD Catalyst 13.4 WHQL Released with New OpenGL 4.3 Extensions”

  1. oscarbg

    Some bugs in new 4.3 compute shader, sbos see my bug in AMD and OGL forums report:
    Hi,
    I have been testing new OGL compute shader and storage buffer objects extension and found following bugs (13.4 on 7950):
    (please note all the samples I use for testing this work correctly on Nvidia OGL 4.3 cards)
    *using atomicMax and atomicMin on shared variables hang the GLSL compiler others like atomicOr are OK!

    groupshared uint ldsZMax;
    uint z;
    atomicMax( ldsZMax, z );

    *using a compute shader with following launch size and shared arrays usage:
    #define BLOCK_SIZE 32
    layout (local_size_x = BLOCK_SIZE, local_size_y = BLOCK_SIZE) in;
    shared double As[BLOCK_SIZE*BLOCK_SIZE];
    shared double Bs[BLOCK_SIZE*BLOCK_SIZE];
    crashes with:
    Compute shader(s) failed to link.
    Compute link error: HW_UNSUPPORTED.
    Compute shader not supported by hardware

    diminishing BLOCK_SIZE to less than 32 seems to work.. I have tested using
    layout (local_size_x = 32, local_size_y = 32) in;
    isn’t a issue so 32 should work as for this conf each of this two shared arrays is size 8192 (sizeof(double)*32*32) so total shared mem usage is 2*8192 and is equal to reported max

    size (GL_MAX_COMPUTE_SHARED_MEMORY_SIZE: 32768).. I verify this issue is on shared mem size usage as using something like
    (with BLOCK_SIZE=32):
    shared double As[BLOCK_SIZE*BLOCK_SIZE-1];
    shared double Bs[BLOCK_SIZE*BLOCK_SIZE];
    seems to compile so please fix to be able to use not only 32767 bytes of shared mem but full 32768 bytes..

    *using sbo on non compute shaders (like fragment shaders seems no be not correct)
    *getting GL_MAX_COMPUTE_WORK_GROUP_COUNT and GL_MAX_COMPUTE_WORK_GROUP_SIZE I get using debug_output bug:
    glGetIntegerv parameter has an invalid enum ‘0x91be’ (GL_INVALID_ENUM)
    other new like GL_MAX_COMPUTE_ATOMIC_COUNTERS seem to work..

    Related altough no_attachments extension is not adversited new entry points are present so I played with it using default and seems a simple test works on 79xx but not on 58xx

    series..
    glGenFramebuffers(1,&noat);
    glBindFramebuffer(GL_FRAMEBUFFER_EXT,noat);
    glFramebufferParameteri(GL_FRAMEBUFFER_EXT,GL_FRAMEBUFFER_DEFAULT_WIDTH, w);
    glFramebufferParameteri (GL_FRAMEBUFFER_EXT,GL_FRAMEBUFFER_DEFAULT_HEIGHT, h);
    a sample using this works on 7xxx series but not on 5xxx series..

  2. sfsdf

    What OGL extensions is this driver missing to get full OGL 4.3 capabilities.

Comments are closed.