third_party_mesa3d/docs/relnotes/21.2.0.rst

5273 lines
245 KiB
ReStructuredText
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

Mesa 21.2.0 Release Notes / 2021-08-04
======================================
Mesa 21.2.0 is a new development release. People who are concerned
with stability and reliability should stick with a previous release or
wait for Mesa 21.2.1.
Mesa 21.2.0 implements the OpenGL 4.6 API, but the version reported by
glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
Some drivers don't support all the features required in OpenGL 4.6. OpenGL
4.6 is **only** available if requested at context creation.
Compatibility contexts may report a lower version depending on each driver.
Mesa 21.2.0 implements the Vulkan 1.2 API, but the version reported by
the apiVersion property of the VkPhysicalDeviceProperties struct
depends on the particular driver being used.
SHA256 checksum
---------------
::
0cb3c802f4b8e7699b1602c08c29d06a4d532ab5b8f7a64676c4ca6bb8f4d426 mesa-21.2.0.tar.xz
New features
------------
- zink supports GL_ARB_texture_filter_minmax, GL_ARB_shader_clock
- VK_EXT_provoking_vertex on RADV.
- VK_EXT_extended_dynamic_state2 on RADV.
- VK_EXT_global_priority_query on RADV.
- VK_EXT_physical_device_drm on RADV.
- VK_KHR_shader_subgroup_uniform_control_flow on Intel and RADV.
- VK_EXT_color_write_enable on RADV.
- 32-bit x86 builds now default disable x87 math and use sse2.
- GL ES 3.1 on GT21x hardware.
- VK_EXT_acquire_drm_display on RADV and ANV.
- VK_EXT_vertex_input_dynamic_state on lavapipe
- wideLines on lavapipe
- VK_EXT_line_rasterization on lavapipe
- VK_EXT_multi_draw on ANV, lavapipe, and RADV
- VK_KHR_separate_depth_stencil_layouts on lavapipe
- VK_EXT_separate_stencil_usage on lavapipe
- VK_EXT_extended_dynamic_state2 on lavapipe
- NGG shader based primitive culling is now supported by RADV.
- Panfrost supports OpenGL ES 3.1
- New Asahi driver for the Apple M1
- GL_ARB_sample_locations on zink
- GL_ARB_sparse_buffer on zink
- GL_ARB_shader_group_vote on zink
- DRM format modifiers on zink
- freedreno+turnip: Initial support for a6xx gen4 (a660, a635)
- None
Bug fixes
---------
- The image is distorted while use iGPU(Intel GPU) rendering and output via dGPU (AMD GPU)
- lima: regression in plbu scissors cmd
- freedreno: regression in org.skia.skqp.SkQPRunner#gles_multipicturedraw_*_tiled
- Incorrect rendering
- intel/isl: Wrong surface format name in batch
- [RADV] FSR in Resident Evil: Village looks very pixelated on Polaris
- 21.2.0rc1 Build Failure - GCC6.3
- Crash in update_buffers after closing KDE "splash screen" downloader
- Firefox (wayland) crash in wayland_platform
- Crash in update_buffers after closing KDE "splash screen" downloader
- Firefox (wayland) crash in wayland_platform
- radeonsi: persistent, read-only buffer maps are slow to read
- substance painter flickering with jagged texture and masks shown black
- radv: FP16 mode in FidelityFX FSR doesn't look right
- Regression in Turnip with KGSL and Zink running opengl in proot
- Validation crash on wlroots after wl_shm appeared
- [RADV] Blocky corruption in Scarlet Nexus and vkd3d-proton 2.4
- Use out encoding for float immediates
- Radeon RX580 and 5700 XT: Reloading ARB assembly shaders causes very glitchy rendering
- i915g: dEQP-GLES2.functional.fragment_ops.depth_stencil..* failures
- i915g: dEQP-GLES2.functional.fragment_ops.interaction.basic_shader.* and fragment_ops.random.* failures
- KHR-GL46.shader_ballot_tests.ShaderBallot* tests fails on the main
- i965 nir compiler is lowering fdiv twice or initializing struct twice
- [radv] Textures missing in Doom (2016) w/ any temporal AA setting enabled
- Drop Android.mk
- [build error] macros.h:88:26: error: size of unnamed array is negative
- Game Issue: Nuclear Throne crashes in RadeonSI
- iris: reduce shader storage buffer object alignment
- agx_compile.h:29:10: fatal error: asahi/lib/agx_pack.h: No such file or directory
- radv: VBO range check issues with odd strides and sizes
- Crash in glLinkProgram while trying to craft the link error
- i915g: wide point failures
- Wolfenstein II: The New Colossus - Screen goes black in some cases
- [radv] [regression] Textures missing in Doom (2016)
- Shader compilation memory leaks
- radv: fd leak in Android WSI radv_AcquireImageANDROID
- SpaceEngine in Steam Proton cannot start with Mesa >=20.3
- [bisected] KDE plasma menu text renders like stretched strangely [amdgpu]
- radeonsi: glitches in Euro Truck Simulator 2
- White box for Webrender Firefox with R600_DEBUG=nir on Evergreen GPU
- radv_AllocateDescriptorSets: validation on variable description count is too strict
- Luna Sky Crashes on Launch
- Mesa crashes on undefined texture behaviour
- Mesa crashes on undefined texture behaviour
- cache_test uses uninitialized stack memory
- nir/opt_load_store_vectorize: check_for_robustness() crashes on derefs
- [anv] GravityMark (benchmark) crashes on ANV
- turnip: corrupted geometry after tesselation shaders in GTAV
- [opengl] We happy few not being rendered correctly
- anv: dynamic state prim type is hard
- [iris][bisected] piglit test ...ext_external_objects.vk-image-display-muliple-textures failing after enabled
- Factorio: GPU hang when opening machine inventory
- RuneScape on Mesa 21.1.1 (VEGA10) has bad performance and leaks memory
- LLVM12 breaks atomicCompSwap tests with radeonsi
- freedreno: tex-miplevel-selection causes a creation of too many BOs without flushes, causing a crash
- RADV: Resident Evil Village Freezes during a specific cutscene
- Supraland: flickering black bars on ground
- u_queue.c:173:7: error: implicit declaration of function 'timespec_get' is invalid in C99
- radv: front face and cull mode dynamic state can desync
- radv: GPU hang in Cyberpunk 2077 on Big Navi
- Cyberpunk 1.22 crashes with amdgpu ring gfx_0.0.0 timeout
- [amdgpu][renoir][rx5500m]: [drm:0xffffffff8198ad5e] \*ERROR* ring gfx timeout, signaled seq=10952, emitted seq=10954
- [spirv-fuzz] SPIR-V parsing FAILED: Invalid back or cross-edge in the CFG
- anv: dEQP-VK.graphicsfuzz.spv-stable-bubblesort-flag-complex-conditionals fails
- panfrost Mount and Blade: Warband (glitches)
- anv: dEQP-VK.robustness.robustness2*no_fmt_qual.null_descriptor.samples* fail
- KHR-GLES31.core.shader_image_load_store.basic-glsl-earlyFragTests may be flakey on RPI4
- SPIRV AMD Driver compiler memory leak/usage - 8Gb+ to compile single 41Kb SPIRV file, Ubuntu 21.04
- Regression of !10941: mutter Wayland on bare metal crashes when closing HW accelerated windows
- radv: ACO miscompiles a specific DIRT 5 shader
- Copy paste bug in v3dv_cmd_buffer.c
- Segfault in mtx_unlock/amdgpu_bo_slab_destroy
- [radeonsi] Incorrect rendering when using glDisable(GL_MULTISAMPLE) with multisample backbuffer
- virtio(vulkan): Error building on Android/arm32
- [i965] regression: piglit.spec.arb_depth_buffer_float.fbo-depthstencil-gl_depth32f_stencil8-drawpixels
- iris: Select memory map cacheability settings at BO allocation time
- zink: regression for primitive-restart on ANV
- zink: Unimplemented ALU {un,}pack_half_2x16 in nir_to_spirv
- venus: dEQP-VK.api.command_buffers.bad_inheritance_info_random test failure
- [radeonsi] glClearTexImage on 1D array only clears first slice
- zink: Expected Image Operand ConstOffset to be a const object
- docs: bullet-lists no longer show any bullets
- [RADV] - Path of Exile (238960) - Ground decals are missing or corrupted using the Vulkan renderer.
- [ADL-S / TGL-U / TGL-H] Pixels missing / flickering when render some app on weston
- [iris][biected] piglit.spec.glsl-1_50.execution.geometry.clip-distance-vs-gs-out
- panfrost ff9a0000.gpu: Unhandled Page fault in AS1 at VA 0x0000000009801200
- black stripes in X/Xwayland applications under panfrost/midgard
- radv: significant overhead from radv_pipeline_has_ngg()
- mesa-21.1.1/src/gallium/drivers/vc4/vc4_resource.c:790: reading from uninitialised array
- [iris,i965][hsw,ivb,snb,bdw][bisected][regression] wflinfo crashes
- [iris,i965][hsw,ivb,snb,bdw][bisected][regression] wflinfo crashes
- The image is distorted while use iGPU(Intel GPU) rendering and output via dGPU (AMD GPU)
- [radv] Revert !7207 once BG3 is fixed.
- piglit test 'spec.glsl-1_30.execution.range_analysis_fsat_of_nan' failure
- iris: Rework iris_transfer_mapto better use iris_has_color_unresolved
- ir3_cf breaks nir_op_fquantize2f16
- [i915g] implicit fallthrough
- Add asahi to CI's build
- RADV - Regression - CPU side 'hang' in metro exodus
- Assassin's Creed Odyssey crash on launch
- Metro Exodus not starting under xwayland
- [i915g] PIPE_SHADER_CAP_MAX_HW_ATOMIC_COUNTERS is Unknown cap 38
- Elite Dangerous: Odyssey alpha crashes GPU on launch
- glmark2-es2 -b terrain crashes since Bifrost FP16
- [iris][bisected][regression] dEQP-GLES31.functional.texture.multisample.* tests crash on multiple platforms
- gallium: tc regression
- panfrost: Look into invalidate_resource()
- [ivb,hsw][i965][bisected] dEQP-GLES3.functional.shaders.texture_functions.texture.sampler2darrayshadow_vertex failing
- ci: Explicitly test shader caching
- radv: RoTR fails on Raven APU
- Tungsten Graphics links in Gallium docs
- Obs Studio Broken on Latest Mesa Git(Regression)(Bisected)
- Graphics corruption and GPU hang with RADV/LLVM
- old kernels (4.19) support in radv
- Elite Dangerous: Odyssey alpha crashes GPU on launch
- CSGO: Some default variables can cause problems with trust mode
- freedreno: dEQP-GLES3.functional.fence_sync.client_wait_sync_finish flakes
- glxclient.h:56:10: fatal error: 'loader.h' file not found
- mesa git started to break wine + UnrealTournament.exe (old dx6 game)
- SuperTuxKart artifacting on RK3399
- [amdgpu]: Golf With Your Friends (431240): ERROR Waiting for fences timed out
- don't flush for each blit/grid
- No sRGB capable visuals/fbconfigs reported in glx
- Documentation request: AMD debug variables
- docs: versions is out-of-date
- Strange results when trying to read from VK_FORMAT_R64_SFLOAT in compute shader
- anv: dEQP-VK.binding_model.buffer_device_address.set3.depth3.basessbo.convertcheck* slow
- Iris doesn't support INTEL_performance_query anymore
- [iris][bisected][regression] dEQP-GLES3.functional.texture.specification.teximage2d* failures
- RADV: TRUNC_COORD breaks gather operations
- [RADV] corruption in avatar after dying in Heroes of the Storm
- drm-shim build error with glibc 2.33
- Metro Exodus crashing due to memory overflow
- Sauerbraten shader rendering broken on RV530 (r300g)
- texture glitches on CS:GO on Tiger Lake
- Incorrect texture blitting/mapping when running Unigine-Heaven 4.0 on ADL-S / TGL-H, TGL-U
- Build fail due to "parameter name omitted" on Gallium Nine
- v3dv: DiligentEngine fail to run with SRGB swapchain
- Non-DRI builds broken by recent cleanups in Mesa core
- Cinnamon core dump after installing latest oibaf mesa build (165a69d2)
- yuv sampler lowering regression
- clover llvm build failure class llvm::VectorType has no member named getNumElements;
- Possible corruption for apps using multiple Z-buffers on TGL
Changes
-------
Aaron Liu (1):
- amd: add Yellow Carp support
Abel García Dorta (2):
- i915g: add HW atomic counters as unsupported
- i915g: fix implicit fallthrough
Adam Jackson (53):
- Revert "glx: Lift sending the MakeCurrent request to top-level code"
- gallium/xlib: Fix for recent gl_config changes
- glx/drisw: Enable GLX_ARB_create_context_no_error
- glx: Remove silly __glXGetGLVersion() indirection
- glx: Remove some truly ancient unused code
- glx: Remove major/minor version tracking from extension table
- glx: Mark GLX_{ATI_pixel_format_float,NV_float_buffer} as supported
- glx: Remove some non-functional GL extension from the table
- glx: Generalize __glXGetStringFromTable a little
- glx: Remove redundant client_support field from extension table
- glx: Enable pure-client-library extensions explicitly
- glx: Fold client_gl_only array into its one real user
- glx: Remove some ancient backwards-compatibility typedefs
- zink: Learn about VK_KHR_swapchain
- zink: Fix format query for minmax reduction support
- gallium: Fix PIPE_BIND_SAMPLER_REDUCTION_MINMAX definition to be unique
- dri: Try harder to infer the drawable fbconfig if needed
- glx: Return the right GLX opcode in synthetic MakeCurrent errors
- glx: s/dri_message/glx_message/
- glx: Add and use DebugMessageF convenience macro
- glx: Convert undocumented LIBGL_DIAGNOSTIC to LIBGL_DEBUG=verbose
- glx: Remove unused debugging printfs
- glx: Implement GLX_EXT_no_config_context
- glx: Stop pretending the GLX major number isn't 1
- glx: Stop force-enabling extensions "implied" by GLX 1.3
- glx: Simplify glXIsDirect
- glx: Remove warn-once-ery around GLX 1.3 functions
- glx: Remove unused opcode argument to __glX{Get,QueryServer}String
- glx: Stash a copy of the XExtCodes in the glx_display
- glx: Simplify some overuse of GetGLXScreenConfigs
- glx: Move server GLX vendor and version strings to glx_screen
- glx: s/Display \*/struct glx_display \*/ over internal API
- glx: Remove some dead declarations from glxclient.h
- Revert "glx: s/Display \*/struct glx_display \*/ over internal API"
- include: Remove unused i810_pci_ids.h
- i915c: Add a symlink for i830_dri.so
- mesa: Remove unused _mesa_{create,destroy}_visual
- mesa: Ignore the depth buffer when computing framebuffer floatness
- gallium: Reset attachments to ST_ATTACHMENT_INVALID when revalidating
- format/fxt1: Clean up fxt1_variance's argument list
- mesa: s/malloc/calloc/ to silence a warning
- gallium: Remove unused st_visual::render_buffer
- gallivm: Fix a signature mismatch warning
- zink/ntv: Don't call free() on ralloc'd memory
- gallium/dri: Remove unused dri_drawable::drisw_surface
- drisw: Don't bzero displaytarget pixels
- intel: properly constify isl_format_layouts
- classic/xlib: Fix the build after !9817
- swrast: Fix a warning from gcc 11
- loader/dri3: Properly initialize the XFIXES extension
- loader/dri3: Don't churn through xfixes regions in SwapBuffers
- vl/dri3: Don't leak regions on the X server
- meson: Make prefer-{crocus,iris} always take effect
Alejandro Piñeiro (30):
- v3dv/debug: print correct stage name
- v3dv/debug: use gl stage when checking debug flag
- v3dv/pipeline: track descriptor maps per stage, not per pipeline
- v3dv: remove custom icd json generation
- v3dv: move extensions table to v3dv_device
- v3dv: don't use typedef enum with broadcom stages
- v3dv: remove unused v3dv_zs_buffer_from_vk_format
- broadcom/compiler: use proper type field for atomic operations
- v3d/simulator: capture hub interrupts
- v3d/simulator: add a cache flush mode enum
- v3d/simulator: wait for cache flushes
- v3d/simulator: use the proper register when waiting on a CSD submit
- v3d/simulator: use BFC/RFC registers to wait for bin/render to complete
- broadcom/common: move v3d_tiling to common
- v3d/simulator: hw mem is now an v3d_size_t, typedef to uint32_t
- v3d/simulator: get rid of has_gca wrapper
- v3dv: rename v3dv_pack for v3dvx_pack
- v3dv/cmd_buffer: add helper job_emit_binning_prolog
- v3dv/cmd_buffer: move cl_emit calls for Draw methods to helpers
- v3dv: start to move and wrap hw-version code with v3dv_queue
- v3dv: split v3dv_pipeline hw version dependant code to a new source file
- v3dv: split v3dv_image hw version dependant code to a new source file
- v3dv: split v3dv_format hw version dependant code to a new source file
- v3dv: split v3dv_device hw version dependant code to a new source file
- v3dv: move several hw version dependant code to their own (v3dvx) source file
- v3dv: split v3dv_descriptor hw version dependant to a new source file.
- v3dv: don't use cl_packet_length for prepacked data
- v3dv: remove gen-dependant includes from v3dv_private
- v3dv/build: meson infrastructure for multi-hw-version support
- v3dv/format: expose properly that some formats are not filterable
Alexander Monakov (2):
- freedreno/drm-shim: pretend to offer DRM 1.6.0
- freedreno/drm-shim: keep GEM buffers page-aligned
Alexander Shi (1):
- mesa: texparam: Add a clamping macro to handle out-of-range floats returned as integers.
Alexey Nurmukhametov (1):
- tu/kgsl: Fix file descriptor double close
Alyssa Rosenzweig (668):
- nir: Update some comments referring to imov
- panfrost: Don't allow_forward_pixel_to_kill for Z/S blit
- panfrost: Set allow_forward_pixel_to_be_killed for blit
- panfrost: Set clean_fragment_write for blits
- panfrost: Invert blend_reads_dest logic
- panfrost: Don't allow FPK if a RT is missing
- panfrost: Allow FPK when there are no side effects
- panfrost: Keep Bifrost blendable -> pixel in table
- panfrost: Specialize blendable formats for sRGB
- panfrost: Simplify format_to_bifrost_blend prototype
- panfrost: Drop blendable format accessor
- panfrost: Always pick dithered tb formats
- panfrost: Remove padded unorm blendable formats
- docs/macos: Explain Apple GLX versus OSMesa on macOS
- nir/lower_fragcolor: Use shader_instructions_pass
- nir/lower_fragcolor: Handle fp16 outputs
- panfrost: Fix formats converting uninit from AFBC
- nir/lower_fragcolor: Fix driver_location assignment
- nir/lower_fragcolor: Take max cbufs as argument
- d3d12: Switch to nir_lower_fragcolor
- util/bitset: Add BITSET_COUNT helper
- nir: Add fsin_agx opcode
- asahi: Stub command-line compiler for AGX G13B
- agx: Add opcode descriptions as Python
- agx: Generate opcode list
- agx: Generate runtime-accessible opcode table
- agx: Generate builder routines
- agx: Stub NIR backend compiler
- agx: Remap varyings to match AGX ABI
- agx: Stub control flow walking
- agx: Stub NIR instruction iteration
- agx: Stub emit_intrinsic
- agx: Implement load_const as mov
- agx: Implement direct st_vary
- agx: Add agx_alu_src_index helper for emit_alu
- agx: Implement vec2/vec3/vec4 ops
- agx: Implement fragment_out
- agx: Add instruction printing
- agx: Add a trivial register allocator
- agx: Add instruction packing
- agx: Add packing for memory loads/stores
- agx: Add st_vary(_final) instruction packing
- agx: Terminate programs with stop and traps
- agx: Implement ld_vary
- agx: Implement simple floating point ops
- agx: Implement fsin/fcos
- agx: Add 8-bit AGX minifloat routines
- agx: Add minifloat tests
- agx: Implement native float->int conversions
- agx: Implement native int->float conversions
- agx: Add bitwise operations
- agx: Add iadd/imad integer arithmetic
- agx: Add saturated integer add/subtract support
- agx: Add 32-bit bitwise shifts
- agx: Add forward optimizing pass for fmov
- agx: Add dead code eliminator
- agx: Propagate fmov backwards as well
- agx: Propagate immediates
- agx: Implement limited case of i2i16/i2i32 as iadd
- agx: Add sysval management helper
- agx: Implement load_ubo/kernel_input
- agx: Set flag on last st_vary instruction
- agx: Lower load_attr to device memory accesses
- agx: Implement vertex_id
- agx: Add agx_tex_dim helper
- agx: Emit texture ops
- agx: Pack texture ops
- agx: Add min/max support
- agx: Support 1-bit booleans
- agx: Implement b2f
- agx: Add b2i implementation
- agx: Pack cmpsel
- agx: Support bcsel
- asahi: Add hexdump utility
- asahi: Add command buffer XML definitions
- asahi: Add allocation data structure
- asahi: Add a GenXML fork
- asahi: Add (clean room) IOKit uABI header
- asahi: Add command buffer decode helpers
- asahi: Add tiling routines
- asahi: Add device abstraction
- asahi: Add pool data structure
- asahi: Add uniform upload routines
- asahi: Add some magic IOGPU routines
- asahi: Add vertex formats table
- asahi: Add Gallium driver
- nir/opcodes: Reword confusing comment
- pan/bi: Add missing sr_count to pseudo-atomics
- pan/bi: Don't reference uninit source in ATOM_C1
- pan/bi: Add simple constant folding pass
- pan/bi: Don't reference nir_lower_mediump_outputs
- pan/bi: Simplify Python expression
- pan/bi: Union modifiers from across variants
- pan/bi: Support 16-bit load_interpolated_input
- pan/bi: Emit int CSEL instead of float by default
- pan/bi: Implement vectorized f32_to_f16
- pan/bi: Fix 16-bit fsat
- pan/bi: Improve assert for vector size errors
- pan/bi: Implement vectorized int downcasts
- pan/bi: Fix loads and stores smaller than 32 bits
- pan/bi: Lower swizzles on CLPER
- pan/bi: Add and use bi_negzero helper
- pan/bi: Don't schedule clamps to +FADD.v2f16
- pan/bi: Workaround \*V2F32_TO_V2F16 erratum
- panfrost: Don't unroll loops in GLSL
- panfrost: Remove old dEQP workaround
- pan/bi: Track dual-src blend type
- pan/bi: Handle different sizes of LD_TILE
- pan/bi: Add single-component 8-bit mkvec lowering
- pan/bi: Handle swizzles in i2i8
- pan/bi: Lower 8-bit fragment input
- panfrost: Make comment less confusing
- panfrost: Support alpha_to_one
- panfrost: Minor cleanup of blend CSO
- panfrost: Don't clobber RT0 if RTn is disabled
- pan/lower_blend: Clean up type size handling
- pan/lower_blend: Use NIR helpers
- pan/lower_blend: Rename is_bifrost->scalar
- panfrost/blend: Fix outdated comments
- panfrost/blend: Workaround a v7 implementation-detail
- panfrost/blend: Distribute to_c_factor
- panfrost/blend: Prepare for lower_fragcolor
- panfrost: Call nir_lower_fragcolor based on key
- panfrost: Assume lower_fragcolor has been called
- panfrost/lower_framebufffer: Don't use i2imp
- pan/blend: Emit explicit conversions for all types
- panfrost: Key blend shaders to the input types
- pan/mdg: Hide units behind MIDGARD_MESA_DEBUG=verbose
- pan/mdg: More concise RMU name
- pan/mdg: Don't print zero shifts
- pan/mdg: Suppress most attribute tables
- pan/mdg: Don't print explicit .rte
- pan/mdg: Don't print mem addr brackets
- pan/mdg: Reduced printed parens
- pan/mdg: Don't print zero
- pan/bi: Add imm_uintN helper
- pan/bi: Handle integer min/max ourselves
- pan/bi: Handle ineg
- pan/bi: Handle b2f ourselves
- pan/bi: Handle b2i8/16
- pan/bi: Track scalarness of 16-bit ALU
- pan/bi: Don't swizzle scalars
- pan/bi: Switch to 1-bit bools
- pan/bi: Use nir_lower_to_bit_size
- pan/mdg: Use _output_ type for outmod printing
- pan/mdg: Remove midgard_opt_copy_prop_reg
- pan/mdg: Enable nir_opt_{move, sink}
- panfrost/blend: Inline blend constants
- pan/mdg: Model blend shader interference
- panfrost: Fix typo handling blend types
- pan/bi: Change swizzled scalars to identity
- pan/bi: Adapt branching for 1-bit bools
- pan/bi: Handle make_vec with 1-bit bools
- pan/bi: Temporarily switch back to 0/~0 bools
- pan/bi: Enable NIR vectorization
- pan/bi: Fix int<-->float size converts
- pan/bi: Copyprop constants
- pan/bi: Garbage collect bifrost_nir.h
- pan/bi: Enable mediump BLEND lowering
- panfrost: Enable 16-bit support on Bifrost
- pan/lower_framebuffer: Fix bitsize mismatch
- nir: Add blend lowering pass
- panfrost: Use common blend lowering
- nir/divergence_anlysis: Add intrinsics for Bifrost
- pan/bi: Drop load_sampler_lod_parameters_pan
- pan/bi: Map load_subgroup_invocation to FAU
- pan/bi: Add "lanes per warp" accessor
- pan/bi: Add divergent intrinsic lowering pass
- asahi: Translate blend CSO to lower_blend options
- asahi: Augment Gallium key with blend state
- asahi: Call nir_lower_blend with selected key
- asahi: Garbage collect bind_state
- asahi: Implement set_blend_color
- asahi: Add blend constant system value
- asahi: Call nir_lower_fragcolor
- asahi: Fix shader key hash function
- asahi: Pass through "reads tilebuffer?" bit
- agx: Return agx_instr* from emit_intrinsic
- agx: Implement blend constant color sysvals
- agx: Rename blend -> st_tile
- agx: Add ld_tile opcode
- agx: Assume lower_fragcolor has been called
- agx: Condition writeout ops on already being emitted
- agx: Implement load_output
- agx: Set reads_tib appropriately
- panfrost: Drop panfrost_fence in favour of pipe_fence_handle
- docs: Simplify now that kmsro is autoenabled
- pan/bi: Add first_vertex to vertex ID
- panfrost: Track buffers needing resolve
- panfrost: Set discard based on the resolve set
- panfrost: Implement framebuffer invalidation
- panfrost: Hide CAP_INT16 behind is_deqp
- panfrost: Don't translate compare funcs
- panfrost: Remove spurious assignment
- panfrost: Clean up cases for emit_fbd
- panfrost: Don't upload empty push uniform table
- pan/mdg: Use smaller LD_UNIFORM instructions
- ci: Build asahi in meson-gallium job
- panfrost: Fix major flaw in BO cache
- panfrost: Drop random #define
- panfrost: Use natural shader limits
- panfrost: Make clear which limits are arbitrary
- panfrost: Garbage collect comment
- panfrost: Shorten iffy comment
- pan/mdg: Remove unused midgard_int_alu_op_prefix
- pan/mdg: Fix output types for scalar fields
- pan/mdg: Fix spills to TLS
- pan/mdg: Set lower_uniforms_to_ubo
- panfrost: Add unowned mode to pan_pool
- panfrost: Label all BOs in userspace
- panfrost: Label pools
- panfrost: Make pool slab size configurable
- panfrost: Add reference type for unowned pool
- panfrost: Pool shaders
- panfrost: Pool texture views
- panfrost: Reduce blitter pool size
- panfrost: Fix blending for unbacked MRT
- panfrost: Fix the reads_dest prototype
- panfrost: Fix is_opaque prototype
- panfrost: Fix blend constant fetch prototype
- panfrost: Fix blend fixed-function prototype
- panfrost: Fix pan_blend_to_fixed_function_equation prototype
- panfrost: Move blend properties to CSO create
- panfrost: Translate fixed-function blend at CSO create
- panfrost: Garbage collect Gallium blend includes
- panfrost: Pack blend equations at CSO create time
- panfrost: Distribute out constant colour code
- panfrost: Simplify blend_final
- panfrost: Pass batch to panfrost_get_blend
- panfrost: Streamline fixed-function get_blend path
- panfrost: Remove unused dither flag
- panfrost: Split Bifrost BLEND emit by word
- panfrost: Precompute bifrost_blend_type_from_nir
- panfrost: Add draw-time merge helper
- panfrost: Prepack partial RSD at compile time
- panfrost: Move depth/stencil/alpha to CSO create
- panfrost: Preset evaluate_per_sample
- panfrost: Correct the type of sample_mask
- panfrost: Fill out the rasterizer CSO
- panfrost: Move early-z decision earlier
- panfrost: Streamline the !fs_required case
- panfrost: Hoist allow_forward_pixel_to_be_killed
- panfrost: Partially determine FPK state
- panfrost: Distribute masks for FPK selection
- panfrost: Pull erratum workaround into own function
- panfrost: Hoist part of shader_reads_tilebuffer
- panfrost: Pack draw-time RSD all-at-once
- panfrost: Move batch_set_requirements to the CSO
- panfrost: Deduplicate some code from indirect/direct draws
- panfrost: Pass batch to panfrost_get_index_buffer_bounded
- panfrost: Remove silly assertion
- panfrost: Mark job_index > 10000 as unlikely
- panfrost: Simplify panfrost_bind_sampler_states
- panfrost: Express viewport in terms of the batch
- asahi: Set PACKED_STREAM_OUTPUT
- glsl: Fix subscripted arrays with no XFB packing
- glsl: Fix packing of matrices for XFB
- panfrost: Streamline varying linking code
- panfrost: Define dirty tracking flags
- panfrost: Add the usual clean/dirty helpers
- panfrost: Dirty all state when batch is set
- panfrost: Dirty track RSDs
- panfrost: Dirty track textures/samplers
- panfrost: Dirty track viewport descriptor
- panfrost: Dirty track fragment images
- panfrost: Add PAN_MESA_DEBUG=dirty option
- panfrost/ci: Disable GLES2 jobs when we run GLES3
- panfrost/ci: Disable G72 jobs for now
- panfrost/ci: Split rules by ISA
- ci: Condition ppc64-el on specific drivers
- ci: Condition s390x on specific drivers
- panfrost: Only link varyings once in good conditions
- panfrost: Lower max inputs again
- panfrost: Abort on faults in SYNC mode
- panfrost: Remove minimal mode
- panfrost: Increase tiler_heap max allocation to 64MB
- panfrost/ci: Disable terrain trace
- panfrost/ci: Remove reference to dated flag
- panfrost/ci: Run jobs with PAN_MESA_DEBUG=sync
- panfrost: Add Message Preload descriptor XML
- panfrost: Add message preload to pan_shader_info
- panfrost: Inline pan_prepare_shader_descriptor
- panfrost: Don't take ctx in panfrost_shader_compile
- panfrost: Expose PIPE_CAP_SHAREABLE_SHADERS
- asahi: Fix meson.build definition to depend on agx_pack.h
- agx: Drop cmdline version back to ES3.0
- agx: Pack ld_var Dx
- agx: Enable 1-bit load_const
- agx: Implement boolean mov
- agx: Track current_block
- agx: Track block offsets
- agx: Add nest field to IR
- agx: Add invert_cond (ccn) to IR
- agx: Add branch target to IR
- agx: Add inner loop nesting count field
- agx: Model control flow instructions
- agx: Model pop_exec
- agx: Add push_exec alias
- agx: Pack control flow instructions
- agx: Model jump instructions
- agx: Fix up branch offsets at pack time
- agx: Implement emit_if the simplest way
- agx: Optimize out empty else blocks
- agx: Implement loops in the simplest way
- agx: Add break/continue support
- agx: Zero r0l before first use of control flow
- asahi: Fix scissor descriptor definition
- asahi: Add "set scissor" command
- asahi: Add scissor enable bit
- asahi: Defer viewport pack
- asahi: Dirty track viewport descriptor
- asahi: Track scissor states
- asahi: Mark scissor dirty if rast->scissor changes
- asahi: Skip draws if the scissor culls everything
- agx: Add scissor upload BO
- asahi: Expose PIPE_CAP_CLIP_HALFZ
- asahi: Add unknown bits seen with the GL driver
- asahi: Enable depth culling
- asahi: Update viewport descriptor depth fields
- asahi: Implement scissors and scissor to viewport
- asahi: Fix off-by-one in viewport scissoring
- asahi: Implement wide lines
- asahi: Determine tiling vs linear for internal textures
- asahi: Use dt_stride for line_stride where needed
- asahi: Add layout enum to XML
- asahi: Translate layouts for texture and RTs
- asahi: Identify line stride in texture/RT XML
- asahi: Respect linear strides
- asahi: Handle linear display targets as well as tiled
- asahi: Note that "render target" lacks an sRGB bit
- asahi: Align strides to 16 bytes
- asahi: Print unknown enum values
- asahi: Add format enums
- asahi: Hide pixel formats behind an opaque type
- asahi: Scaffold format table
- asahi: Use pixel table in is_format_supported
- asahi: Respect render target format swizzle
- asahi: Add ETC2 formats to table
- asahi: Add "hacks for dEQP" flag
- asahi: Lift streamout scaffolding from Panfrost
- asahi: Fake CAPs for ES3 with AGX_MESA_DEBUG=deqp
- asahi: Flesh out the formats table
- asahi: Allow half-float vertex buffers
- asahi: Make data_valid a bitset to save memory
- asahi: Abort on blit()
- asahi: Add mipmapping state to the XML
- asahi: Set levels in texture descriptor
- asahi: Allocate slices for mipmapping
- panfrost: Update comment
- panfrost: Shrink pan_draw_mode return type
- panfrost: Add draw parameters dirty flags
- panfrost: Analyze sysval dirty flags
- panfrost: Dirty track constant buffers
- panfrost: Don't allocate empty varying buffer
- panfrost: Dirty track stack sizes
- panfrost: Write translate_index_size better
- panfrost: Minor changes to draw_vbo
- panfrost: Bubble up errors
- panfrost: Elucidate thread group split field
- panfrost: Eliminate reserve_* functions
- panfrost/ci: Report flakes on IRC
- vc4: Use Rn_UINT instead of In_UINT for index buffers
- v3d: Use Rn_UINT instead of In_UINT for index buffers
- etnaviv: Use Rn_UINT instead of In_UINT for index buffers
- freedreno: Use Rn_UINT instead of In_UINT for index buffers
- lima: Use Rn_UINT instead of In_UINT for index buffers
- si: Use Rn_UINT instead of In_UINT for index buffers
- docs/gallium: Document the index buffer format convention
- nir: Add nir_intrinsic_load_back_face_agx
- asahi: Mark special fragment inputs as sysvals
- agx: Model get_sr
- agx: Generate enums from Python
- agx: List sr enum in Python
- agx: Pack SR immediate
- agx: Lower front face to back face
- agx: Handle load_back_face_agx
- ci: Disable the iris APL jobs
- nir/lower_fragcolor: Avoid redundant load_output
- pan/bi: Pull out bi_count_write_registers
- pan/bi: Use TEXS_2D for rect textures
- pan/bi: Simplify TEXC codegen for sr_count=0
- pan/bi: Fix bi_rewrite_passthrough ordering
- pan/bi: Bundle after RA
- pan/bi: Add post-RA optimizer
- pan/bi: Track liveness while scheduling
- pan/bi: Allow IADD.u32 on FMA as \*IADDC
- pan/bi: Use explicit affinities in RA
- pan/bi: Inline spilling in RA
- pan/bi: Explicit zero reg_live_{in, out} when needed
- pan/bi: Model interference with preloaded regs
- pan/bi: Allow move/sink in blend shaders
- pan/bi: Don't restrict the register file in non-blend shaders
- pan/bi: Model +BLEND clobbering of r48
- pan/bi: Handle images in vertex shaders
- pan/bi: Lower loads with component > 0
- pan/bi: Lower stores with component != 0
- pan/bi: Lower 64-bit ints again
- pan/bi: Emit a dummy ATEST if needed
- pan/bi: Simplify spill code
- pan/bi: Track words instead of bytes in RA
- pan/bi: Don't allocate past the end of the reg file
- panfrost: Remove AFBC format fixups
- panfrost: Add missing 'Reverse issue order flag'
- panfrost: Disable AFBC on v7
- panfrost: Don't duplicate attribute buffers
- panfrost: Separate image attribute and buffer emit
- panfrost: Be explicit in image modifier handling
- panfrost: Use util_last_bit for images
- panfrost: Default indirect attributes to 1D type
- pan/indirect: Factor out is_power_of_two_or_zero
- pan/indirect_draw: Use unsigned comparisons
- pan/indirect_draw: Fix 1 instance, nonzero divisor
- panfrost: Correctly size varyings
- panfrost: Use varying format from frag shader
- pan/bi: Force u32 for flat varyings
- panfrost: Fix vertex image attribute overrun
- panfrost: Simplify compute_checksum_size formula
- panfrost: Fix crc_valid condition
- panfrost: Zero r_dimension for buffer textures
- panfrost: Add util_draw_indirect() debug path
- panfrost: Align NPOT divisor records
- panfrost: Fix src_offset data type
- panfrost: Make instancing code more obvious
- panfrost: Assert alignment of indirect records
- pan/mdg: Use consistent casing in midgard_print
- pan/mdg: Make -Wswitch happy
- pan/mdg: Stub memory_barrier{_image}
- panfrost: Clarify how fs_sidefx works with oq
- panfrost: Simplify Midgard blend disable
- panfrost: Don't force early-z with occlusion query
- panfrost: Respect early-Z force on Midgard
- pan/mdg: Fix units for SUBSAT
- pan/mdg: Handle {i,u}{add,sub}_sat
- pan/mdg: Update r1.w comment
- pan/mdg: Fix incorrect rewrite in Midgard scheduler
- panfrost: Mark 16/32_UNORM as non-renderable (v5)
- panfrost: Don't allocate WLS when not needed
- pan/mdg: Wire in PAN_SYSVAL_VERTEX_INSTANCE_OFFSETS
- pan/mdg: Lower away gl_VertexID offset
- pan/mdg: Use more accurate ld/st reg estimates
- pan/mdg: Don't skip unit-based checks in choose_instruction
- pan/mdg: Assert scheduled instructions are reasonable
- pan/mdg: Insert moves to load/store registers
- panfrost: Fix dirty state emission
- panfrost: Emulate indirect draws on Midgard
- panfrost: Add some missing BGRA formats
- panfrost: Remove scissor_culls_everything
- panfrost: Don't set a blend shader for no_colour
- panfrost: Allocate XFB buffers per-instance
- panfrost: Fix BUFFER image handling
- panfrost: Make image buffers robust
- panfrost: Lower max compute size
- panfrost: Set PIPE_COMPUTE_CAP_SUBGROUP_SIZE
- panfrost: Set PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK
- panfrost: Drop todo on PIPE_COMPUTE_CAP_IMAGES_SUPPORTED
- panfrost: Don't CRC mipmapped textures
- panfrost: Reduce pan_image_state indirection
- pan/indirect_dispatch: Indent NIR blocks
- pan/indirect_dispatch: Simplify empty command case
- pan/indirect_dispatch: Distinguish minus-1 defs
- pan/indirect_dispatch: Expand split expressions
- pan/indirect_dispatch: Use extracted values
- panfrost: Use direct dispatch with shared memory
- panfrost: Don't clobber indirect dispatch fields
- panfrost: Make data_valid a bitset
- panfrost: Remove pan_image_state
- panfrost: Set valid_buffer_range for GPU writes
- panfrost: Add XML for vertex/instance ID records
- panfrost: Clean up vertex/instance ID on Midgard
- panfrost: Flush everything for glMemoryBarrier
- panfrost: Flush before compute jobs
- panfrost: Set vertex job_barrier
- panfrost: Add "Cache Flush" job XML
- panfrost: Advertise GLES3.1
- pan/decode: Fix image attribute counting
- pan/decode: Handle cache flush jobs
- panfrost/ci: Blank G52 flakes file
- panfrost/ci: Don't skip SSBO tests on G52
- panfrost/ci: Do fractional dEQP-GLES31 run on Midgard
- docs/features: Mark GLES3.1 as done on Panfrost
- docs/panfrost: Update API versions
- pan/bi: Include modifier info in opcode table
- pan/bi: Move bi_word_node to common code
- pan/bi: Move typesize to common code
- pan/bi: Track instruction size in opcode table
- pan/bi: Handle fsat_signed and fclamp_pos
- pan/bi: Report tuples, not nops, in shader-db
- pan/bi: Propagate fabs/neg/sat
- pan/bi: Add back custom algebraic opts
- pan/bi: Fuse fclamp_pos and fsat_signed
- pan/bi: Schedule FCMP.v2f16 with abs modifier
- pan/bi: Fuse abs into FCMP/FMIN/FMAX.v2f16
- nir: Fix constant folding for irhadd/urhadd
- agx: Mark components as ASSERTED
- agx: Add agx_immediate_f helper
- agx: Add perspective bit to ld_var
- agx: Update ld_vary encoding mask
- agx: Add ld_vary_flat opcode
- asahi: Identify varying descriptor fields
- agx: Rename remap_varyings -> remap_varyings_vs
- agx: Implement nir_intrinsic_load_frag_coord
- agx: Implement ld_vary_flat
- agx: Rename agx_pack to agx_pack_binary
- agx: Remap fragment shader varyings explicitly
- asahi: Unify varying linking code with vertex shaders
- agx: Pull out agx_write_components
- agx: Add agx_exit_block helper
- agx: Add liveness analysis pass
- agx: Mark sources that kill
- agx: Count write registers, not components
- agx: Lift agx_block_add_successor from Panfrost
- agx: Track logical control flow graph
- asahi: Wire in tgsi_to_nir
- asahi: Fix random \*2
- asahi: Guard for overflow when packing
- asahi: Always flush when setting framebuffer state
- asahi: Handle Z16_UNORM textures
- asahi: Add zsbuf to batch
- asahi: Save zsbuf ptr
- asahi: Add internal (renderable) formats to the table
- asahi: Set fragment key for non-U8NORM render targets
- asahi: Implement colour buffer reloads
- asahi: Remove spurious assignment
- asahi: Remove spurious varying assignment
- asahi: Generalize varying linking
- asahi: Add ASAHI_MESA_DEBUG=no16 option
- agx: Fix 32-bit bitwise shifts
- agx: Fix LOD_MIN enum
- agx: Pack LOD descriptors
- agx: Fix lod_mode shift
- agx: Legalize LOD sources to be 16-bit only
- agx: Handle txl
- asahi: Fail on LOD clamps/bias
- asahi: Identify texture/sampler count fields
- asahi: Identify vertex texture/sampler counts
- asahi: Set vertex texture/sampler counts
- asahi: Track more Gallium state
- asahi: Wire in u_blitter
- asahi: Handle nonzero first_level
- asahi: Fix meson dependency on packing in compiler
- asahi: Prepack rasterizer faces
- asahi: Implement the stencil test
- asahi: Flush for accesses to Z/S buffer
- asahi: Comment on an embedded data structure
- asahi: Skip over holes in the vbufs
- asahi: Add XML for the attachment structure
- asahi: Sync attachment magic with asahi demo
- asahi: Parametrize software "command buffer" size
- asahi: Identify "command buffer" size field in map
- asahi: Move IOGPU header to XML
- asahi: Extend IOGPU header to contain encoder
- asahi: Use GenXML for main bind fragment
- asahi: Identify attachment length field
- asahi: Set data_valid for the depth buffer
- asahi: Enable primitive restart
- asahi: Use XML for interpolation packet
- panfrost: Express dependencies as resources, not BOs
- panfrost: Wrap occlusion query in pipe_resource
- panfrost: Split "flush writer" from "flush accessing"
- panfrost: Eliminate redundant flushes with AFBC
- panfrost: Add secondary shader XML fields
- pan/decode: Handle IDVS jobs on Bifrost
- agx: Don't choke on registers in the optimizer
- agx: Count read registers as well
- agx: Assign registers locally
- agx: Pipe in nir_register
- agx: Ensure we don't overallocate registers
- panfrost: Move draw_vbo to pan_cmdstream.c
- panfrost: Move most CSO creates to pan_cmdstream.c
- panfrost: Split out prepare_rsd into a vtbl
- panfrost: Move blend CSO to cmdstream/context
- panfrost: Don't ralloc panfrost_blend_state
- panfrost: Move launch_grid to pan_cmdstream
- panfrost: Move panfrost_emit_tile_map to pan_job
- panfrost: Use vtable for fragment descriptor functions
- panfrost: Clean up pan_cmdstream.h
- panfrost: Move sample accessor to pan_cmdstream
- panfrost: Remove pan_cmdstream.h
- panfrost: Remove unused midgard-pack.h includes
- docs: Update relnotes for panfrost/asahi
- pan/bi: Improve clause printing
- pan/bi: Fix skip/lod_mode aliasing with VAR_TEX
- pan/bi: Add bi_foreach_instr_global_rev_safe helper
- pan/bi: Pack staging_barrier for the -next- clause
- pan/bi: Try to hit full occupancy on v7
- pan/bi: Only spill nodes that could progress in RA
- pan/bi: Report cycle counts
- pan/bi: Track LOD mode even for TEXC
- pan/bi: Analyze helper invocations
- pan/bi: Fuse LD_VAR+TEXS_2D -> VAR_TEX
- pan/bi: Add a constant subexpression elimination pass
- pan/bi: Workaround widen restrictions on +FADD.f32
- pan/bi: Simplify cube map descriptor generation
- pan/bi: Comment the fexp2 implementation
- pan/bi: Factor out exp2/log2 code
- pan/bi: Don't lower fpow
- panfrost: Fix FPK enable condition
- panfrost: Add a performance counter dump utility
- panfrost: Don't set zs_update_operation in vertex shaders
- panfrost: Zero depth_source in vertex shaders
- panfrost: Query tiler features
- panfrost: Enable more tiler levels if we can
- panfrost: Generalize pan_blitter's reg count assert
- panfrost: Set register allocation in the v7 RSD
- asahi: Move fixed internal shaders to agx_blit.c
- asahi: Add missing copyright/guards for magic.c/h
- asahi: Remove unused bo_access property
- asahi/decode: Only dump mapped allocations
- asahi: Make track_free safer
- asahi/decode: Check fewer zeroes after a command buffer
- asahi: Reserve more space to stop a command buffer
- asahi: Identify more unknown fields in the memmap
- asahi/decode: Fix up high word
- asahi/decode: Handle CULL packets
- asahi/decode: Fix decoding of draw calls
- asahi: Allow specifying an encoder ID
- asahi: Allocate global IDs
- asahi: Consolidate some magic numbers
- asahi: Garbage collect senseless cmdbuf struct
- asahi/decode: Print clear/store pipelines
- asahi/decode: Print some IOGPU stuff
- asahi: Set bits in UNK11 needed for points
- asahi: Set point magic bit in rasterizer
- asahi: Set bit for psiz
- asahi: Lower PIPE_CAPF_MAX_POINT_WIDTH to hw limit
- asahi: Unpack varying descriptors (1x)
- asahi: Identify triangle/lines vs point varyings
- asahi: Handle point coordinates
- agx: Flip point coordinates because OpenGL
- panfrost: Inline flip_compare_func into pan_encoder.h
- panfrost: Move panfrost_vertex/instance_id to per-gen
- panfrost: Inline away pan_pool.c
- panfrost: Express pack_work_groups more concisely
- panfrost: Inline away pan_invocation.c
- panfrost: Assert that injected jobs are for blits
- panfrost: Inline panfrost_get_z_internal_format
- panfrost: Move arch-independent pan_format code
- panvk: Don't use panfrost_bifrost_swizzle
- panfrost: Remove panfrost_bifrost_swizzle
- panfrost: Add GenXML macros
- panfrost: Compile format table multiple times
- panfrost: Specialize blendable_formats for v6
- panfrost: Use smaller sizes in blend table
- panfrost: Give WLS Instances a default
- panfrost: Pin an architecture for blending
- panfrost: Use generic delete for ZSA
- panfrost: Remove reference to mali_blend_equation_packed
- panfrost: Avoid GenXML enum dependences
- panfrost: Remove pan_blitter integration
- panfrost: Init/destroy blitter from per-gen file
- panfrost: Only access blitter from per-gen
- pan/bi: Refuse to CSE non-SSA sources
- pan/bi: Make bi_foreach_instr_in_tuple safer
- pan/bi: Update ins->link after scheduling
- pan/bi: Do helper termination analysis on clauses
- pan/bi: Handle multiple destinations in scheduler
- pan/bi: Add bi_before_tuple convenience method
- pan/bi: Handle 4-src instructions in scheduler
- pan/bi: Calculate dependency graph when bundling
- pan/bi: Add a bundling heuristic
- panfrost: Fix format swizzles on G72
- targets/graw-xlib: Add missing dep_x11
- pan/mdg: Garbage collect silly quirk
- asahi: Fix sampler filtering flag
- agx: Fix mismatched units in load_ubo
- agx: Plug memory leak in register allocator
- pan/bi: Restrict swizzles on same cycle temporaries
- pan/bi: Remove incorrect errata workaround
Andres Gomez (25):
- ci: Uprev piglit to 9d87cc3d79e ("framework/replay: send backend's subprocess stderr to sys.stderr")
- ci: Add test which occasionally times out to lavapipe-vk skips
- ci: add xorg to the x86_test-vk container
- ci: allow starting xorg for piglit run
- ci: remove results directory content only with piglit runners
- ci: make sure we only read the first line from install/VERSION
- ci: update some radv trace checksums
- ci: update some radv trace checksums
- ci: update radv's trace job tag for Raven
- ci: remove radv's trace job for Polaris10
- ci: uprev apitrace to 10.0
- ci: uprev DXVK to 1.8.1
- ci: add radv's trace job for Navy Flounder
- ci: include VKD3D-Proton tests into the VK test container
- ci: add VKD3D-Proton testsuite runner
- ci: add VKD3D-Proton testsuite job for radv's Navy Flounder
- ci: disentangle tags for containers and artifacts produced by them
- ci: remove glslangValidator installation from the VK test container
- ci: replace glslangValidator with glslang-tools
- ci: fix the vkd3d-proton runner
- ci: build the hang-detection tool into x86_test-vk
- ci: update some radv trace checksums
- ci: bump x86_test-base tag
- ci: remove unzip from several containers that don't use it at all
- ci: use bash with download-git-cache.sh
Andrii Simiklit (1):
- Remove redundant assignment
Antonio Caggiano (15):
- panfrost: Fix invalid conversions
- panfrost: Meson dependency
- util: Perfetto SDK v15.0
- pps: Gfx-pps v0.3.0
- pps: Gfx-pps config tool
- pps: Documentation
- intel/perf: Extern C
- pps: Intel pps driver
- pps: Intel documentation
- ci: Add a manual job for tracking the performance of Freedreno
- panfrost: Counter definitions
- panfrost: Performance configuration
- panfrost: Fix pan_pool_ref construction
- pps: Panfrost pps driver
- pps: Panfrost documentation
Anuj Phogat (39):
- intel: Rename files with gen_debug prefix
- intel: Rename gen_debug prefix to intel_debug
- intel: Rename GEN_DEBUG prefix to INTEL_DEBUG
- intel: Rename intel_device_info.c to intel_dev_info.c
- intel: Rename gen_device prefix in filenames
- intel: Rename gen_device prefix to intel_device
- intel: Fix alignment and line wrapping due to gen_device renaming
- intel: Rename GEN_DEVICE prefix in macros to INTEL_DEVICE
- intel: Rename gen_get_device prefix to intel_get_device
- intel: Rename gen_get_aperture_size to intel_get_aperture_size
- intel: Drop gen prefix in gen_has_get_tiling()
- intel: Rename gen_context.h to intel_context.h
- intel: Rename gen_context prefix to intel_context
- intel: Rename gen_perf prefix in filenames to intel_perf
- intel: Rename gen_perf prefix to intel_perf in source files
- intel: Fix alignment and line wrapping due to gen_perf renaming
- intel: Rename GEN_PERF prefix to INTEL_PERF in build files
- intel: Rename GEN_PERF prefix to INTEL_PERF in source files
- intel: Rename gen_{pipeline, oa, counter, hw} to intel_{..}
- intel: Rename brw_gen_enum.h to brw_gfx_ver_enum.h
- intel: Rename gen enum to gfx_ver
- intel: Rename gen keyword in test_eu_validate.cpp
- intel: Rename gens keyword to gfx_vers
- intel: Rename index_gen keyword to index_ver
- intel: Rename eu compact instruction tests
- intel: Rename gen_{mapped, clflush, invalidate} prefix to intel_{..}
- intel: Remove devinfo_to_gen() helper function
- intel: Rename isl_to_gen keyword to isl_encode
- intel: Rename vk_to_gen keyword to vk_to_intel
- intel: Rename gen_10 to ver_10
- intel: Rename calculate_gen_slm_size to intel_calculate_slm_size
- intel: Rename _gen_{program, part, batch, freq} to _intel_{..}
- intel: Rename GEN_PART to INTEL_PART
- intel: Rename {i965, iris, anv, isl}_gen prefix in build files
- intel: Rename since_gen to since ver
- intel: Rename _gen keyword to _gfx_ver in few build files
- intel: Fix GEN_GEN macro checks
- intel/gfx12+: Add Wa_14013840143
- intel: Rename GFX 12.5 to XE_HP
Axel Davy (1):
- st/nine: Fix compilation error on non-x86 platforms
Bas Nieuwenhuizen (41):
- radv: Fix memory leak on descriptor pool reset with layout_size=0.
- amd/common: Use cap to test kernel modifier support.
- radv: Only require DRM 3.23.
- radeon/vcn: Use the correct pitch for chroma surface.
- nir: Add load_sbt_amd intrinsic.
- radv: Add sbt descriptors user SGPR input.
- aco: Add load_sbt_amd intrinsic implementation.
- radv: Use global BO list with raytracing.
- radv: Add support for RT bind point.
- radv: Add RT pipeline bind.
- radv: Implement vkCmdTraceRays.
- radv: Use correct border swizzle on GFX9+.
- nir: Add bvh64_intersect_ray_amd intrinsic.
- aco: Implement bvh64_intersect_ray_amd intrinsic.
- nir/lower_returns: Deal with single-arg phis after if.
- radv: Don't skip barriers that only change queues.
- radv: Actually return correct value for read-only DCC compressedness.
- radv: Allow DCC images to be compressed with foreign queues.
- gallium/vl: Use format plane count for sampler view creation.
- gallium/va: Add support for PRIME_2 import.
- radv: Use the global BO list for acceleration structures.
- radv: Add initial CPU BVH building.
- radv: Implement device-side BVH building.
- radv: Add acceleration structure descriptor set support.
- radv: Convert lower_intrinsics to a switch statement
- radv: Implement load_vulkan_descriptor for acceleration structures.
- radv: Expose formats for acceleration structure.
- radv: Add rt perftest flag.
- radv: Enable VK_KHR_acceleration_structure with RADV_PERFTEST=rt.
- radv: Add -Wpointer-arith.
- util/fossilize_db: Pull seek into lock.
- util/fossilize_db: Split out reading the index.
- util/fossilize_db: Do not lock the fossilize db permanently.
- util/fossilize_db: Only lock the db file, not the index.
- nir: Add lowered vendor independent raytracing intrinsics.
- nir: Add raytracing shader call lowering pass.
- meson: Bump libdrm for amdgpu to 2.4.107.
- radv/winsys: Return vulkan errors for buffer creation.
- radv/winsys: Add support for a fixed VA address for replay.
- radv: Support address capture and replay.
- ac/surface: Handle non-retiled displayable DCC correctly for modifiers.
Bastian Beranek (1):
- glx: Assign unique serial number to GLXBadFBConfig error
BillKristiansen (2):
- d3d12: Fixes stale context bindings after copy, resolve, and clear
- d3d12: Sets all SRV descriptors as data-static
Billy Laws (1):
- meson: Increase Android Platform SDK version limit
Boris Brezillon (60):
- panfrost: Don't advertise AFBC mods when the format is not supported
- panfrost: Reserve thread storage descriptor in panfrost_launch_grid()
- panfrost: Fix RSD emission on Bifrost v6
- panfrost: Fix indirect draws
- pan/bi: Don't set the EOS flag if there's at least one successor
- panfrost: Keep panfrost_batch_reserve_framebuffer() private
- panfrost: Fix ZS reloading on Bifrost v6
- pan/midg: Fix 2 memory leaks
- pan/bi: Expand pseudo instructions when nosched is set
- pan/midg: Fix midgard_pack_common_store_mask()
- pan/midg: Make sure the constant offset is in range in mir_match_iadd()
- panfrost: Make sure pack_work_groups_compute() is passed valid dimensions
- panfrost: Add helpers to emit indirect dispatch jobs
- panfrost: Hook-up indirect dispatch support
- panfrost: Only advertise INDIRECT_DRAW if the kernel supports HEAP BOs
- ci: Update to a kernel that has the panfrost MMU fixes
- panfrost/ci: Test GLES 3.1 on Bifrost
- panfrost/ci: Skip draw_indirect.compute_interop.large.*
- panfrost/ci: Run the full deqp-gles3 testsuite
- panfrost: Fix format definitions to match gallium expectations
- Revert "gallium/util: Fix depth/stencil blit shaders"
- panfrost: Pass an image view to panfrost_estimate_texture_payload_size()
- panfrost: Fix blit shader names
- panfrost: Pack pan_blit_surface fields
- panfrost: Get rid of the vertex_count arg in pan_preload_emit_varying()
- panfrost: Make pan_preload_emit_*_sampler() applicable to blits
- panfrost: Stop assigning ->position in pan_preload_emit_varying()
- panfrost: Make pan_preload_emit_*_textures() applicable to blits
- panfrost: Make pan_preload_emit_viewport() applicable to blits
- panfrost: Rename pan_preload_emit_varying()
- panfrost: Shrink the number of args passed to prepare_{bifrost,midgard}_rsd()
- panfrost: Don't select the blit shader fragout type twice
- panfrost: Stop assuming the viewport will always cover the framebuffer
- panfrost: Extend pan_blitter to support blit/resolve operations
- panfrost: Use pan_blit() when PAN_MESA_DEBUG=panblit
- panfrost: Split the indexed and !indexed indirect draw info structs
- pan/bi: Add support for gl_{BaseVertex,BaseInstance}
- pan/bi: Add support for gl_DrawID
- panfrost: Expose the DRAW_PARAMETERS cap on Bifrost
- panfrost: Flag indirect draw/dispatch shaders as internal
- panfrost: Relax the stride check when importing resources
- panfrost: Try to align scanout resource stride on 64 bytes
- panfrost: Don't freeze blit batches
- panfrost: Avoid duplicate entries in access->readers
- panfrost: Simplify the dependency tracking logic
- panfrost: Limit the number of active batch to 32
- ci: Update to a new kernel fixing a bug in the panfrost driver
- panfrost: Constify the constants pointers passed to pan_blend functions
- panfrost: Make panfrost_scoreboard_initialize_tiler() return the job pointer
- pan/midg: Add a flag to dump internal shaders
- panfrost: Add a Vulkan driver for Midgard/Bifrost GPUs
- panfrost: Don't add blit context BOs twice
- panfrost: Pass a memory pool to pan_blit_ctx_init()
- panfrost: Add alignment info to write-value and cache-flush jobs
- panfrost: Allocate WRITE_VALUE jobs with panfrost_pool_alloc_desc()
- panvk: Use the desc alloctor when we can
- panfrost: Start splitting the panfrost pool logic
- panvk: Support returning BOs allocated by panvk_pool to a 'free BO' pool
- panfrost: Replace the batch->bos hashmap by a sparse array
- panfrost: Do tracking of resources, not BOs
Boyuan Zhang (8):
- vl: add st_rps_bits for HEVC decode
- frontends/va: get st_rps_bits from VA pic param hevc
- frontends/vdpau: disable UseStRpsBits for vdpau hevc
- radeon/vcn: enable parsing support for st_rps_bits
- frontends/omx: use pipe buffer map instead of texture map
- radeon/vcn: move calc_dpb_size into create_decoder
- radeon/vcn: allocate non-tmz context buffer for VCN2+
- radeon/vcn: use st_rps_bits only when it's set
Caio Marcelo de Oliveira Filho (25):
- spirv: Don't replicate patch bool in vtn_variable
- nir: Remove now unnecessary conditions from emit_load/store helpers
- intel/compiler: Add common function for CS dispatch info
- iris: Use brw_cs_get_dispatch_info()
- anv: Use brw_cs_get_dispatch_info()
- i965: Use brw_cs_get_dispatch_info()
- intel/compiler: Remove unused exported functions
- nir: Move shared_memory_explicit_layout bit into common shader_info
- intel/compiler: Clarify why VUE is recomputed by FS
- nir: Rename nir_is_per_vertex_io to nir_is_arrayed_io
- compiler: Rename local_size to workgroup_size
- compiler: Rename SYSTEM_VALUE_LOCAL_GROUP_SIZE to SYSTEM_VALUE_WORKGROUP_SIZE
- nir: Rename nir_intrinsic_load_local_group_size to nir_intrinsic_load_workgroup_size
- nir: Rename WORK_GROUP (and similar) to WORKGROUP
- nir: Move zero_initialize_shared_memory into common shader_info
- nir: Move workgroup_size and workgroup_variable_size into common shader_info
- anv: Support workgroup memory in other shaders
- nir/lower_io: Rename vertex_index to array_index in helpers
- nir/gather_info: Rename per_vertex to is_arrayed
- spirv: Fix handling of OpBranchConditional with same THEN and ELSE
- nir/opt_if: Don't split ALU for single block infinite loops
- nir: Add test to check edge case in Split ALU optimization
- spirv: Update headers and metadata from latest Khronos commit
- spirv: Support SPV_KHR_subgroup_uniform_control_flow
- anv: Advertise VK_KHR_shader_subgroup_uniform_control_flow
Carsten Haitzler (Rasterman) (1):
- panfrost: Fix Bo imports to not take the process down if fd is invalid
Charlie (10):
- v3dv: enable KHR_image_format_list
- v3dv: enable KHR_sampler_mirror_clamp_to_edge
- v3dv: enable KHR_incremental_present
- v3dv: enable KHR_uniform_buffer_standard_layout
- v3dv: clamp srgb render targets
- v3dv: remove sRGB blending workaround
- v3dv: add the unswizzled RGBA4444 format
- v3dv: divide by block size in copy_image_blit
- v3dv: add ASTC formats to get_compatible_tlb_format
- v3dv: enable ASTC formats
Charlie Birks (1):
- v3dv: document two supported extensions
Charlie Turner (5):
- ci: Remove obsolete reference to DEQP_SKIPS
- radv: Merge dEQP default skips into all generation-specific skip lists
- radv: Add a STONEY baseline for dEQP.
- radv: Provide a toggle to avoid warnings about unsupported devices.
- spirv_to_nir: Add environment variable to change default log level
Charmaine Lee (1):
- svga: fix texture rectangle sampling when no sampler view declaration is defined
Chia-I Wu (93):
- venus: stop using vn_renderer_sync in vn_fence
- venus: stop using vn_renderer_sync in vn_semaphore
- venus: stop using vn_renderer_sync in vn_queue
- venus: remove vn_renderer_sync support from vn_queue_submission
- venus: remove VN_SYNC_TYPE_SYNC
- venus: remove vn_queue::sync_queue_index
- venus: remove vn_ring_wait_all
- venus: wait on vkQueuePresentKHR
- venus: remove vn_renderer_info::has_timeline_sync
- venus: add vn_image_init_memory_requirements
- venus: add vn_image_create
- venus: add vn_wsi_create_scanout_image
- venus: refactor vn_queue_submission_count_semaphores
- venus: clang-format clean
- venus: change SpaceBeforeParens style option
- venus: provide accessors for vn_instance_submit_command
- venus: update venus-protocol headers to use accessors
- venus: rename VN_CS_ENCODER_INITIALIZER
- venus: add vn_renderer_shmem
- venus: use vn_renderer_shmem
- venus: add dev->renderer pointer
- venus: pass vn_renderer in vn_renderer_bo functions
- venus: move vn_renderer_bo_ops to vn_renderer
- venus: merge bo create and init ops
- venus: move some common members to vn_renderer_bo
- venus: use sparse array to manage vn_renderer_bo
- venus: make sure gem_handle and vn_renderer_bo are 1:1
- venus: update venus-protocol for external memory
- venus: rework external memory capability queries
- venus: enable external memory support
- venus: fix render pass without attachments
- venus: fix dmabuf import mmap_size check
- venus: fix dmabuf import fail path
- venus: add VN_MAX_API_VERSION
- venus: rename vn_instance::renderer_version
- venus: clarify/fix instance renderer versions
- venus: clarify/fix device renderer version
- venus: refactor vn_physical_device_init_extensions
- venus: avoid strcmp for spec version override
- venus: refactor vn_physical_device_init_supported_extensions
- venus: init supported extensions in one place
- venus: add extension check for ANDROID_native_buffer
- venus: clean up vn_device_fix_create_info
- venus: get rid of #ifdef's in vn_CreateImage
- pps: fix a missing include in Intel pps driver
- util/u_thread: fix u_thread_setname for long names
- venus: add struct vn_command_buffer_builder
- venus: remember cmd buffer level and queue family
- venus: ignore pInheritanceInfo when we should
- docs: add basic documentation for venus
- vulkan/wsi: provide more info in wsi_image_create_info
- venus: add vn_device_memory_alloc as a helper
- venus: fix asserts on mem bo
- venus: fix opaque fd re-import
- venus: move wsi_image_create_info parsing
- venus: remember image wsi states
- venus: handle VN_COMMAND_BUFFER_STATE_INVALID
- venus: remember render pass PRESENT_SRC attachments
- venus: remember render pass PRESENT_SRC barriers
- venus: remember image view image
- venus: remember framebuffer attachments
- venus: remember cmd buffer render pass and framebuffer
- venus: remember cmd buffer fb attachments
- venus: add vn_image_memory_barrier_has_present_src
- venus: add vn_cmd_get_image_memory_barriers
- venus: clean up vn_get_intercepted_barriers
- venus: undo wsi iamge ownership transfer for Android
- venus: reland wsi image ownership transfer for Android
- venus: enable wsi image ownership transfer for common wsi
- vulkan/util: add vk_default_allocator
- radv: use vk_default_allocator
- v3dv: use vk_default_allocator
- tu: use vk_default_allocator
- anv: use vk_default_allocator
- venus: use vk_default_allocator
- venus: silence compiler warnings
- venus: query experimental features in one call
- venus: document the darkest corner of venus
- venus: move vn_renderer_sync_ops to vn_renderer
- venus: simplify vn_renderer_sync creation
- venus: update venus-protocol headers
- venus: add support for external fence on Android
- venus: add support for external semaphores on Android
- venus: clean up vn_physical_device_get_native_extensions
- venus: fix compatibility with older host drivers
- venus: be verbose about which physical devices are skipped
- vulkan/wsi: fix select_memory_type when all MTs are local
- venus: fix empty submits with BOs
- egl/surfaceless: try kms_swrast before swrast
- meson: allow egl_native_platform to be specified
- venus: clean up vn_AllocateMemory
- venus: suballocate memory in more cases
- vulkan/wsi/x11: do not inherit last_present_mode
Christian Gmeiner (3):
- ci: disable initrd support
- drm-shim: fix compile with glibc >= 2.33
- ci: bare-metal: drop webdav support
Connor Abbott (133):
- ir3: Fix list corruption in legalize_block()
- ir3: Reduce max const file indirect offset base to 9 bits
- ir3, tu: Add compiler flag for robust UBO behavior
- tu: Correctly preserve old push descriptor contents
- tu: Handle robust UBO behavior for pushed UBO ranges
- tu: Handle null descriptors
- tu: Expose VK_EXT_robustness2
- ir3/parser: Fix oob write with immediates array
- ir3: Improve cat1 modifier disassembly
- ir3: Assemble and disassemble swz/gat/sct
- ir3: Prevent oob writes to inputs/outputs array
- nir/lower_clip_disable: Fix store writemask
- ir3, tu: Cleanup indirect i/o lowering
- freedreno: Don't lower indirects in GLSL IR
- freedreno/a6xx: Better document SP_GS_PRIM_SIZE
- freedreno/a6xx: Fix SP_GS_PRIM_SIZE for large sizes
- tu: Fix SP_GS_PRIM_SIZE for large sizes
- ir3/postsched: Fix dependencies for a0.x/p0.x
- ir3/cp: Clone registers for compare-folding optimization
- ir3/sched: Use correct src index
- ir3/postsched: Use correct src index
- ir3/delay: Remove special case for array deps
- ir3/postsched: Fix ir3_postsched_node::delay calculation
- ir3/cp_postsched: Fixup SSA use pointer for direct reads
- ir3: Refactor nir->ir3 block handling
- ir3: Make predecessors an array
- ir3: Rework outputs
- ir3: Don't assume regs[1] exists in ir3_fixup_src_type()
- nir/lower_phis_to_scalar: Add "lower_all" option
- ir3/cf: Rewrite pass
- ir3: Use round-to-nearest-even for fquantize2f16
- ir3: Call nir_lower_wrmask() again after lowering scratch
- ir3: Only use per-wave pvtmem layout for compute
- ir3: Introduce phi and parallelcopy instructions
- ir3: Add ir3_start_block()
- ir3: Readd support for translating NIR phi nodes
- ir3: Prepare for instructions with multiple destinations
- ir3: Improve register printing for SSA
- ir3: Add ir3_register::array.base
- ir3/delay: Fix full->half and half->full delay
- ir3: Add reg_elems(), reg_elem_size(), and reg_size()
- ir3: Make branch conditions non-SSA
- ir3: Rewrite delay calculation
- ir3/delay: Delete pre-RA repeat handling
- ir3/postsched: Don't use SSA source information
- ir3: Remove unused check_src_cond()
- ir3: Add dominance infrastructure
- ir3: Add pass to lower arrays to SSA
- ir3: Expose occupancy calculation functions
- ir3: Rewrite register allocation
- ir3/ra: Add a validation pass
- ir3: Remove right and left copy prop restrictions
- ir3/sched: Don't schedule collect early
- ir3/sched: Make collects count against tex/sfu limits
- ir3/sched: Consider unused destinations when computing live effect
- ir3: Add simple CSE pass
- ir3: Insert output collects in the main shader
- ir3: Copy propagate immed/const to meta instructions
- ir3: Improve printing of array parallelcopies/phis
- ir3/ra: Fix array parallelcopy confusion
- ir3: Make tied sources/destinations part of the IR
- ir3: Split read-modify-write array dests in two
- ir3: Update ir3_register::instr when cloning instructions
- ir3: Validate that ir3_register::instr is correct
- ir3: Add is_reg_special()
- ir3: Make ir3_instruction::address a normal register
- ir3: Split ir3_reg_create() into ir3_{src,dst}_create()
- ir3: Add separate src/dst count in ir3_instr
- ir3/legalize: Construct branch properly
- ir3: Add srcs/dsts arrays to ir3_instruction
- freedreno/isa: Convert to srcs/dsts
- freedreno/tests: Convert to srcs/dsts
- ir3/sched: Convert to srcs/dsts arrays
- ir3/core: Switch to srcs/dsts arrays
- ir3/ra: Switch to srcs/dsts arrays
- ir3/parser: Switch to srcs/dsts arrays
- ir3/array_to_ssa: Switch to srcs/dsts arrays
- ir3/legalize: Switch to srcs/dsts arrays
- ir3/print: Switch to srcs/dsts arrays
- ir3/validate: Switch to srcs/dsts arrays
- ir3/opts: Switch to srcs/dsts arrays
- ir3/frontend: Switch to srcs/dsts arrays
- ir3: Remove regs array
- ir3: Remove IR3_REG_DEST
- ir3/ra: Fix corner case in collect handling
- freedreno/a6xx: Make SP_XS_PVT_MEM_HW_STACK_OFFSET non-inline
- freedreno, tu: Set SP_XS_PVT_MEM_HW_STACK_OFFSET
- freedreno/computerator: Fix local_size typo
- ir3/sched: Speed up live_effect
- ir3: Stop creating dummy dest registers
- ir3: Prepare dest helpers for multi-dst instructions
- ir3: Add foreach_dst/foreach_dst_n
- ir3: Support multi-mov instructions
- ir3/delay: Support multi-mov instructions
- ir3/postsched: Support multi-mov instructions
- ir3/legalize: Support multi-mov instructions
- ir3: Use correct flags for movmsk & multi-mov
- ir3/validate: Support multi-mov instructions
- ir3: Print multi-mov instructions
- ir3: Add min gen for multi-mov instructions
- ir3/lower_parallelcopy: Use SWZ
- nir/subgroups: Replace lower_vote_eq_to_ballot with lower_vote_eq
- nir/subgroups: Support > 1 ballot components
- nir: Add read_invocation_cond_ir3 intrinsic
- tu, ir3: Plumb through support for CS subgroup size/id
- ir3/nir: Call nir_lower_subgroups
- ir3: Handle shared register liveness correctly
- ir3: Handle unreachable blocks
- ir3: Prevent propagating shared regs out of loops
- ir3: Better valid flags for shared regs
- ir3: Actually allow shared reg moves to be folded
- ir3: Fix shared reg delay
- ir3: Make MOVMSK use repeat
- ir3: Fix infinite loop in scheduler when splitting
- ir3/sched: Handle branch condition in split_pred()
- ir3: Cleanup ir3_legalize jump optimization
- ir3: Support any/all/getone branches
- ir3: Add subgroup pseudoinstructions
- ir3: Handle shared registers in lower_parallelcopy
- ir3: Implement nir subgroup intrinsics
- ir3: Fix convergence behavior for loops with continues
- ir3/legalize: Fix loop convergence behavior
- tu: Update subgroup properties
- ir3/nir: Lower indirect references of compact variables
- ir3: Add missing include to ir3_parser.y
- ir3: Add ir3_collect() for fixed-size collects
- ir3/lower_parallelcopy: Don't manually set wrmask
- ir3: Update .editorconfig and .dir-locals.el
- ir3: Manually reformat some places
- freedreno: Add some options to .clang-format
- ir3: Reformat source with clang-format
- ir3/print: Manual formatting fixups
- ir3: Preserve gl_ViewportIndex in the binning shader
Corentin Noël (4):
- ci: Use the caching proxy for Mesa artifacts
- ci: Re-enable virgl tesselation shader
- ci: Bump virglrenderer
- ci: actually run piglit tests with virgl
Daniel Schürmann (33):
- aco: fix additional register requirements for spilling
- aco: relax validation rules for p_reduce dst RegType
- driconf: set vk_x11_strict_image_count for Metro: Exodus
- aco/ra: prevent underflow register for p_create_vector operands
- radv: call nir_copy_prop() after load-store vectorization
- aco/ra: also prevent overflow register for p_create_vector operands
- aco: remove condition operand from branch in invert block
- radv,aco: scalarize all phis via nir_lower_phis_to_scalar()
- aco: simplify Phi RegClass selection
- aco/ra: only create phi-affinities for killed operands
- aco/ra: refactor affinity coalescing
- aco/ra: refactor register assignment for vector operands
- amd/ci: add hawaii-specific skip and fail lists
- aco/ra: handle copies of definition registers
- aco/ra: handle copies of copies better
- aco/util: replace DIV_ROUND_UP(n+1,m) by n/m+1
- aco: reorder and cleanup #includes
- aco: add missing Licenses and remove Authors from files
- aco: add 'common/' and 'llvm/' prefix to #includes
- aco/meson: remove unnecessary dependencies
- aco: refactor SDWA opcode validation
- aco: remove (wrong) GCC array-bounds warning
- util/meson: include inc_gallium
- aco: add .clang-format file
- aco: Format.
- aco/meson: remove inc_gallium from include_directories
- aco: fix self-intersecting register swaps
- aco: fix extract_vector optimization
- aco/isel: avoid unnecessary calls to nir_unsigned_upper_bound()
- aco/insert_waitcnt: Remove many unnecessary wait_imm.combine()
- aco/live_var_analysis: change worklist to a single integer
- aco/optimizer: ensure to not erase high bits when propagating packed constants
- aco: include <cstddef> in aco_util.h
Daniel Stone (69):
- CI: Disable Panfrost and radeonsi
- CI: Disable all Panfrost/AMD/Iris automatic jobs
- CI: Disable rk3399-gru-kevin jobs for now
- doc: Gratuituous promotion of Wayland
- docs: Even more gratutious nitpicks
- Revert "CI: Disable rk3399-gru-kevin jobs for now"
- CI: Fix path confusion in OpenCL Piglit execution
- ci/zink: Skip flaky GLX test
- ci/radeonsi: Skip flaky glx-swap-copy test
- ci/windows: Artifact Meson build and test logs
- ci/windows: Re-enable Windows build
- ci: Add Piglit gl-1.0-blend-func to everyone's skips
- ci/lava: Iterate all job results, not just the first
- ci/lava: Handle proxy download failures
- ci/lava: Add validate-only mode to job submitter
- ci/lava: Add --dump-yaml option to submitter
- ci/bare-metal: Factor out environment to a separate script
- ci/bare-metal: Don't leak JWT into logs
- ci/lava: Move LAVA files to lava/
- ci/lava: Pass JWT separately from environment variables
- ci/lava: Cosmetic reordering of job init
- ci/lava: Wrap submission in a shell script
- ci/lava: Clean up variable naming, document them
- ci: Make PIPELINE_ARTIFACTS_BASE a common variable
- ci: Add JOB_ARTIFACTS_BASE variable
- ci: Use JOB_ARTIFACTS_BASE for Piglit fails
- ci/lava: Use per-job rootfs overlay for environment
- ci/panfrost: Remove useless variable
- ci/lava: Generate job name from lava-submit.sh
- ci/lava: Remove unused arguments
- ci/lava: Add explicit fatal-error handler
- ci/lava: Disable stdout/stderr buffering
- ci/lava: Dump and artifact YAML again
- ci/lava: Avoid tee as it ruins exit status
- ci/piglit: Fix path to uploaded images
- ci/lava: Always upload Piglit replay images to MinIO
- ci/lava: Set PIGLIT_NO_WINDOW
- ci/lava: Explicitly start Xorg for Iris EGL tests
- ci/bare-metal: Rename BM_KERNEL_MODULES to HWCI_KERNEL_MODULES
- ci/lava: Use HWCI_KERNEL_MODULES to load modules
- ci/lava: Rename environment variable script
- ci/bare-metal: Try harder to do NTP
- ci/bare-metal: Reorder init so network comes first
- ci: Move bare-metal init script to common directory
- ci: Be consistent about install path
- ci/bare-metal: Consistently set library paths
- ci/bare-metal: Split init script into two stages
- ci/bare-metal: Move devcoredump capture to CI common
- ci/lava: Start using devcoredump captures
- ci: Consistent pass/fail result output
- ci: Unify {BM,LAVA}_START_XORG environment
- ci: Unify {BARE_METAL,LAVA}_TEST_SCRIPT environment
- ci/bare-metal: Set CPU and GPU governors to max, disable GPU runtime PM
- ci/lava: Pass MinIO path on the command line
- ci/lava: Use common stage-2 init
- ci/lava: Drop bitrotten fastboot support
- ci/lava: Make kernel image type a normal argument
- ci/lava: Generate YAML from Python, not Jinja
- llvmpipe: Add handle export for resource_get_param
- Revert "ci: disable panfrost t760 jobs"
- CI: Disable LAVA devices for maintenance
- Revert "CI: Disable LAVA devices for maintenance"
- util/disk_cache: Don't leak when cache is empty
- panfrost/genxml: Decode Bifrost index-driven vertex jobs
- ci/panfrost: Temporarily disable sun50i/RK3288
- Revert "ci/panfrost: Temporarily disable sun50i/RK3288"
- vulkan/wsi/wayland: Initialise wl_shm pointer in VkImage
- egl/wayland: Error on invalid native window
- egl/wayland: Allow EGLSurface to outlive wl_egl_window
Danylo Piliaiev (36):
- nir: add lowering pass for helperInvocationEXT()
- turnip: implement VK_EXT_shader_demote_to_helper_invocation
- turnip: implement VK_KHR_shader_terminate_invocation
- ir3: treat 16b imul as mul.s24
- turnip: enable shaderInt16
- ir3: do not double threadsize when exceeding branchstack limit
- ir3: make possible to specify branchstack up to 64
- tu: do not corrupt unwritten render targets
- ir3: do not move varying inputs that depend on unmovable instrs
- ir3: do not fold cmps from different blocks with non-null address
- ir3: memory_barrier also controls shared memory access order
- ir3: update bar/fence bits in accordance to blob
- turnip: implement VK_KHR_vulkan_memory_model
- docs: mark off VK_KHR_vulkan_memory_model for turnip
- turnip,freedreno/a6xx: SP_BLEND_CNTL has per-mrt blend enable bit
- freedreno/a5xx: SP_BLEND_CNTL has per-mrt blend enable bit
- turnip: copy all layers specified in vkCmdCopyImage
- ci/turnip: drop fail annotation for float_control tests
- ci/turnip: drop fail annotation for image.extend_operands_spirv1p4.*
- turnip: do not ignore early_fragment_tests
- turnip: make possible to create read-only bo with tu_bo_init_new
- turnip: make cmdstream bo's read-only to GPU
- turnip: place a limit on the growth of BOs
- freedreno: reduce the upper bound of IB size by one
- turnip: reset push descriptor set on command buffer reset
- turnip: emit vb stride dynamic state when it is dirty
- turnip: fix register_index calculations of xfb outputs
- turnip: implement VK_EXT_provoking_vertex
- turnip: do not re-emit same vs params
- turnip: early exit in tu6_draw_common to save cpu cycles
- freedreno/computerator: pass iova of buffer to const register
- freedreno/isa: add uoffset type to print positive-only offsets
- ir3: add ldg.a,stg.a which allow complex in-place offset calculation
- glsl: Prohibit implicit conversion of mem parameter in atomicOP functions
- ir3: add newly found shlg.b16 instruction
- freedreno: fix wrong tile aligment for 3 CCU gpu
Dave Airlie (146):
- iris: move get_time into a static in bufmgr code.
- iris: move target to isl dim translate to inline.
- lavapipe: add support for non-dri loader on linux
- llvmpipe: split screen init up.
- llvmpipe: wrap late screen init with a mutex.
- llvmpipe: delay late screen creation until context init.
- lavapipe: fix mipmapped resolves.
- lavapipe: mark event_storage as volatile
- intel: move brw_ff_gs_prog_key/data to compiler.
- intel/compiler: add support for compiling fixed function gs
- i965: port fixed function geom shader to use compiler paths
- i965: drop old brw ff gs code.
- intel/genxml: align gen4/5 xml for store data immediate
- intel/genxml: rewrite the prefilterop xml to be more consistent.
- intel/gemxml: move blitter command to render on gen4/5
- intel/genxml: fix raster op fields on gen4/5
- intel/decoder: fixup batch decoder for binding tables on gen4/5
- intel/decoder: add gen4/5 geometry state decode
- gallivm: handle texture arrays in non-fragment shaders with lod.
- llvmpipe: fix non-multisampled rendering to multisampled framebuffer
- llvmpipe: add the interesting bit of cpu detection to the cache.
- st/nir: always revectorise if scalarising happens.
- intel/gfx6: move xfb_setup outside the gs compiler into the driver.
- intel/isl: decrease isl_format_layouts size by 36k
- intel/isl: convert null surface fill to a struct.
- intel/isl: add levels and minimum array element to null fill
- intel/isl: add blend enable flag to gen4/5
- u_blitter: fix fs used when no color emitted
- u_blitter: fix stencil blit fallback for crocus.
- iris: drop unused function declaration
- nir/edgeflags: update outputs written when lowering edge flags.
- st/mesa: also disable other int textures
- intel/decode: handle gen4/5 WM state fragment shaders
- intel: reorder base program key.
- intel/compiler: add flag to indicate edge flags vertex input is last
- crocus: initial gallium driver for Intel gfx 4-7
- ci: add crocus to the build tests
- crocus: Don't call SET_TILING for dmabuf imports
- crocus: Make iris_bo_import_dmabuf take a modifier
- crocus: introduce main resource configuration helper.
- crocus: Drop buffer support in resource_from_handle
- crocus: hook up memory object creation from handle
- crocus: hook up resource creation from memory object
- crocus: plumb device/driver UUID generators
- crocus: enable GL_EXT_memory_object feature on gen7
- crocus: fix scanout tiling so glamor/modesetting can work.
- crocus: fixed some missing WM dirtys.
- crocus: fixup render aux usage function.
- crocus: disable Z16
- crocus/gen6: fix depth blit blorp regression.
- i965: fix regression in pipe control on g45
- crocus: drop dead gen prototypes.
- crocus: fixup stray tab
- crocus: rename genX proto functions to avoid iris conflicts.
- crocus: fixup workaround_bo to match 965.
- crocus: convert a bunch of is_haswell into verx10 checks.
- crocus: refactor blend state code.
- crocus/gen8: limit some pipe controls to gen7/hsw
- crocus: limit texture gather workarounds to gen7/hsw
- crocus/stencil: limit stencil workaround to gen7
- crocus/query: add gen8 support to queries by extending hsw checks
- crocus: extend l3 config setup to gen8
- crocus/gen8: add push constant support (extend hsw)
- crocus/gen8: extend some compute + state functions to gen8
- crocus/gen8: extend image support to gen8
- crocus: extend hsw cut index to gen8
- crocus/gen8: extend predicate handling to gen8.
- crocus/gen8: add sampler / border color support for gen8
- crocus/gen8: add l3 config support
- crocus/gen8: extending gen7 binding table pointers
- crocus/gen8: limit vertex buffer workarounds to ivb
- crocus/gen8: add raw pipe control support for gen8 workarounds
- crocus/gen8: add support for vertex instancing and index buffers.
- crocus/gen8: state base address + misc setup state.
- crocus/gen8: add VF topology support
- crocus/gen8: add PMA fix from iris
- crocus/gen8: add streamout support
- crocus/gen8: add SBE swiz support
- crocus/gen8: add VF SGVS support.
- crocus/gen8: add PS blend command support.
- crocus/gen8: refactor blend state for gen8
- crocus/gen8: add rasterizer state changes.
- crocus/gen8: add viewport support
- crocus/gen8: add depth stencil state support
- crocus/gen8: port over vs/gs/ds state changes.
- crocus/gen8: port over ps/wm state changes from iris.
- crocus/gen8: port over VFE/compute state changes
- croucs/gen8: handle gfx8 surface aux addr reloc.
- crocus/gen8: handle sampler differences
- crocus/gen8: hookup gen8 state generators
- crocus/gen8: add support for cherryview (env var for bdw)
- croucs: limit stencil swizzle change to older generations
- crocus/bufmgr: fix userptr left over fail
- crocus: Explicitly cast value to uint64_t
- crocus: free context state properly.
- crocus: fix vertex buffer leak on screen end.
- crocus: fix batch state bo leak
- meson/crocus: add prefer-crocus option.
- crocus/query: poll the syncobj in the no wait situation
- intel/genxml: fix gfx6 GS SVB_INDEX encoding
- crocus/gfx6: fix sampler view first level.
- crocus: dirty blend state more often.
- crocus: Avoid replacing backing storage for buffers with no contents
- crocus/gfx6: always be dirtying gs attachments for xfb
- crocus: fix another printf specifier.
- crocus/gen8: add back z16 support for gen8
- crocus: disable Z16 unorm textures on pre-gen8 as well.
- gallium/sw: add sw_vk bit to avoid having to futz with env vars for lavapipe
- zink: drop getenv hacking now that gallium is fixed.
- iris: make iris_bind_reserve_3d and Wa_1604061319 only check for dirty render bindings
- crocus: cleanup some deadcode in the gen5 blend emit
- crocus: expose ARB_blend_func_extended on gen 45/50
- crocus/gen5: enable support for GL_EXT_gpu_shader4
- crocus: fix crash on index buffer rebinding.
- crocus: fixup index buffer dirtying.
- draw: fix tessellation output vertex size calculation
- draw/tess: write correct primitive id into vertices
- crocus: inline the d/s resource handling functions
- crocus: don't update draw parameters unless needed
- crocus: optimise bo_unref path a little.
- crocus: inline group_index<->bti
- crocus: reorder version checks on indirect xfb
- crocus: restrict prim_restart on index buffer check to pre-hsw
- crocus: support rebinding streamout target buffers
- crocus: use threaded context base classes
- crocus/tc: init/deinit threaded resource
- crocus: add unsync transfer pool
- crocus: enable threaded context support
- ac: fix win32 build
- crocus/gen8: fix wrap mode needs border color.
- crocus: add GL_CLAMP emulation in driver again.
- vulkan/wsi/wl: add wl_shm support for lavapipe.
- lavapipe: add the separate depth/stencil layout enable.
- crocus: use simple_mtx in the bufmgr
- lvp: fixup multi draw memcpys
- draw: handle resetting draw_id between instances.
- softpipe/aniso: move DDQ calculation to after scaling.
- crocus/gen4-5: fix ff gs emit on VS vue map change.
- llvmpipe: add support for time elapsed queries.
- draw/llvmpipe: multiply polygon offset units by 2
- teximage: return correct desktop GL error for compressedteximage
- crocus/gen4: restrict memcpy mapping to gen5
- intel/fs: restrict max push length on older GPUs to a smaller amount
- crocus/gen45: fix mapping compressed textures
- intel/genxml: fix raster operation field in blt genxml
- crocus: add support for set alpha to one with blt.
Dmitry Baryshkov (2):
- freedreno/regs: split DSI PHY registers to separate xml files.
- freedreno/regs: split old/not used phy registers to separate DB
Drew Davenport (1):
- radeonsi: Report multi-plane formats as unsupported
Duncan Hopkins (3):
- zink: Correct compiler issue with have_moltenvk member having been moved.
- gallium/dri: Guard DRI driver global variables on MacOS if Zink is enabled.
- zink: Fix MacOS compiling issues
Dylan Baker (27):
- meson: OpenMP is supposed to be optional
- docs: add release notes for 21.0.3
- docs: update sha256 sum for mesa 21.0.3
- docs: update calendar and link releases notes for 21.0.3
- docs: update calendar for 21.1.0-rc1
- docs: update calendar for 21.1.0-rc2
- docs: update calendar for 21.1.0-rc3
- meson/vulkan: fix linkage on windows
- docs: Add calendar entries for 21.2 release candidates.
- VERSION: bump for 21.2-rc1
- .pick_status.json: Update to f40a08d25c91256cd3dff0211b8e10d5bbb3734e
- .pick_status.json: Update to a62973580b7846f2213cbd2589e9473c26596683
- .pick_status.json: Update to 27534a49cf3872646cb8ef9371707d74a81b1986
- VERSION: bump for 21.2-rc2
- .pick_status.json: Update to b45cddda183230232937387f91d009500b2372c9
- .pick_status.json: Update to 49908c602ffd2d84063effa7ddd0ee842be41a89
- VERSION: bump for 21.2.0-rc3
- .pick_status.json: Update to dff0d9911d176802b54890c796e19f56c50f24e1
- .pick_status.json: Update to b8e29e89366a5264391dc7c10e778330b7add66a
- freedreno/ir3: Add build id to the disassembler test
- .pick_status.json: Mark 8cb795b4772f882024b20c4d4b051b2411dd1a8c as denominated
- .pick_status.json: Update to 87b0962fef4e447a2ea9c76a611aa20b109a259d
- .pick_status.json: Update to 842b8c8965327615f4692384a905dd63f1fba63d
- .pick_status.json: Update to 97be8e42e42f3b739c3de808553094f86ad8879f
- bin/gen_release_notes: Add basic tests for parsing issues
- bin/gen_release_notes: Don't consider issues for other projects
- bin/gen_release_notes: Fix commits with multiple Closes:
Eleni Maria Stea (5):
- egl: fix in expected type
- util: replaced ENODATA with ENOATTR for non-Linux systems
- util: Removed unused statement from FreeBSD build
- intel: struct bitset is renamed to brw_bitset
- intel: PAGE_SIZE used in allocators shouldn't be defined on FreeBSD
Ella-0 (1):
- anv: expose primary node to VK_EXT_physical_device_drm even when VK_KHR_display is not enabled
Emil Velikov (1):
- gbm: list to stderr all the missing extension
Emma Anholt (251):
- ci/freedreno: Merge a630 piglit to a single job.
- freedreno: Fix YUV sampler regression.
- ci/virgl: Mark a couple of new Crash tests as flakes.
- ci/freedreno: Skip some precision tests on a530.
- nir_to_tgsi: Use ARL instead of UARL in the !native_integers case.
- nir: Generate load_ubo_vec4 directly for !PIPE_CAP_NATIVE_INTEGERS
- ci/lavapipe: Don't include deqp's shader_cache in the artifacts.
- ci/lava: Return the run's results/ artifacts from the DUTs.
- ci/piglit: Always include the HTML summary in a run.
- ci/lava: Point the shader cache at tmpfs.
- mesa: Remove dead _mesa_unpack_rgba_block().
- util: Switch the non-block formats to unpacking rgba rows instead of rects.
- util/format: Add some NEON intrinsics-based u_format_unpack.
- panfrost: Enable packed uniforms.
- zink: Enable PIPE_CAP_PACKED_UNIFORMS.
- ci: Build deqp-egl targeting x11_egl_glx
- ci/llvmpipe: Test dEQP-EGL against Xvfb.
- ci/freedreno: Test dEQP-EGL against Xorg.
- mapi: Respect MESA_DEBUG=silent for no-op debug output.
- freedreno: Mark glsl-fs-fogscale as a Fail.
- freedreno/a6xx: Don't try to do Z-as-RGBA blits for mismatched formats.
- util: Fix big-endian handling of z/s formats.
- mesa: Deduplicate _mesa_pack_ubyte_stencil_row()
- mesa: Deduplicate _mesa_pack_float_z_row().
- mesa: Deduplicate _mesa_pack_uint_z_row().
- mesa: Remove dead _mesa_get_pack_float_z_func().
- msea: Move z24s8-to-z24s8 packing fastpath to swrast.
- mesa: Move per-pixel Z pack functions to swrast.
- mesa: Remove dead _mesa_pack_ubyte_rgba_rect().
- mesa: Replace _mesa_pack_ubyte_rgba_row() with pack_ubyte_rgba_8unorm().
- ci/radeonsi: Mark a glx_arb_sync_control/timing flake.
- turnip: Only write the tu_RegisterDeviceEXT() out fence on success.
- ci: Add missing vulkan dep for freedreno (turnip) and v3dv test jobs.
- u_format: Fix z32_s8x24 s8 unpacking on big-endian.
- u_format: Add missing BE swizzles for R8SG8SB8UX8U_NORM
- ci/freedreno: Mark dEQP-EGL flakes reported on IRC since its introduction.
- ci/freedreno: Mark new flakes from the go-fast branch.
- ci/freedreno: Mark another recent piglit flake.
- ci/freedreno: Fix the recent-a5xx-texture-flakes matches.
- ci/freedreno: Add another db820c flake that's appeared in the last few months.
- tgsi: Mark the tgsi_exec_channel and tgsi_double_channel ALIGN16.
- tunrip: Add support for VK_EXT_separate_stencil_usage.
- ci/freedreno: Mark a5xx texture gather as flaky.
- turnip: Demote API version to 1.1.
- ci/llvmpipe: Add testing of gles3/31/gl.
- ci/lavapipe: Add fractional NIR stress test job.
- freedreno/a5xx: Fix up border color pointers.
- gallium/tgsi_exec: Drop the unused dst_datatypes from dest stores.
- tgsi_exec: Drop unused destination dimension support.
- tgsi_exec: Mark the store file default case as unreachable.
- gallium/tgsi_exec: Simplify bounds checks on the const file.
- turnip: Switch to the shared vulkan ICD generator.
- turnip: Move the extension tables to tu_device.c
- ci/freedreno: Add another daily dose of a530 flakes.
- turnip: Drop wideLines properties since we don't support wide lines.
- turnip: Claim 2 discrete queue priorities.
- freedreno: Update editorconfig and emacs settings for freedreno reformat.
- ci/turnip: Clean up some stale fail annotations.
- ci/turnip: Add some links to issues and MRs for some test failures.
- turnip: Drop fail annotation for driver_properties.
- ci: Switch to apitraces for glmark2
- ci/panfrost: Add some more traces to replay.
- ci/iris: Add some more traces to replay.
- ci/freedreno: Skip refract on a306 now that it hangchecks sometimes.
- midgard: Fix type for vertex_builtin_arg() and compute_builtin_arg().
- ci/freedreno: Skip a test that's taking out the a530 boards.
- ci/freedreno: Mark two more recent intermittent a530 flakes.
- ci/deqp: Make DEQP_EXPECTED_RENDERER a required regex for VK like for GLES.
- ci/intel: Add test jobs for dEQP.
- vulkan: Avoid stomping array padding in the MemoryProperties wrapper.
- mesa/st: Only use 16-bit ints or floats in the NIR path.
- i915g: Disable 3D-pipeline clears.
- i915g: Switch batchbuffer dumping to mesa_logi().
- i915g: Fix dumping of the FS in batchbuffers.
- ci/i915g: Introduce manual testing of i915g using anholt's runner.
- i915g: Make the FS for compile failures write red instead of DIFFUSE.
- i915g: Add support for the .Absolute flag on TGSI srcs.
- i915g: Stop advertising support for indirect addressing in the FS.
- i915g: Fix writing of undefined depth value if not writing any outputs.
- i915g: Fix undefined results for TGSI_OPCODE_KILL
- ci/iris: Switch GLK back to manual testing.
- ci/freedreno: Clear compswap flake annotation.
- ci/freedreno: Clear stale validation failure flake annotation.
- ci/freedreno: Drop a630 flake annotation from the go-fast changes.
- ci/freedreno: Add a link explaining get_display_plane_capabilities
- ci/freedreno: Drop VK flake annotations not seen in the last ~year.
- ci/freedreno: Consolidate ssbo.fragment_binding_array flake annotation.
- ci/freedreno: Mark a630 glx-visuals-depth/stencil as piglit flakes.
- ci/freedreno: Also mark waitformsc as flaky.
- ci/freedreno: Add glx-copy-sub-buffer to flakes on a530 and a630.
- mesa/st: Fix iris regression with clip distances.
- ci/freedreno: Add another a630 piglit flake.
- ci/freedreno: Turn off default a530 quick_gl testing, do full quick_shader.
- turnip: Reorganize copy_format()'s switch statement.
- turnip: Make sure that SNORM blits don't clamp ambiguous -1.0 values.
- Revert "ci: Configure DUTs for max performance"
- ci: Add known-flake handling for the IRC flake reports
- ci: Move the flakes channels to OFTC
- util: Add a helper macro for defining initial-exec variables.
- android: Fix ELF TLS support.
- ci/android: Update to building for SDK 29 by default.
- u_format: Fix some pep8 in u_format_parse.py.
- u_format: Drop redundant .name init.
- u_format: Move the BE swizzle computation into Format init.
- u_format: Use the nice helper for reversing an array.
- u_format: Assert that array formats don't include BE swizzles.
- u_format: Define tests for r3g3b2 formats and fix BE swizzles for them.
- u_format: Fix the BE channel ordering for R5G5B5A1_UINT.
- u_format: Sanity check the BE channels for all bitmask formats.
- u_format: Sanity check that BE swizzles are appropriately mapped from LE.
- u_format: Use the computed BE channels/swizzles for bitmask formats.
- ci/freedreno: Add some more known flakes from recent marge runs.
- docs/freedreno: Update for the fanin/fanout -> collect/split rename.
- docs/freedreno: Rewrite the section on array access.
- tgsi_exec: Garbage-collect the FAST_MATH path.
- u_math: Reduce fast-log2 table size from 65k entries back to 256.
- llvmpipe: Don't call util_init_math().
- ra: Add a unit test.
- ra: Document that class index is allocated in order, use that in r300.
- ra: Use struct ra_class in the public API.
- ra: Add fast-path support for register classes of contiguous regs.
- vc4: Use the ra_alloc_contig_reg_class() function to speed up RA.
- v3d: Use the ra_alloc_contig_reg_class() function to speed up RA.
- intel/fs: Use ra_alloc_contig_reg_class() to speed up RA.
- intel/vec4: Use ra_alloc_contig_reg_class() to reduce RA overhead.
- lima: Use ra_alloc_contig_reg_class().
- util/ra: Use the conflicting neighbor to skip unavailable registers.
- ci/i915g: Fix incorrect expectation.
- i915g: Make sure we don't try to texture from the const file.
- ci/lava: Finish garbage-collecting the TEST_SUITE variable
- ci: Update piglit and deqp/piglit-runner.
- ci/freedreno: Enable running all of piglit_gl for a530's manual test.
- ci/piglit: Skip WGL on all the Linux runs.
- ci/fastboot: Add a serial timeout to catch fastboot prompt failure.
- ci/fastboot: Consistently restart the run on intermittent conditions.
- ci/iris: Enable piglit testing on AML-Y.
- ci: Disable Xorg's screensaver entirely.
- ci/deqp: Drop stress/perf skips lists.
- ci/deqp: Skip flush_finish on all CI jobs.
- ci/softpipe: Move the flake to the flakes list.
- ci: Add a flakes IRC channel for llvmpipe/softpipe.
- ci/deqp: Skip dEQP-VK.wsi.display.get_display_plane_capabilities
- ci/piglit: Move the WGL skip to a common skips file.
- ci/piglit: Skip glx_arb_sync_control@timing.* on all systems.
- freedreno: Drop batch-cache orphan tracking.
- freedreno: Make a bunch of the batch cache take ctx as the arg.
- freedreno: Drop a bit of indirection around the batch cache flush path.
- freedreno: Be more strict about QUERY_AVAILABLE to simplify the code.
- freedreno: Fix batch reference handling in flush_resource().
- freedreno: Move the !MAP_WRITE write batch refcounting to the branch.
- freedreno: Remove broken back_blit optimization.
- freedreno: Flush batches upon destroying the ctx.
- freedreno: Add perf_debug() for our software conditional rendering.
- freedreno: Move FD_MESA_DEBUG=msgs output to mesa_logi.
- freedreno/fdl: Give the tiling mode a nice name in debug dumps.
- freedreno: Add more detailed blit debug in FD_MESA_DEBUG=msgs.
- freedreno: Skip staging blits from uninitialized resources.
- freedreno: Add some cheza flakes from the last week.
- i915: Disable vertex texturing and delete the code.
- i915: Drop assertion failure about seeing each const decled once.
- i915g: Allow fragment coord conventions TGSI properties to be set.
- nir/lower_int_to_float: Make sure the cursor is in the right spot.
- nir: Do peephole select on other instructions if the limit is ~0.
- nir_to_tgsi: Fix internal handling of NIR uints for !CAP_INTEGERS
- nir_to_tgsi: Support integer sysvals on !CAP_INTEGERS hardware.
- i915g: Handle fragment depth being in OUT[1] not OUT[0].
- i915g: Switch to using nir-to-tgsi.
- i915g: Add triangle provoking vertex support.
- freedreno: Move some driver debug printfs to mesa_logd.
- freedreno/ir3: Move the assert output to mesa_loge().
- util/log: Add a streaming printf interface.
- freedreno/ir3: Use mesa_log_stream() for ir3 disassembly.
- freedreno/ir3: Move the native code output to mesa_log as well.
- nir: Add an interface for logging shaders with mesa_log*.
- freedreno/ir3: Move NIR printing to mesa_log.
- gallium/draw: Garbage collect draw_set_force_passthrough
- gallium/draw: Garbage collect draw_pt_fetch_emit.
- gallium/util: Introduce a helper for finding whole-resource blits.
- freedreno: Move the rsc-based batch flushing to helper functions.
- freedreno: Handle full blit discards by invalidating the resource.
- freedreno: Cooperate with tc to stop checking the BC for resource_busy().
- ci/llvmpipe: Mark two more multithread program link flakes.
- i915g: Remove nr_cbufs loop.
- i915g: Create an i915_surface for our pipe_surfaces.
- i915g: Compute 3DSTATE_BUF_INFO flags at surface create time.
- i915g: Move cbuf color swizzle lookup to CSO creation time.
- i915g: Simplify color write mask setup.
- i915g: Use the color swizzle to reshuffle the blend const color.
- i915g: Clear xfails for vertex texturing.
- freedreno: Fix leak of the screen hash table.
- i915g: Fix GL_ARB_copy_buffer assertion fails.
- i915g: Fix bad naming of depth texture formats.
- i915g: Finish out blend factor overrides for both RGBx and A8.
- ci/i915g: Skip the piglit glx tests since we're not running X.
- freedreno/ir3: Report RA failure with mesa_loge().
- turnip: Link more MRs and issues related to our xfails.
- turnip: Use vk_startup_errorf() in more startup paths.
- ci/turnip: Document create_instance_device_intentional_alloc_fail's fail.
- turnip: Disable buffer texturing on 422 formats.
- Revert "freedreno: Cooperate with tc to stop checking the BC for resource_busy()."
- nir: Add a helper for chasing movs with nir_ssa_scalar().
- turnip: Short-circuit if ladder generation for constant index SSBO/UBOs.
- i915g: Apply clang-format.
- i915g: Bake the decls and program together.
- i915g: Allow use of I915_DEBUG= options on non-DEBUG builds.
- i915g: Enable dumping of fragment shaders under I915_DEBUG=fs.
- i915g: Use the normal compile error path for empty FSes.
- i915g: Log program compile errors to mesa_loge().
- i915g: Stop translating the fragment program on the first error.
- i915g: Improve logging of unsupported opcodes.
- i915g: replace "uint" with normal uint32_t.
- i915g: Use stdbool.h instead of custom bools.
- i915g: Remove redundant p->error setting.
- i915g: Mark program errors on setting up temps, constants, and immediates.
- i915g: Fix off-by-one in constant count assertion.
- intel: Early exit from inst_is_in_block().
- i915g: Finish the uint -> uint32_t conversion.
- i915g: Add the nice cube map layout comments from i915c.
- i915g: Fix FS debug dumping for declarations.
- i915g: Delete redundant i915_hw_sampler_views atom.
- i915g: Add curly braces for normal mesa style (and helps clang-format)
- i915g: Set up the cube map texture wrap modes.
- freedreno: Update comments about PIPE_BUFFER shadowing.
- freedreno: swap ->valid when shadowing resources.
- freedreno/a5xx: Make sure to mark blit read/write access in the BC.
- freedreno: Stop manually marking blit dst buffers as valid.
- freedreno: Swap needs_ubwc_clear when shadowing.
- freedreno: Flush the shadowed resource's write batch up front.
- i915g: Add support for per-vertex point size.
- i915g: whitespace fixup from the cube map fix.
- i915g: Force 1D textures to use wrap mode for the Y coordinate.
- i915g: Make sure the 1D texture Y channel is initialized.
- anv: Fix unused var warning on release builds from an assertion.
- nir: Add a nir_instr_remove that recursively removes dead code.
- nir: Use remove_and_dce for nir_shader_lower_instructions().
- nir: Free the instructions in a DCE instr removal.
- i915g: Fix writemasking of SEQ/SNE/SSG.
- nir_to_tgsi: Run copy prop (and thus dce) after lower_bool_to_float.
- nir_to_tgsi: Declare immediates as float on non-native-ints hardware.
- turnip: Fix allocation size for vkCmdUpdateBuffer.
- i915g: Fix dumping of 3DSTATE_BACKFACE_STENCIL_OPS.
- i915g: Fix backface stencil when front_ccw is set.
- ci: Make sure that we build the piglit dmabuf tests.
- freedreno: Suballocate our long-lived ring objects.
- freedreno/a6xx: Reduce the size of the config stateobj allocation.
- freedrneo/a6xx: Reduce the size of the long-lived texture stateobj.
- freedreno/a6xx: Allocate just enough memory for SO state, only if we do SO.
- freedreno: Optimize duplicate obj-obj ring relocs.
- i915g: Fix release build compiler warnings.
- ci: Enable testing of i915g in the the debian -Werror release build.
- freedreno: Lock access to msm_pipe for RB object suballocation.
Enrico Galli (10):
- microsoft/compiler: zero out unused WebGPU system values
- microsoft/compiler: Remove de-duplication of arbitrary semantic names
- d3d12, microsoft/compiler: Switching semantic names to TEXCOORD
- d3d12, microsoft/compiler: Moving driver_location allocation to compiler
- util: Add simple test for util_qsort_r
- util: Add qsort_r/s args adapter for MSVC and BSD/macOS
- nir: Add modes filter to nir_sort_variables
- microsoft/compiler: Switch io sort to use nir_sort_variables_with_modes
- microsoft/spirv_to_dxil: Add drive_location assignment
- microsoft/compiler: Add support for get_ssbo_size to translator
Eric Engestrom (15):
- VERSION: bump to 21.2.0-devel
- docs: reset new_features.txt
- egl/x11: don't forget to exit the attrib list loop
- docs: add release notes for 21.1.0
- docs: add release notes for 21.1.1
- docs: update calendar and link releases notes for 21.1.0
- docs: update calendar and link releases notes for 21.1.1
- docs/release-calendar: add the schedule for the 21.1 branch
- docs: add release notes for 21.1.2
- docs: update calendar and link releases notes for 21.1.2
- docs: add release notes for 21.1.3
- docs: update calendar and link releases notes for 21.1.3
- docs: add release notes for 21.1.4
- docs: update calendar and link releases notes for 21.1.4
- docs/release-calendar: add a few more 21.1 releases
Erico Nunes (7):
- gallium/hud: create vs_text to match fs_text
- gallium/hud: extend check for has_srgb
- docs/lima: add an initial page for Lima
- lima: enable z16 format
- lima: add reload command to the command dump
- meson: kmsro: require dri3 for X11
- lima: avoid crash with negative viewport values
Erik Faye-Lund (193):
- zink: fix stencil-export cap emission
- lavapipe: resolve border-color when creating sampler
- lavapipe: implement VK_EXT_custom_border_color
- nir/lower_tex: do not stumble on 16-bit inputs
- zink: document requirement of VK_EXT_custom_border_color
- gallivm: handle 16-bit input in i2b32
- gallivm: run nir_opt_algebraic_late
- gallivm: add 16-bit integer support
- zink: do not require vulkan memory model for shader-images
- docs: write basic meta-documentation
- zink: do not read outside of array
- docs: remove out-of-date gles info
- docs: remove documentation of MESA_CI_VISUAL
- docs: remove documentation of MESA_PRIVATE_CMAP
- docs: remove documentation of MESA_HPCR_CLEAR
- docs: nest cherry-pick example under note
- docs: use tables instead of pre-formatted text
- docs: use math notation for example matrices
- docs: use code-block for console-content
- docs: use code-block for glsl
- docs: use code-block for c
- docs: use code-block for ini
- zink: only emit extended-formats cap if needed
- zink: remove memory-model leftovers
- docs: fixup link to extension
- docs: fix quoting around a few limits
- zink: correct image cap checks
- docs: add missing zink-requirement
- docs: someome -> someone
- zink: enable required instance ext
- zink: make zink_binding private
- zink: remove stray semicolons
- zink: fixup bad indentation
- docs: remove out-of-date versions doc
- zink: fix shader-image requirements
- zink: correct an extension-link
- docs: fixup indentation of radeonsi envvar values
- docs: document r600 envvars
- zink: use UINT32_MAX instead of UINT_MAX
- zink: respect bit-size of dref-result
- zink: run nir_opt_algebraic_late
- zink: always lower function-temp derefs
- zink: support emitting 16-bit int types
- zink: enable 16-bit int support
- zink: support emitting 16-bit float types
- zink: perform fp16 texture-lookups as fp32 and then convert
- zink: enable 16-bit float support
- zink/codegen: prefer first definition of prop/feature structs
- zink: also enable float16 from KHR extension
- lavapipe: consistently use nir macros
- docs: update gallium doxygen docs
- zink: handle matrix-types after vectors
- zink: cache SpvId for aggregate glsl_types
- zink: always enable fixed shader-caps
- zink: do not check for varying output for fragment shaders
- zink: emit cap early
- zink: remove needless shader-info from context
- zink: emit sample-shading cap early
- zink: emit cap early
- zink: only emit ImageBuffer cap if needed
- docs: do not generate redirects on error
- gallium/u_vbuf: avoid dereferencing NULL pointer
- freedreno/a5xx: Remove ppgtt hack
- docs: remove doxygen support
- zink: remove incorrect border-swizzle assumption
- lavapipe: emit correct textures_used for texture-arrays
- zink: do not ask glsl-compiler to unroll
- lavapipe: fix fsum with swizzle
- st/mesa: do not take util_logbase2 of a negative size
- zink: check for error when binding memory
- gallium: allow to report errors from p_screen::resource_bind_backing
- lavapipe: report out-of-memory when binding
- llvmpipe: allow calculating size of overly large texture
- lavapipe: report allocation-error
- lavapipe: correct reported number of UBOs
- translate: reserve more vertex-shader outputs
- translate: assert that nr_elements is in range
- ci: Uprev piglit to 3351e8952 ("max-texture-size: report merged results")
- docs/features: document GL_ARB_ES3_2_compatibility support for zink
- docs/features: mark a few more extensions as done for zink
- zink: fix provoking-vertex cap for quads
- docs: promote #dri-devel on oftc over freenode
- docs: update link to #zink
- docs: update location of #panfrost
- docs: update link to #lima
- zink: simplify emit_load_const
- v3d: use helper to simplify things
- ci: downgrade sphinx to v3.x
- docs: update another IRC reference
- docs: update another IRC reference
- docs: drop clayton from intel-ci notice
- zink: use actual const for const offset
- lavapipe: handle cube-array image-views
- lavapipe: do not interpret cube-compatible as cubemap
- zink: only mark resources as cube-compatible if supported
- zink: mark 2d-arrays as cube-compatible
- zink: implement half-float packing
- zink: untangle have_EXT_debug_utils and ZINK_DEBUG_VALIDATION
- zink: add support for string-markers
- util/prim_restart: revert part of bad fix
- docs: quote a few defines
- docs: fix header-levels in envvars.rst
- docs: use file-role for paths
- docs: use envvar role for envvars
- docs: add the doc-comment for fse-vars
- docs: do not list all gles major versions
- docs: update list of apis to match website
- docs: update llvm requirement
- docs: rename vmware-guest article
- docs: clean up list of deprecated systems
- docs: move swrast to deprecated drivers list
- docs: clean up software-drivers list
- docs: clean up openswr links
- docs: split out layered driver to its own list
- docs: clean up freedreno links
- docs: add links to documented drivers
- r600: explicitly advertise index buffer format support
- zink: limit images we mark as cube-compatible
- zink: rename spirv_15 bool to spirv_1_4_interfaces
- zink: allow to specify any spir-v version to nir_to_spirv
- zink: calculate spir-v version based on vk version
- zink: only enable vote if we can support it
- zink: use a macro for spir-v versions
- st/pbo: use correct type for images and textures
- docs: update master -> main in edit-links
- zink/ci: increase piglit and deqp-runner timeouts
- llvmpipe: fix edge-rule logic for lines
- llvmpipe: consistently deal with post-rast state
- llvmpipe: fix multisample lines again
- llvmpipe: do not always use pixel-rounded coordinates for points
- zink/ci: re-enable test
- zink: reject more illegal blits
- zink: limit non-extension version feature to spirv 1.5
- zink: use correct type for u_bit_scan
- zink: do not unmap dt-buffers twice
- zink: drop paranoid code
- zink: add missing compiler-dependency
- zink: drop some more vla usage
- zink: fix more initializer styles
- zink: introduce a define for max descriptors per type
- zink: use max-descriptor define
- zink: use alloca instead of hard-to-size vlas
- zink: correct type of flags to flush
- zink: fixup signedness of subtraction
- zink: remove unused function
- zink: drop repeated usage-bit
- zink: do not check buffer-format for usage-bits
- docs: remove outdated meson-section
- docs: remove outdated clarification
- docs: drop historic meson details
- docs: use more file-roles
- docs: use rst captions
- wgl: remove hard limit on pixelformats
- zink: drop unused macros
- zink: remove unused function-pointers
- zink: unbreak moltenvk code
- zink: remove unused moltenvk functions
- zink: do not store moltenvk functions in screen
- zink: remove some needless moltenvk details
- libgl-gdi: add missing include
- iris/ci: disable amly jobs
- aux/trace: fix bool argument
- zink: cast pointers to uintptr_t
- ci/windows: work around meson encoding issues
- ci/windows: enable msvc builds of zink
- ci/windows: fix zink msvc build-rules
- gallium/u_threaded: do not apply start twice
- ci: fix source-deps for radv on windows
- zink: hook up line-rasterization ext
- zink: use bit-allocation for boolean rasterizer-state
- zink: support line stippling
- zink: fill in the right line-mode based on state
- docs: update zink requirements
- llvmpipe: reject unsupported shader-image formats
- lavapipe: query formats for shader-image support
- llvmpipe: only report supported shader-image formats
- lavapipe: expose more storage-image features
- lavapipe: do not disable multisampling for smooth lines
- lavapipe: fix disable_multisample condition
- gallium: explicitly specify line rasterization mode
- draw: respect line_rectangular state
- llvmpipe: respect rectangular_lines
- lavapipe: re-expose line-rasterization extension
- lavapipe: expose strict-lines feature
- zink: implement support for non-planar DRM modifiers
- zink: remove duplicate format-mapping on little-endian
- vulkan: do not map zero-sized region of memory
- vulkan: allocate host-visible memory for swapchain images
- zink: check for right feature
- zink: respect line_rectangular state
- lavapipe: do not assert on more than 32 samplers
- lavapipe: do not mark unsupported tests as crashing
- d3d12: split up root parameter update and set
Erik Kurzinger (1):
- vulkan/device_select: avoid segfault on Wayland if wl_drm is unavailable
Ernst Sjöstrand (1):
- nv50: Fix use of initializers on older compilers
Ezequiel Garcia (2):
- panfrost: Add GPU IDs for G52 1-Core-2EE (RK3568/RK3566)
- panfrost: Rename G52 product ID 0x7402 as G52r1
Felix DeGrood (16):
- intel: add L3 Bypass Disable to gen xml
- iris: Cache VB/IB in L3$ for Gen12
- iris: reduce redundant tile cache flushes
- intel/blorp: remove tile flush from emit surface state
- intel/compiler: Use switch for DERIVATIVE_GROUP logic
- intel/compile: refactor DERIVATIVE_GROUP logic
- intel/compiler: tileY friendly LID order for CS
- intel/compiler: balanced tileY/linear friendly LID order for CS
- anv: Cache VB/IB in L3$ for Gfx12
- anv: Add debug messages for DEBUG_PIPE_CONTROL
- anv: Clear all pending stall after pipe flush
- anv: Remove Tile Cache flush from SBA, Pipe Select
- anv: remove unnecessary Tile Cache flushes
- anv: Only flush Tile Cache on VK_ACCESS_HOST_R/W
- anv: Add ANV_PIPE_HDC_PIPELINE_FLUSH_BIT
- anv: Replace DC Flush with HDC Pipeline Flush
Francisco Jerez (20):
- intel/fs: Implement representation of SWSB cross-pipeline synchronization annotations.
- intel/fs: Add helper functions inferring sync and exec pipeline of an instruction.
- intel/fs: Represent SWSB in-order dependency addresses as vectors.
- intel/fs: Calculate SWSB cross-pipeline synchronization information.
- intel/fs: Use CHV/BXT implementation of 64-bit MOV_INDIRECT on XeHP+.
- intel/fs: Fix repclear assembly for XeHP+ regioning restrictions.
- intel/fs: Handle regioning restrictions of split FP/DP pipelines.
- intel/eu: Teach EU validator about FP/DP pipeline regioning restrictions.
- intel/compiler: Lower integer division on XeHP.
- intel/fs: Introduce lowering pass to implement derivatives in terms of quad swizzles.
- intel/fs: Add more efficient fragment coordinate calculation.
- iris/gen12: Work around push constant corruption on context switch.
- iris/gfx12: Invalidate ISP at the end of every batch.
- intel/fs/xehp: Assert that the compiler is sending all 3 coords for cubemaps.
- intel/fs: Track single accumulator in scoreboard lowering pass.
- intel/fs: Implement Wa_22012725308 for cross-pipe accumulator data hazard.
- intel/fs: Add SWSB dependency annotations for cross-pipeline WaR data hazards on XeHP+.
- intel/fs: Teach IR about EOT instruction writing the accumulator implicitly on TGL+.
- intel/fs: Fix synchronization of accumulator-clearing W/A move on TGL+.
- intel/fs: Implement Wa_14013745556 on TGL+.
Georg Lehmann (11):
- radv: Fix compatible image handle type for dmabufs.
- v3dv: use VKAPI_ATTR and VKAPI_CALL.
- zink: Add a missing VKAPI_ATTR.
- vulkan: Update the XML and headers to 1.2.180
- radv: Implement VK_EXT_global_priority_query.
- ac: Check me_fw_feature for 32bit predication on gfx10.3
- ac: Enable 32bit predication on gfx10.
- ac: Enable 32bit predication on gfx9 with fw feature version 52.
- lavapipe: Use common default allocator.
- lavapipe: Add a missing VKAPI_ATTR.
- vulkan/wsi/wayland: Add support for more SRGB formats.
Gert Wollny (40):
- Revert "r600: don't set an index_bias for indirect draw calls"
- Revert "r600: Don't advertise support for scaled int16 vertex formats"
- r600: don't set an index_bias for indirect draw calls
- virgl: use pipe_draw_info::restart_index only when primitive_restart is enabled
- r600: update pipe_draw_info::restart_index only when primitive_restart is enabled
- nir/opt_algebraic: optimizations for add umax/umin with zero
- nir: Add filter callback for lower_to_scalar to the options
- gallium: pass lower_to_scalar_filter to lower_to_scalar pass
- r600/sfn: lower to scalar with filter applied
- mesa: add an extension MESA_bgra
- compiler/nir: check whether var is an input in lower_fragcoord_wtrans
- nir/linker: add option to ignore the IO precisions for better varying packing
- r600/sfn: Ignore precision when linking
- r600: don't put INTERP_X and INTERP_Z into one instruction group
- r600/sfn: Use valid pixel mode only in fragment shaders
- r600/sfn: Use valid pixel mode for SSBO and Image result fetches
- r600/sfn: force new CF if fetch through TC would be used in same clause
- r600/sfn: Lower FS pos input w-transform in NIR
- r600/sfn: Don't check the faction when searching for the input slot
- r600/sfn: count only distinct literals per instruction group
- r600/sfn: Fix Cayman trans ops
- r600/sfn: Use unified index register code for samplers
- r600/sfn: Use unified code path for index register load
- r600/sfn: Fix texture gather for Cayman
- r600/sfn: Fix ssbo/image atomic swap for Cayman
- r600/sfn: Fix Cayman SSBO write with more than one value
- r600/sfn: Fix Geometry shader for Cayman
- r600/sfn: read number of images from shader info
- r600/sfn: Fix cube query layer number for indirect access
- r600/sfn: Add lowering pass to legalize image access
- r600/sfn: legalize image access on Cayman
- r600: Enable NIR debug flags also for Cayman
- r600/sfn: don't designates initializers, since they are c++20
- r600/sfn: don't read back unused image atomic result values
- r600/sfn: Drop method for emit_atomic_add, it is handled in generic code
- r600/sfn: Don't read return values of atomic ops that are not used
- r600/sfn: Clean up some ALU lowering and move code
- r600/sfn: Lower offset in TXF instructions
- virgl: Enable ASTC formats also for 3D textures
- r600/sfn: initialize all texture lower options
Gustavo Padovan (10):
- traces-iris: fix expectation for Intel GLK
- gitlab-ci: enable Intel AML-Y as experimental
- gitlab-ci: rule anchor for experimental devices as manual in MRs
- gitlab-ci: enable all 3 intel devices as manual in MR pipelines
- iris/ci: disable failing gimark test for now
- iris/ci: enable intel devices automatically in MR pipelines
- gitlab-ci: add python script to submit lava jobs
- gitlab-ci: enable testing on Intel Kaby Lake as experimental
- ci/lava: propely report test failure through sys.exit()
- ci/lava: do not save lava.yaml in the artifacts
Hans-Kristian Arntzen (2):
- radv: Take image alignment into account when allocating MUTABLE pool.
- radv: Allocate buffer list for MUTABLE descriptor types as well.
Heinrich Fink (6):
- softpipe: add missing sentinel to debug option array
- llvmpipe: unmap display target of shader image/sampler
- softpipe: unmap display target of shader sampler
- llvmpipe: do not leak map of display target in fs setup
- llvmpipe: do not leak display target mapped ptr in cs setup
- gbm/dri: Fix leaking bo memory on failure path
Hoe Hao Cheng (15):
- vulkan/util: generate vk_dispatch_table that combines all dispatch tables
- nir: define NIR_ALU_MAX_INPUTS
- zink: remove variable length arrays in ntv
- zink: introduce vk_dispatch_table
- zink/codegen: split commands into three groups
- zink/codegen: add zink_verify_*_extensions()
- zink: slight refactor of load_device_extensions()
- zink: use the dispatch tables
- zink/codegen: allow conditional enabling of instance extensions
- zink/codegen: clean the constructor of Extension up
- zink: do not fail when EXT_calibrated_timestamps is unavailable
- zink: move extension function verification to when it is used
- zink: zero-init structs with ISO C
- zink: standardize zero-init code style
- zink: make codegen compatible with python 3.5
Hubert Jasudowicz (1):
- docs/egl: Add missing backticks
Hyunjun Ko (6):
- turnip: prep work for timeline semaphore support
- turnip: Implement VK_KHR_timeline_semaphore.
- turnip/kgsl: Fix to build on android.
- turnip: add missing VKAPI_ATTR/CALL
- turnip: Copy command buffers to deferred submit request
- turnip/kgsl: new flag TU_USE_KGSL
Iago Toral Quiroga (118):
- v3dv: avoid redundant BO job additions for textures and samplers
- v3dv: avoid redundant BO job additions for UBO/SSBO
- v3dv: avoid redundant BO job additions for spill / shared BOs
- v3dv: optimize a few cases of BO job additions
- v3dv: use a bitfield to implement a quick check for job BO tracking
- v3dv: fix descriptor set limits
- v3dv: fix array sizes when tracking BOs during uniform setup
- v3dv: don't use a dedicated BO for each occlusion query
- v3dv: fix sRGB blending workaround
- v3dv: improve dirty descriptor set state tracking
- v3dv: dirty viewport doesn't affect fragment shaders
- v3dv: better tracking of dirty push constant state
- vulkan/wsi: give drivers the option to decide if they need to blit
- v3dv: implement wsi hook to decide if we can present directly on device
- compiler/nir: add a divergence analysis option for non-uniform workgroup id
- v3dv: choose a larger CSD supergroup size if possible
- broadcom/compiler: track if a shader has control barriers in prog_data
- v3dv: limit supergroup size in presence of TSY barriers
- broadcom/common: move CSD supergroup sizing to a common helper
- v3d: choose a larger CSD supergroup size if possible
- broadcom/compiler: add a loop unrolling pass
- v3dv: setup loop unrolling
- v3d: move NIR compiler options to GL driver
- broadcom/compiler: add a compiler strategy to disable loop unrolling
- broadcom/compiler: refactor compile strategies
- broadcom/compiler: specify maximum thread count in compile strategies
- v3d: enable NIR loop unrolling
- v3d: re-enable GLSL loop unrolling
- broadcom/compiler: change register allocation policy for accumulators
- broadcom/compiler: move vertex shader output handling to its own function
- broadcom/compiler: implement non-uniform offset on vertex outputs
- broadcom/compiler: make vir_VPM_WRITE_indirect handle non-uniform offsets
- broacom/compiler: enable PIPE_SHADER_CAP_INDIRECT_OUTPUT_ADDR
- broadcom/compiler: don't use nir_src_is_dynamically_uniform
- v3dv: don't lower indirect derefs on output variables
- broadcom/compiler: don't unroll due to indirect indexing of outputs
- v3d: disable GLSL loop unrolling again
- broadcom/compiler: clarify PIPE_SHADER_CAP_INDIRECT_INPUT_ADDR setting
- broadcom/compiler: don't emit TLB loads for components that don't exist
- broadcom/compiler: consider RT component size when lowering logic ops in Vulkan
- broadcom/ci: update fail list for v3dv
- v3d: take TLB blit framebuffer dimensions from smallest surface dimensions
- v3dv: implement VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_POINT_CLIPPING_PROPERTIES
- v3dv: fix texture_size()
- v3dv: allow creating uncompressed views from compressed images and vice versa
- v3dv: expose VK_KHR_maintenance2
- v3dv: define V3D_MAX_BUFFER_RANGE
- v3dv: implement VK_KHR_maintenance3
- v3dv: implement VK_KHR_bind_memory2
- v3dv: implement VK_KHR_get_memory_requirements2
- v3dv: keep track of whether an image may be backed by external memory
- v3dv: implement VK_KHR_dedicated_allocation
- v3dv: trivially handle VK_STRUCTURE_TYPE_EXPORT_MEMORY_ALLOCATE_INFO_KHR
- v3dv: add v3dv_GetImageSparseMemoryRequirements back
- v3dv: implement vkCmdDispatchBase
- v3dv: create a helper for image creation
- v3dv: implement interactions of VK_KHR_device_group with VK_KHR_swapchain
- v3dv: implement VK_KHR_device_group
- v3dv: don't keep an open file descriptor for imported fences/semaphores
- v3dv: implement external semaphore/fence extensions
- v3dv: increase number of supported SSBOs
- v3dv: expose KHR_relaxed_block_layout
- v3dv: document VK_KHR_relaxed_block_layout as implemented
- v3dv: expose VK_KHR_storage_buffer_storage_class
- v3dv: refactor descriptor updates
- v3dv: implement VK_KHR_descriptor_update_template
- v3dv: fix incorrect render area setup
- v3dv: expose KHR_variable_pointers
- v3dv: don't lower vulkan resource index result to scalar
- v3dv: implement VK_KHR_get_display_properties2
- v3dv: handle Vulkan 1.1 feature and property queries
- v3dv: don't support VK_IMAGE_CREATE_BLOCK_TEXEL_VIEW_COMPATIBLE_BIT
- Revert "v3dv: allow creating uncompressed views from compressed images and vice versa"
- v3dv: expose VK_KHR_shader_non_semantic_info
- v3dv: implement VK_EXT_index_type_uint8
- v3dv: implement vkCmdBlitImage2KHR
- v3dv: implement vkCmdCopyBuffer2KHR
- v3dv: implement vkCmdCopyBufferToImage2KHR and vkCmdCopyImageToBuffer2KHR
- v3dv: implement vkCmdCopyImage2KHR
- v3dv: implement vkCmdResolveImage2KHR
- v3dv: expose VK_KHR_copy_commands2
- v3dv: remove const qualifier for resource pointer in view objects
- broadcom/compiler: implement nir_intrinsic_load_subgroup_id correctly
- broadcom/compiler: lower nir_intrinsic_load_num_subgroups
- broadcom/compiler: add FLAFIRST and FLNAFIRST opcodes
- broadcom/compiler: implement more subgroup intrinsics
- broadcom/compiler: add a ntq_emit_cond_to_bool helper
- broadcom/compiler: add a set_a_flags_for_subgroup helper
- broadcom/compiler: track if a compute shader uses subgroup functionality
- broadcom/util: don't use compute supergroup packing with subgroups
- v3dv: expose correct subgroup size
- v3dv: expose support for basic subgroup operations
- broadcom/compiler: use nir_sort_variables_with_modes
- v3dv: account for dst offset of copy query results operations
- v3dv: always free pipeline stages after compiling
- v3dv: extend broadcom stages to include geometry
- v3dv: define a generic helper to create binning pipeline stages
- v3dv: add a few more broadcom shader stage helpers
- broadcom/compiler: track if geometry shaders write gl_PointSize
- v3dv: add support for geometry shaders to pipelines
- broadcom/compiler: create a helper for computing VPM config
- v3dv: emit state packets for geometry shaders
- v3dv: handle QUNIFORM_FB_LAYERS
- v3dv: fix copy buffer to image TFU path for 3D images
- broadcom/compiler: handle compact input arrays for geometry shaders
- broadcom/compiler: don't ignore constant offset on per-vertex input loads
- v3dv: implement layered attachment clears
- v3dv: remove fallback path for vkCmdClearAttachments
- v3dv: remove deferred vkCmdClearAtachments path
- broadcom/ci: update expected fails for v3dv after enabling geometry shaders
- v3dv: expose geometry shaders
- v3dv: fix push constant range for texel buffer copy pipelines
- v3dv: implement layered texel buffer copies using a geometry shader
- v3dv: allow batching texel buffer copies for 3D images
- v3dv: use defines for push constant offsets used by texel buffer copy shaders
- v3d: better scissor tracking
- broadcom/compiler: implement gl_PrimitiveID in FS without a GS
- v3dv: remove more dead clearing code
Ian Romanick (49):
- tgsi_exec: Fix NaN behavior of saturate
- tgsi_exec: Fix NaN behavior of min and max
- ci: Uprev piglit to b3a9fa345 ("framework/replay: Quote resource names before signing")
- tgsi_exec: Use C99 functions for min and max instead of open coding
- gallivm: Fix NaN behavior of min and max
- gallivm: Use range analysis to generate better fmin and fmax code
- gallivm: Use GALLIVM_NAN_RETURN_OTHER_SECOND_NONNAN for norm clamping
- gallivm: Remove unused GALLIVM_NAN_RETURN_NAN
- nir/algebraic: Remove some optimizations of comparisons with fsat
- nir/algebraic: Tautology replacements require sources be numbers
- nir/algebraic: Invert comparisons less often
- nir/algebraic: Equality comparison inversions require sources be numbers
- nir/algebraic: Mark some more comparison reductions exact
- nir/algebraic: Mark some more logic-joined comparison reductions as exact
- nir/algebraic: Rearrange some logic-joined comparisons and reduce
- nir/algebraic: Add algebraic opt for float comparisons with identical operands.
- util/format: Delete trailing whitespace
- dri: Fix typo before __DRI_IMAGE_COMPONENTS defines
- egl/dri2: Rely on drm-uapi for DRM_FORMAT defines
- mesa/st: Always call st_nir_lower_tex_src_plane if samplers were lowered
- nir/lower_tex: Add support for lowering Y41x formats
- util/format: Add Y41x formats
- egl/dri2: Add Y41x formats
- gallium/dri: Add Y41x formats
- util/format: Add Y21x formats
- egl/dri2: Add Y21x formats
- gallium/dri: Add Y21x formats
- intel/isl: Add mappings for PIPE_FORMAT_R8G8_R8B8_UNORM and PIPE_FORMAT_G8R8_B8R8_UNORM
- mesa: Add R8G8_R8B8 and G8R8_B8R8 formats
- nir/lower_tex: Add support for lowering YUYV formats
- gallium/dri: Allow use of R8G8_R8B8 for YUYV and G8R8_B8R8 for UYVY
- iris: Return correct enum names from fmt_swizzle
- iris: Silence warnings about implicit enum type conversions
- mesa/st: Don't assert !unify_interfaces in the passthrough edge flags case
- radeonsi: Use util_cpu_caps to detect number of CPUs
- util: Zero out all of mask in util_set_thread_affinity
- util: Change order of PIPE_OS_UNIX code in util_cpu_detect_once
- util: Trivial cleanup in the BSD code of util_cpu_detect_once
- util: Fix setting nr_cpus on some BSD variants
- util: Set util_cpu_caps.num_cpu_mask_bits based on total CPUs in the system
- util: Use maximum number of CPUs for determining cache topology
- util: Consider CPU affinity when detecting number of CPUs
- v3d: ci: Add KHR-GLES31.core.shader_image_load_store.basic-glsl-earlyFragTests to flakes
- intel/compiler: Add the ability to defer IP updates in backend_instruction::remove
- intel/compiler: Add cfg_t::adjust_block_ips() method
- intel/compiler: Update block IPs once in dead_code_eliminate
- intel/compiler: Update block IPs once in register_coalesce
- intel/compiler: Update block IPs once in opt_cmod_propagation
- nir/gcm: Clear out pass_flags before starting
Icecream95 (38):
- panfrost: Assert staging resource allocation was successful
- panfrost: Unset shared/scanout binding flags for staging resources
- pan/bi: Skip nir_opt_move/sink for blend shaders
- panfrost: Fix shader texture count
- pan/decode: Allow frame shader DCDs to be in another BO than the FBD
- pan/decode: Print errors when closing dump file
- pan/mdg: Fix calculation of available work registers
- panfrost: Remove incorrect comment
- panfrost: Fix viewport scissor for preload draws
- panfrost: Split panfrost_batch_submit to prevent stack overflows
- pan/bi: Add "lane_dest" modifier
- pan/bi: Replace lane0 modifier with lane_dest for load instructions
- panfrost: Make pan_select_crc_rt a non-static function
- panfrost: Always write reloaded tiles when making CRC data valid
- pan/bi: Add two tuples to a clause when needed with NOSCHED
- panfrost: Skip blit shader labelling if the buffer has no space
- panfrost: Fix polygon list size computations
- pan/mdg: Try scheduling load/store ops in pairs
- pan/decode: Flush the dump stream after decoding
- panfrost: Call abort() when aborting on fault
- panfrost: Use first_tiler to check if tiling is needed
- pan/mdg: Add a bundle ID to instructions
- pan/mdg: Reorder some code in mir_spill_register
- pan/mdg: Fill from TLS before spilling non-SSA nodes
- pan/mdg: Fix reading a spilt register in the bundle it's written
- pan/mdg: Add 16 bytes of padding to the end of shaders
- panfrost: Don't set dirty_mask for constant buffers
- pan/bi: Create a mask of UBOs that need to be uploaded
- pan/mdg: Create a mask of UBOs that need to be uploaded
- panfrost: Only upload UBOs when needed
- panfrost: Set bound dimensions to framebuffer size
- Revert "panfrost: Fix crc_valid condition"
- panfrost: Always use a fragment shader when alpha test is enabled
- panfrost: Fix GPU ID for t76x in get_perf_config
- panfrost: Fix full_threads calculation on v6
- pan/bi: Create a nop clause when the shader starts with ATEST
- panfrost: Initialise the blend equation in create_blend_state
- pan/mdg: Analyze helper termination after scheduling
Ilia Mirkin (29):
- nv50/ir: offset accesses to shared memory
- nv50/ir: refine limitation on load/store loading offsets, include atomics
- nv50/ir: "zero" register does not work with g[] memory
- nv50/ir: mark ATOM as having 3 arguments
- nv50/ir: wipe any info about memory when seeing a locking op
- nv50/ir: optimize shift of 0 bits
- nv50: pass surface/buffer parameters to shader via aux buffer
- nv50/ir: add surface op lowering
- nv50/ir: add lowering for shared atomics
- nv50: add compute invocations counter
- nv50: add remapping of buffers/images into unified space
- nv50: add support for doing membars
- nv50: add indirect compute support
- intel: fix MI builder for pre-gen7
- nv50: fix streamout queries
- nvc0: fix 3d images
- vdpau: allow state tracker to report a lower number of macroblocks
- nouveau: improve video limit reporting
- st/mesa: avoid enabling image/buffer/compute extensions for weak hardware
- mesa: relax ES 3.1 compute shader requirements
- st/mesa: properly encode OES_geometry_shader requirement
- mesa/get: allow image/buffer/atomic variables to be fetched in es3.1
- st/mesa: allow hardware to claim ES 3.1 without hw indirect draws
- nv50: expose images/buffers/compute
- nv50: expose GL ES 3.1 for nva3+ hardware
- mesa: always expose NV_image_formats and OES_shader_image_atomic
- mesa: also flush after compute dispatch when debug flag enabled
- nv50: use the no-mipmap texture type for 2d ms views
- st/mesa: always report the max samples as supported
Ishi Tatsuyuki (1):
- radv: ignore redundant variable descriptor counts (v2)
Italo Nicola (28):
- pan/mdg: fix midgard writemask encoding for stores
- util: add util_sign_extend
- pan/mdg: clean up redundant/unused variables in disassemble.c
- pan/mdg: rename dest_override to shrink_mode
- pan/mdg: improve outmod printing
- pan/mdg: refactor mir_pack_swizzle
- pan/mdg: add proper expand_mode enum
- pan/mdg: encode/decode expand_mode properly
- pan/mdg: add midgard_src_expand_mode validation
- pan/mdg: improve input modifier printing
- pan/mdg: improve swizzle decoding
- pan/mdg: fix/change ALU opcodes descriptions and add some missing ops
- pan/mdg: stop querying datatype by reading opcode name
- pan/mdg: print input data type for ALU opcodes
- pan/mdg: stop using size disambiguation suffixes
- pan/mdg: fix midgard.h indentation
- pan/mdg: improve mask decoding
- pan/mdg: remove register prefixes
- pan/mdg: print special alu arg outmods
- pan/mdg: misc cleanups
- pan/mdg: add helpers for load/store special read regs
- pan/mdg: improve ldst opcode names and add missing ops
- pan/mdg: print names of non-work registers
- pan/mdg: properly encode/decode ldst instructions
- pan/mdg: improve tex opcode decoding and add missing ops
- panfrost/ci: Improve coverage for T860
- virgl: implement EXT_multisampled_render_to_texture
- panfrost: fix GL_EXT_multisampled_render_to_texture regression
Iván Briano (2):
- intel/nir: Fix txs for null surfaces
- anv: fix feature/property/sizes reported for fragment shading rate
James Jones (18):
- gbm: Remove stat and refcount fields from gbm_device
- gbm: Inline load_backend function content
- gbm: Create device directly in find_backend
- gbm: Consolidate env var and default backend loops
- gbm: Give getenv backend override its own function
- gbm: Give gbm_device a reference to its backend
- gbm: Add gbm_core struct to export code to backends
- gbm: Move majority of gbmint.h to gbm_backend_abi.h
- gbm: Version the GBM backend interface
- gbm: Add backend ABI-check test
- gbm: Rename backend description list to builtin_backends
- loader: Factor out driver library loading code
- meson: Add a GBM backends search path build option
- gbm: Rename the DRI backend from gbm_dri.so to dri
- gbm: Put common device creation in a helper function
- gbm: Support dynamically loading named backends
- gbm: Load backend based on DRM device driver name
- loader: Handle failure to load DRI driver library
James Park (14):
- meson: Fix winflexbison warnings
- ac/surface: Move drm_fourcc.h to common header
- radv: Use ac_drm_fourcc.h
- meson: Add wrap for libelf on Windows
- meson: Disable libdrm for RADV on Windows
- meson: Disable MSVC warning 5105
- amd: Fix warnings around variable sizes
- radv: Add _WIN32 guard in radv_check_gpu_hangs
- radv: Fix unused label warning on Windows
- radv: Add <io.h> on WIndows for missing close()
- draw/clip: Use NAN to make MSVC happy
- llvmpipe: Remove stray ## operator for MSVC
- ci: Update Windows image to build RADV
- vulkan: Support 32-bit "weak" symbols on MSVC
Jan Beich (1):
- anv: adjust headers for non-GNU after e9e1e0362b6c
Jason Ekstrand (139):
- intel/compiler: Don't insert barriers for NULL sources
- anv: Use the same re-order mode for streamout as for GS
- vulkan: Update the XML and headers to 1.2.177
- anv: Implement VK_EXT_provoking_vertex
- gallium: Add a transcode_astc driconf option
- intel/isl: There are seven aux states
- intel/isl: Fix isl_color_value_unpack to match the prototype
- intel/eu: SVB writes only happen on Gen6
- intel/fs: Stop using brw_dp_read/write_desc in Gen7+ only code
- intel/eu: Set message subtype properly for SIMD8 FB fetch
- intel/fs: Don't use pixel_z for Gen4-5 source_depth_to_render_target
- intel/nir: Set lower txs with non-zero LOD
- nir/builder: Move clamp helpers to nir_builder.h
- anv: Check offset instead of alloc_size for freeing surface states
- anv: Allow storage on all formats that support typed writes
- anv: Plumb the shader into push constant helpers
- anv: Support pushing shader constants
- anv: Push at most 32 regs for vec4 shaders
- intel/vec4: Don't spill fp64 registers more than once
- intel/vec4: Add some asserts to move_push_to_pull
- intel/vec4: Update nr_params in pack_uniform_registers
- intel/vec4: Set up push ranges before we emit any code
- intel/vec4: Add support for masking pushed data
- intel/vec4: Add support for UBO pushing
- nir: Add a nir_instr_move helper
- nir/gather_info: Expose a nir_intrinsic_writes_external_memory helper
- nir: Add a discard optimization pass
- intel/fs: Handle non-perspective-correct interpolation on gen4-5
- intel/nir,i965: Move HW generation check for UBO pushing to i965
- intel/vec4: Also use MOV_FOR_SCRATCH for swizzle resolves
- intel/isl: Fix isl_format_is_valid
- intel/fs/ra: Fix payload node setup for SIMD16 on Gen4-5
- ttn: Stop manually managing system_values_read
- anv: Require softpin on Gen8+
- anv: Make use_softpin compile-time in genX code
- anv: Handle OOM in the pinned path in anv_reloc_list_add
- anv: Add a helper to add a BO to the batch list without a reloc
- anv: Make anv_batch_emit_reloc inline and optimize SKL+
- anv: Fast-path surface relocs when we have softpin
- anv: Optimize anv_address_physical when ANV_ALWAYS_SOFTPIN
- anv/blorp: Optimize addresses/relocations when ANV_ALWAYS_SOFTPIN
- iris: Use isl_surf_get_image_surf instead of hand-rolling it
- iris: Move target_to_isl_surf_dim to iris_resource.c
- intel/isl: Add a isl_surf_get_image_offset_B_tile_el helper
- intel/blorp: Use isl_surf_get_image_offset_B_tile_el in ccs_ambiguate
- intel/isl: Make the offset helpers four dimensional
- intel/isl: Make tile logical extents four dimensional
- intel/isl: Use a 4D physical total extent for size calculations
- i965: Use nir_lower_passthrough_edgeflags
- anv: Agressively no-op Flush/InvalidateMappedMemoryRanges
- docs: Begin documenting ISL
- isl: Document more members of isl_surf
- docs/isl: Document ISL's units
- docs/isl: Add detailed documentation about isl formats
- docs/isl: Add detailed documentation about tiling on Intel GPUs
- docs/isl: Add detailed documentation about CCS compression
- util: Move the 4x4 matrix inverse function to u_math
- crocus: Drop extra_aux support
- nir,amd: Suffix nir_op_cube_face_coord/index with _amd
- nir,panfrost: Suffix fsat_signed and fclamp_pos with _mali
- nir,vc4: Suffix a bunch of unorm 4x8 opcodes _vc4
- vulkan: Update the XML and headers to 1.2.182
- nir: Require vectorized ALU ops to be all-or-nothing
- nir,docs: Add docs for NIR ALU instructions
- nir: Document all the ALU opcodes
- docs,isl: Document Sandy Bridge HiZ/stencil
- editorconfig: Use 3-space tabs for .rst
- docs/nir: Use 3-space tabs
- docs/isl: Consistently use 3-space tabs
- spirv: Create acceleration structure and shader record variables
- anv: Add minimal boilerplate for VK_KHR_acceleration_structure
- anv: Add stub support for acceleration structures
- anv: Add support for binding acceleration structures
- anv: Add minimal boilerplate for VK_KHR_ray_tracing_pipeline
- anv: Get ready for more pipeline stages
- anv: Add a ray-tracing pipeline object
- anv: Add support for binding ray-tracing pipelines
- anv,iris: Move the SHADER_RELOC enums to brw_compiler.h
- intel/compiler: Generalize shader relocations a bit
- intel/compiler: Add a U32 reloc type
- intel/fs: Add support for compiling bindless shaders with resume shaders
- intel/rt: Use reloc constants for the resume SBT
- anv: Disallow UBO pushing for bindless shaders
- nir/apply_pipeline_layout: Handle bindless shaders
- anv: Support fetching descriptor addresses from push constants
- anv: Compile ray-tracing shaders
- anv: Compile trivial return and trampoline shaders
- intel/fs: Don't pull CS push constants if uses_inline_data
- anv: Create and return ray-tracing pipeline SBT handles
- anv: Compute scratch sizes for ray-tracing pipelines and shader groups
- anv: Add support for vkCmdSetRayTracingPipelineStackSizeKHR
- anv: Allow _anv_combine_address with a NULL batch
- anv: Make anv_address::offset 64-bit
- anv: Implement vkCmdTraceRays and vkCmdTraceRaysIndirect
- isl: Assert some iris invariants in isl_surf_get_ccs_surf
- isl: Take a hiz_or_mcs_surf in isl_surf_supports_ccs
- isl,iris: Move the extra_aux_surf logic into iris
- isl,docs: Add a chapter on AUX state tracking
- docs/isl: Improve the bit[6] swizzling section of the tiling chapter
- include/drm-uapi: bump headers
- anv: Claim to be a discrete GPU if has_lmem
- util: Add an implementation of qsort_r for non-GNU platforms
- nir: Add a function for sorting variables
- intel/genxml: Add SURFTYPE_SCRATCH on GFX version 12.5
- intel/isl: Add support for scratch buffers
- intel/fs: Implement spilling on XeHP
- intel/fs: Implement load/store_scratch on XeHP
- intel/genxml: Add new ScratchSpaceBuffer fields on GFX version 12.5
- iris: Add a MEMZONE_BINDLESS and uploader
- iris: Add support for scratch on XeHP
- anv: Add support for scratch on XeHP
- intel/genxml: Remove old scratch fields on GFX version 12.5
- iris/bufmgr: Stop changing mapping modes on buffers
- intel/devinfo: Add a has_lsc bit
- intel/compiler: Add LSC to messages brw_ir_performance
- intel/fs: Lower uniform pull constant load message to LSC dataport
- docs/isl/tiling: Fix swizzle pattern for X-tiling
- intel/isl: Pull the uncompressed surface view code from anv
- intel/blorp: Adjust the compressed copy rectangle before convert_to_single_slice
- intel/blorp: Use isl_surf_get_uncompressed_surf
- intel/isl: Add more cases to isl_surf_get_uncompressed_surf
- iris: Don't leak the surface if uncompressed re-interp fails
- iris: Use isl_surf_get_uncompressed_surf
- nir: Drop nir_ssa_def::name and nir_register::name
- android: Drop the Android.mk build system
- android: Restore android/Android.mk
- nir/lower_subgroups: Pad ballot values before bitcasting
- docs: Add docs for running a local Mesa build
- mailmap: Update for Emma's new e-mail address
- Convert a few files to UTF-8
- mailmap: Add two more lines for Alyssa Rosenzweig
- glsl: Delete lower_texture_projection
- anv/allocator: Use list->u64 in free_list_push
- iris: Re-emit MEDIA_VFE_STATE for variable group size shaders
- anv: Handle errors properly in anv_i915_query
- intel: Pull anv_i915_query into common code
- anv: Use intel_i915_query_alloc for memory regions
- iris: Use intel_i915_query for meminfo
- nir/lower_tex: Rework invalid implicit LOD lowering
Jeremy Huddleston (2):
- libgl-xlib: Set darwin-versions
- libgl-xlib: Add missing dep_x11 dependency
Jeremy Newton (1):
- Update libva requirement
Jesse Natalie (44):
- microsoft/spirv_to_dxil: Lower samplers from deref to index
- microsoft/spirv_to_dxil: Lower loads/stores to DXIL
- microsoft/compiler: Support raw SRVs/UAVs through dxil_module_get_res_type
- microsoft/compiler: Support arrays of UBOs
- microsoft/compiler: Emit CBVs via variables for Vulkan
- microsoft/compiler: Emit SSBO variables
- microsoft/compiler: Split Vulkan resource_index / descriptor processing
- microsoft/compiler: Better support UBO/SSBO references to descriptors
- microsoft/compiler: Store nir_shader in the ntd_context
- microsoft/compiler: Support raw SRVs in addition to typed SRVs
- microsoft/compiler: Propagate access when lowering SSBO loads
- microsoft/clc: If local size isn't specified either in the shader or at runtime, set it to (1,1,1)
- gallium: Define PIPE_ARCH_AARCH64 for MSVC arm64 builds
- nir: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- d3d12: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- microsoft/clc: Fix MSVC unreferenced variable warnings
- microsoft/clc: Fix undeclared function warning
- microsoft/compiler: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- shader_enums: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- gallium/aux: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- llvmpipe: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- xmlconfig: Fix MSVC warning C4334 (32bit shift cast to 64bit)
- CI: Windows: Bump warning level to W3 (except for zlib)
- microsoft/compiler: Remove hardcoded limits on numbers of resource arrays
- microsoft/compiler: Remove assert-only resource size or usage tracking
- microsoft/compiler: For Vulkan environment, don't create resource handles upfront
- vtn: Propagate access data that's present on all struct members to the struct itself
- vtn: Propagate access data from UBO/SSBO/push constant types to variables of that type, not just their pointers
- nir: Rename nir_lower_cl_images_to_tex, replace 'cl' with 'readonly'
- nir_lower_readonly_images_to_tex: Support non-CL semantics
- nir_lower_readonly_images_to_tex: Use nir_shader_lower_instructions
- microsoft/compiler: Treat read-only SSBOs as SRVs
- microsoft/spirv_to_dxil: Treat read-only storage images as SRVs
- d3d12, microsoft/compiler: Use SRV/sampler variable binding data
- microsoft/compiler: Rewrite sampler splitting pass to be smarter and handle derefs
- microsoft/compiler: Fix function signature for bufferStore to support overloads
- microsoft/compiler: Map descriptor set -> binding space
- microsoft/compiler: Handle unbounded arrays
- llvmpipe: Fix optimization loop to actually loop
- nir: Add relaxed 24bit opcodes
- vtn: Use relaxed 24bit opcodes for CL 24bit math
- microsoft/compiler: Change behavior for emitting inexpressible barriers
- nir_lower_readonly_images: Clear variable data when changing the type
- mesa/main: Check for fbo attachments when importing EGL images to textures
John Bates (1):
- add execmem build option
Jonathan Marek (4):
- freedreno/registers: define REG_DSI_CPHY_MODE_CTRL
- tu: remove workaround for conditional rendering + hw binning
- freedreno/a6xx: larger gmem_page_align from tile align instead of gpu id
- freedreno/common: unhardcode CCU color cache offset
Jordan Justen (20):
- Revert "intel/compiler: Silence unused parameter warning in update_inst_scoreboard"
- intel/eu: Allow 64-bit registers on XeHP.
- intel/fs: Disable 3-src immediates on XeHP.
- intel/fs: End computer shader with message gateway on XeHP.
- intel/compiler: Lower txd for 3D samplers on XeHP.
- intel/compiler: Fix INTEL_DEBUG=hex
- commit_in_branch_test.py: Rename branch master to main
- bin/pick: Rename master branch to main
- .gitlab-ci.yml: Use main branch for gitlab ci
- issue_templates/Bug Report: Rename master branch to main
- docs/releasing.rst: Rename master branch to main
- docs: Rename master branch to main
- mesa: NOTE! Default branch is now main
- intel/isl: Add Wa_22011186057 to disable CCS on ADL GT2 A0
- intel/dev: Add device info for ADL GT2
- intel: Add 2 ADL-S pci-ids
- intel/gen125.xml: Drop GPGPU_WALKER
- intel/devinfo: Add has_local_mem
- iris/bufmgr: Align vma addresses to 64K for local memory
- intel/dev: Set has_local_mem for DG1
Jose Maria Casanova Crespo (5):
- v3d: YUV formats at is_dmabuf_modifier_supported are external_only
- v3d: YUV formats at query_dmabuf_modifiers are external_only
- v3d: DRM_FORMAT_MOD_BROADCOM_SAND128 only available for NV12 format.
- ci/v3d: Update piglit expectations.
- v3d/driconf: Expose non-MSAA texture limits for mutter and gnome-shell
Joshua Ashton (5):
- radv: Handle unnormalized samplers in YCbCr lowering
- venus: Fix zero-initialized fd causing apps to hang/crash
- driconf: Add more workarounds for Teardown
- llvmpipe: Handle NULL views in llvmpipe_cleanup_stage_sampling
- lavapipe: Use common Vulkan format helpers
Joshua Watt (1):
- v3d, vc4: Fix dmabuf import for non-scanout buffers
José Fonseca (8):
- lavapipe: Fix lvp_execute_cmds' pipe_stream_output_target leak.
- lavapipe: Fix lvp_pipeline_compile's nir_xfb_info leak.
- wgl: Remove opengl32.mingw.def.
- draw: Allocate extra padding for extra shader outputs.
- draw: Plug leak when combining tessellation with primitive assembly.
- d3d10umd,d3d10sw: Initial import.
- d3d10sw: Add a sanity test.
- d3d10umd: Avoid duplication in CreateDevice.
Juan A. Suarez Romero (42):
- v3dv: avoid dereferencing null value
- ci: support KHR-GLES testing
- ci/v3d: add KHR-GLES test jobs
- ci/llvmpipe: run KHR-GLES2.* tests
- ci/softpipe: run KHR-GLESxx tests
- iris: hook up memory object creation from handle
- iris: hook up resource creation from memory object
- iris: enable GL_EXT_memory_object feature
- ci/broadcom: update expected results
- ci/vc4: add KHR-GLES2.* job test
- ci/broadcom: add EGL testing jobs
- v3dv: check returned values
- ci/v3d: execute all piglit tests
- v3dv/pipeline_cache: bail out in case of error
- ci/v3d: fix typo in job name
- ci/v3dv: update flakes
- ci/baremetal: propagate ASAN_OPTIONS to devices
- ci/broadcom: update expected results
- v3d: rename header include guards
- v3d: rename VC5 enums and definitions
- broadcom/qpu: rename from VC5 to V3D
- broadcom/simulator: change references to VC5
- v3dv: rename VC5 to V3D
- v3dv: check dest bitsize in color blit
- util/hash_table: do not leak u64 struct key
- ci/broadcom: update expected results
- v3d: fix resource leak in error path
- st/mesa: fix pipe resource leak
- broadcom/compiler: fix dynamic-stack-buffer-overflow error
- ci: Update VK-GL-CTS to 1.2.6.1
- ci/broadcom: update expected results
- vc4: initialize array
- ci/v3dv: update expected results
- ci/broadcom: unset manual jobs
- ci/v3dv: test v3dv in arm64 environment
- broadcom/ci: Report flakes on IRC
- ci/vc4: update piglit failures
- ci: update VK-GL-CTS to 1.2.6.2
- broadcom/compiler: emit TMU flush before a jump
- v3dv: assert job->cmd_buffer is valid
- broadcom: remove v3dv3 from neon library
- gallium/hud: initialize query
Kai-Heng Feng (1):
- iris: Avoid abort() if kernel can't allocate memory
Karol Herbst (18):
- clover/llvm: handle Fixed vs Scalable vectors explicitly starting with llvm-11
- util/format: fix value declarations for big endian
- nv50/query: fix stringop-overflow gcc warning
- nvc0: fix implicit-fallthrough gcc warning
- clover/memory: fix data race in buffer subclasses
- nouveau: fix race in nouveau_screen_get_name
- nouveau/mm: pass mm_bucket to mm_slab_new
- nouveau/mm: remove unused nouveau_mm_allocation.next field
- nv50/ir: when constant folding shl(mul, a) we need to copy muls type
- nv50/ir: don't optimize shl(mul_hi, a) to mul_hi
- nv50/ir/ra: fixes upcoming barrier file
- nv50/ir: add barrier and thread_state files
- gv100/ir: add support for barrier thread state files for OP_CVT
- gm107/ir: emit barrier sources for quadon/pop
- gv100/ir: fix quadop/pop lowering
- nv50/ir: fix surface lowering when values get shared accross operations
- nv50/ir/nir: fix smem size for GL
- nv30: fix emulated vertex index buffers
Keith Packard (1):
- vulkan/x11: Mark present complete using serial instead of MSC
Kenneth Graunke (29):
- iris: only flush the render cache for aux changes, not format changes
- isl: Work around NVIDIA and AMD display pitch requirements
- i965: Don't advertise Y-tiled modifiers for scanout buffers on Gfx8-
- iris: Don't advertise Y-tiled modifiers for scanout buffers on Gfx8
- iris: Replace no_gpu flag with PIPE_MAP_DIRECTLY
- iris: Promote to MAP_DIRECTLY when required before NULL return
- iris: Delete a comment suggesting we use tiled staging buffers
- iris: Make an iris_bo_is_external() helper and use it in a few places
- iris: Track imported vs. exported status separately
- iris: Use staging blits for reads from uncached buffers.
- iris: Use staging blits for transfers involving imported BOs
- iris: Assert on mapping a tiled buffer without MAP_RAW
- iris: Drop fallback GEM_MMAP_GTT if GEM_MMAP with I915_MMAP_WC fails
- iris: Delete GTT mapping support
- iris: Pick a single mmap mode (WB/WC) at BO allocation time
- iris: Use bo->mmap_mode in transfer map read check
- iris: Add a flags argument to iris_bo_alloc()
- iris: Add an alignment parameter to iris_bo_alloc()
- iris: Only use SET/GET_TILING when exporting/importing BOs
- iris: Add a BO_ALLOC_SMEM flag for allocating from system memory
- anv: Fix dynamic primitive topology for tess on Gfx7.x too
- iris: Stop calling I915_GEM_SET_CACHING on discrete GPUs
- iris: Fail BO allocation if we can't enable snooping properly.
- iris: Delete unused bo->cache_coherent flag
- vulkan/wsi: Fix prime blits to use system memory for the destination
- iris: Reduce SSBO alignment requirements from 64B to 4B
- crocus: Reduce SSBO alignment requirements from 64B to 4B.
- iris: Force device local memory for u_upload_mgr buffers
- iris: Use simple_mtx in the bufmgr.
Leo Liu (9):
- frontends/va: add VASurfaceAttribUsageHint attribute
- frontends/va: fix multi planes for external memeory type
- frontends/va: use pipe buffer map instead of texture map
- radeon/vcn/enc: use surface swizzle mode instead of linear
- radeonsi: add PIPE_FORMAT_P010 for HEVC Main10 profile to encode param
- radeonsi: separate video hw info based on HW engine individually
- frontends/va: use the correct entrypoint to get config attributes
- frontends/va: include the profile queries for encoder as well
- frontends/va: use the entrypoint from context instead of the hard-coded one
Lepton Wu (3):
- virgl: move new added field to the end.
- Revert "virgl: Cache depth and stencil buffers"
- gallium: Reset {d,r}Priv in dri_unbind_context
Lionel Landwerlin (59):
- anv: fix 3DSTATE_MULTISAMPLE emission on gen8+
- anv: disable baked in pipeline bits from dynamic emission path
- vulkan/util: cast enums to int64_t in switch
- spirv: fix uToAccelerationStructure handling
- spirv: fixup pointer_to/from_ssa with acceleration structures
- vulkan: bump headers/registry to version 1.2.175
- anv: drop extension check for dynamic state
- anv: prepare pipeline for delayed emission of color writes
- anv: implement VK_EXT_color_write_enable
- anv: reuse define for number of render target assert
- vulkan/wsi/display: don't report support if there is no drm fd
- i965/bufmgr: fix invalid assertion
- intel/dev: printout correct subslice/dualsubslice name
- intel/genxml: Add coarse pixel shading instructions
- intel/decoder: decode CPS_STATE
- intel/compiler: make sure we keep the lowest dispatch limit
- intel/compiler: rework message descriptors for render targets
- intel/compiler: use existing helpers to pull bits of descriptors
- intel/compiler: handle coarse pixel in render target writes descriptors
- intel/compiler: add support for fragment shading rate variable
- intel/compiler: add support for fragment coordinate with coarse pixels
- intel/compiler: add coarse pixel offset on Gfx12.5+
- intel/compiler: add restrictions related to coarse pixel shading
- anv: implement VK_KHR_fragment_shading_rate
- isl: document format fields
- intel/fs: use the final destination type for regioning restrictions
- intel/mi_builder: fix resolve call
- anv: fix perf query pass with command buffer batching
- anv: handle spirv parsing failure
- iris: fix assert to reflect correct limit for encoded size
- intel/perf: allow opening perf stream with no context filtering
- intel/perf: allow metric sets to be loaded with on OA reports
- anv: fixup physical device properties of fragment shading rate
- intel/fs: make sure shuffle is lowered to supported types
- intel/perf: update gen9/11 TestOa configs
- intel/perf: update Gen11 RenderBasic programming
- intel/perf: update Gen11 RenderBasic programming
- intel/perf: add EHL availability condition to HDCAndSF counters
- intel/perf: update Gen9/11 programming for AsyncCompute
- intel/perf: rename metric descriptions
- anv: implement VK_EXT_physical_device_drm
- blorp: add blorp string in shader keys
- anv: cache raytracing trampoline shader
- anv: store more RT shader data in pipeline_stage object
- anv: move trivial return shader to device
- anv: implement caching for ray tracing pipelines
- intel/rt: switch to common pass for shader calls lowering
- nir: drop the btd_resume_intel intrinsic
- nir: use a more fitting index for btd_stack_push_intel
- anv: bound checks buffer memory binding in debug builds
- anv: allocate bigger batches as we grow command buffers
- intel/perf: use the right popcount for 64bits
- intel/compiler: Track latency/perf of LSC fences
- isl: fix mapping of format->stringname
- loader/dri3: create linear buffer with scanout support
- nir/lower_shader_calls: adding missing stack offset alignment
- anv: fix submission batching with perf queries
- drm-shim: implement stat/fstat when xstat variants are not there
- intel/disasm: fix missing oword index decoding
Lucas Stach (9):
- etnaviv: fix vertex sampler setup
- dri: add loader_dri_create_image helper
- loader/dri3: convert to loader_dri_create_image
- loader/dri: hook up createImageWithModifiers2
- gallium/dri: copy image use in dup_image
- dri: don't call modifier interfaces when modifiers_count is 0
- frontend/dri: add EXPLICIT_FLUSH hint in dri2_resource_get_param
- etnaviv: remove double assigment of surface->texture
- etnaviv: flush used render buffers on context flush when neccessary
Luis Felipe Strano Moraes (2):
- meson: print information about layers being built as part of summary
- overlay_layer: add missing undef
Maksim Sisov (2):
- iris: export GEM handle with RDWR access rights
- i965: export GEM handle with RDWR access rights
Marcin Ślusarz (28):
- intel/tools: remove unused macros
- intel/batch_decoder: set foreground color of decoded instructions
- i965: fully populate perf_config before using it to initialize perf_context
- iris: fully populate perf_config before using it to initialize perf_context
- intel/perf: move calculation of period_exponent to perf ctx init
- gallium/u_threaded: implement INTEL_performance_query hooks
- gallium/u_threaded: offload begin/end_intel_perf_query
- nir: handle float atomics in nir_gather_info
- nir: handle float atomics in nir_lower_memory_model
- intel: simplify is_haswell checks, part 1
- intel: simplify is_haswell checks, part 2
- i965: simplify gfx version checks
- intel/isl: replace format_gen by verx10
- intel/disasm: decode/describe more send messages
- intel/disasm: remove useless space after "("
- iris: fix error message on I915_GEM_[GS]ET_TILING failure
- intel/decoder: add assert for register size
- anv: fix potential integer overflows
- intel/tools: fix left shift overflow on 32-bit
- intel/tools: fix int-to-pointer/pointer-to-int cast warnings on 32-bit
- intel/tools: fix invalid type in argument to printf format specifier
- intel/tools: fix potential memory leaks
- intel/blorp: initialize BLEND_STATE using braced initializer list
- intel/fs: use stack for temporary array
- anv: keep descriptor set's address directly in anv_descriptor_set
- anv: handle push descriptor sets when they are sent with push constants
- anv: drop unused argument of anv_descriptor_set_address
- intel/compiler: document register types
Marek Olšák (190):
- ci: don't build clover with LLVM 9 on radeonsi because it's unsupported
- amd: drop support for LLVM 9
- amd: drop support for LLVM 10
- amd: remove some references to older LLVM versions in comments
- amd/registers: fix the kernel header parser with latest headers
- amd/registers: clean up gfx103.json
- amd/registers: rename IMG_FORMAT to GFX10_FORMAT to disambiguate the meaning
- radeonsi: don't decompress DCC for float formats in si_compute_copy_image
- radeonsi: fix incorrect comments in culling code and NIR lowering
- radeonsi: fix automatic DCC retiling after DCC clear and DCC decompression
- radeonsi: fix automatic DCC retiling after compute image stores
- gallium/util: add easy profiling helpers using TIME_ELAPSED queries
- Revert "st/pbo: use cso_set_vertex_buffers_and_elements() for st_pbo_draw"
- Revert "ci/radeonsi: Add expected failures due to #4674 having slipped in"
- ac/surface: document more meta equation dependencies
- radeonsi: make the gfx9 DCC MSAA clear shader depend on the number of samples
- radeonsi: remove the separate DCC optimization for Stoney
- amd: addrlib update for April
- gallium: renumber PIPE_MAP_* enums to remove holes
- gallium: remove 4 bytes from pipe_transfer
- gallium+(u_threaded,r300,r600,radeonsi): move transfer offset into pipe_transfer
- util: print CPU caps in release builds too
- util: fix (re-enable) L3 cache pinning
- Revert "gallium/u_threaded: align batches and call slots to 16 bytes"
- gallium/u_threaded: move base_valid_buffer_range to transfer where it belongs
- gallium/u_threaded: handle sampler views == NULL better
- gallium/u_threaded: rewrite slot layout to reduce wasted space
- gallium/u_threaded: don't set resource pointers to NULL after driver calls
- gallium/u_threaded: fix 32-bit breakage due to incorrect pointer arithmetic
- gallium/u_threaded: pass last into and return call size from execute callbacks
- gallium/u_threaded: merge draws in tc_call_draw_single
- gallium/u_threaded: add callbacks and documentation for resource busy checking
- gallium/u_threaded: track whether TCS, TES, or GS have ever been used
- gallium/u_threaded: query shader resource limits
- gallium/u_threaded: add buffer lists - tracking of buffers referenced by tc
- gallium/u_threaded: add driver-internal flush tracking for buffer lists
- gallium/u_threaded: don't invalidate idle buffers
- gallium/u_threaded: always map idle buffers unsynchronized
- gallium/u_threaded: don't reference resource in pipe_transfer
- util: add thread-safe version of idalloc
- zink: don't set u_resource_vtbl
- gallium: remove unused u_default_transfer_unmap
- gallium: remove resource_get_handle from u_resource_vtbl
- gallium: remove transfer_flush_region from u_resource_vtbl
- radeonsi: stop using u_resource_vtbl::resource_destroy
- r300: stop using u_resource_vtbl::resource_destroy
- nouveau: stop using u_resource_vtbl::resource_destroy
- i915g: stop using u_resource_vtbl::resource_destroy
- virgl: stop using u_resource_vtbl::resource_destroy
- svga: stop using u_resource_vtbl::resource_destroy
- r600: stop using u_resource_vtbl::resource_destroy
- gallium: remove u_resource_vtbl::resource_destroy
- gallium: split transfer_(un)map into buffer_(un)map and texture_(un)map
- gallium: remove u_resource_vtbl::transfer_(un)map
- gallium: remove empty structure u_resource_vtbl
- gallium: remove structure u_resource
- radeonsi: simplify the NGG culling vertex count heuristic
- amd: add Beige Goby support
- amd/registers: don't generate 32-bit register fields
- amd/registers: regenerate json files without 32-bit register fields
- amd: fix incorrect addrlib comment for HTILE equations
- ac/gpu_info: set has_zero_index_buffer_bug for Navi12 too
- ac/llvm: set target features per function instead of per target machine
- ac/llvm: expose set_range_metadata to more users
- ac/llvm: allow ac_build_optimization_barrier with SGPRs, pointers, and metadata
- ac/llvm: set range metadata on mbcnt and deduplicate get_thread_id
- ac/llvm: don't draw the primitive for the dummy export workaround for Navi1x
- winsys/amdgpu: don't hold a mutex while accessing is_shared
- radeonsi: remove unused SI_IMAGE_ACCESS_AS_BUFFER
- radeonsi: handle PIPE_CAP_MAX_VERTEX_BUFFERS
- radeonsi: add a gfx10 bug workaround for NOT_EOP
- radeonsi: fix a coherency issue when VS memory stores are not visible in PS
- radeonsi: always use the L2 LRU cache policy for faster clears and copies
- radeonsi: don't disable L2 caching for staging textures
- radeonsi: don't use GS fast launch with small instances
- radeonsi: fix the fast launch vert/prim thread counts if they are trimmed
- radeonsi: remove a twice duplicated workaround for VERT_GRP_SIZE
- radeonsi: re-enable fast launch with indexed tri strips because it doesn't hang
- radeonsi: improve generated culling code by adding optimization barriers
- radeonsi: change si_resource::alignment to alignment_log2 for better packing
- radeonsi: remove 8 bytes from si_resource, turn other 4 bytes into padding
- radeonsi: add a gfx10 hw bug workaround with the barrier before gs_alloc_req
- radeonsi: add missing threaded_resource_deinit calls in fail paths
- radeonsi: rewrite the prefix sum computation for shader culling
- radeonsi: allow changing the NGG subgroup size to 256 but don't change it yet
- radeonsi: generate buffer_id_unique for u_threaded_context
- radeonsi: implement threaded context callbacks for resource busy checking
- radeonsi: disable DFSM on gfx9 by default because it decreases performance a lot
- radeonsi: remove DFSM after we discovered how bad it is
- gallium/u_vbuf: add a fast path to skip refcounting for uploaded user buffers
- mesa: move _mesa_copy_vertex_attrib/buffer functions to their only use
- mesa: don't call _mesa_set_draw_vao in glPushClientAttrib
- mesa: optimize glPush/PopClientAttrib for GL_CLIENT_VERTEX_ARRAY_BIT
- mesa: optimize unreferencing VBOs in glPopClientAttrib
- mesa: don't call FLUSH_VERTICES in glPopClientAttrib
- mesa: don't save/restore VAO NumUpdates and IsDynamic to fix update tracking
- st/mesa: execute glFlush asynchronously if no image has been imported/exported
- gallium/u_threaded: don't update valid_buffer_range for read-only shader buffers
- gallium/u_threaded: clear valid buffer range only if it's not bound for write
- gallium/u_threaded: use tc_drop_resource_reference in call_draw_single_drawid
- gallium/u_threaded: merge draws faster by merging indexbuf unreferencing
- radeonsi: check is_buffer once instead of 4 times in si_set_sampler_view_desc
- radeonsi: use the restrict keyword to set sampler view descriptors faster
- radeonsi: don't clear register fields in si_set_mutable_tex_desc_fields
- radeonsi: restructure si_set_sampler_views for faster unbinding trailing slots
- radeonsi: remove no-op unref in si_set_constant_buffer
- radeonsi: set desc[3] of all buffer descriptors at context creation
- radeonsi: move a few functions from si_state_draw.cpp into si_gfx_cs.c
- radeonsi: compile si_state_draw.cpp for each gfx generation separately
- radeonsi: remove the chip_class dimension from the draw_vbo array
- radeonsi: remove -Wstrict-overflow=0 since it doesn't seem to be needed
- gallium/pb: change alignment to 32 bits
- shader_enums: change VERT_BIT back to the 32-bit shift
- glthread: change when glFlush flushes asynchronously
- st/mesa: fix an incorrect comment in st_context_flush
- st/mesa: move the st_flush_bitmap_cache call into st_flush
- mesa: add gallium flush_flags param into ctx->Driver.Flush
- mesa: move _mesa_notifySwapBuffers into the x11 swrast driver
- mesa: execute glFlush asynchronously if no image has been imported/exported
- radeonsi: fix compile failures with SI_PRIM_DISCARD_DEBUG enabled
- radeonsi: use ac_build_bit_count instead of opencoding it
- radeonsi: fix incorrect counting of compute_num_verts_rejected
- radeonsi: fix multi draws for the prim discard CS
- ac/llvm: add a callback to ac_cull_triangle to generate code in inner-most block
- radeonsi: move the accepting code into the bbox cull branch in NGG cull code
- ac/surface/tests: fix RB counts
- ac/surface: don't set DCC_PIPE_ALIGN modifier bit for gfx10 with 1 RB
- radeonsi: restructure si_get_vs_vgpr_comp_cnt for readability
- radeonsi: merge 2 conditional blocks with same condition into 1 in culling code
- radeonsi: set more precise max_waves in NGG code
- radeonsi: remove incorrect comment about PA
- radeonsi: try to keep all VS input loads together for better perf
- radeonsi: don't compile TES and GS draw_vbo variants for the prim discard CS
- radeonsi: remove the Z culling option from the primitive discard CS
- radeonsi: drop gfx7 support from the prim discard CS to simplify code
- radeonsi: drop support for triangle fans from the prim discard CS
- radeonsi: skip buffer_atomic_add(ptr, n) when n=0 in the prim discard CS
- radeonsi: cleanup some primitive discard CS TODOs regarding instancing, etc.
- ac/llvm: don't set skip-uniform-regions to fix atomic.cmpswap
- mesa: unreference zombie buffers when creating buffers to lower memory usage
- radeonsi: document why VBO descriptors in user SGPRs are beneficial
- radeonsi: if shader culling culls all vertices, cull the primitive exports too
- radeonsi: remove incorrect comment about hangs in gfx10_ngg_gs_emit_epilogue
- radeonsi: don't use NGG culling on 1 RB chips
- ac/gpu_info: adjust the condition for use_late_alloc
- radeonsi: optimize set_inlinable_constants when they don't change
- st/mesa: don't track VS sampler views for st_draw_feedback in st_context
- st/mesa: don't track FS sampler views for bitmap/drawpix in st_context
- st/mesa: don't memset the sampler view array, don't init trailing slots to NULL
- st/mesa: sink _mesa_get_samplerobj into st_update_single_texture
- st/mesa: read Target only once in st_update_single_texture
- st/mesa: return sview from st_update_single_texture via return value, not param
- st/mesa: remove the const qualifier for a few st_sampler_view instances
- st/mesa: sink refcounting from st_get_sampler_views into st_sampler_view.c
- st/mesa: add a mechanism to bypass atomics when binding sampler views
- st/mesa: remove the sampler min_lod/max_lod value swap
- cso: disallow NULL sampler state templates in cso_single_sampler
- cso: update max_sampler_seen only once in cso_set_samplers
- cso: don't look up a sampler CSO if the last one is identical
- mesa: use atomics instead of mutexes for refcounting texture objects
- mesa: use atomics instead of mutexes for refcounting sampler objects
- mesa: use atomics instead of mutexes for refcounting renderbuffers
- mesa: remove mutex locking from a glBindTexture early out path
- mesa: translate into pipe_sampler_state in GL functions
- mesa: add LodBias quantization from st/mesa
- mesa: add IsBorderColorNonZero to skip border color update for st/mesa faster
- mesa: lower GL_CLAMP in texture and sampler functions instead of st/mesa
- radeonsi: remove the GDS variants of compute-based primitive discard
- radeonsi: change how the prim discard CS is enabled and splitting limits
- radeonsi: fix issues with draw-level splitting for the prim discard CS
- radeonsi: add optimal multi draws and draw-level splitting for prim discard CS
- radeonsi: move the accepting code into the bbox cull branch in prim discard CS
- radeonsi: drop smoothing quality to 4xAA for better performance
- ac/llvm: don't return a status from ac_cull_triangle because it's unused
- ac/llvm: rework how negative W affects culling to not call accept_func twice
- radeonsi: rewrite a confusing comment in si_upload_and_prefetch_VB_descriptors
- ac/surface/tests: fix the ARM build
- radeonsi,radv: fix a late alloc deadlock with <= 6 CUs per SA
- radeonsi: move an incorrectly placed comment about late alloc
- ac,radeonsi: move late alloc computation into common code and shader states
- radeonsi: enable uniform inlining by default
- util/idalloc: change num_elements to units of elements instead of bits
- util/idalloc: fold the size call into init
- util/idalloc: reserving an ID that already exists should be no-op
- util/idalloc: hide or remove unused public functions
- util/idalloc: add exists and foreach helpers
- util/idalloc: add util_idalloc_alloc_range
- radeonsi: don't expose no-attachment MSAA 16x on all 1 RB chips due to issues
- mesa: fix incorrect comment in draw_gallium_multimode
- st/mesa: always use PIPE_USAGE_STAGING for GL_MAP_READ_BIT usage
Mark Janes (11):
- iris: Increase the size of upload buffers
- iris: Upload constant resources for efficient GPU access
- iris: Use const_uploader for iris_create_stream_output_target
- iris: Use const uploader for blorp vertex data
- iris: Use const uploader for draw parameters
- iris: Use const uploader for user index data
- intel/compiler: Add getter helpers for LSC message descriptor fields
- intel/compiler: Add LSC messages to brw_schedule_instructions
- intel/fs: Lower DW untyped r/w messages to LSC when available
- intel/fs: Lower untyped atomic messages to LSC when available
- intel/fs: Lower A64 untyped r/w messages to LSC when available
Martin Krastev (1):
- compiler/glsl: Use mutex lock while freeing up mem_ctx
Martin Peres (1):
- ci: add the dEQP expectations for radv on Renoir
Matt Turner (10):
- intel/eu: Add instruction compaction support on XeHP.
- compiler/glsl: Return progress from propagate_invariance()
- compiler/glsl: Propagate invariant/precise when splitting arrays
- compiler/glsl: Always propagate_invariance() last
- freedreno/afuc: Print uintptr_t with PRIxPTR
- sparc: Avoid some redefinition warnings
- tu: Provide a toggle to avoid warnings about unsupported devices
- freedreno/ci: Use TU_IGNORE_CONFORMANCE_WARNING to reduce warnings
- ci: Unify on MESA_VK_IGNORE_CONFORMANCE_WARNING
- amd/ci: Use MESA_VK_IGNORE_CONFORMANCE_WARNING to reduce warnings
Matti Hamalainen (11):
- gallium/tools: clean up tracediff.sh a bit
- gallium/tools: improve option handling in dump_state.py
- gallium/tools: implement better suppression of variants
- gallium/tools: implement 'named' pointers option in dump.py
- gallium/tools: use left-column output mode of sdiff in tracediff.sh
- gallium/tools: improve tracediff.sh argument handling
- gallium/tools: implement "high-level" overview mode option in dump scripts
- gallium/tools: improve pointer type tracking in parse.py
- gallium/tools: add option to use Meld for diffing
- aux/trace: add missing return value to trace output
- gallium/tools: improve handling of pointer arrays
Mauro Rossi (11):
- egl/android: include "util/compiler.h" for FALLTHROUGH macro
- android: panfrost/lib: add pan_cs.c to Makefile.sources
- android: gallium/radeonsi: add nir include path
- android: amd/common: add nir include path
- android: pan/bi: add bi_opt_constant_fold.c to Makefile.sources
- android: nir: add nir_lower_fragcolor.c to Makefile.sources
- android: intel/compiler: add brw_compile_ff_gs.c to Makefile.sources
- android: i965: remove brw_ff_gs_emit.c from Makefile.sources
- android: ac: add ac_nir_lower_ngg.c to Makefile.sources
- android: ac: add include src/util path
- android: aco: add aco_optimizer_postRA.cpp to Makefile.sources
Michael Tang (1):
- microsoft/compiler: Maintain sorting of resource type in the context
Michael Walle (1):
- kmsro: Add mali-dp
Michel Dänzer (18):
- lima/ppir: Cast pointer to uintptr_t instead of uint64_t
- util: Remove unused Android options_tbl_lock
- Convert most remaining free-form fall-through comments to FALLTHROUGH
- Guard FALLTHROUGH annotations after assert()
- llvmpipe: Drop switch with only default case
- iris: Drop unneeded default switch case
- Use explicit break instead of fall-through to break-only case
- ci: Enable -Werror in clang jobs
- osmesa: Replace default case FALLTHROUGH annotation by following return
- ci: Enable -Werror for the remaining GCC build jobs
- ci: Move -Werror enabling from job definitions to meson build script
- ci: Add test which occasionally times out to lavapipe-vk skips
- Fix up leftover "state_trackers" references to "frontends"
- turnip: Mark local variable ASSERTED
- ci: Add debian/ prefix to job names for Debian based docker images
- ci: Rename Debian based build jobs from meson-* to debian-*
- ci: Add Fedora 34 based x86 build docker image
- ci: Add Fedora release build job
Michel Zou (14):
- lavapipe: fix unused variable warning
- vulkan: fix duplicate win32 def
- gallium: fix uninitialized variable warning
- meson: link vulkan_util with link_whole on mingw
- docs: list more vulkan extensions
- vulkan/wsi: avoid wsi_x11_check_for_dri3 for sw device
- zink: fix win32 build
- swr: fix uninitialized variable warnings
- llvmpipe: restrict optim bug workaround to gcc 10.x
- glapi: fix Warray-parameter
- zink: Drop useless zink_dispatch_table
- zink: Fix win32 build
- zink: Fix unused-variable warning
- meson: dont use missing dumpbin path
Miguel Gomez (1):
- i965: Prevent invalid framebuffer usage
Mike Blumenkrantz (548):
- gallium: add PIPE_BIND_SAMPLER_REDUCTION_MINMAX
- gallium: split PIPE_CAP_SAMPLER_REDUCTION_MINMAX into modes
- mesa/st: plumb GL_TEXTURE_REDUCTION_MODE_ARB through QueryInternalFormat
- zink: hook up VK_EXT_sampler_filter_minmax
- zink: support format queries for VK_EXT_sampler_filter_minmax
- zink: handle minmax sampler creation for VK_EXT_sampler_filter_minmax
- zink: export PIPE_CAP_SAMPLER_REDUCTION_MINMAX_ARB
- docs: update GL_ARB_texture_filter_minmax for zink
- zink: compare against screen batch id when determining which semaphore to use
- zink: always copy the nir shader before compiling
- zink: fix tcs slot map eval for user vars
- zink: fix tcs input reservation for user vars
- st/pbo: use cso_set_vertex_buffers_and_elements() for st_pbo_draw
- zink: merge copy-to-scanout path into non-deferred flush path
- zink: force scanout sync when mapping scanout resource
- util/format: add util_format_is_rgbx_or_bgrx
- zink: use undefined layout for first scanout obj transition
- Revert "zink: force scanout sync when mapping scanout resource"
- zink: move scanout sync to end of batch
- zink: add a flag indicating whether scanout object needs updating
- zink: move wsi flush info conditional to queue submission
- zink: directly set batch->state->flush_res from flush_resource hook
- zink: add clear-on-flush mechanic deeper into flush codepath
- gallium: when tracing is enabled for threaded drivers, trace the driver thread
- nir/lower_fragcolor: set outputs_written for fragdata members
- softpipe: fix render condition checking
- softpipe: fix streamout queries
- softpipe: ci updates
- zink: track persistent resource objects, not resources
- zink: restore previous semaphore (prev_sem) handling
- zink: use cached memory for staging resources
- zink: init timeline semaphore on screen creation, not first batch creation
- zink: only reset query on suspend if the query has previously been stopped
- zink: when performing an implicit reset, sync qbos
- lavapipe: implement VK_EXT_provoking_vertex
- zink: hook up VK_EXT_provoking_vertex
- zink: implement VK_EXT_provoking_vertex
- zink: ci updates
- zink: update docs
- nir/gl_lower_buffers: set access for ssbo load/store instrs
- zink: use non-atomic load/store ops if intrinsic is not actually coherent
- zink: remove leftover references to flatshading in shader keys
- zink: hook up VK_KHR_shader_clock
- zink: add conversion util for nir_scope -> SpvScope
- zink: add spirv builder for unops with a const operand
- zink: support nir_intrinsic_shader_clock
- zink: export PIPE_CAP_TGSI_CLOCK
- zink: generate spirv 1.5 from ntv when using vk >= 1.2
- zink: create entrypoints for descriptor variables with spirv 1.5
- zink: add fastpath for getting default shader variants
- zink: use first-created shader variant as the default
- zink: hook up VK_EXT_sample_locations
- zink: hook up VK_EXT_conservative_rasterization
- zink: hook up VK_EXT_shader_subgroup_ballot
- zink: hook up EXT_image_drm_format_modifier
- docs: mark off GL_ARB_shader_clock for zink
- gallium: rename pipe_draw_start_count -> pipe_draw_start_count_bias
- gallium: move pipe_draw_info::index_bias to pipe_draw_start_count_bias
- mesa/st: rename DrawGalliumComplex -> DrawGalliumMultiMode
- gallium: split drawid out of pipe_draw_info and as a separate draw_vbo param
- gallium: remove padding members from pipe_draw_info
- util/tc: split out drawid-using draws into a separate call
- iris: fix indirect drawid
- zink: grab GetPhysicalDeviceMemoryProperties2 from instance
- zink: hook up VK_EXT_memory_budget
- zink: support PIPE_CAP_QUERY_MEMORY_INFO
- zink: minor refactoring of buffer map for read case
- zink: add a screen util function for handling VkResults
- zink: use zink_screen_handle_vkresult() for fence and timeline waiting
- zink: add a ctx function for handling device lost resets
- zink: use new ctx device lost checker function
- zink: add a pipe_context::resource_commit hook
- zink: implement sparse buffer creation/mapping
- zink: export PIPE_CAP_SPARSE_BUFFER_PAGE_SIZE
- aux/cso_cache: add handling for save/restore of compute states
- zink: clamp zs samplers to XXXX swizzle for all non-zero/one swizzles
- gallium/inlines: remove atomic set from pipe_reference_init()
- nir: add nir_isub_imm
- lavapipe: handle buffer sizes better in CmdBindTransformFeedbackBuffersEXT
- lavapipe: do not read sampler descriptor info during update if layout has immutables
- lavapipe: set events to the unsignalled state on creation
- lavapipe: flag renderpasses as having color/zs attachments
- lavapipe: update more states on null multisample pipeline info
- lavapipe: zero out the dsa state info and flag for updating on null dsa state
- lavapipe: zero out the blend state info and flag for updating on null blend state
- lavapipe: don't unnecessarily flag dsa states for updating
- lavapipe: ignore tess pipeline info if no tess shaders in pipeline
- lavapipe: don't access pipeline viewport state when it should be ignored
- lavapipe: don't access pipeline dsa state when it should be ignored
- lavapipe: don't access pipeline blend state when it should be ignored
- zink: split off descriptor layout from descriptor pools
- zink: unify pipeline layout creation functions
- zink: abstract descriptor init
- zink: abstract descriptor usage for programs
- zink: abstract descriptor pool usage for programs
- zink: use explicit types during descriptor updates
- zink: check descriptor layout support before creating it
- zink: move more vertex state stuff into the hw state
- zink: split vertex state pipeline hashing into its own value
- zink: flag pipeline for change more often when vbos change without dynamic state
- zink: return current pipeline object if state hasn't changed
- zink: hook up dynamic dsa states
- zink: start using dynamic front face state
- util/hash_table: _mesa_hash_table_create_u32_keys()
- zink: add a pipe_context::clear_buffer hook
- zink: never use LINEAR for VK_EXT_4444_formats
- zink: make ZINK_INLINE_UNIFORMS more standardized in function
- zink: clamp 3D surface viewtype to 2D only in the create_surface hook
- zink: add a target param to create_ivci()
- zink: simplify samplerview surface creation
- zink: only set layer info for samplerviews if there are multiple layers
- zink: handle in-renderpass clears in fb_clears_apply_internal()
- zink: break zs clear loop once both bits are set when beginning renderpass
- zink: add debug assert to verify that zink_clear_framebuffer() is accurate
- zink: remove compute cruft from resource mapping
- zink: break out draw dispatch into separate functions
- zink: fix texture barriers for real this time
- zink: rework memory_barrier hook again (third time's the charm)
- ci: skip glsl-uniform-interstage-limits tests for softpipe jobs
- zink: use DONTCARE renderpass when a new scanout fb attachment is set
- iris: refcount separate screen objects for resource tracking
- zink: stop invalidating descriptor sets on pool destroy
- zink: add context-based descriptor info tracking infrastructure
- zink: unify resource rebinding
- zink: track bind counts for descriptors
- zink: update samplerview descriptor layouts when image binds are set
- zink: don't track sampler states onto buffer sampler sets
- zink: track max slot idx for descriptor types
- zink: track number of tbos in shader data
- zink: add slot params to zink_context_invalidate_descriptor_state
- zink: use better iterating for buffer rebinds
- zink: call invalidate on invalid descriptor sets during recycle
- zink: make zink_context_update_descriptor_states() static
- zink: remove screen param from zink_descriptors_update()
- zink: pop descriptor refs when invalidating sets
- zink: flush every 100k draws/computes
- zink: check for a work_count-based stall in zink_maybe_flush_or_stall()
- zink: always do maybe_flush after draw/compute
- zink: stop overwriting buffer map pointers for stream uploader
- zink: fix DrawParameters shader cap usage
- lavapipe: fix fencing when submitting multiple cmdbufs
- zink: immediately return false when getting query result if it's not gonna happen
- util/queue: don't require a fence when adding a job
- zink: split out base renderpass begin into separate function
- zink: add a flag for tracking/validating renderpass clears
- zink: add flags for determining whether to update framebuffer and renderpass
- zink: emit some barriers out of renderpass where possible
- nir/builder: add nir_pad_vector and nir_pad_vec4 util functions
- zink: don't multiply cube array image layers
- zink: populate images with u_blitter if transfer_dst isn't available
- zink: add even more validation for linear images before creation
- util/primconvert: add C++ guards to header
- aux/trace: support pipe_screen::query_memory_info
- aux/trace: pipe_screen::query_dmabuf_modifiers
- aux/trace: pipe_context::is_dmabuf_modifier_supported
- aux/trace: propagate pipe_screen::transfer_helper pointer
- aux/trace: pipe_screen::get_dmabuf_modifier_planes
- aux/trace: trace pipe_screen::resource_create_with_modifiers
- util/prim_restart: fix util_translate_prim_restart_ib
- ci: more freedreno flakes
- aux/vbuf: prevent uint underflow and assert if no vbs are dirty
- aux/trace: add pipe_context::set_debug_callback hook
- aux/trace: more effectively unwrap pipe_context params from screen functions
- aux/trace: trace transfer ops
- aux/trace: stop dumping transfer data for threaded contexts
- aux/trace: hook tc methods
- aux/trace: fix set_inlinable_constants hook
- aux/trace: fix query handling with tc
- aux/trace: add a pipe_context::clear_buffer hook
- aux/trace: dump 'wait' param for get_query_result
- radeonsi: explicitly return support for all index buffer formats
- zink: rename ptr_add_usage -> batch_ptr_add_usage
- zink: make descriptor_layout_get a public util function
- zink: make a public util function for allocating descriptor sets
- zink: unify pipeline layout creation and move to descriptor_program_init
- zink: pass descriptor type to set layout create()
- zink: replace has_descriptors program member with a util function
- zink: abstract descriptor functionality and make descriptor structs private
- zink: improve samplerview update flagging
- zink: emit descriptor barriers and references during bind
- zink: add vertex buffer barriers during bind
- zink: make timeline_wait use only a screen param
- zink: move timeline_wait() to screen function
- zink: implement tc idalloc resource id stuff
- zink: force streamout rebind when mapping a streamout buffer for writing
- zink: implement a tc is_resource_busy hook
- zink: call tc_driver_internal_flush_notify() on flush
- zink: mark some buffer barrier functions inline/static
- zink: switch to memory barriers instead of actual buffer barriers
- zink: hook up push descriptor and descriptor template extensions
- zink: disable push descriptors on amd
- nir/builder: add nir_mask
- radv: make radv_pipeline::attrib_ends 32bit
- radv: set maxVertexInputAttributeOffset to UINT32_MAX
- zink: remove weird lod hack for texturing
- zink: ci updates
- llvmpipe: remove clamping to [0,1] for tri offset
- lavapipe: moar @optimize
- llvmpipe: split out scene surface info into separate struct
- llvmpipe: split out scene surface init into separate function
- llvmpipe: only dump tgsi shaders if they're actually tgsi shaders
- llvmpipe: store a screen pointer in resource struct
- llvmpipe: stop accessing pipe_resource::screen internally
- lavapipe: skip "pipeline barriers" if they're first or last in a cmdbuf
- lavapipe: also ignore multiple pipeline barriers in succession
- gallium/aux: add helper for pre-clamping clear_buffer value to dword
- zink: clamp clear_buffer values
- radeonsi: clamp clear_buffer values using new util helper
- zink: improve unsupported feature warning message
- aux/trace: avoid deadlock in screen::flush_frontbuffer hook
- gallivm: fix oob imageLoad with formats that have <4 components
- llvmpipe: ci updates
- aux/indices: break out primitive type conversion to separate function
- aux/indices: break out index size conversion to separate function
- aux/indices: break out index count conversion into separate function
- aux/indices: employ Delete The Code methodology
- lavapipe: add more format mappings for vertex buffer formats
- zink: reapply resource/surface refs after app flushes
- zink: reapply program refs automatically
- zink: remove barriers/refs from descriptor cache
- zink: mark some draw functions inline
- zink: only rebind pipelines when necessary
- zink: handle rebinds for vertex buffers
- zink: only rebind vertex buffers when necessary
- zink: only update viewport state when necessary
- zink: update scissor only when necessary
- zink: ref vertex buffers during set_vertex_buffers
- zink: stop using util_set_vertex_buffers_mask()
- Revert "zink: call tc_driver_internal_flush_notify() on flush"
- compiler/spirv: expand_to_vec4 -> nir_pad_vec4
- anv: fix availability for copying timestamp query results
- zink: add a second descriptor manager
- zink: unify code for updating res->bind_count values
- zink: unify more resource bind count tracking code
- zink: optimize buffer rebinds
- zink: ci updates
- aux/trace: dump resource for samplerview and surface
- aux/draw: if pipe_draw_info::index_bias_varies is not set, ignore index_bias for N>1
- aux/draw: fix aalines and aapoints for shaders with explicit FragData outputs
- radv: declare index_va in a single call for indexed draw packet emit
- radv: explicitly load a desc set layout struct member during set allocate
- zink: add a util function to create a null surface
- zink: replace context-based null framebuffer surfaces with internal ones
- zink: create dummy surface/bufferview for null descriptor use
- zink: handle null bufferview/imageview descriptors when robustness2 is missing
- zink: ci updates
- zink: no-op read access buffer barriers if existing access exists for earlier stage
- zink: emit fb attachment barriers inline during renderpass start
- zink: track number of fb attachment binds on resources
- zink: use VK_IMAGE_LAYOUT_DEPTH_STENCIL_READ_ONLY_OPTIMAL when possible
- aux/tc: fix ubo unbinding
- Revert "Revert "zink: call tc_driver_internal_flush_notify() on flush""
- nouveau: explicitly advertise index buffer format support
- r300: explicitly advertise index buffer format support
- d3d12: explicitly advertise index buffer format support
- zink: explicitly advertise index buffer format support
- zink: more accurately handle shader layer/viewport caps
- util/prim_restart: assert the index size at the start of the function
- util/prim_restart: pre-trim degenerate primitives during draw rewrite
- util/prim_restart: store index bounds while rewriting draws
- util/prim_restart: store the total index count when rewriting draws
- util/prim_restart: update index bounds before draws in util_draw_vbo_without_prim_restart
- util/prim_restart: simplify util_draw_vbo_without_prim_restart a bit
- zink: populate maxSampleLocationGridSize for all available sample sizes on init
- zink: set VK_IMAGE_CREATE_SAMPLE_LOCATIONS_COMPATIBLE_DEPTH_BIT_EXT on zs rts
- zink; add a pipe_screen::get_sample_pixel_grid hook
- zink: add a pipe_context::set_sample_locations hook
- zink: also flag sample_locations_changed if framebuffer samples changes
- zink: add a util function for populating VkSampleLocationsInfoEXT
- zink: update vk sample location info during framebuffer setup
- zink: add a pipe_context::evaluate_depth_buffer hook
- zink: use dynamic state to apply sample locations during draw
- zink: export PIPE_CAP_PROGRAMMABLE_SAMPLE_LOCATIONS
- util/vbuf: fix buffer overrun in attribute conversions
- zink: fix caching of shader variants with inlined uniforms
- util/blitter: remove duplicated set_sample_mask calls
- util/disk_cache: add nocopy variant of disk cache store function
- zink: use scissor region for discarding clears during blit
- zink: clamp PIPE_CAP_MAX_VIEWPORTS to PIPE_MAX_VIEWPORTS
- aux/cso: add flag to disable vbuf
- aux/cso: split cso_destroy_context into unbind and a destroy functions
- lavapipe: use cso caching
- zink: fix typo that's definitely not at all embarrassing or anything like that
- aux/cso: store flatshade_first state from rasterizer
- util/primconvert: add function for setting flatshade_first
- util/vbuf: add flatshade_first to vbuf context and api
- aux/cso: set flatshade_first onto vbuf when binding rasterizer
- aux/tc: pass rebind count and rebind bitmask with replace_buffer_storage func
- util/prim_restart: use more direct conversion for restart index
- zink: add a function for creating descriptor layouts for push sets
- zink: split lazy sets based on descriptor type
- zink: match lazy descriptor set layout in cache mode
- zink: modernize cached push ubo descriptor updating
- zink: modernize cached ubo descriptor updating
- zink: modernize cached ssbo descriptor updating
- zink: modernize cached image descriptor updating
- zink: remove sorting for dynamic ubo offset updating
- zink: move ubo range assert to update_descriptor_state()
- zink: unify cached descriptor update code
- zink: run lazy batch descriptor functions in cache mode
- zink: add is_buffer flag to union zink_descriptor_surface
- zink: update null sampler/image descriptor surface with is_buffer during hashing
- zink: move shader image descriptor set refs to underlying type
- zink: add funcs for descriptor_surface refs
- zink: move samplerview descset refs to base objects
- zink: enable templated descriptor updates in cache mode
- zink: add oob asserts for descriptor set ref setting
- zink: skip hash updates for descriptor types which aren't used
- zink: unblock last_set cached descriptor reuse when safe to do so
- zink: add ZINK_DESCRIPTORS env var to explicitly set a mode
- zink: remove zink_batch_state::descs_used
- zink: split batch state work_count into separate vars
- zink: reorder has_barriers flag in batch state struct
- zink: optimize zink_tc_fence struct packing
- zink: move batch usage functions to static inlines
- zink: remove atomic from batch usage setting
- zink: make batch_usage_unset take a batch state param
- zink: unset program batch usage on state reset
- zink: remove unnecessary conditionals in resource batch tracking
- zink: make batch_usage_set take a batch state param
- zink: make batch_usage_matches take a batch state param
- zink: cache descriptor update templates along with layout
- zink: track active use counts for descriptor layouts
- zink: destroy lazy descriptor pools during batch reset when unused
- zink: slightly refactor program updating during draw
- zink: remove return types from program update functions during draw
- zink: simplify zink_program_has_descriptors()
- zink: mark bind_stage() as inline
- zink: unify gfx shader create callbacks
- zink: use u_live_shader_cache
- zink: remove unnecessary draw checks
- zink: move batch decl to top of draw_vbo
- zink: stop sanitizing primitive_restart flag in draw info
- zink: handle nir_op_pack_64_2x32
- zink: add update flag for rasterizer state change
- zink: add update flag for dsa state change
- zink: split stencil ref changes to separate dirty flag
- anv: fix dynamic primitive topology for tess
- zink: update pipe_screen::num_contexts
- zink: set subdata hook as PIPE_MAP_ONCE
- zink: move queue init to screen creation
- util/queue: add a global data pointer for the queue object
- zink: add a more direct check for rgbx formats in create_sampler_view hook
- zink: smash dstAlphaBlendFactor to ZERO for RGBX attachments
- zink: also nope out of any dst alpha blends for rgbx formats
- zink: support more RGBX formats
- zink: ci updates
- zink: mark some functions inline
- zink: collapse host_visible and non-coherent alignment alloc cases
- zink: change a bunch of sparse buffer resource checks to host-visible checks
- zink: avoid caching visible vram allocations
- zink: key alloc cache on heap index, not heap flags
- zink: check actual mem props to determine if resource object is coherent
- zink: use fake buffer barriers for descriptors
- zink: always defer image descriptor barriers
- zink: remove duplicated bitflag filtering for inline uniforms
- zink: remove inlinable_uniforms_dirty_mask
- radv: move pipe_misaligned and l2_coherent image checks to flags set on init
- nine: only enable tgsi disk cache if the driver supports it
- nine: add zink to the build target
- zink: handle custom border color without matching wrap mode case
- zink: add a flag for disabling conditional render during blit
- zink: add more clear hooks
- zink: clear the fb clears array instead of freeing it on reset
- zink: support multidraw
- zink: use multidraw
- vk/util: add macros for multidraw
- zink: clear textures directly when possible
- zink: only update last_finished during batch reset if the batch was used
- zink: improve tc fence disambiguation
- zink: add and use fencing functions which take batch usage structs
- zink: use batch usage api for resource helper function
- zink: remove no-longer-used resource helper functions
- zink: queue v3.0
- zink: apply zink_resource_object::offset for memory flush/invalidates
- zink: break out offset alignment calculation into helper
- zink: make init_mem_range() a public function
- zink: enforce multi-context waiting for unflushed resources on foreign batches
- zink: move queue submit thread to screen
- zink: move sparse buffer commit to screen queue
- zink: move fence reset to submit thread
- zink: flag scanout updates to batch state, not resource
- zink: move some end-of-batch stuff to submit thread
- zink: don't clear batch resources on fence finish
- ci: disable panfrost t760 jobs
- aux/draw: add a util function for reading back indirect draw params
- util/prim_restart: break out draw rewriting into separate function
- util/primconvert: handle indirect draws
- util/primconvert: map index buffer before getting index translator function
- util/primconvert: handle rewriting of prim-restart draws with unsupported primtype
- util/primconvert: handle multidraws in primconvert
- gallium: add a pipe cap to rewrite index buffers for draws using a non-fixed restart index
- gallium: handle automatic 8bit -> 16bit index buffer rewrites
- gallium: add a pipe cap for performing automatic prim type conversion
- gallium: add a pipe cap for determining driver support for prim type in restarts
- zink: export PIPE_CAP_EMULATE_NONFIXED_PRIMITIVE_RESTART
- zink: export 8bit index buffer support based on extension presence
- zink: export supported prim types
- zink: export supported primitive restart types
- zink: remove primconvert
- zink: ci updates
- zink: use depth/stencil-only layouts for depth/stencil-only formats
- lavapipe: implement multidraw ext
- zink: break out image descriptor layout into util function
- zink: split deferring of barriers to image and buffer functions
- zink: only do deferred image barriers if layout changes
- zink: use bind counts to more accurately determine image descriptor's exact layout
- zink: improve automatic layout transitions for sampler+image descriptors
- zink: only queue deferred descriptor layout change on first bind or change
- zink: flush pending clears if a resource is bound as a descriptor
- zink: repack zink_context struct a bit
- anv: unify some draw state vertex constant emission
- anv: VK_EXT_multi_draw implementation
- util/vbuf: always claim support for PATCHES in restart modes
- util/vbuf: flag fallback_always if any prim types are missing from restart modes
- zink: add direct conversion from pipe_shader_type->VkPipelineStageFlags
- zink: split dummy buffer creation and populate
- zink: try for better buffer allocation heaps
- zink: don't align device-local buffer memory
- zink: make mem cache limits dynamically scalable
- zink: uncap mem caching
- zink: cache visible vram
- zink: attempt to handle some resource unmap cases in 32bit envs
- radv: pre-calc vertex buffer descriptor size on pipeline object
- lavapipe: hook up some bits for handling dynamic line stipple state
- lavapipe: implement EXT_vertex_input_dynamic_state
- zink: avoid unnecessarily rewriting gl_DrawID
- zink: unify/consolidate some barrier queuing
- zink: break up ctx descriptor state updating to per-type functions
- zink: add a ref for flush resource
- zink: unify fb surface unbinding
- zink: move line width and depth bias updating into conditional during draw
- zink: merge some streamout state emission into the same draw conditional
- zink: rework pipeline cache implementation
- zink: make prim type a bitfield in pipeline info
- zink: rename 'template' struct member
- zink: remove unnecessary return from zink_desc_type_from_vktype()
- zink: add c++ header guards
- zink: add more explicit casts to draw code
- zink: don't add batch tracking during buffer rebinds if refs are dirty
- zink: remove stencil resource batch tracking
- zink: split out resource tracking into more incremental functions
- zink: split batch usage setting from refcounting
- zink: split samplerview/imageview usage/refcounting calls
- zink: add resource refs after last descriptor unbind
- zink: use vkresult helper for map return
- zink: only flag persistent resource maps for invalidation if they aren't coherent
- zink: don't add mem allocation offset when copying buf2image
- zink: use pipe_resource::width0 for clamping ssbo sizes
- zink: use 0 as the offset when mapping qbos
- zink: stop screwing up buffer offsets during for maps
- zink: add a screen function for waiting on a batch id
- zink: check last_finished before timeline waiting
- lavapipe: store whether the geometry shader outputs GL_LINES
- lavapipe: store the geometry shader prim type to render state
- lavapipe: implement VK_EXT_line_rasterization
- lavapipe: wideLines support
- zink: ci updates for wideline fails
- relnotes: add some line feature updates for lavapipe
- features: mark off line rasterization for lavapipe
- features: mark off some zink features
- features: fix ARB_shader_group_vote -> GL_ARB_shader_group_vote
- features: add VK_EXT_multi_draw
- features: mark off EXT_vertex_input_dynamic_state for lavapipe
- radv: use multidraw iteration for direct draws
- radv: emit NOT_EOP for multi indexed draws
- radv: emit drawid for multidraws
- radv: determine if hardware can emit NOT_EOP before emitting
- radv: split indexed draw cases based on whether drawid is used
- radv: add a gfx10 bug workaround for NOT_EOP
- radv: implement VK_EXT_multi_draw
- lavapipe: handle null vertex buffers more gracefully
- util/vbuf: check 3-component 16bit int formats for translation
- zink: make shader cache local to a single program
- zink: split up shader cache per-stage
- zink: set gfx program shaders and generate internal tcs during program creation
- zink: remove gfx program slot mapping
- zink: remove shader_id
- zinK: tweak shader module update -> pipeline combined_dirty conditional
- lavapipe: implement EXT_separate_stencil_usage
- lavapipe: implement KHR_separate_depth_stencil_layouts
- features: more lavapipe extensions
- relnotes: more lavapipe features
- zink: add util function for transferring resource refs to batch
- zink: move resource object ref to batch in invalidate hook
- zink: move resource object ref to batch in init_storage_object
- zink: remove refs from buffer rebinds
- zink: remove refs from vertex buffers
- zink: remove refs from ubos
- zink: remove refs from shader buffers
- zink: remove refs from shader images
- zink: remove resource refs from samplerviews
- zink: remove refs from desc ref updating
- zink: add surface ref during rebind if unflushed usage
- zink: set new batch usage during surface rebinds
- zink: remove imageview refs from shader images
- zink: remove samplerview refs
- zink: remove fb surface refs
- zink: remove fb surface resource refs
- zink: remove some descriptor_refs_dirty checks from resource binding
- zink: add a per-stage mask for ubo binds
- zink: add a per-stage bind mask for ssbos
- zink: make samplerview bind mask apply to buffer resources too
- zink: make image_bind_count work for buffers
- zink: remove barriers from buffer rebinds
- zink: optimize buffer rebinds
- zink: redo streamout and texture components of memory_barrier hook
- zink: remove unnecessary stall during device-local map case
- lavapipe: only apply pipeline state for depth bias if it's enabled
- lavapipe: implement EXT_extended_dynamic_state2
- features: EXT_extended_dynamic_state2 for lavapipe
- relnotes: EXT_extended_dynamic_state2 for lavapipe
- zink: store the last vertex stage to the context during bind
- zink: use last_vertex_stage pointer to optimize streamout emission during draw
- zink: update streamout buffer strides inline
- zink: move descriptor update closer to start of draw
- zink: consolidate and optimize index buffer handling during draw
- features: mark off VK_EXT_multi_draw for radv
- zink: remove zink_shader_module refcounting
- zink: flag all shaders for create during gfx program init
- zink: keep a mask of stages present in a gfx program
- zink: flag shader modules as default
- zink: store the default variant hash for a program
- nir/format_convert: nir_shift -> nir_shift_imm
- nir/format_convert: add ssa version of uint packing
- lavapipe: disable line rasterization ext
- zink: ensure sparse allocations aren't marked host-visible
- zink: fix mem info query to be more permissive
- zink: zero out sampler/image descriptor surface info for null descriptor updates
- zink: ci updates
- zink: populate modifier props onto screen object during init
- zink: start storing modifiers to the base resource struct
- zink: store modifier aspect to resource
- zink: add a pipe_screen::resource_get_param hook
- zink: use VkImageDrmFormatModifierListCreateInfoEXT for creating from modifier array
- zink: explicitly disallow using the modifier image create for non-linear images
- zink: don't pass modifier count to first image create
- zink: add fallback for linear modifier use
- zink: add a pipe_screen::resource_create_with_modifiers hook
- features: mark off line rasterization for lavapipe
- relnotes: add some missing zink/lavapipe updates
- ci: add vulkan files to lavapipe rules
- ci: only trigger gallium_core_file_list jobs from dri and glx frontend changes
- zink: simplify modifier ifdefs
- zink: improve detection for broken drawids
- lavapipe: increment drawid for multidraws
- util/foz: stop crashing on destroy if prepare hasn't been called
- zink: use array size in spirv bo length calculations
Nanley Chery (8):
- anv: Add clear_supported to anv_layout_to_aux_state
- anv: Avoid sampling some MCS surfaces with clear
- iris: Avoid sampling some MCS surfaces with clear
- isl: Add isl_aux_usage_has_compression
- iris: Prefer more GPU-based uploads for compression
- intel: Limit the D16 workarounds to Gfx12.0
- anv,iris: Port the D16 workaround stalls to BLORP
- intel/isl: Fix HiZ+CCS comment about ambiguates
Neha Bhende (4):
- svga: Add target and sampler_return_type info into shader key
- svga: Use shader_key info to declare resources if TGSI shader is missing it
- svga: Initialize pipe_shader_state for transform shaders
- aux/indices: include provoking vertex check in prim type conversion
Neil Roberts (1):
- kmsro: Fix confusing comma expression
Niklas Haas (3):
- vulkan/wsi/x11: return VK_SUBOPTIMAL_KHR on mismatched swapchain
- vulkan/wsi/x11: lower resize events to VK_SUBOPTIMAL_KHR
- vulkan/wsi/wayland: implement the full format table
Olivier Fourdan (1):
- radeonsi: Check aux_context on si_destroy_screen()
Paul Gofman (1):
- util: add force_gl_names_reuse for SWKOTOR.
Paul Kocialkowski (1):
- lima: Take offset in account when checking BO size
Paulo Zanoni (2):
- iris: finish converting from drmIoctl to intel_ioctl
- iris: don't munmap NULL pointers
Petr Vaněk (1):
- docs/install: remove one extra when
Philipp Zabel (1):
- etnaviv: fix gbm_bo_get_handle_for_plane for multiplanar images
Philippe Normand (1):
- i915: Prevent invalid framebuffer usage
Pierre Moreau (2):
- clover/spirv: Properly size 3-component vector args
- clover/nir: Set constant buffer pointer size to host
Pierre-Eric Pelloux-Prayer (57):
- driconf: add workaround for Golf With Friends
- glx: init __GLXvendorInfo to NULL
- radeonsi/nir: enable nir_opt_move_discards_to_top pass
- radeonsi: enable glsl_correct_derivatives_after_discard by default
- st/mesa: fix clearing of 1D array textures
- frontend/dri: set PIPE_BIND_PROTECTED later
- frontend/dri: fix bool/int comparison
- radeonsi: allow write-only mapping of encrypted textures
- radeonsi: fix encryption check for buffers
- radeonsi: dirty msaa_config on rs->multisample_enable change
- winsys/amdgpu: don't read bo->u.slab.entry after pb_slab_free
- amdgpu/winsys: remove amdgpu_cs_has_chaining
- winsys/amdgpu: reduce amdgpu_cs size
- winsys/amdgpu: use int16 for buffer_indices_hashlist
- radeonsi: add _once suffix to depth_cleared_level_mask
- radeonsi: add si_install_draw_wrapper
- radeonsi: use si_install_draw_wrapper for tmz handling
- radeonsi/nir: add si_nir_is_output_const_if_tex_is_const
- radeonsi: use si_nir_is_output_const_if_tex_is_const
- vbo: delay vbo_exec_vtx_map call
- radeonsi: delay sample_pos_buffer creation until first use
- util/u_queue: move function definition up
- util/u_queue: add UTIL_QUEUE_INIT_SCALE_THREADS flag
- disk_cache: use UTIL_QUEUE_INIT_SCALE_THREADS
- radeonsi: skip instance_count==0 draws on <= GFX9
- radeonsi: disable ngg culling on llvm < 12
- mesa/shaderapi: change construct_name signature
- mesa/shaderapi: add an optional shader override mechanism
- ac/llvm: call the callback in all return paths of ac_cull_triangle
- radeonsi: fix fb_too_small condition
- radeonsi/gfx7: always sync pfp/me
- ac/surface: don't print stencil info if tex has no stencil
- radeonsi/driconf: add workaround for SpaceEngine
- glthread: add a last parameter to unmarshal functions
- glthread: return consumed bytes
- glthread: use custom marshal/unmarshal for CallList
- glthread: merge sucessive glCallList
- dlist: add locked param to _mesa_lookup_list
- dlist: prelock ctx->Shared->DisplayList before execute_list
- dlist: remove OPCODE_EXT_0
- dlist: remove InstSize
- dlist: unindent code
- dlist: use an union instead of allocating a 1-sized array
- dlist: always use merged primitive for drawing
- dlist: split hot/cold data from vertex_list
- dlist: use a separate opcode for vbo replay using loopback
- dlist: use a new OPCODE to avoid loading cold data
- dlist: increment/check list nesting when handling OPCODE_CALL_LIST(S)
- dlist: store all dlist in a continuous memory block
- dlist: remove _mesa_dlist_alloc_aligned
- dlist: remove unused _mesa_dlist_alloc
- dlist: skip NOP command at the head of a list
- mesa: clear shader_info::is_lowered in prog_to_nir
- mesa: fix bindless uniform samplers update
- dlist: don't handle unmerged draws as merged
- gallium/va: don't use key=NULL in hash tables
- amd/registers: fix fields conflict detection
Qiang Yu (1):
- st/mesa: fix size miss match for some check
Rafael Antognolli (5):
- intel/fs: Lower dword integer multiplies on XeHP.
- iris/bufmgr: Query memory region info.
- iris/bufmgr: Add new set of buckets for local memory.
- iris/bufmgr: Add flag to allocate from local memory.
- iris: Map with WC on non-LLC platforms.
Rhys Perry (92):
- aco/ra: use original names when renaming loop carried phi operands
- aco/ra: remove live-in temporary from live_out_per_block when moving it
- radv: fix barrier in radv_decompress_dcc_compute shader
- radv: fix clearing DCC-compressed e5b9g9r9 images
- aco: set TRUNC_COORD=0 for nir_texop_tg4
- ac/nir: set TRUNC_COORD=0 for nir_texop_tg4
- aco: remove image parameter from get_sampler_desc()
- Revert "radeonsi: set TRUNC_COORD=0 for Total War: WARHAMMER to fix it"
- aco: don't update register demand during RA validation
- aco: allow SDWA sels smaller than the operand size
- aco: add and use Program::progress
- nir/load_store_vectorize: assume CAN_REORDER ops don't alias with stores
- nir/opt_load_store_vectorize: improve handling of swizzles
- nir/opt_load_store_vectorize: ignore load_vulkan_descriptor
- nir/opt_load_store_vectorize: loop internally
- radv: improve vectorization callback for small bit sizes
- radv: only set robust_modes if robustBufferAccess2 is enabled
- radv: disable VK_FORMAT_R64_SFLOAT
- radv: cleanup LLVM implementation of vulkan_descriptor_index
- radv: implement vulkan_resource_reindex
- nir/lower_non_uniform: allow lowering with vec2 handles
- radv,aco: use nir_address_format_vec2_index_32bit_offset
- vulkan: fix use-after-free in vk_common_DestroyDebugReportCallbackEXT
- radv: fix use-after-free upon GS copy shader cache hits
- radv: fix possible use-after-free when inserting GS copy shader from cache
- radv,ac/llvm: use a dword alignment for descriptor loads
- aco: group loads from the same vertex binding into the same clause
- radv,aco: use per-attribute vertex descriptors for robustness
- Revert "radv,aco: don't use MUBUF for multi-channel loads on GFX8 with robustness2"
- radv,aco: compact vertex buffer descriptors
- ci: remove expected robustness2 fails for Renoir
- aco/ra: initialize temp_in_scc earlier
- aco/ra: fix get_reg_for_operand() with no free registers
- aco/ra: fix get_reg_for_operand() when the blocking var is a vector
- aco/ra: fix get_reg_for_operand() with vector operands
- aco/ra: use flags instead of booleans for update_renames()
- aco: disallow SGPRs on DPP instructions
- radv: don't allocate DCC predicate if the image doesn't use DCC
- radv: add radv_absolute_depth_bias
- radv: workaround incorrect depthBiasConstantFactor by Path of Exile
- radv: fix formatting of radv_dri_options
- radv: make attrib_end variable in radv_flush_vertex_descriptors 32-bit
- aco: do not clause NSA instructions
- aco/tests: add tests for form_hard_clauses()
- aco/tests: improve reporting of failed code checks
- aco: don't create 4 and 5 dword NSA instructions on GFX10
- aco: don't use nir_block_is_unreachable()
- nir/unsigned_upper_bound: don't require dominance metadata
- nir/algebraic: optimize extract of extract
- nir, nir/algebraic: add byte/word insertion instructions
- aco: disallow SDWA for instructions with 64-bit definitions/operands
- aco: add p_extract/p_insert
- aco: implement nir_op_extract/nir_op_insert
- aco: use byte/word extract pseudo-instructions
- ac/llvm: implement byte/word extract/insert instructions
- radv: use byte/word extract/insert instructions
- aco: optimize 32-bit extracts and inserts using SDWA
- aco: make validate_ir() output usable in tests
- aco: disallow literals with some instruction formats
- aco/tests: add tests for p_extract/p_insert lowering
- aco/tests: add SDWA tests
- aco: use v1b/v2b for ds_read_u8/ds_read_u16
- radv: improve LDS alignment check for load/store vectorization
- aco: don't ever widen 8/16-bit sgpr load_shared
- aco: use ds_read_{u8,u16}_d16
- aco: fix emit_mbcnt() with a VGPR mask
- radv: increase maxComputeSharedMemorySize
- nir/load_store_vectorizer: fix check_for_robustness() with indirect loads
- nir/opt_load_store_vectorize: check for restrict at the variable
- nir/opt_load_store_vectorize: only require one variable to be restrict
- nir: document that ACCESS_RESTRICT is not set at intrinsics
- radv,aco: use all attributes in a binding to obtain an alignment for fetch
- aco: adjust the condition for expanding vertex fetch data format
- aco/ra: use adjust_max_used_regs() in compact_relocate_vars()
- aco: don't move descriptor loads below buffer loads
- aco: move VMEM instructions below descriptor loads
- aco/lower_phis: fix undef_operands initialization with >32 predecessors
- aco/lower_phis: don't allocate unused temporary ids
- nir: use a single set during CSE
- nir/cse: resize the instruction set
- nir/propagate_invariant: add invariant_prim option
- radv: allow VK_FORMAT_R8G8_SRGB sampling
- nir/opt_load_store_vectorize: fix check_for_robustness() with deref access
- aco/tests: fix 32-bit build
- docs/envvars: fix RADV_TEX_ANISO
- aco: remove resource flags
- aco: handle NIR loops without breaks
- radv: enable VK_KHR_shader_subgroup_uniform_control_flow
- radv: don't ever convert num_records to bytes if it's zero
- radv: adjust num_records when offset>stride
- radv: use null vertex descriptor if num_records=0
- aco: don't create v_madmk_f32/v_madak_f32 from v_fma_legacy_f16
Rob Clark (157):
- freedreno: Add .clang-format
- freedreno: Some manual reformatting
- freedreno: Re-indent
- freedreno: Manual fixups
- freedreno: Add missing foreach macros and update indentation
- freedreno/drm: Re-indent
- freedreno/afuc: Re-indent
- freedreno/common: Re-indent
- freedreno/computerator: Re-indent
- freedreno/decode: Re-indent
- freedreno/drm-shim: Re-indent
- freedreno/ir2: Re-indent
- freedreno/perfcntrs: Re-indent
- freedreno/fdl: Re-indent
- ir3: handle 16b op_i2b1
- ci: Update kernel with a few freedreno related fixes
- ci: Add timeout for traces jobs
- freedreno: Small indent fix
- freedreno: Avoid staging blits with stencil on older gens
- freedreno: Make sure we actually flush if we need a fence
- freedreno: Add a couple debug traces
- freedreno: Allow resource shadowing for TC
- freedreno/drm: Move submit->primary to base class
- freedreno/drm: Cleanup bo allocation flags
- freedreno/drm: Cleanup bo cpu_prep flags
- freedreno/drm: Add FD_BO_PREP_FLUSH
- freedreno/drm: Move the growable array helper
- freedreno/drm: Add locked version fd_{bo,pipe}_del()
- freedreno/drm: Userspace fences
- freedreno/drm: Inline the fence-table
- freedreno/batch: Don't create fences for every batch
- freedreno: last_fence optimization for TC async flushes
- freedreno: Move fence struct to header
- freedreno: Drop unused create_fence() arg
- freedreno/drm: Reference count submits
- freedreno: Re-work fd_submit fence interface
- freedreno/drm: Add pipe tracking for deferred submits
- freedreno/drm/sp: Split submit prep and finish
- freedreno/drm/sp: Implement deferred submit merging
- freedreno: Avoid flushing deferred submits for u_trace
- freedreno/drm: fd_submit should hold ref to fd_pipe
- freedreno/drm: pipe should hold reference to device
- freedreno/drm: Async submit support
- freedreno/drm: Assume explicit fences if in_fence_fd
- freedreno/ci: Disable counterstrike trace on a306 for now
- freedreno/ci: Skip texsubmimage cube_map_array
- ci: Add DEQP_CASELIST_INV_FILTER
- freedreno/ci: Isolate dEQP-EGL reset_context tests
- freedreno: Remove samples-per-tex tracking
- freedreno/drm: Allow FD_BO_PREP_FLUSH without _NOSYNC
- freedreno: Flush resources harder
- freedreno/ci: Mark client_wait_sync_finish as flake
- freedreno/ci: Update piglit skips/fails
- freedreno/drm: Initialize control->fence
- freedreno: Fix TC last_fence optimization
- freedreno: Consolidate needs_flush and clearing last_fence
- freedreno/query/acc: Set needs_flush
- freedreno/tools: Fix async flush vs fdperf/computerator
- pps: Lower min sampling interval
- util/perfetto: Add one-time init
- freedreno: Add freedreno pps driver
- gallium/aux: Add perfetto support to u_trace
- freedreno/drm: Add support to query device suspend count
- freedreno/pps: Detect GPU suspend on newer kernels
- freedreno: Moar header C++-proofing
- freedreno: Add perfetto renderpass support
- pps: Add a more interesting cfg example
- docs/perfetto: Updates for freedreno and render-stages
- gallium/u_threaded: Add to_call() helper
- gallium/u_threaded: Add call logging
- freedreno/ir3: Don't force RTNE if rounding mode is undefined
- freedreno/a6xx: Add a few registers
- freedreno: Rename internal resource_busy
- freedreno: Implement TC resource_busy
- freedreno/tu+drm: Extract out pm4 pkt header helpers
- freedreno: Move pkt parsing helpers to common
- freedreno/afuc: Split out instruction decode helper
- freedreno/afuc: Split out utils
- freedreno/afuc: Clean up special regs
- freedreno/afuc: Add pipe reg name decoding
- freedreno/afuc: Add emulator mode to afuc-disasm
- freedreno/registers: Add a few a6xx regs and notes
- freedreno/afuc: Extract full gpu-id
- freedreno/afuc: Split out helpers to parse labels and packet-table
- freedreno/afuc: Add emulator support to run bootstrap
- freedreno/ci: Add real packet-table loading for afuc test
- freedreno/afuc: Use emulator to extract jmptbl
- freedreno/headergen2: Fix compile warnings with CP_DRAW_INDIRECT_MULTI
- freedreno/a6xx: Fix mh31 intermittent faults
- freedreno: Fix typo
- freedreno: Don't return a flushed batch
- egl: zero is a valid fd
- egl+libsync: Add check for valid fence-fd
- frontend/dri: Fix fence-fd logic
- freedreno/ir3: Fix use after free
- Revert "st/mesa: execute glFlush asynchronously if no image has been imported/exported"
- freedreno: Fix batch flush race condition
- freedreno: Fix fdperf flush
- gallium/u_threaded: Missing driver-thread marking
- freedreno: Add string-marker debug trace
- freedreno: Move assert
- freedreno: Add tid to DBG() msgs
- freedreno: Remove assert
- freedreno/registers: add A5XX_RBBM_STATUS3 bit
- freedreno: Add missing valid range tracking for SSBOs/images
- docs: Update freedreno features
- freedreno/ci: Sort a630 piglit xfails
- freedreno/a6xx: Fix r16_snorm blits
- freedreno/a6xx: Handle non-UBWC surface views
- freedreno/a6xx: Improve UBWC demotion logic
- freedreno: Drop obsolete comment
- freedreno: Don't try staging blit for non-renderable formats
- freedreno: Add debugging for blitter fallback recursion
- freedreno: Avoid recursive re-entry of u_blitter
- freedreno/a6xx: Handle R8G8 sharp edges in validate_format()
- freedreno/a6xx: Also validate format in blitter path
- freedreno: Flush batches on shadow/uncompress
- freedreno: Fallback to sw for copy_image with compressed
- freedreno: Fix flushes with NULL batch
- freedreno/blitter: Flush before self-blits
- freedreno/a6xx: Use UNORM for SNORM copy blits
- freedreno/a6xx: Handle u/snorm vs u/sint validation
- freedreno: Fix for multi-draw blits
- freedreno/a6xx: Flip on copy_image
- freedreno/a6xx: Skip nv_copy_image tests
- freedreno: Defer freeing batch->key
- freedreno/ci: Start longest traces first
- freedreno/ci: Increase # of jobs for CI runners
- freedreno/ci: Garbage collect some a630 flakes
- freedreno/a6xx: Handle fb_read in sysmem path
- freedreno: Flush if at risk of overflowing bos table
- turnip: Use drmIoctl()
- turnip: Fix AcquireImageANDROID() handle type
- turnip: Add CrOS Gralloc support
- nir: Add pass to lower phi precision
- freedreno+ir3: Enable INT16
- freedreno/a6xx: Fix framebuffer_barrier crash
- turnip: avoid some UB
- turnip: Split tu6_emit_xs()
- freedreno/computerator: Add script to probe FLUT values
- freedreno/ir3: Add float immed "FLUT" support
- freedreno: Rename \*_dev_info
- freedreno: Generate device-info tables at build time
- freedreno: Convert fd_dev_info to const pointer
- turnip: Convert fd_dev_info to const pointer
- freedreno/ir3: Get tess_use_shared from fd_dev_info
- freedreno/ir3: Get reg_size_vec4 from fd_dev_info
- turnip: Drop unused vshs_workgroup param
- turnip: Get storage_16bit from fd_dev_info
- turnip: Get indirect_draw_wfm_quirk from fd_dev_info
- turnip: Get has_tex_filter_cubic from fd_dev_info
- turnip: Get has_sample_locations from fd_dev_info
- freedreno+turnip: Add has_cp_reg_write
- freedreno+turnip: Add has_8bpp_ubwc
- freedreno+turnip: Get device name from device-info table
- freedreno+turnip: Add a6xx gen4 support
- freedreno/a6xx: Add missing PC_CCU_INVALIDATE_x
Robert Foss (1):
- freedreno/regs: add 5nm DSI PHY/PLL regs
Robert Tarasov (1):
- iris: Check data alignment for copy_mem_mem
Rohan Garg (8):
- i965: plumb device/driver UUID generators
- i965: Initial implementation for EXT_memory_object_*
- i965: Implement semaphore support for EXT_external_objects
- i965: Implement BufferDataMem
- i965: fix in fences backend for ext_external_objects edge case
- i965: Enable EXT_memory_object_* for Gen 7 and above
- docs: mark external memory and semaphore extensions done for i965
- ci: Don't artifact rendered images when job succeeds
Roland Scheidegger (1):
- llvmpipe: fix nir dot products (fsum op)
Roman Stratiienko (7):
- anv_android: Add missing type
- meson: egl: Do not build platform_drm for Android
- android: Add scripts to build using meson
- nouveau: Don't require RTTI and use it only when enabled
- egl: android: prepare code for adding more buffer_info getters
- egl: android: add IMapper@4 metadata API buffer_info getter
- AOSP: Do not add '-Wl,--gc-sections' to the linker arguments
Ryan Houdek (3):
- Default enable SSE2 on mesa builds.
- Switch u_format_test to passed on i386
- Update release notes with mention that x87 is no longer used on x86
Sagar Ghuge (16):
- anv: Set correct fast clear value for depth during blorp operation
- anv: Avoid corrupting indirect depth clear values
- anv: Query memory region info
- anv: Wrapper around I915_GEM_CREATE_EXT_MEMORY_REGIONS
- anv: Allocate BO in appropriate region
- anv: Allocate scratch and workaround BO in local memory
- intel/compiler: Define new LSC data port encodings
- intel/compiler: Add support for LSC fence operations
- intel/compiler: Add helpers for LSC message descriptors
- intel/disasm: Disassmeble LSC messages
- intel/disasm: Disassemble LSC message extended descriptors
- intel/fs: Lower untyped float atomic messages to LSC when available
- intel/fs: Lower Byte scattered r/w messages to LSC when available
- intel/fs: Lower A64 byte scattered r/w messages to LSC dataport
- intel/fs: Lower A64 atomic messages to LSC dataport
- intel/fs: Lower varying pull constant load message to LSC dataport
Samuel Iglesias Gonsálvez (13):
- turnip: move pipeline gras_su and rb{stencil,depth}_cntl_mask initialization
- turnip: initialize pipeline->rb_{stencil,depth}_cntl always
- turnip: refactor how LRZ state is calculated
- turnip/lrz: add support for VK_EXT_extended_dynamic_state
- turnip: document GRAS_LRZ_CNTL's UNK5 bitfield
- turnip/lrz: added support for depth bounds test enable
- turnip: fix typo in tu_CmdBeginRenderPass2()
- turnip: implement LRZ direction
- turnip: update LRZ state based on stencil test state
- turnip: group all geometry constant draw states in one
- turnip: fix setting dynamic state mask for VK_DYNAMIC_STATE_STENCIL_OP_EXT case
- turnip: add LRZ early-z support
- anv: do not dereference VkPipelineMultisampleStateCreateInfo always
Samuel Pitoiset (130):
- amd: drop support for LLVM 8
- radv: keep DCC compressed for clears on compute with image stores
- aco: fix opquantize2f16 on GFX6-7
- radv: fix fast clearing depth-only or stencil-only aspects with HTILE
- radv: fix emitting depth bias when beginning a command buffer
- radv: remove radv_image_iview::bo
- radv: remove radv_image_iview::multiplane_planes
- radv: allow concurrent MSAA images to be FMASK compressed
- radv: fix emitting default depth bounds state on GFX6
- radv/winsys: remove set but never used use_llvm
- radv: remove old comment about LLVM <= 8
- ac: move ac_lower_indirect_derefs() outside of the LLVM dir
- radv: cleanup LLVM related includes
- radv: remove RADV_DEBUG=nothreadllvm
- radv/winsys: fix allocating the number of CS in the sysmem path
- radv/winsys: fix resetting the number of padded IB words
- radv: make sure CP DMA is idle before executing secondary command buffers
- radv: remove warnings about RADV_PERFTEST=aco,llvm
- radv/llvm: implement the image load DCC bug
- radv: enable DCC stores with the LLVM backend
- radv: re-introduce missing skip list for Polaris10
- radv: fix various CMASK regressions on GFX9
- radv: add the provoking vertex mode to the pipeline/shader keys
- radv/llvm: adjust NGG if provoking vertex mode is last
- aco: adjust NGG if provoking vertex mode is last
- radv: implement VK_EXT_provoking_vertex
- radv: enable TC-compat CMASK on GFX8-9
- radv: fix computation of the number of user SGPRS for NGG GS state
- radv: check if DCC is enabled when resolving different levels
- radv: only keep concurrent MSAA images compressed if TC-compat CMASK
- radv/winsys: add GFX6_MAX_CS_SIZE instead of using a magic value
- radv/winsys: fix executing huge secondary command buffers on GFX6
- radv: implement RADV_FORCE_VRS for the LLVM backend
- util/math: change ROUND_DOWN_TO to return a uint64_t
- radv: adjust the computation of the total usage of memory used
- radv: expose 2/3rd of total memory as VRAM and 1/3rd as GTT on APUs
- radv: fix missing ITERATE_256 for D/S MSAA images that are TC-compat HTILE
- radv: declare VK_EXT_extended_dynamic_state2 but leave it disabled
- radv: declare new dynamic states for VK_EXT_extended_dynamic_state2
- radv: implement dynamic depth bias enable
- radv: implement dynamic primitive restart enable
- radv: implement dynamic rasterizer discard enable
- radv: advertise VK_EXT_extended_dynamic_state2
- radv: fix extending the dirty bits to 64-bit
- radv: dump the trap handler shader with RADV_DEBUG=metashaders
- nir/opt_access: fix getting variables in presence of similar bindings/desc
- radv: add missing entrypoints for VK_EXT_extended_dynamic_state2
- radv: enable DCC stores on RDNA2
- aco: fix derivatives/intrinsics with SGPR sources
- Revert "radv: Do not access set layout during vkCmdBindDescriptorSets."
- radv: fix heap indices when computing the budget
- ac: ac_gpu_info::has_vgt_flush_ngg_legacy_bug
- radv: fix fast clearing DCC if one level can't be compressed on GFX10+
- radv: simplify radv_pipeline_has_gs_copy_shader()
- radv: remove small overhead of radv_pipeline_has_ngg()
- radv: ignore dynamic blend constants if blend isn't enabled
- radv: remove an useless TODO for dynamic line width
- radv: pass an image range to radv_layout_dcc_compressed()
- radv: remove redundant call to radv_dcc_enabled()
- radv: only mark DCC as compressed when drawing if layout allows it
- radv: only init DCC if compressed in the HW resolve path
- radv: do not decompress DCC for partial resolves if stores are supported
- radv: use radv_dcc_enabled() for the FB mip flush workaround
- aco: fix emitting discard when the program just ends
- radv: stop reporting ACO from the device name
- radv: remove DFSM
- util/drirc: make engine_versions an optional field
- radv: add few new drirc options
- util/drirc: use application_name_match for the SotTR RADV workaround
- radv: move all game workarounds to drirc
- radv: fix missing default state for DB_DFSM_CONTROL
- radv: fix generating hang reports if mutable descriptors are used
- radv: enable RADV_DEBUG=invariantgeom for Monster Hunter World
- ac/rgp: mark SQTT_FILE_CHUNK_TYPE_ISA_DATABASE as deprecated
- ac/rgp: bump the SQTT file minor version to 5
- radv: enable RADV_DEBUG=invariantgeom for SotTR DX11/DX12 versions
- ac: import performance counters from RadeonSI
- ac: rename ac_dump_thread_trace() to ac_dump_rgp_capture()
- ac/rgp: fix ac_fill_sqtt_asic_info() name
- ac: add ac_thread_trace::data
- radv/winsys: allow to reserve a VMID
- radv: emit PA_SC_CONSERVATIVE_RASTERIZATION_CNTL only on GFX9+
- ac/debug: fix color printing PKT3 when count in header is too low
- aco: fix range checking for SSBO loads/stores with SGPR offset on GFX6-7
- radv: dump SPIR-V instead of using spirv-dis when generating a hang report
- aco: fix emitting literal offsets with SMEM on GFX7
- ci: update list of expected failures for Pitcairn/Oland (RADV)
- radv: do not launch an IB2 for secondary cmdbuf with INDIRECT_MULTI on GFX7
- radv/winsys: add a small comment explaining the CHAIN bit
- ci: add expected list of failures for Bonaire (RADV)
- radv: fix aligning the image offset by using align64()
- radv/winsys: adjust some error messages
- radv/winsys: remove useless errno.h includes
- radv: fix dynamic rasterizer discard enable state
- radv: reject binding buffer/image when the device memory is too small
- radv: always decompress both aspects of a depth/stencil image
- radv: create only one pipeline for decompressing depth/stencil images
- radv: fix dynamic culling and depth/stencil related dynamic states
- ac/perfcounters: remove ac_pc_block_base::num_prelude
- ac/perfcounters,radeonsi: rework performance counters layout
- ac/perfcounters: rename num_multi to num_spm_counters
- ac/perfcounters: add more SPM configuration fields
- ac/perfcounters: add a GPU block ID to every block definitions
- radv: implement dynamic logic op
- radv: advertise extendedDynamicState2LogicOp
- radv: fix RADV_FORCE_VRS for 2x1 and 1x2
- radv: fix fd leak in vkAcquireImageANDROID()
- radv: disable DCC for DOOM 2016 and Wolfenstein II
- radv: implement VK_EXT_color_write_enable
- radv: advertise VK_EXT_color_write_enable
- radv: add support for more HTILE clear codes
- radv: prevent fast clearing HTILE depth for unrestricted ranges
- radv: allow more fast clears for depth surfaces without TC-compat HTILE
- ci: update list of expected failures against CTS 1.2.6.2 for RADV
- ci: remove few CTS that are now skipped with RADV
- aco: fix emitting d16 for MIMG instructions on GFX9+
- aco: fix emitting a16 for MIMG instructions on GFX10+
- aco: fix shared_atomic_comp_swap if the second source isn't a VGPR
- radv: fix applying radv_disable_dcc for DOOM and Wolfenstein II
- aco: use nir_ssa_def_is_unused() to determine if atomic dest is used
- ac,radv: implement the cs_regalloc_hang HW bug workaround
- radv: fix applying radv_disable_dcc for DOOM 2016 again
- radv: lower primitive shading rate in NIR
- radv: only init the TC-compat ZRANGE metadata for the depth aspect
- radv: fix bounds checking for zero vertex stride on GFX6-7
- radv: report APUs as discrete GPUs for Red Dead Redemption 2
- radv: fix specifying the stencil layout for separate depth/stencil layouts
- radv: allow unused VkSpecializationMapEntries
- radv: fix selecting the first active CU when profiling with SQTT
- radv: fix missing cache flushes when clearing HTILE levels on GFX10+
Sergii Melikhov (1):
- util/format: Change the pointer offset.
Simon Ser (27):
- radeon/vcn: handle tiled buffers when decoding
- util/format: document block depth field
- ac/surface: use blocksizebits instead of blocksize
- radeonsi: stop special-casing YUV formats in si_query_dmabuf_modifiers
- ac/surface: allow non-DCC modifiers for YUV on GFX9+
- frontends/va: improve surface attribs processing
- gallium, va: add support for VASurfaceAttribDRMFormatModifiers
- radeonsi: implement pipe_context.create_video_buffer_with_modifiers
- radv: stop special-casing multi-planar formats in radv_get_modifier_flags
- dri: add createImageWithModifiers2 interface
- gallium/dri: implement createImageWithModifiers2
- i965: implement createImageWithModifiers2
- vulkan/wsi/wayland: simplify wl_surface version check
- docs/envvars: document MESA_VK_WSI_PRESENT_MODE
- radv: implement VK_EXT_physical_device_drm
- amd/addrlib: remove Meson debug message()
- vulkan/wsi: unify format logic in dmabuf_handle_modifier
- vulkan/wsi: prefer the Wayland linux-dmabuf protocol
- vulkan/wsi/wayland: remove swapchain wl_drm wrapper
- vulkan/wsi/wayland: remove unnecessary wl_proxy_set_queue call
- vulkan/wsi/wayland: fix wsi_wl_image_init error code
- vulkan/wsi/wayland: handle dmabuf params allocation failure
- etnaviv: fix renderonly check in etna_resource_alloc
- etnaviv: fail in get_handle(TYPE_KMS) without a scanout resource
- freedreno: fail in get_handle(TYPE_KMS) without a scanout resource
- panfrost: fail in get_handle(TYPE_KMS) without a scanout resource
- lima: fail in get_handle(TYPE_KMS) without a scanout resource
Simon Zeni (4):
- vulkan/wsi: add drm_fd param to wsi_display_get_connector
- vulkan/wsi: Implement VK_EXT_acquire_drm_display
- radv: Implement VK_EXT_acquire_drm_display
- anv: Implement VK_EXT_acquire_drm_display
Steve Pronovost (1):
- d3d12: Add mechanism for D3D12 Adapter Selection
Stéphane Marchesin (1):
- virgl: resources without any binding can be cached
SureshGuttula (3):
- frontends/va/picture:Fix wrong reallocation even surface is protected
- frontends/va: Derive image from interlaced buffers for h26[4/5]encode
- radeon/vcn: calc_dpb_size should be based on dpb_type
Tapani Pälli (21):
- anv: do not support image export with stencil aspect set
- glx: fix compilation error when function name not found
- glsl: ignore interface precision qualifier on desktop GL
- glx: revert "Downgrade sRGB-ful fbconfigs"
- i965: support only color formats with memory objects
- nir: skip assert check with empty structs
- isl: require hiz for depth surface in isl_surf_get_ccs_surf
- anv: require rendering support for blit destination feature
- mesa: fix error set for glCompressedTexSubImage calls
- gitlab-ci: enable building of Vulkan tests in Piglit
- anv: introduce new dynamic states
- anv: support rasterizer discard dynamic state
- anv: support depth bias enable dynamic state
- anv: support primitive restart enable dynamic state
- anv: centralize vk_to_intel_logic_op array
- anv: support blending logic op dynamic state
- anv: toggle on VK_EXT_extended_dynamic_state2
- docs: add VK_EXT_extended_dynamic_state2 features.txt entry
- anv: provide dummy vkCmdSetPatchControlPointsEXT
- iris: take a reference to memobj bo in iris_resource_from_memobj
- anv: fix emitting dynamic primitive topology
Thomas H.P. Andersen (9):
- nir: return progress from nir_lower_packing
- nir/lower_packing: use shader_instructions_pass
- anv: remove dead code
- nir/ifind_msb_rev: fix input check
- zink: remove initialization override
- lavapipe: remove initialization override
- broadcom/compiler: use correct flag enum
- broadcom/compiler: fix add vs. mul
- nine: Fix assert in tx_src_param
Thong Thai (1):
- radeon/vcn/enc: Add missing line to HEVC SPS header code
Timothy Arceri (23):
- mesa: fix incomplete GL_NV_half_float implementation
- mesa: make _mesa_find_temp_intervals() a static function
- mesa: fix _mesa_add{_typed}_unnamed_constant() declarations
- mesa: fix _mesa_add_state_reference() declaration mismatch
- mesa: fix glShaderSource() error handling
- util: disable glthread in CSGO
- glsl: create validate_component_layout_for_type() helper
- glsl: add missing support for explicit components in interface blocks
- nir/lower_io_to_vector: fix per vertex io handling for arrays
- Revert "util: disable glthread in CSGO"
- util: add work around for the game We Happy Few
- util/tests: initialise key in cache_test
- mesa: don't crash on incorrect texture use
- i965: don't crash on incorrect texture use
- glsl: force_glsl_version to shaders with no defined version
- util/driconf: add new ignore_write_to_readonly_var workaround
- util: add some workarounds for the game Luna Sky
- util/disk_cache: delete more cache items in one go when full
- util/radeonsi: add radeonsi workaround for Nuclear Throne
- glsl: replace some C++ code with C
- util: add workaround for Full Bore
- glsl: relax rule on varying matching for shaders older than 4.20
- intel/compiler: make sure swizzle is applied to if condition
Timur Kristóf (74):
- aco: Mark VCC clobbered for iadd8 and iadd16 reductions on GFX6-7.
- radv: Ignore GS copy shader when determining NGG GS wave size.
- radv: Properly enable Wave32 mode for NGG GS.
- nir: Support upper bound of subgroup_id/num_subgroups for non-compute.
- nir: Support upper bound of unsigned bit size conversions.
- nir: Allow load_primitive_id in VS in nir_divergence_analysis.
- nir: Add AMD specific intrinsics for merged shaders and NGG.
- aco: Allow workgroup barrier and shared scope for NGG shaders.
- aco: Fixup the NIR metadata after sanitize_cf_list.
- aco: Split ngg_emit_sendmsg_gs_alloc_req from the wave0 check.
- radv: Fill shader info earlier.
- radv: Gather NGG info sooner.
- aco: Implement new NGG specific NIR intrinsics.
- ac: Add new NIR pass to lower NGG VS/TES.
- radv: Use new NGG NIR lowering for VS/TES when ACO is used.
- ac: Add NIR lowering for NGG GS.
- radv: Use new NIR lowering of NGG GS when ACO is used.
- aco: Determine whether a few more instructions need exec.
- aco: Use Operand instead of Temp for the exec mask stack.
- aco: Remember when exec mask is const, and restore the const then.
- aco: Don't use s_and_saveexec with branches when exec is constant.
- aco: Refactor SSA elimination phi info to use vector instead of map.
- aco: Eliminate useless exec writes in jump threading.
- aco/insert_exec_mask: Fixed unused variable warning in release build.
- aco/util: Initialize IDSet::bits_set to zero.
- gallium/tessellator: Fix uninitialized variable warnings.
- anv: Fix unused function warnings for memory range checks.
- gallivm: Fix a few uninitialized variable warnings.
- nine: Fix uninitialized warning in texture9.c
- radv/cmd_buffer: Fix warning by initializing instance count.
- aco: Don't eliminate exec write when it's used by a copy later.
- aco: Don't DCE instructions that write non-temps, eg. exec.
- aco: Add Operand(Temp, PhysReg) constructor.
- aco: New writeout overloads for the test framework.
- aco: Introduce a new, post-RA optimizer.
- aco: Use s_cbranch_vccz/nz in post-RA optimization.
- aco: Eliminate SALU comparison when SCC can be used instead.
- radv: Remove duplicate code for getting GS info.
- radv: Don't generate GS copy shader when the pipeline has NGG.
- radv: Assert that there is no GS copy shader when the pipeline has NGG.
- aco: Add note about v_alignbyte in the ISA README.
- nir: Add nir_op_sad_u8x4 which corresponds to AMD's v_sad_u8.
- aco: Implement nir_op_sad_u8x4.
- aco: Add validation for v_permlane instructions.
- nir: Add AMD-specific byte and lane permute intrinsics.
- aco: Implement byte and lane permute intrinsics.
- aco: Keep VGPR destinations for uniform shared loads when beneficial.
- ac/nir: Refactor and optimize the repacking sequence.
- amd: Add extra source to the mbcnt_amd NIR intrinsic.
- aco: Use as_vgpr for the second source of mbcnt_amd.
- ac/nir: Update TCS output barriers with nir_var_mem_shared.
- aco: Fix checking if load_shared is used by cross lane instructions.
- radv/llvm: Emit s_barrier at the beginning of NGG non-GS shaders.
- aco/gfx10: NGG zero output workaround for conservative rasterization.
- aco/gfx10: Emit barrier at the start of NGG VS and TES.
- radv: Add last_vgt_api_stage and use it to simplify some code.
- radv: Move radv_optimize_nir_algebraic to a separate function.
- radv: Allow enabling vertex grouping, fix NGG info with it disabled.
- radv: Set parameter cache oversubscription according to the PC lines.
- nir: Add AMD specific intrinsics for NGG shader based culling.
- ac/nir: Add a NIR port of ac_llvm_cull.
- ac/nir: Use a ballot that matches the wave size during NGG lowering.
- ac/nir: Implement NGG deferred attribute culling in NIR.
- radv: Expose radv_get_viewport_xform in radv_private.h
- radv: New shader args for NGG culling settings and viewport.
- aco: Implement NGG culling related intrinsics.
- radv: Support NGG culling with new perftest environment variable.
- radv: Run algebraic optimizations before NGG lowering.
- ac/nir: Reuse the repacked output positions of culling shaders.
- ac/nir: Analyze culling shaders to remember which inputs are used when.
- ac/nir: Reuse uniforms from top part of culling shaders.
- radv, aco, ac/nir: Tweak position export scheduling for NGG culling.
- radv: Don't compile NGG culling into shaders that write viewport index.
- radv: Remove num_viewports from radv_skip_ngg_culling.
Tomeu Vizoso (51):
- ci: Reenable radeonsi jobs, and extend coverage
- ci/lava: Build all piglit profiles in LAVA images
- ci/lava: Update kernel for LAVA to 5.11
- ci/lava: Start Xorg on request, for Piglit
- ci: Test RadeonSI with piglit's quick_gl
- ci: Use a single kernel+rootfs for both baremetal and LAVA jobs
- ci: Drop hack to disable all modules from defconfig
- ci/radeonsi: Add expected failures due to #4674 having slipped in
- panfrost/ci: Enable some dEQP 3.1 tests on Mali T860
- Revert "CI: Disable Panfrost and radeonsi"
- panfrost: Don't access members of NULL pointers
- pan/midgard: Don't emit zero padding
- ci: Remove the need for an empty Piglit results file
- Revert "CI: Disable all Panfrost/AMD/Iris automatic jobs"
- ci: Update kernel to v5.13-rc2
- panfrost/ci: Test Panfrost on the Mali G72 GPU
- panfrost/ci: Add one more flake test for G72
- radv/ci: Test on Stoney on CI
- ci/lava: Add caching proxies for trace downloads
- ci/piglit: Use wget instead of ci-fairy to check a file exists
- ci: Configure DUTs for max performance
- ci: Uprev piglit to eee7d89611cf "tests: Replay profile frame times"
- ci: Uprev apitrace to 170424754bb4 "retrace: Get --loop to work without rewinding"
- radeonsi/ci: Add new Piglit failures
- ci/freedreno: Add depth32f_stencil8 flakes
- ci/zink: Add nearest_linear_mirror_l8_pot flake
- ci/freedreno: Fix name of flake
- ci/freedreno: Add new flake after "ci: Configure DUTs for max performance"
- ci/freedreno: Add spec@arb_copy_buffer@dlist flake on a530
- Partial revert of "ci: Add a manual job for tracking the performance of Freedreno"
- ci/freedreno: Skip Portal 2 trace on a630, due to flakiness
- Revert "ci/freedreno: Skip Portal 2 trace on a630, due to flakiness"
- ci/lava: Disable CPU frequency scaling
- ci/lava: Switch LAVA jobs to x86 runners
- ci: Disable windows builds due to runner not being available
- ci: Build Crosvm in our container
- ci: Move Kernel build tasks into its own file
- ci: Store the credentials in /tmp
- ci: Run tests inside Crosvm
- iris/ci: Update the checksums for the pixmark-piano trace
- panfrost/ci: Add some failures that crept in
- ci/lava: Improve error reporting in lava_job_submitter.py
- ci/lava: Don't overwrite PIGLIT_REPLAY_EXTRA_ARGS
- Revert "ci: Disable the iris APL jobs"
- ci/bare-metal: Add parens around shell command
- panfrost: Fork pan_pool for Gallium and Vulkan
- panvk: Add VkCommandPool support
- panvk: Support calls to CreateDescriptorSetLayout without bindings
- panvk: Make panvk_queue_transfer_sync more generic
- panfrost: Specify alignment for the Job Header descriptor
- panvk: Add vkEvents support
Tony Wasserka (18):
- radv: Remove assert about pDepthStencilState
- aco/spill: Fix improper handling of exec phis
- aco/scheduler: Fix register demand computation for downwards moves
- aco/scheduler: Fix register demand computation for upwards moves
- aco/scheduler: Verify register demand invariants in debug mode
- util: add support for defining bitwise operators on strongly typed enums
- util: tune signatures of generated enum operators
- aco/scheduler: Clean up register demand tracking
- aco/scheduler: Move cursor handling state to dedicated interfaces
- aco/ra: Fix off-by-one-error in print_regs
- aco/ra: Clean up print_regs output and support byte-allocated variables
- aco/ra: Split print_regs by lines of 64 registers
- aco: Replace Operand literal constructors with factory member functions
- aco: Remove use of deprecated Operand constructors in test_to_hw_instr.cpp
- aco: Remove use of deprecated Operand constructors in aco_builder.h
- aco: Remove use of deprecated Operand constructors
- aco: Clean up unneeded literal casts
- aco: Remove deprecated Operand constructors
Vasily Khoruzhick (3):
- lima: switch resource to linear layout if there's to many full updates
- lima: implement alpha test
- lima: handle fp16 vertex formats
Ville Syrjälä (2):
- i915: Implement __DRI_IMAGE_ATTRIB_OFFSET query
- i915: Implement __DRI2_FLUSH version 4
Vinson Lee (17):
- clover: Add constructor for constant_argument.
- glx: Fix macOS build.
- nv50/ir: Initialize Graph::Node member tag.
- nvc0: Remove unnecessary bsp_bo NULL check.
- nv50/ir: Initialize BuildUtil member tail.
- nv50/ir: Initialize CodeEmitterNV50 member progType.
- nv50/ir: Initialize GCRA::RIG_Node members.
- nvc0/ir: Initialize CodeEmitterGK110 member progType in constructor.
- nv50/ir: Add ConstantFolding constructor.
- travis: Download XQuartz from GitHub.
- v3dv: Fix assert.
- nvc0/ir: Initialize CodeEmitterNVC0 member progType in constructor.
- intel/vec4: Add missing break statement.
- nvc0/ir: Initialize Limits members in constructor.
- asahi: Fix macOS macro.
- st/xa: Mark default xa_get_pipe_format case unreachable.
- asahi: Move assignment after null check.
Yevhenii Kolesnikov (3):
- intel: fix leaking memory on shader creation
- glsl: Add operator for .length() method on implicitly-sized arrays
- glsl: Properly handle .length() of an unsized array
Yiwei Zhang (79):
- venus: update venus-protocol headers
- venus: implement dma_buf fd import and properties query
- venus: cap api version to 1.1 for Android
- venus: fix virtgpu_bo_init_dmabuf for classic resource
- venus: close the import memory fd on success
- venus: force a roundtrip after vn_renderer_bo_create_dmabuf
- venus: set bo->size to 0 for classic resource
- venus: update venus-protocol headers
- venus: implement VK_ANDROID_native_buffer v7
- venus: use VK_EXT_image_drm_format_modifier
- venus: update venus-protocol headers
- venus: enable VK_EXT_queue_family_foreign
- venus: handle VK_IMAGE_LAYOUT_PRESENT_SRC_KHR transfer
- venus: handle wsi image queue ownership transfer for Android
- venus: query extended resource info from gralloc
- venus: populate VK_ERROR_OUT_OF_HOST_MEMORY if applied
- virgl: do not use winsys info for guest storage of classic resource
- venus: fix vkEnumeratePhysicalDeviceGroups
- venus: stop advertising KHR_driver_properties for Android
- venus: clean up vn_android api names
- venus: add AHB format and VkFormat conversion helper functions
- venus: add vn_android_get_ahb_usage helper function
- venus: add ahb image and buffer properties query support
- venus: vn_GetAndroidHardwareBufferPropertiesANDROID (part 1/2)
- venus: vn_GetAndroidHardwareBufferPropertiesANDROID (part 2/2)
- anv: fix AHB leak upon exportable allocation
- radv: fix AHB leak upon exportable allocation
- gallium/st: add a back buffer fallback for front rendering
- gallium/dri: implement EGL_KHR_mutable_render_buffer
- egl/android: check front rendering support for cros gralloc
- venus: tiny refactor of vn_android_get_gralloc_buffer_info
- venus: complete the format conversion between AHB and Vulkan
- venus: fix vn_GetAndroidHardwareBufferPropertiesANDROID
- venus: fix AHB image format properties query
- venus: prepare image creation helpers for AHB
- venus: implement image creation for ahb handle type
- venus: refactor device memory fd import
- venus: implement AHB allocation and import (part 1/2)
- venus: implement AHB allocation and import (part 2/2)
- venus: implement vn_GetMemoryAndroidHardwareBufferANDROID
- venus: support AHB external format for sampler YCbCr conversion
- venus: advertise VK_ANDROID_external_memory_android_hardware_buffer
- venus: rename dmabuf to dma_buf when it represents a type
- venus: fix misaligned bo_flags between import and query
- venus: refactor for property query of dma_buf fd
- venus: fix mismatched bo mmap_size for export and multiple imports
- venus: initial AHB support for multi-planar format
- venus: update to the latest venus protocol
- venus: support AHB prop query with host dma_buf size
- venus: refactor gralloc buffer and drm modifier properties query
- venus: unify VkNativeBufferANDROID and AHardwareBuffer image create info
- venus: forward the host renderer hardware info
- egl/android: fix cached buffer slots for EGL Android winsys
- egl/android: refactor to use the legit vndk/window.h header
- vulkan: fix back compat with Android Oreo and below
- egl/android: add aosp_nougat system/window.h back for back compat
- virgl: forward the host renderer hardware info
- anv: fix Android WSI VkFence
- venus: silence a build warning
- venus: refactor vn_AcquireImageANDROID with globalFencing
- venus: moves GPU rendering off CPU timeline for Android WSI
- venus: add debug info for experimental features during init
- radv: fix build errors after commit 8b7ff784
- anv: fix build errors after commit 8b7ff78
- venus: remove workarounds for multi-planar format interop
- anv: fix some log formats
- anv: support multi-planar format in add_all_surfaces_explicit_layout
- anv: enable multi-planar support for drm format modifier
- venus: properly support GPU_DATA_BUFFER for AHB
- venus: use the mesa "drm-uapi/drm_fourcc.h" header
- venus: remove unsupported AHB formats
- venus: resolve AHB external format with DRM format
- venus: add more logs for Android WSI debugging
- venus: prepare vn_CreateBuffer for AHB
- venus: handle ahb backed VkBuffer creation properly
- venus: fix AHB VkBuffer memory requirement
- egl/android: only apply front rendering usage in shared buffer mode
- egl/android: restore image creation fallback path used by virgl
- venus: cache ahb backed buffer memory type bits requirement
Yogesh Mohanmarimuthu (4):
- radv: set RADEON_FLAG_GTT_WC flag for prime memory
- glx: Keep display fd open for prime
- glx: create DRI screen for display GPU incase of prime
- loader: allocate VRAM in display GPU in case of prime
Yurii Kolesnykov (1):
- c_std=c11 in meson default_options
Zhaofeng Li (1):
- Add default driver selections for RISC-V
Zhu Yuliang (1):
- gallium/vl: don't leak fd in vl_dri3_screen_create
Zoltán Böszörményi (2):
- crocus: Add pipe loader driver
- crocus: Make the driver loader use PCI IDs for crocus
cheyang (1):
- virgl:Fix the leak of hw_res used as fence
luc (1):
- panfrost: Only clear existing color buffers