| Mesa 19.1.0 Release Notes / June 11, 2019 |
| ========================================= |
| |
| Mesa 19.1.0 is a new development release. People who are concerned with |
| stability and reliability should stick with a previous release or wait |
| for Mesa 19.1.1. |
| |
| Mesa 19.1.0 implements the OpenGL 4.5 API, but the version reported by |
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / |
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being |
| used. Some drivers don't support all the features required in OpenGL |
| 4.5. OpenGL 4.5 is **only** available if requested at context creation. |
| Compatibility contexts may report a lower version depending on each |
| driver. |
| |
| SHA256 checksums |
| ---------------- |
| |
| :: |
| |
| 2a6c3af3a803389183168e449c536304cf03e0f82c4c9333077933543b9d02f3 mesa-19.1.0.tar.xz |
| |
| New features |
| ------------ |
| |
| - GL_ARB_parallel_shader_compile on all drivers. |
| - GL_EXT_gpu_shader4 on all GL 3.1 drivers. |
| - GL_EXT_shader_image_load_formatted on radeonsi. |
| - GL_EXT_texture_buffer_object on all GL 3.1 drivers. |
| - GL_EXT_texture_compression_s3tc_srgb on Gallium drivers and i965 (ES |
| extension). |
| - GL_NV_compute_shader_derivatives on iris and i965. |
| - GL_KHR_parallel_shader_compile on all drivers. |
| - VK_EXT_buffer_device_address on Intel and RADV. |
| - VK_EXT_depth_clip_enable on Intel and RADV. |
| - VK_KHR_ycbcr_image_arrays on Intel. |
| - VK_EXT_inline_uniform_block on Intel and RADV. |
| - VK_EXT_external_memory_host on Intel. |
| - VK_EXT_host_query_reset on Intel and RADV. |
| - VK_KHR_surface_protected_capabilities on Intel and RADV. |
| - VK_EXT_pipeline_creation_feedback on Intel and RADV. |
| - VK_KHR_8bit_storage on RADV. |
| - VK_AMD_gpu_shader_int16 on RADV. |
| - VK_AMD_gpu_shader_half_float on RADV. |
| - VK_NV_compute_shader_derivatives on Intel. |
| - VK_KHR_shader_float16_int8 on Intel and RADV (RADV only supports |
| int8). |
| - VK_KHR_shader_atomic_int64 on Intel. |
| - VK_EXT_descriptor_indexing on Intel. |
| - VK_KHR_shader_float16_int8 on Intel and RADV. |
| - GL_INTEL_conservative_rasterization on iris. |
| - VK_EXT_memory_budget on Intel. |
| |
| Bug fixes |
| --------- |
| |
| - `Bug 81843 <https://bugs.freedesktop.org/show_bug.cgi?id=81843>`__ - |
| [SNB IVB HSW] ETC2 textures are not returned as compressed images |
| - `Bug 99781 <https://bugs.freedesktop.org/show_bug.cgi?id=99781>`__ - |
| Some Unity games fail assertion on startup in |
| glXCreateContextAttribsARB |
| - `Bug 100239 <https://bugs.freedesktop.org/show_bug.cgi?id=100239>`__ |
| - Incorrect rendering in CS:GO |
| - `Bug 100316 <https://bugs.freedesktop.org/show_bug.cgi?id=100316>`__ |
| - Linking GLSL 1.30 shaders with invariant and deprecated variables |
| triggers an 'mismatching invariant qualifiers' error |
| - `Bug 104272 <https://bugs.freedesktop.org/show_bug.cgi?id=104272>`__ |
| - [OpenGL CTS] [HSW] |
| KHR-GL46.direct_state_access.textures_compressed_subimage assert |
| fails |
| - `Bug 104355 <https://bugs.freedesktop.org/show_bug.cgi?id=104355>`__ |
| - Ivy Bridge ignores component mappings in texture views |
| - `Bug 104602 <https://bugs.freedesktop.org/show_bug.cgi?id=104602>`__ |
| - [apitrace] Graphical artifacts in Civilization VI on RX Vega |
| - `Bug 107052 <https://bugs.freedesktop.org/show_bug.cgi?id=107052>`__ |
| - [Regression][bisected]. Crookz - The Big Heist Demo can't be |
| launched despite the "true" flag in "drirc" |
| - `Bug 107505 <https://bugs.freedesktop.org/show_bug.cgi?id=107505>`__ |
| - [lars] |
| dEQP-GLES31.functional.geometry_shading.layered#render_with_default_layer_3d |
| failure |
| - `Bug 107510 <https://bugs.freedesktop.org/show_bug.cgi?id=107510>`__ |
| - [GEN8+] up to 10% perf drop on several 3D benchmarks |
| - `Bug 107563 <https://bugs.freedesktop.org/show_bug.cgi?id=107563>`__ |
| - [RADV] Broken rendering in Unity demos |
| - `Bug 107987 <https://bugs.freedesktop.org/show_bug.cgi?id=107987>`__ |
| - [Debug mesa only]. Crash happens when calling drawArrays |
| - `Bug 108250 <https://bugs.freedesktop.org/show_bug.cgi?id=108250>`__ |
| - [GLSL] layout-location-struct.shader_test fails to link |
| - `Bug 108457 <https://bugs.freedesktop.org/show_bug.cgi?id=108457>`__ |
| - [OpenGL CTS] |
| KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage |
| fails |
| - `Bug 108540 <https://bugs.freedesktop.org/show_bug.cgi?id=108540>`__ |
| - vkAcquireNextImageKHR blocks when timeout=0 in Wayland |
| - `Bug 108766 <https://bugs.freedesktop.org/show_bug.cgi?id=108766>`__ |
| - Mesa built with meson has RPATH entries |
| - `Bug 108824 <https://bugs.freedesktop.org/show_bug.cgi?id=108824>`__ |
| - Invalid handling when GL buffer is bound on one context and |
| invalidated on another |
| - `Bug 108841 <https://bugs.freedesktop.org/show_bug.cgi?id=108841>`__ |
| - [RADV] SPIRV's control flow attributes do not propagate to LLVM |
| - `Bug 108879 <https://bugs.freedesktop.org/show_bug.cgi?id=108879>`__ |
| - [CIK] [regression] All opencl apps hangs indefinitely in |
| si_create_context |
| - `Bug 108999 <https://bugs.freedesktop.org/show_bug.cgi?id=108999>`__ |
| - Calculating the scissors fields when the y is flipped (0 on top) |
| can generate negative numbers that will cause assertion failure later |
| on. |
| - `Bug 109057 <https://bugs.freedesktop.org/show_bug.cgi?id=109057>`__ |
| - texelFetch from GL_TEXTURE_2D_MULTISAMPLE with integer format fails |
| - `Bug 109107 <https://bugs.freedesktop.org/show_bug.cgi?id=109107>`__ |
| - gallium/st/va: change va max_profiles when using Radeon VCN |
| Hardware |
| - `Bug 109216 <https://bugs.freedesktop.org/show_bug.cgi?id=109216>`__ |
| - 4-27% performance drop in Vulkan benchmarks |
| - `Bug 109326 <https://bugs.freedesktop.org/show_bug.cgi?id=109326>`__ |
| - mesa: Meson configuration summary should be printed |
| - `Bug 109328 <https://bugs.freedesktop.org/show_bug.cgi?id=109328>`__ |
| - [BSW BXT GLK] dEQP-VK.subgroups.arithmetic.subgroup regressions |
| - `Bug 109391 <https://bugs.freedesktop.org/show_bug.cgi?id=109391>`__ |
| - LTO Build fails |
| - `Bug 109401 <https://bugs.freedesktop.org/show_bug.cgi?id=109401>`__ |
| - [DXVK] Project Cars rendering problems |
| - `Bug 109404 <https://bugs.freedesktop.org/show_bug.cgi?id=109404>`__ |
| - [ANV] The Witcher 3 shadows flickering |
| - `Bug 109443 <https://bugs.freedesktop.org/show_bug.cgi?id=109443>`__ |
| - Build failure with MSVC when using Scons >= 3.0.2 |
| - `Bug 109451 <https://bugs.freedesktop.org/show_bug.cgi?id=109451>`__ |
| - [IVB,SNB] LINE_STRIPs following a TRIANGLE_FAN fail to use |
| primitive restart |
| - `Bug 109543 <https://bugs.freedesktop.org/show_bug.cgi?id=109543>`__ |
| - After upgrade mesa to 19.0.0~rc1 all vulkan based application stop |
| working ["vulkan-cube" received SIGSEGV in |
| radv_pipeline_init_blend_state at |
| ../src/amd/vulkan/radv_pipeline.c:699] |
| - `Bug 109561 <https://bugs.freedesktop.org/show_bug.cgi?id=109561>`__ |
| - [regression, bisected] code re-factor causing games to stutter or |
| lock-up system |
| - `Bug 109573 <https://bugs.freedesktop.org/show_bug.cgi?id=109573>`__ |
| - dEQP-VK.spirv_assembly.instruction.graphics.module.same_module |
| - `Bug 109575 <https://bugs.freedesktop.org/show_bug.cgi?id=109575>`__ |
| - Mesa-19.0.0-rc1 : Computer Crashes trying to run anything Vulkan |
| - `Bug 109581 <https://bugs.freedesktop.org/show_bug.cgi?id=109581>`__ |
| - [BISECTED] Nothing is Rendered on Sascha Willem's "subpasses" demo |
| - `Bug 109594 <https://bugs.freedesktop.org/show_bug.cgi?id=109594>`__ |
| - totem assert failure: totem: src/intel/genxml/gen9_pack.h:72: |
| \__gen_uint: La declaración \`v <= max' no se cumple. |
| - `Bug 109597 <https://bugs.freedesktop.org/show_bug.cgi?id=109597>`__ |
| - wreckfest issues with transparent objects & skybox |
| - `Bug 109601 <https://bugs.freedesktop.org/show_bug.cgi?id=109601>`__ |
| - [Regression] RuneLite GPU rendering broken on 18.3.x |
| - `Bug 109603 <https://bugs.freedesktop.org/show_bug.cgi?id=109603>`__ |
| - nir_instr_as_deref: Assertion \`parent && parent->type == |
| nir_instr_type_deref' failed. |
| - `Bug 109645 <https://bugs.freedesktop.org/show_bug.cgi?id=109645>`__ |
| - build error on arm64: tegra_screen.c:33: |
| /usr/include/xf86drm.h:41:10: fatal error: drm.h: No such file or |
| directory |
| - `Bug 109646 <https://bugs.freedesktop.org/show_bug.cgi?id=109646>`__ |
| - New video compositor compute shader render glitches mpv |
| - `Bug 109647 <https://bugs.freedesktop.org/show_bug.cgi?id=109647>`__ |
| - /usr/include/xf86drm.h:40:10: fatal error: drm.h: No such file or |
| directory |
| - `Bug 109648 <https://bugs.freedesktop.org/show_bug.cgi?id=109648>`__ |
| - AMD Raven hang during va-api decoding |
| - `Bug 109659 <https://bugs.freedesktop.org/show_bug.cgi?id=109659>`__ |
| - Missing OpenGL symbols in OSMesa Gallium when building with meson |
| - `Bug 109698 <https://bugs.freedesktop.org/show_bug.cgi?id=109698>`__ |
| - dri.pc contents invalid when built with meson |
| - `Bug 109717 <https://bugs.freedesktop.org/show_bug.cgi?id=109717>`__ |
| - [regression] Cull distance tests asserting |
| - `Bug 109735 <https://bugs.freedesktop.org/show_bug.cgi?id=109735>`__ |
| - [Regression] broken font with mesa_vulkan_overlay |
| - `Bug 109738 <https://bugs.freedesktop.org/show_bug.cgi?id=109738>`__ |
| - Child of Light shows only a black screen |
| - `Bug 109739 <https://bugs.freedesktop.org/show_bug.cgi?id=109739>`__ |
| - Mesa build fails when vulkan-overlay-layer option is enabled |
| - `Bug 109742 <https://bugs.freedesktop.org/show_bug.cgi?id=109742>`__ |
| - vdpau state tracker on nv92 started to hit assert after vl compute |
| work |
| - `Bug 109743 <https://bugs.freedesktop.org/show_bug.cgi?id=109743>`__ |
| - Test fails: |
| piglit.spec.arb_sample_shading.arb_sample_shading-builtin-gl-sample-mask-mrt-alpha |
| - `Bug 109747 <https://bugs.freedesktop.org/show_bug.cgi?id=109747>`__ |
| - Add framerate to vulkan-overlay-layer |
| - `Bug 109759 <https://bugs.freedesktop.org/show_bug.cgi?id=109759>`__ |
| - [BISECTED][REGRESSION][IVB, HSW] Font rendering problem in OpenGL |
| - `Bug 109788 <https://bugs.freedesktop.org/show_bug.cgi?id=109788>`__ |
| - vulkan-overlay-layer: Only installs 64bit version |
| - `Bug 109810 <https://bugs.freedesktop.org/show_bug.cgi?id=109810>`__ |
| - nir_opt_copy_prop_vars.c:454: error: unknown field ‘ssa’ specified |
| in initializer |
| - `Bug 109929 <https://bugs.freedesktop.org/show_bug.cgi?id=109929>`__ |
| - tgsi_to_nir.c:2111: undefined reference to |
| \`gl_nir_lower_samplers_as_deref' |
| - `Bug 109944 <https://bugs.freedesktop.org/show_bug.cgi?id=109944>`__ |
| - [bisected] Android build test fails with: utils.c: error: use of |
| undeclared identifier 'PACKAGE_VERSION' |
| - `Bug 109945 <https://bugs.freedesktop.org/show_bug.cgi?id=109945>`__ |
| - pan_assemble.c:51:46: error: passing argument 2 of ‘tgsi_to_nir’ |
| from incompatible pointer type [-Werror=incompatible-pointer-types] |
| - `Bug 109980 <https://bugs.freedesktop.org/show_bug.cgi?id=109980>`__ |
| - [i915 CI][HSW] |
| spec@arb_fragment_shader_interlock@arb_fragment_shader_interlock-image-load-store |
| - fail |
| - `Bug 109984 <https://bugs.freedesktop.org/show_bug.cgi?id=109984>`__ |
| - unhandled VkStructureType |
| VK_STRUCTURE_TYPE_RENDER_PASS_INPUT_ATTACHMENT_ASPECT_CREATE_INFO |
| - `Bug 110134 <https://bugs.freedesktop.org/show_bug.cgi?id=110134>`__ |
| - SIGSEGV while playing large hevc video in mpv |
| - `Bug 110143 <https://bugs.freedesktop.org/show_bug.cgi?id=110143>`__ |
| - Doom 3: BFG Edition - Steam and GOG.com - white flickering screen |
| - `Bug 110201 <https://bugs.freedesktop.org/show_bug.cgi?id=110201>`__ |
| - [ivb] mesa 19.0.0 breaks rendering in kitty |
| - `Bug 110211 <https://bugs.freedesktop.org/show_bug.cgi?id=110211>`__ |
| - If DESTDIR is set to an empty string, the dri drivers are not |
| installed |
| - `Bug 110216 <https://bugs.freedesktop.org/show_bug.cgi?id=110216>`__ |
| - radv: Segfault when compiling compute shaders from Assassin's Creed |
| Odyssey (regression, bisected) |
| - `Bug 110221 <https://bugs.freedesktop.org/show_bug.cgi?id=110221>`__ |
| - build error with meson |
| - `Bug 110239 <https://bugs.freedesktop.org/show_bug.cgi?id=110239>`__ |
| - Mesa SIGABRT: src/intel/genxml/gen9_pack.h:72: \__gen_uint: |
| Assertion \`v <= max' failed |
| - `Bug 110257 <https://bugs.freedesktop.org/show_bug.cgi?id=110257>`__ |
| - Major artifacts in mpeg2 vaapi hw decoding |
| - `Bug 110259 <https://bugs.freedesktop.org/show_bug.cgi?id=110259>`__ |
| - radv: Sampling depth-stencil image in GENERAL layout returns |
| nothing but zero (regression, bisected) |
| - `Bug 110291 <https://bugs.freedesktop.org/show_bug.cgi?id=110291>`__ |
| - Vega 64 GPU hang running Space Engineers |
| - `Bug 110302 <https://bugs.freedesktop.org/show_bug.cgi?id=110302>`__ |
| - [bisected][regression] piglit egl-create-pbuffer-surface and |
| egl-gl-colorspace regressions |
| - `Bug 110305 <https://bugs.freedesktop.org/show_bug.cgi?id=110305>`__ |
| - Iris driver fails ext_packed_depth_stencil-getteximage test |
| - `Bug 110311 <https://bugs.freedesktop.org/show_bug.cgi?id=110311>`__ |
| - [IVB HSW SNB][regression][bisected] regressions on vec4 |
| deqp/gl{es}cts tests |
| - `Bug 110349 <https://bugs.freedesktop.org/show_bug.cgi?id=110349>`__ |
| - radv: Dragon Quest XI (DXVK) has a graphical glitch (regression, |
| bisected) |
| - `Bug 110353 <https://bugs.freedesktop.org/show_bug.cgi?id=110353>`__ |
| - weird colors seen in valley |
| - `Bug 110355 <https://bugs.freedesktop.org/show_bug.cgi?id=110355>`__ |
| - radeonsi: GTK elements become invisible in some applications (GIMP, |
| LibreOffice) |
| - `Bug 110356 <https://bugs.freedesktop.org/show_bug.cgi?id=110356>`__ |
| - install_megadrivers.py creates new dangling symlink [bisected] |
| - `Bug 110404 <https://bugs.freedesktop.org/show_bug.cgi?id=110404>`__ |
| - Iris fails piglit.spec.ext_transform_feedback.immediate-reuse test |
| - `Bug 110422 <https://bugs.freedesktop.org/show_bug.cgi?id=110422>`__ |
| - AMD_DEBUG=forcedma will crash OpenGL aps with SIGFAULT on VegaM |
| 8706G |
| - `Bug 110441 <https://bugs.freedesktop.org/show_bug.cgi?id=110441>`__ |
| - [llvmpipe] complex-loop-analysis-bug regression |
| - `Bug 110443 <https://bugs.freedesktop.org/show_bug.cgi?id=110443>`__ |
| - vaapi/vpp: wrong output for non 64-bytes align width (ex: 1200) |
| - `Bug 110454 <https://bugs.freedesktop.org/show_bug.cgi?id=110454>`__ |
| - [llvmpipe] piglit arb_color_buffer_float-render GL_RGBA8_SNORM |
| failure with llvm-9 |
| - `Bug 110462 <https://bugs.freedesktop.org/show_bug.cgi?id=110462>`__ |
| - Epic Games Launcher renders nothing with "-opengl" option |
| - `Bug 110474 <https://bugs.freedesktop.org/show_bug.cgi?id=110474>`__ |
| - [bisected][regression] vk cts fp16 arithmetic failures |
| - `Bug 110497 <https://bugs.freedesktop.org/show_bug.cgi?id=110497>`__ |
| - [DXVK][Regression][Bisected][SKL] Project Cars 2 crashes with Bug |
| Splat when loading finishes |
| - `Bug 110526 <https://bugs.freedesktop.org/show_bug.cgi?id=110526>`__ |
| - [CTS] dEQP-VK.ycbcr.{conversion,format}.\* fail |
| - `Bug 110530 <https://bugs.freedesktop.org/show_bug.cgi?id=110530>`__ |
| - [CTS] dEQP-VK.ycbcr.format.g8_b8_r8_3plane_420\* reports VM faults |
| on Vega10 |
| - `Bug 110535 <https://bugs.freedesktop.org/show_bug.cgi?id=110535>`__ |
| - [bisected] [icl] GPU hangs on crucible |
| func.miptree.r8g8b8a8-unorm.aspect-color.view-2d.levels01.array01.extent-512x512.upload-copy-with-draw |
| tests |
| - `Bug 110540 <https://bugs.freedesktop.org/show_bug.cgi?id=110540>`__ |
| - [AMD TAHITI XT] valve artifact broken |
| - `Bug 110573 <https://bugs.freedesktop.org/show_bug.cgi?id=110573>`__ |
| - Mesa vulkan-radeon 19.0.3 system freeze and visual artifacts (RADV) |
| - `Bug 110590 <https://bugs.freedesktop.org/show_bug.cgi?id=110590>`__ |
| - [Regression][Bisected] GTAⅣ under wine fails with GLXBadFBConfig |
| - `Bug 110632 <https://bugs.freedesktop.org/show_bug.cgi?id=110632>`__ |
| - "glx: Fix synthetic error generation in \__glXSendError" broke wine |
| games on 32-bit |
| - `Bug 110648 <https://bugs.freedesktop.org/show_bug.cgi?id=110648>`__ |
| - Dota2 will not open using vulkan since 19.0 series |
| - `Bug 110655 <https://bugs.freedesktop.org/show_bug.cgi?id=110655>`__ |
| - VK_LAYER_MESA_OVERLAY_CONFIG=draw,fps renders sporadically |
| - `Bug 110698 <https://bugs.freedesktop.org/show_bug.cgi?id=110698>`__ |
| - tu_device.c:900:4: error: initializer element is not constant |
| - `Bug 110701 <https://bugs.freedesktop.org/show_bug.cgi?id=110701>`__ |
| - GPU faults in in Unigine Valley 1.0 |
| - `Bug 110721 <https://bugs.freedesktop.org/show_bug.cgi?id=110721>`__ |
| - graphics corruption on steam client with mesa 19.1.0 rc3 on polaris |
| - `Bug 110761 <https://bugs.freedesktop.org/show_bug.cgi?id=110761>`__ |
| - Huge problems between Mesa and Electron engine apps |
| - `Bug 110784 <https://bugs.freedesktop.org/show_bug.cgi?id=110784>`__ |
| - [regression][bisected] Reverting 'expose 0 shader binary formats |
| for compat profiles for Qt' causes get_program_binary failures on |
| Iris |
| |
| Changes |
| ------- |
| |
| Adam Jackson (1): |
| |
| - drisw: Try harder to probe whether MIT-SHM works |
| |
| Albert Pal (1): |
| |
| - Fix link release notes for 19.0.0. |
| |
| Alejandro Piñeiro (12): |
| |
| - blorp: introduce helper method blorp_nir_init_shader |
| - nir, glsl: move pixel_center_integer/origin_upper_left to |
| shader_info.fs |
| - nir/xfb: add component_offset at nir_xfb_info |
| - nir_types: add glsl_varying_count helper |
| - nir/xfb: adding varyings on nir_xfb_info and gather_info |
| - nir/xfb: sort varyings too |
| - nir_types: add glsl_type_is_struct helper |
| - nir/xfb: handle arrays and AoA of basic types |
| - nir/linker: use nir_gather_xfb_info |
| - nir/linker: fix ARRAY_SIZE query with xfb varyings |
| - nir/xfb: move varyings info out of nir_xfb_info |
| - docs: document MESA_GLSL=errors keyword |
| |
| Alexander von Gluck IV (1): |
| |
| - haiku: Fix hgl dispatch build. Tested under meson/scons. |
| |
| Alexandros Frantzis (1): |
| |
| - virgl: Fake MSAA when max samples is 1 |
| |
| Alok Hota (32): |
| |
| - swr/rast: update SWR rasterizer shader stats |
| - gallium/swr: Param defaults for unhandled PIPE_CAPs |
| - gallium/aux: add PIPE_CAP_MAX_VARYINGS to u_screen |
| - swr/rast: Convert system memory pointers to gfxptr_t |
| - swr/rast: Disable use of \__forceinline by default |
| - swr/rast: Correctly align 64-byte spills/fills |
| - swr/rast: Flip BitScanReverse index calculation |
| - swr/rast: Move knob defaults to generated cpp file |
| - swr/rast: FP consistency between POSH/RENDER pipes |
| - swr/rast: Refactor scratch space variable names |
| - swr/rast: convert DWORD->uint32_t, QWORD->uint64_t |
| - swr/rast: simdlib cleanup, clipper stack space fixes |
| - swr/rast: Add translation support to streamout |
| - swr/rast: bypass size limit for non-sampled textures |
| - swr/rast: Cleanup and generalize gen_archrast |
| - swr/rast: Add initial SWTag proto definitions |
| - swr/rast: Add string handling to AR event framework |
| - swr/rast: Add general SWTag statistics |
| - swr/rast: Fix autotools and scons codegen |
| - swr/rast: Remove deprecated 4x2 backend code |
| - swr/rast: AVX512 support compiled in by default |
| - swr/rast: enforce use of tile offsets |
| - swr/rast: add more llvm intrinsics |
| - swr/rast: update guardband rects at draw setup |
| - swr/rast: add SWR_STATIC_ASSERT() macro |
| - swr/rast: add flat shading |
| - swr/rast: add guards for cpuid on Linux |
| - swr/rast: early exit on empty triangle mask |
| - swr/rast: Cleanup and generalize gen_archrast |
| - swr/rast: Add initial SWTag proto definitions |
| - swr/rast: Add string handling to AR event framework |
| - swr/rast: Add general SWTag statistics |
| |
| Alyssa Rosenzweig (192): |
| |
| - panfrost: Initial stub for Panfrost driver |
| - panfrost: Implement Midgard shader toolchain |
| - meson: Remove panfrost from default driver list |
| - kmsro: Move DRM entrypoints to shared block |
| - panfrost: Use u_pipe_screen_get_param_defaults |
| - panfrost: Check in sources for command stream |
| - panfrost: Include glue for out-of-tree legacy code |
| - kmsro: Silence warning if missing |
| - panfrost: Clean-up one-argument passing quirk |
| - panfrost: Don't hardcode number of nir_ssa_defs |
| - panfrost: Add kernel-agnostic resource management |
| - panfrost: Remove if 0'd dead code |
| - panfrost: Remove speculative if 0'd format bit code |
| - panfrost: Elucidate texture op scheduling comment |
| - panfrost: Specify supported draw modes per-context |
| - panfrost: Fix build; depend on libdrm |
| - panfrost: Backport driver to Mali T600/T700 |
| - panfrost: Identify MALI_OCCLUSION_PRECISE bit |
| - panfrost: Implement PIPE_QUERY_OCCLUSION_COUNTER |
| - panfrost: Don't align framebuffer dims |
| - panfrost: Improve logging and patch memory leaks |
| - panfrost: Fix various leaks unmapping resources |
| - panfrost: Free imported BOs |
| - panfrost: Swap order of tiled texture (de)alloc |
| - panfrost: Cleanup mali_viewport (clipping) code |
| - panfrost: Preserve w sign in perspective division |
| - panfrost: Fix clipping region |
| - panfrost: Stub out separate stencil functions |
| - panfrost: Add pandecode (command stream debugger) |
| - panfrost: Implement pantrace (command stream dump) |
| - panfrost/midgard: Refactor tag lookahead code |
| - panfrost/midgard: Fix nested/chained if-else |
| - panfrost: Rectify doubleplusungood extended branch |
| - panfrost/midgard: Emit extended branches |
| - panfrost: Dynamically set discard branch targets |
| - panfrost: Verify and print brx condition in disasm |
| - panfrost: Use tiler fast path (performance boost) |
| - panfrost/meson: Remove subdir for nondrm |
| - panfrost/nondrm: Flag CPU-invisible regions |
| - panfrost/nondrm: Make COHERENT_LOCAL explicit |
| - panfrost/nondrm: Split out dump_counters |
| - panfrost/midgard: Add fround(_even), ftrunc, ffma |
| - panfrost: Decode render target swizzle/channels |
| - panfrost: Add RGB565, RGB5A1 texture formats |
| - panfrost: Identify 4-bit channel texture formats |
| - panfrost: Expose perf counters in environment |
| - panfrost/midgard: Allow flt to run on most units |
| - panfrost: Import job data structures from v3d |
| - panfrost: Decouple Gallium clear from FBD clear |
| - panfrost: Cleanup cruft related to clears |
| - panfrost/midgard: Don't force constant on VLUT |
| - panfrost: Flush with offscreen rendering |
| - panfrost/midgard: Promote smul to vmul |
| - panfrost/midgard: Preview for data hazards |
| - panfrost: List primitive restart enable bit |
| - panfrost/drm: Cast pointer to u64 to fix warning |
| - panfrost: Cleanup needless if in create_bo |
| - panfrost: Combine has_afbc/tiled in layout enum |
| - panfrost: Delay color buffer setup |
| - panfrost: Determine framebuffer format bits late |
| - panfrost: Allocate dedicated slab for linear BOs |
| - panfrost: Support linear depth textures |
| - panfrost: Document "depth-buffer writeback" bit |
| - panfrost: Identify fragment_extra flags |
| - util: Add a drm_find_modifier helper |
| - v3d: Use shared drm_find_modifier util |
| - vc4: Use shared drm_find_modifier util |
| - freedreno: Use shared drm_find_modifier util |
| - panfrost: Break out fragment to SFBD/MFBD files |
| - panfrost: Remove staging SFBD for pan_context |
| - panfrost: Remove staging MFBD |
| - panfrost: Minor comment cleanup (version detection) |
| - panfrost/mfbd: Implement linear depth buffers |
| - panfrost/mfbd: Respect per-job depth write flag |
| - panfrost: Comment spelling fix |
| - panfrost: Allocate extra data for depth buffer |
| - panfrost; Disable AFBC for depth buffers |
| - panfrost: Compute viewport state on the fly |
| - panfrost/midgard: Implement fpow |
| - panfrost: Workaround buffer overrun with mip level |
| - panfrost: Fix primconvert check |
| - panfrost: Disable PIPE_CAP_TGSI_TEXCOORD |
| - panfrost/decode: Respect primitive size pointers |
| - panfrost: Replay more varying buffers |
| - panfrost: Rewrite varying assembly |
| - panfrost/midgard: Fix b2f32 swizzle for vectors |
| - panfrost: Fix viewports |
| - panfrost: Implement scissor test |
| - panfrost/midgard: Add fcsel_i opcode |
| - panfrost/midgard: Schedule ball/bany to vectors |
| - panfrost/midgard: Add more ball/bany, iabs ops |
| - panfrost/midgard: Map more bany/ball opcodes |
| - panfrost/midgard: Lower bool_to_int32 |
| - panfrost/midgard: Lower f2b32 to fne |
| - panfrost/midgard: Lower i2b32 |
| - panfrost/midgard: Implement b2i; improve b2f/f2b |
| - panfrost/midgard: Lower source modifiers for ints |
| - panfrost/midgard: Cleanup midgard_nir_algebraic.py |
| - panfrost: Stub out ES3 caps/callbacks |
| - panfrost/midgard: Add ult/ule ops |
| - panfrost/midgard: Expand fge lowering to more types |
| - panfrost/midgard: Handle i2b constant |
| - panfrost/midgard: fpow is a two-part operation |
| - panfrost: Preliminary work for mipmaps |
| - panfrost: Fix vertex buffer corruption |
| - panfrost/midgard: Disassemble \`cube\` texture op |
| - panfrost/midgard: Add L/S op for writing cubemap coordinates |
| - panfrost: Preliminary work for cubemaps |
| - panfrost/decode: Decode all cubemap faces |
| - panfrost: Include all cubemap faces in bitmap list |
| - panfrost/midgard: Emit cubemap coordinates |
| - panfrost: Implement command stream for linear cubemaps |
| - panfrost: Extend tiling for cubemaps |
| - panfrost: Implement missing texture formats |
| - panfrost/decode: Print negative_start |
| - panfrost: Clean index state between indexed draws |
| - panfrost: Fix index calculation types and asserts |
| - panfrost: Implement FIXED formats |
| - panfrost: Remove support for legacy kernels |
| - nir: Add "viewport vector" system values |
| - panfrost: Implement system values |
| - panfrost: Cleanup some indirection in pan_resource |
| - panfrost: Respect box->width in tiled stores |
| - panfrost: Size tiled temp buffers correctly |
| - panfrost/decode: Add flags for tilebuffer readback |
| - panfrost: Add tilebuffer load? branch |
| - panfrost/midgard: Add umin/umax opcodes |
| - panfrost/midgard: Add ilzcnt op |
| - panfrost/midgard: Add ibitcount8 op |
| - panfrost/midgard: Enable lower_find_lsb |
| - panfrost: Remove "mali_unknown6" nonsense |
| - panfrost/midgard: Drop dependence on mesa/st |
| - panfrost: Cleanup indexed draw handling |
| - nir: Add nir_lower_viewport_transform |
| - panfrost/midgard: Use shared nir_lower_viewport_transform |
| - panfrost: Track BO lifetime with jobs and reference counts |
| - panfrost: Fixup vertex offsets to prevent shadow copy |
| - panfrost/mdg: Use shared fsign lowering |
| - panfrost/mdg/disasm: Print raw varying_parameters |
| - panfrost/midgard: Pipe through varying arrays |
| - panfrost/midgard: Implement indirect loads of varyings/UBOs |
| - panfrost/midgard: Respect component of bcsel condition |
| - panfrost/midgard: Remove useless MIR dump |
| - panfrost: Respect backwards branches in RA |
| - panfrost/midgard: Don't try to inline constants on branches |
| - panfrost/midgard: imul can only run on \*mul |
| - panfrost: Disable indirect outputs for now |
| - panfrost: Use actual imov instruction |
| - panfrost/midgard: Dead code eliminate MIR |
| - panfrost/midgard: Track loop depth |
| - panfrost/midgard: Fix off-by-one in successor analysis |
| - panfrost/midgard: Remove unused mir_next_block |
| - panfrost/midgard: Update integer op list |
| - panfrost/midgard: Document sign-extension/zero-extension bits |
| (vector) |
| - panfrost/midgard: Set integer mods |
| - panfrost/midgard: Implement copy propagation |
| - panfrost/midgard: Optimize MIR in progress loop |
| - panfrost/midgard: Refactor opcode tables |
| - panfrost/midgard: Add "op commutes?" property |
| - panfrost/midgard: Remove assembler |
| - panfrost/midgard: Reduce fmax(a, 0.0) to fmov.pos |
| - panfrost/midgard: Extend copy propagation pass |
| - panfrost/midgard: Optimize csel involving 0 |
| - panfrost/midgard: Copy prop for texture registers |
| - panfrost/midgard: Identify inand |
| - panfrost/midgard: Add new bitwise ops |
| - Revert "panfrost/midgard: Extend copy propagation pass" |
| - panfrost/midgard: Only copyprop without an outmod |
| - panfrost/midgard: Fix regressions in -bjellyfish |
| - panfrost/midgard: Fix tex propogation |
| - panfrost/midgard: imov workaround |
| - panfrost: Use fp32 (not fp16) varyings |
| - panfrost/midgard: Safety check immediate precision degradations |
| - panfrost: Workaround -bshadow regression |
| - panfrost: Remove shader dump |
| - panfrost/decode: Hit MRT blend shader enable bits |
| - panfrost: Fix blend shader upload |
| - panfrost/midgard: reg_mode_full -> reg_mode_32, etc |
| - panfrost/midgard/disasm: Catch mask errors |
| - panfrost/midgard/disasm: Extend print_reg to 8-bit |
| - panfrost/midgard/disasm: Fill in .int mod |
| - panfrost/midgard: Fix crash on unknown op |
| - panfrost/midgard: Rename ilzcnt8 -> iclz |
| - panfrost/midgard/disasm: Support 8-bit destination |
| - panfrost/midgard/disasm: Print 8-bit sources |
| - panfrost/midgard/disasm: Stub out 64-bit |
| - panfrost/midgard/disasm: Handle dest_override generalized |
| - panfrost: Support RGB565 FBOs |
| - panfrost/midgard: Fix integer selection |
| - panfrost/midgard: Fix RA when temp_count = 0 |
| - panfrost/midgard: Lower mixed csel (NIR) |
| - panfrost/midgard: iabs cannot run on mul |
| |
| Alyssa Ross (1): |
| |
| - get_reviewer.pl: improve portability |
| |
| Amit Pundir (1): |
| |
| - mesa: android: freedreno: build libfreedreno_{drm,ir3} static libs |
| |
| Andre Heider (5): |
| |
| - iris: fix build with gallium nine |
| - iris: improve PIPE_CAP_VIDEO_MEMORY bogus value |
| - iris: add support for tgsi_to_nir |
| - st/nine: enable csmt per default on iris |
| - st/nine: skip position checks in SetCursorPosition() |
| |
| Andreas Baierl (2): |
| |
| - nir: add rcp(w) lowering for gl_FragCoord |
| - lima/ppir: Add gl_FragCoord handling |
| |
| Andres Gomez (12): |
| |
| - mesa: INVALID_VALUE for wrong type or format in Clear*Buffer*Data |
| - gitlab-ci: install distro's ninja |
| - glsl: correctly validate component layout qualifier for dvec{3,4} |
| - glsl/linker: always validate explicit location among inputs |
| - glsl/linker: don't fail non static used inputs without matching |
| outputs |
| - glsl/linker: simplify xfb_offset vs xfb_stride overflow check |
| - Revert "glsl: relax input->output validation for SSO programs" |
| - glsl/linker: location aliasing requires types to have the same width |
| - docs: drop Andres Gomez from the release cycles |
| - glsl/linker: always validate explicit locations for first and last |
| interfaces |
| - docs/relnotes: add support for VK_KHR_shader_float16_int8 |
| - glsl/linker: check for xfb_offset aliasing |
| |
| Andrii Simiklit (5): |
| |
| - i965: consider a 'base level' when calculating width0, height0, |
| depth0 |
| - i965: re-emit index buffer state on a reset option change. |
| - util: clean the 24-bit unused field to avoid an issues |
| - iris: make the TFB result visible to others |
| - egl: return correct error code for a case req ver < 3 with |
| forward-compatible |
| |
| Antia Puentes (1): |
| |
| - nir/linker: Fix TRANSFORM_FEEDBACK_BUFFER_INDEX |
| |
| Anuj Phogat (7): |
| |
| - i965/icl: Add WA_2204188704 to disable pixel shader panic dispatch |
| - anv/icl: Add WA_2204188704 to disable pixel shader panic dispatch |
| - intel: Add Elkhart Lake device info |
| - intel: Add Elkhart Lake PCI-IDs |
| - iris/icl: Set Enabled Texel Offset Precision Fix bit |
| - iris/icl: Add WA_2204188704 to disable pixel shader panic dispatch |
| - intel: Add support for Comet Lake |
| |
| Axel Davy (49): |
| |
| - st/nine: Ignore window size if error |
| - st/nine: Ignore multisample quality level if no ms |
| - st/nine: Disable depth write when nothing gets updated |
| - st/nine: Do not advertise support for D15S1 and D24X4S4 |
| - st/nine: Do not advertise CANMANAGERESOURCE |
| - st/nine: Change a few advertised caps |
| - Revert "d3dadapter9: Support software renderer on any DRI device" |
| - st/nine: Fix D3DWindowBuffer_release for old wine nine support |
| - st/nine: Use FLT_MAX/2 for RCP clamping |
| - st/nine: Upload managed textures only at draw using them |
| - st/nine: Upload managed buffers only at draw using them |
| - st/nine: Fix buffer/texture unbinding in nine_state_clear |
| - st/nine: Finish if nooverwrite after normal mapping |
| - st/nine: Always return OK on SetSoftwareVertexProcessing |
| - st/nine: Enable modifiers on ps 1.X texcoords |
| - st/nine: Ignore nooverwrite for systemmem |
| - st/nine: Fix SINCOS input |
| - st/nine: Optimize surface upload with conversion |
| - st/nine: Optimize volume upload with conversion |
| - st/nine: rename \*_conversion to \*_internal |
| - st/nine: Refactor surface GetSystemMemPointer |
| - st/nine: Refactor volume GetSystemMemPointer |
| - st/nine: Support internal compressed format for surfaces |
| - st/nine: Support internal compressed format for volumes |
| - st/nine: Add drirc option to use data_internal for dynamic textures |
| - drirc: Add Gallium nine workaround for Rayman Legends |
| - st/nine: Recompile optimized shaders based on b/i consts |
| - st/nine: Control shader constant inlining with drirc |
| - st/nine: Regroup param->rel tests |
| - st/nine: Refactor param->rel |
| - st/nine: Compact nine_ff_get_projected_key |
| - st/nine: Compact pixel shader key |
| - st/nine: use helper ureg_DECL_sampler everywhere |
| - st/nine: Manually upload vs and ps constants |
| - st/nine: Refactor shader constants ureg_src computation |
| - st/nine: Make swvp_on imply IS_VS |
| - st/nine: Refactor ct_ctor |
| - st/nine: Track constant slots used |
| - st/nine: Refactor counting of constants |
| - st/nine: Prepare constant compaction in nine_shader |
| - st/nine: Propagate const_range to context |
| - st/nine: Cache constant buffer size |
| - st/nine: Handle const_ranges in nine_state |
| - st/nine: Enable computing const_ranges |
| - st/nine: Use TGSI_SEMANTIC_GENERIC for fog |
| - st/nine: Optimize a bit writeonly buffers |
| - st/nine: Throttle rendering similarly for thread_submit |
| - st/nine: Check discard_delayed_release is set before allocating more |
| - d3dadapter9: Revert to old throttling limit value |
| |
| Bart Oldeman (1): |
| |
| - gallium-xlib: query MIT-SHM before using it. |
| |
| Bas Nieuwenhuizen (105): |
| |
| - radv: Only look at pImmutableSamples if the descriptor has a sampler. |
| - amd/common: Add gep helper for pointer increment. |
| - amd/common: Implement ptr->int casts in ac_to_integer. |
| - radv: Fix the shader info pass for not having the variable. |
| - amd/common: Use correct writemask for shared memory stores. |
| - amd/common: Fix stores to derefs with unknown variable. |
| - amd/common: Handle nir_deref_type_ptr_as_array for shared memory. |
| - amd/common: handle nir_deref_cast for shared memory from integers. |
| - amd/common: Do not use 32-bit loads for shared memory. |
| - amd/common: Implement global memory accesses. |
| - radv: Do not use the bo list for local buffers. |
| - radv: Implement VK_EXT_buffer_device_address. |
| - radv: Use correct num formats to detect whether we should be use 1.0 |
| or 1. |
| - radv: Sync ETC2 whitelisted devices. |
| - radv: Clean up a bunch of compiler warnings. |
| - radv: Handle clip+cull distances more generally as compact arrays. |
| - radv: Implement VK_EXT_depth_clip_enable. |
| - radv: Disable depth clamping even without |
| EXT_depth_range_unrestricted. |
| - radv: Fix float16 interpolation set up. |
| - radv: Allow interpolation on non-float types. |
| - radv: Interpolate less aggressively. |
| - turnip: Add driver skeleton (v2) |
| - turnip: Fix up detection of device. |
| - turnip: Gather some device info. |
| - turnip: Remove abort. |
| - turnip: Fix newly introduced warning. |
| - turnip: Add buffer allocation & mapping support. |
| - turnip: Report a memory type and heap. |
| - turnip: Cargo cult the Intel heap size functionality. |
| - turnip: Initialize memory type in requirements. |
| - turnip: Disable more features. |
| - turnip: Add 630 to the list. |
| - turnip: Fix bo allocation after we stopped using libdrm_freedreno ... |
| - turnip: Fix memory mapping. |
| - turnip: Add image layout calculations. |
| - turnip: Stop hardcoding the msm version check. |
| - turnip: move tu_gem.c to tu_drm.c |
| - turnip: Implement pipe-less param query. |
| - turnip: Implement some format properties for RGBA8. |
| - turnip: Remove some radv leftovers. |
| - turnip: clean up TODO. |
| - turnip: Implement some UUIDs. |
| - turnip: Implement a slow bo list |
| - turnip: Add a command stream. |
| - turnip: Add msm queue support. |
| - turnip: Make bo_list functions not static |
| - turnip: Implement submission. |
| - turnip: Fill command buffer |
| - turnip: Shorten primary_cmd_stream name. |
| - turnip: Add emit functions in a header. |
| - turnip: Move stream functions to tu_cs.c |
| - turnip: Add buffer memory binding. |
| - turnip: Make tu6_emit_event_write shared. |
| - turnip: Add tu6_rb_fmt_to_ifmt. |
| - turnip: Implement buffer->buffer DMA copies. |
| - turnip: Add image->buffer DMA copies. |
| - turnip: Add buffer->image DMA copies. |
| - turnip: Add todo for copies. |
| - turnip: Fix GCC compiles. |
| - turnip: Deconflict vk_format_table regeneration |
| - gitlab-ci: Build turnip. |
| - radeonsi: Remove implicit const cast. |
| - radv: Allow fast clears with concurrent queue mask for some layouts. |
| - vulkan/util: Handle enums that are in platform-specific headers. |
| - vulkan: Update the XML and headers to 1.1.104 |
| - radv: Implement VK_EXT_host_query_reset. |
| - radv: Use correct image view comparison for fast clears. |
| - radv: Implement VK_EXT_pipeline_creation_feedback. |
| - ac/nir: Return frag_coord as integer. |
| - nir: Add access qualifiers on load_ubo intrinsic. |
| - radv: Add non-uniform indexing lowering. |
| - radv: Add bolist RADV_PERFTEST flag. |
| - ac: Move has_local_buffers disable to radeonsi. |
| - radv: Use local buffers for the global bo list. |
| - radv: Support VK_EXT_inline_uniform_block. |
| - radv: Add support for driconf. |
| - vulkan/wsi: Add X11 adaptive sync support based on dri options. |
| - radv: Add adaptive_sync driconfig option and enable it by default. |
| - radv: Add logic for subsampled format descriptions. |
| - radv: Add logic for multisample format descriptions. |
| - radv: Add multiple planes to images. |
| - radv: Add single plane image views & meta operations. |
| - radv: Support different source & dest aspects for planar images in |
| blit2d. |
| - radv: Add ycbcr conversion structs. |
| - radv: Add support for image views with multiple planes. |
| - radv: Allow mixed src/dst aspects in copies. |
| - ac/nir: Add support for planes. |
| - radv: Add ycbcr samplers in descriptor set layouts. |
| - radv: Update descriptor sets for multiple planes. |
| - radv: Add ycbcr lowering pass. |
| - radv: Run the new ycbcr lowering pass. |
| - radv: Add hashing for the ycbcr samplers. |
| - radv: Add ycbcr format features. |
| - radv: Add ycbcr subsampled & multiplane formats to csv. |
| - radv: Enable YCBCR conversion feature. |
| - radv: Expose VK_EXT_ycbcr_image_arrays. |
| - radv: Expose Vulkan 1.1 for Android. |
| - radv: Fix hang width YCBCR array textures. |
| - radv: Set is_array in lowered ycbcr tex instructions. |
| - radv: Restrict YUVY formats to 1 layer. |
| - radv: Disable subsampled formats. |
| - radv: Implement cosited_even sampling. |
| - radv: Do not use extra descriptor space for the 3rd plane. |
| - nir: Actually propagate progress in nir_opt_move_load_ubo. |
| - radv: Prevent out of bound shift on 32-bit builds. |
| |
| Benjamin Gordon (1): |
| |
| - configure.ac/meson.build: Add options for library suffixes |
| |
| Benjamin Tissoires (1): |
| |
| - CI: use wayland ci-templates repo to create the base image |
| |
| Boyan Ding (3): |
| |
| - gk110/ir: Add rcp f64 implementation |
| - gk110/ir: Add rsq f64 implementation |
| - gk110/ir: Use the new rcp/rsq in library |
| |
| Boyuan Zhang (1): |
| |
| - st/va: reverse qt matrix back to its original order |
| |
| Brian Paul (51): |
| |
| - st/mesa: whitespace/formatting fixes in st_cb_texture.c |
| - svga: assorted whitespace and formatting fixes |
| - svga: fix dma.pending > 0 test |
| - mesa: fix display list corner case assertion |
| - st/mesa: whitespace fixes in st_sampler_view.c |
| - st/mesa: line wrapping, whitespace fixes in st_cb_texture.c |
| - st/mesa: whitespace fixes in st_texture.h |
| - svga: init fill variable to avoid compiler warning |
| - svga: silence array out of bounds warning |
| - st/wgl: init a variable to silence MinGW warning |
| - gallium/util: whitespace cleanups in u_bitmask.[ch] |
| - gallium/util: add some const qualifiers in u_bitmask.c |
| - pipebuffer: use new pb_usage_flags enum type |
| - pipebuffer: whitespace fixes in pb_buffer.h |
| - winsys/svga: use new pb_usage_flags enum type |
| - st/mesa: move, clean-up shader variant key decls/inits |
| - st/mesa: whitespace, formatting fixes in st_cb_flush.c |
| - svga: refactor draw_vgpu10() function |
| - svga: remove SVGA_RELOC_READ flag in SVGA3D_BindGBSurface() |
| - pipebuffer: s/PB_ALL_USAGE_FLAGS/PB_USAGE_ALL/ |
| - st/mesa: init hash keys with memset(), not designated initializers |
| - intel/decoders: silence uninitialized variable warnings in |
| gen_print_batch() |
| - intel/compiler: silence unitialized variable warning in |
| opt_vector_float() |
| - st/mesa: move utility functions, macros into new st_util.h file |
| - st/mesa: move around some code in st_context.c |
| - st/mesa: add/improve sampler view comments |
| - st/mesa: rename st_texture_release_sampler_view() |
| - st/mesa: minor refactoring of texture/sampler delete code |
| - docs: try to improve the Meson documentation (v2) |
| - drisw: fix incomplete type compilation failure |
| - gallium/winsys/kms: fix incomplete type compilation failure |
| - nir: silence a couple new compiler warnings |
| - docs: separate information for compiler selection and compiler |
| options |
| - docs: link to the meson_options.txt file gitlab.freedesktop.org |
| - st/mesa: implement "zombie" sampler views (v2) |
| - st/mesa: implement "zombie" shaders list |
| - st/mesa: stop using pipe_sampler_view_release() |
| - svga: stop using pipe_sampler_view_release() |
| - llvmpipe: stop using pipe_sampler_view_release() |
| - swr: remove call to pipe_sampler_view_release() |
| - i915g: remove calls to pipe_sampler_view_release() |
| - gallium/util: remove pipe_sampler_view_release() |
| - nir: fix a few signed/unsigned comparison warnings |
| - st/mesa: fix texture deletion context mix-up issues (v2) |
| - nir: use {0} initializer instead of {} to fix MSVC build |
| - util: no-op \__builtin_types_compatible_p() for non-GCC compilers |
| - docs: s/Aptril/April/ |
| - llvmpipe: init some vars to NULL to silence MinGW compiler warnings |
| - glsl: work around MinGW 7.x compiler bug |
| - svga: add SVGA_NO_LOGGING env var (v2) |
| - glsl: fix typo in #warning message |
| |
| Caio Marcelo de Oliveira Filho (61): |
| |
| - nir: keep the phi order when splitting blocks |
| - i965: skip bit6 swizzle detection in Gen8+ |
| - anv: skip bit6 swizzle detection in Gen8+ |
| - isl: assert that Gen8+ don't have bit6_swizzling |
| - intel/compiler: use 0 as sampler in emit_mcs_fetch |
| - nir: fix example in opt_peel_loop_initial_if description |
| - iris: Fix uses of gl_TessLevel\* |
| - iris: Add support for TCS passthrough |
| - iris: always include an extra constbuf0 if using UBOs |
| - nir/copy_prop_vars: don't get confused by array_deref of vectors |
| - nir/copy_prop_vars: add debug helpers |
| - nir/copy_prop_vars: keep track of components in copy_entry |
| - nir/copy_prop_vars: change test helper to get intrinsics |
| - nir: nir_build_deref_follower accept array derefs of vectors |
| - nir/copy_prop_vars: add tests for load/store elements of vectors |
| - nir: fix MSVC build |
| - st/nir: count num_uniforms for FS bultin shader |
| - nir/copy_prop_vars: rename/refactor store_to_entry helper |
| - nir/copy_prop_vars: use NIR_MAX_VEC_COMPONENTS |
| - nir/copy_prop_vars: handle load/store of vector elements |
| - nir/copy_prop_vars: add tests for indirect array deref |
| - nir/copy_prop_vars: prefer using entries from equal derefs |
| - nir/copy_prop_vars: handle indirect vector elements |
| - anv: Implement VK_EXT_external_memory_host |
| - nir: Add a pass to combine store_derefs to same vector |
| - intel/nir: Combine store_derefs after vectorizing IO |
| - intel/nir: Combine store_derefs to improve code from SPIR-V |
| - nir: Handle array-deref-of-vector case in loop analysis |
| - spirv: Add an execution environment to the options |
| - intel/compiler: handle GLSL_TYPE_INTERFACE as GLSL_TYPE_STRUCT |
| - spirv: Use interface type for block and buffer block |
| - iris: Clean up compiler warnings about unused |
| - nir: Take if_uses into account when repairing SSA |
| - mesa: Extension boilerplate for NV_compute_shader_derivatives |
| - glsl: Remove redundant conditions when asserting in_qualifier |
| - glsl: Enable derivative builtins for NV_compute_shader_derivatives |
| - glsl: Enable texture builtins for NV_compute_shader_derivatives |
| - glsl: Parse and propagate derivative_group to shader_info |
| - nir/algebraic: Lower CS derivatives to zero when no group defined |
| - nir: Don't set LOD=0 for compute shader that has derivative group |
| - intel/fs: Use TEX_LOGICAL whenever implicit lod is supported |
| - intel/fs: Add support for CS to group invocations in quads |
| - intel/fs: Don't loop when lowering CS intrinsics |
| - intel/fs: Use NIR_PASS_V when lowering CS intrinsics |
| - i965: Advertise NV_compute_shader_derivatives |
| - gallium: Add PIPE_CAP_COMPUTE_SHADER_DERIVATIVES |
| - iris: Enable NV_compute_shader_derivatives |
| - spirv: Add support for DerivativeGroup capabilities |
| - anv: Implement VK_NV_compute_shader_derivatives |
| - docs: Add NV_compute_shader_derivatives to 19.1.0 relnotes |
| - spirv: Add more to_string helpers |
| - spirv: Tell which opcode or value is unhandled when failing |
| - spirv: Rename vtn_decoration literals to operands |
| - spirv: Handle SpvOpDecorateId |
| - nir: Add option to lower tex to txl when shader don't support |
| implicit LOD |
| - intel/fs: Don't handle texop_tex for shaders without implicit LOD |
| - spirv: Properly handle SpvOpAtomicCompareExchangeWeak |
| - intel/fs: Assert when brw_fs_nir sees a nir_deref_instr |
| - anv: Fix limits when VK_EXT_descriptor_indexing is used |
| - nir: Fix nir_opt_idiv_const when negatives are involved |
| - nir: Fix clone of nir_variable state slots |
| |
| Carlos Garnacho (1): |
| |
| - wayland/egl: Ensure EGL surface is resized on DRI update_buffers() |
| |
| Chad Versace (17): |
| |
| - turnip: Drop Makefile.am and Android.mk |
| - turnip: Fix indentation in function signatures |
| - turnip: Fix result of vkEnumerate*LayerProperties |
| - turnip: Fix result of vkEnumerate*ExtensionProperties |
| - turnip: Use vk_outarray in all relevant public functions |
| - turnip: Fix a real -Wmaybe-uninitialized |
| - turnip: Fix indentation |
| - turnip: Require DRM device version >= 1.3 |
| - turnip: Add TODO for Android logging |
| - turnip: Use vk_errorf() for initialization error messages |
| - turnip: Replace fd_bo with tu_bo |
| - turnip: Add TODO file |
| - turnip: Fix 'unused' warnings |
| - turnip: Don't return from tu_stub funcs |
| - turnip: Annotate vkGetImageSubresourceLayout with tu_stub |
| - turnip: Fix error behavior for |
| VkPhysicalDeviceExternalImageFormatInfo |
| - turnip: Use Vulkan 1.1 names instead of KHR |
| |
| Charmaine Lee (5): |
| |
| - svga: add svga shader type in the shader variant |
| - svga: move host logging to winsys |
| - st/mesa: purge framebuffers with current context after unbinding |
| winsys buffers |
| - mesa: unreference current winsys buffers when unbinding winsys |
| buffers |
| - svga: Remove unnecessary check for the pre flush bit for setting |
| vertex buffers |
| |
| Chenglei Ren (1): |
| |
| - anv/android: fix missing dependencies issue during parallel build |
| |
| Chia-I Wu (78): |
| |
| - egl: fix KHR_partial_update without EXT_buffer_age |
| - turnip: add .clang-format |
| - turnip: use msm_drm.h from inc_freedreno |
| - turnip: remove unnecessary libfreedreno_drm dep |
| - turnip: add wrappers around DRM_MSM_GET_PARAM |
| - turnip: add wrappers around DRM_MSM_SUBMITQUEUE\_\* |
| - turnip: constify tu_device in tu_gem\_\* |
| - turnip: preliminary support for tu_QueueWaitIdle |
| - turnip: run sed and clang-format on tu_cs |
| - turnip: document tu_cs |
| - turnip: add tu_cs_add_bo |
| - turnip: minor cleanup to tu_cs_end |
| - turnip: update cs->start in tu_cs_end |
| - turnip: inline tu_cs_check_space |
| - turnip: add more tu_cs helpers |
| - turnip: build drm_msm_gem_submit_bo array directly |
| - turnip: add tu_bo_list_merge |
| - turnip: add cmdbuf->bo_list to bo_list in queue submit |
| - turnip: preliminary support for tu_BindImageMemory2 |
| - turnip: preliminary support for tu_image_view_init |
| - turnip: preliminary support for tu_CmdBeginRenderPass |
| - turnip: add tu_cs_reserve_space(_assert) |
| - turnip: emit HW init in tu_BeginCommandBuffer |
| - turnip: preliminary support for tu_GetRenderAreaGranularity |
| - turnip: add tu_tiling_config |
| - turnip: add internal helpers for tu_cs |
| - turnip: add tu_cs_{reserve,add}_entry |
| - turnip: specify initial size in tu_cs_init |
| - turnip: never fail tu_cs_begin/tu_cs_end |
| - turnip: add tu_cs_sanity_check |
| - turnip: provide both emit_ib and emit_call |
| - turnip: add tu_cs_mode |
| - turnip: add TU_CS_MODE_SUB_STREAM |
| - turnip: preliminary support for loadOp and storeOp |
| - turnip: add a more complete format table |
| - turnip: add functions to import/export prime fd |
| - turnip: advertise VK_KHR_external_memory_capabilities |
| - turnip: advertise VK_KHR_external_memory |
| - turnip: add support for VK_KHR_external_memory_{fd,dma_buf} |
| - turnip: fix VkClearValue packing |
| - turnip: preliminary support for fences |
| - turnip: respect color attachment formats |
| - turnip: mark IBs for dumping |
| - turnip: use 32-bit offset in tu_cs_entry |
| - turnip: more/better asserts for tu_cs |
| - turnip: add tu_cs_discard_entries |
| - turnip: tu_cs_emit_array |
| - turnip: fix tu_cs sub-streams |
| - turnip: simplify tu_cs sub-streams usage |
| - turnip: create a less dummy pipeline |
| - turnip: parse VkPipelineDynamicStateCreateInfo |
| - turnip: parse VkPipelineInputAssemblyStateCreateInfo |
| - turnip: parse VkPipelineViewportStateCreateInfo |
| - turnip: parse VkPipelineRasterizationStateCreateInfo |
| - turnip: parse VkPipelineDepthStencilStateCreateInfo |
| - turnip: parse VkPipeline{Multisample,ColorBlend}StateCreateInfo |
| - turnip: preliminary support for shader modules |
| - turnip: compile VkPipelineShaderStageCreateInfo |
| - turnip: parse VkPipelineShaderStageCreateInfo |
| - turnip: parse VkPipelineVertexInputStateCreateInfo |
| - turnip: add draw_cs to tu_cmd_buffer |
| - turnip: preliminary support for draw state binding |
| - turnip: preliminary support for tu_CmdDraw |
| - turnip: guard -Dvulkan-driver=freedreno |
| - turnip: preliminary support for tu_GetImageSubresourceLayout |
| - turnip: preliminary support for Wayland WSI |
| - vulkan/wsi: move modifier array into wsi_wl_swapchain |
| - vulkan/wsi: create wl_drm wrapper as needed |
| - vulkan/wsi: refactor drm_handle_format |
| - vulkan/wsi: add wsi_wl_display_drm |
| - vulkan/wsi: add wsi_wl_display_dmabuf |
| - vulkan/wsi: make wl_drm optional |
| - virgl: handle fence_server_sync in winsys |
| - virgl: hide fence internals from the driver |
| - virgl: introduce virgl_drm_fence |
| - virgl: fix fence fd version check |
| - virgl: clear vertex_array_dirty |
| - virgl: skip empty cmdbufs |
| |
| Chris Forbes (3): |
| |
| - glsl: add scaffolding for EXT_gpu_shader4 |
| - glsl: enable noperspective|flat|centroid for EXT_gpu_shader4 |
| - glsl: enable types for EXT_gpu_shader4 |
| |
| Chris Wilson (19): |
| |
| - i965: Assert the execobject handles match for this device |
| - iris: fix import from dri2/3 |
| - iris: IndexFormat = size/2 |
| - iris: Set resource modifier on handle |
| - iris: Wrap userptr for creating bo |
| - iris: AMD_pinned_memory |
| - iris: Record reusability of bo on construction |
| - iris: fix memzone_for_address since multibinder changes |
| - iris: Tidy exporting the flink handle |
| - iris: Fix assigning the output handle for exporting for KMS |
| - iris: Merge two walks of the exec_bos list |
| - iris: Tag each submitted batch with a syncobj |
| - iris: Add fence support using drm_syncobj |
| - iris: Wire up EGL_IMG_context_priority |
| - iris: Use PIPE_BUFFER_STAGING for the query objects |
| - iris: Use coherent allocation for PIPE_RESOURCE_STAGING |
| - iris: Use streaming loads to read from tiled surfaces |
| - iris: Push heavy memchecker code to DEBUG |
| - iris: Adapt to variable ppGTT size |
| |
| Christian Gmeiner (12): |
| |
| - etnaviv: rs: mark used src resource as read from |
| - etnaviv: blt: mark used src resource as read from |
| - etnaviv: implement ETC2 block patching for HALTI0 |
| - etnaviv: keep track of mapped bo address |
| - etnaviv: hook-up etc2 patching |
| - etnaviv: enable ETC2 texture compression support for HALTI0 GPUs |
| - etnaviv: fix resource usage tracking across different pipe_context's |
| - etnaviv: fix compile warnings |
| - st/dri: allow direct UYVY import |
| - etnaviv: shrink struct etna_3d_state |
| - nir: add lower_ftrunc |
| - etnaviv: use the correct uniform dirty bits |
| |
| Chuck Atkins (1): |
| |
| - meson: Fix missing glproto dependency for gallium-glx |
| |
| Connor Abbott (6): |
| |
| - nir/serialize: Prevent writing uninitialized state_slot data |
| - nir: Add a stripping pass for improved cacheability |
| - radeonsi/nir: Use nir stripping pass |
| - nir/search: Add automaton-based pre-searching |
| - nir/search: Add debugging code to dump the pattern matched |
| - nir/algebraic: Don't emit empty initializers for MSVC |
| |
| Daniel Schürmann (2): |
| |
| - nir: Define shifts according to SM5 specification. |
| - nir: Use SM5 properties to optimize shift(a@32, iand(31, b)) |
| |
| Daniel Stone (2): |
| |
| - panfrost: Properly align stride |
| - vulkan/wsi/wayland: Respect non-blocking AcquireNextImage |
| |
| Danylo Piliaiev (13): |
| |
| - anv: Handle VK_ATTACHMENT_UNUSED in colorAttachment |
| - radv: Handle VK_ATTACHMENT_UNUSED in CmdClearAttachment |
| - anv: Fix VK_EXT_transform_feedback working with varyings packed in |
| PSIZ |
| - anv: Fix destroying descriptor sets when pool gets reset |
| - anv: Treat zero size XFB buffer as disabled |
| - glsl: Cross validate variable's invariance by explicit invariance |
| only |
| - i965,iris,anv: Make alpha to coverage work with sample mask |
| - intel/fs: Make alpha test work with MRT and sample mask |
| - st/mesa: Fix GL_MAP_COLOR with glDrawPixels GL_COLOR_INDEX |
| - iris: Fix assert when using vertex attrib without buffer binding |
| - intel/compiler: Do not reswizzle dst if instruction writes to flag |
| register |
| - drirc: Add workaround for Epic Games Launcher |
| - anv: Do not emulate texture swizzle for INPUT_ATTACHMENT, |
| STORAGE_IMAGE |
| |
| Dave Airlie (63): |
| |
| - virgl: enable elapsed time queries |
| - virgl: ARB_query_buffer_object support |
| - docs: update qbo support for virgl |
| - glsl: glsl to nir fix uninit class member. |
| - radv/llvm: initialise passes member. |
| - radv: remove alloc parameter from pipeline init |
| - iris: fix some hangs around null framebuffers |
| - iris: fix crash in sparse vertex array |
| - iris: add initial transform feedback overflow query paths (V3) |
| - iris: fix cube texture view |
| - iris: execute compute related query on compute batch. |
| - iris: iris add load register reg32/64 |
| - iris: add conditional render support |
| - iris: fix gpu calcs for timestamp queries |
| - iris/WIP: add broadwell support |
| - iris: limit gen8 to 8 samples |
| - iris: setup gen8 caps |
| - iris: add fs invocations query workaround for broadwell |
| - iris: handle qbo fragment shader invocation workaround |
| - st/mesa: add support for lowering fp64/int64 for nir drivers |
| - softpipe: fix texture view crashes |
| - nir/spirv: don't use bare types, remove assert in split vars for |
| testing |
| - nir/deref: remove casts of casts which are likely redundant (v3) |
| - softpipe: fix 32-bit bitfield extract |
| - softpipe: handle 32-bit bitfield inserts |
| - softpipe: remove shadow_ref assert. |
| - softpipe: fix integer texture swizzling for 1 vs 1.0f |
| - nir/split_vars: fixup some more explicit_stride related issues. |
| - draw: bail instead of assert on instance count (v2) |
| - draw/gs: fix point size outputs from geometry shader. |
| - draw/vs: partly fix basevertex/vertex id |
| - softpipe: fix clears to only clear specified color buffers. |
| - softpipe/draw: fix vertex id in soft paths. |
| - softpipe: add indirect store buffer/image unit |
| - nir/deref: fix struct wrapper casts. (v3) |
| - nir: use proper array sizing define for vectors |
| - intel/compiler: use defined size for vector components |
| - iris: avoid use after free in shader destruction |
| - ddebug: add compute functions to help hang detection |
| - draw: add stream member to stats callback |
| - tgsi: add support for geometry shader streams. |
| - softpipe: add support for indexed queries. |
| - draw: add support to tgsi paths for geometry streams. (v2) |
| - softpipe: add support for vertex streams (v2) |
| - virgl: add support for missing command buffer binding. |
| - virgl: add support for ARB_multi_draw_indirect |
| - virgl: add support for ARB_indirect_parameters |
| - draw: fix undefined shift of (1 << 31) |
| - swrast: fix undefined shift of 1 << 31 |
| - llvmpipe: fix undefined shift 1 << 31. |
| - virgl/drm: cleanup buffer from handle creation (v2) |
| - virgl/drm: handle flink name better. |
| - virgl/drm: insert correct handles into the table. (v3) |
| - intel/compiler: fix uninit non-static variable. (v2) |
| - nir: fix bit_size in lower indirect derefs. |
| - r600: reset tex array override even when no view bound |
| - spirv: fix SpvOpBitSize return value. |
| - nir: fix lower vars to ssa for larger vector sizes. |
| - util/tests: add basic unit tests for bitset |
| - util/bitset: fix bitset range mask calculations. |
| - kmsro: add \_dri.so to two of the kmsro drivers. |
| - glsl: init packed in more constructors. |
| - Revert "mesa: unreference current winsys buffers when unbinding |
| winsys buffers" |
| |
| David Riley (3): |
| |
| - virgl: Store mapped hw resource with transfer object. |
| - virgl: Allow transfer queue entries to be found and extended. |
| - virgl: Re-use and extend queue transfers for intersecting buffer |
| subdatas. |
| |
| David Shao (1): |
| |
| - meson: ensure that xmlpool_options.h is generated for gallium targets |
| that need it |
| |
| Deepak Rawat (2): |
| |
| - winsys/drm: Fix out of scope variable usage |
| - winsys/svga/drm: Fix 32-bit RPCI send message |
| |
| Dominik Drees (1): |
| |
| - Add no_aos_sampling GALLIVM_PERF option |
| |
| Drew Davenport (1): |
| |
| - util: Don't block SIGSYS for new threads |
| |
| Dylan Baker (40): |
| |
| - bump version for 19.0 branch |
| - docs: Add relnotes stub for 19.1 |
| - gallium: wrap u_screen in extern "C" for c++ |
| - automake: Add --enable-autotools to distcheck flags |
| - android,autotools,i965: Fix location of float64_glsl.h |
| - meson: remove build_by_default : true |
| - meson: fix style in intel/tools |
| - meson: remove -std=c++11 from intel/tools |
| - get-pick-list: Add --pretty=medium to the arguments for Cc patches |
| - meson: Add dependency on genxml to anvil |
| - meson/iris: Use current coding style |
| - docs: Add release notes for 19.0.0 |
| - docs: Add SHA256 sums for 19.0.0 |
| - docs: update calendar, add news item, and link release notes for |
| 19.0.0 |
| - bin/install_megadrivers.py: Correctly handle DESTDIR='' |
| - bin/install_megadrivers.py: Fix regression for set DESTDIR |
| - docs: Add release notes for 19.0.1 |
| - docs: Add SHA256 sums for mesa 19.0.1 |
| - docs: update calendar, add news item and link release notes for |
| 19.0.1 |
| - meson: Error if LLVM doesn't have rtti when building clover |
| - meson: Error if LLVM is turned off but clover it turned on |
| - docs: Add release notes for 19.0.2 |
| - docs: Add sha256 sums for 19.0.2 |
| - docs: update calendar, and news item and link release notes for |
| 19.0.2 |
| - Delete autotools |
| - docs: drop most autoconf references |
| - ci: Delete autotools build jobs |
| - docs: add relnotes for 19.0.3 |
| - docs: Add SHA256 sums for mesa 19.0.3 |
| - docs: update calendar, and news item and link release notes for |
| 19.0.3 |
| - meson: always define libglapi |
| - glsl: fix general_ir_test with mingw |
| - meson: switch gles1 and gles2 to auto options |
| - meson: Make shader-cache a trillean instead of boolean |
| - meson: make nm binary optional |
| - util/tests: Use define instead of VLA |
| - glsl/tests: define ssize_t on windows |
| - tests/vma: fix build with MSVC |
| - meson: Don't build glsl cache_test when shader cache is disabled |
| - meson: Force the use of config-tool for llvm |
| |
| Eduardo Lima Mitev (5): |
| |
| - freedreno/a6xx: Silence compiler warnings |
| - nir: Add ir3-specific version of most SSBO intrinsics |
| - ir3/nir: Add a new pass 'ir3_nir_lower_io_offsets' |
| - ir3/compiler: Enable lower_io_offsets pass and handle new SSBO |
| intrinsics |
| - ir3/lower_io_offsets: Try propagate SSBO's SHR into a previous shift |
| instruction |
| |
| El Christianito (1): |
| |
| - drirc: add Budgie WM to adaptive-sync blacklist |
| |
| Eleni Maria Stea (6): |
| |
| - i965: Faking the ETC2 compression on Gen < 8 GPUs using two miptrees. |
| - i965: Fixed the CopyImageSubData for ETC2 on Gen < 8 |
| - i965: Enabled the OES_copy_image extension on Gen 7 GPUs |
| - i965: Removed the field etc_format from the struct intel_mipmap_tree |
| - i965: fixed clamping in set_scissor_bits when the y is flipped |
| - radv: consider MESA_VK_VERSION_OVERRIDE when setting the api version |
| |
| Elie Tournier (3): |
| |
| - virgl: Add a caps to advertise GLES backend |
| - virgl: Set PIPE_CAP_DOUBLES when running on GLES This is a lie but no |
| known app use fp64. |
| - virgl: Return an error if we use fp64 on top of GLES |
| |
| Emil Velikov (30): |
| |
| - vc4: Declare the last cpu pointer as being modified in NEON asm. |
| - docs: add release notes for 18.3.3 |
| - docs: add sha256 checksums for 18.3.3 |
| - docs: update calendar, add news item and link release notes for |
| 18.3.3 |
| - anv: wire up the state_pool_padding test |
| - docs: add release notes for 18.3.4 |
| - docs: add sha256 checksums for 18.3.4 |
| - docs: update calendar, add news item and link release notes for |
| 18.3.4 |
| - egl/dri: de-duplicate dri2_load_driver\* |
| - meson: egl: correctly manage loader/xmlconfig |
| - loader: use loader_open_device() to handle O_CLOEXEC |
| - egl/android: bump the number of drmDevices to 64 |
| - docs: mention "Allow commits from members who can merge..." |
| - egl/sl: split out swrast probe into separate function |
| - egl/sl: use drmDevice API to enumerate available devices |
| - egl/sl: use kms_swrast with vgem instead of a random GPU |
| - docs: add release notes for 18.3.5 |
| - docs: add sha256 checksums for 18.3.5 |
| - docs: update calendar, add news item and link release notes for |
| 18.3.5 |
| - docs: add release notes for 18.3.6 |
| - docs: add sha256 checksums for 18.3.6 |
| - docs: update calendar, add news item and link release notes for |
| 18.3.6 |
| - turnip: drop dead close(master_fd) |
| - vulkan/wsi: check if the display_fd given is master |
| - vulkan/wsi: don't use DUMB_CLOSE for normal GEM handles |
| - llvmpipe: add lp_fence_timedwait() helper |
| - llvmpipe: correctly handle waiting in llvmpipe_fence_finish |
| - egl/dri: flesh out and use dri2_create_drawable() |
| - mapi: add static_date offset to MaxShaderCompilerThreadsKHR |
| - mapi: correctly handle the full offset table |
| |
| Emmanuel Gil Peyrot (1): |
| |
| - docs: make bugs.html easier to find |
| |
| Eric Anholt (121): |
| |
| - v3d: Always enable the NEON utile load/store code. |
| - v3d: Fix a release build set-but-unused compiler warning. |
| - mesa: Skip partial InvalidateFramebuffer of packed depth/stencil. |
| - v3d: Fix image_load_store clamping of signed integer stores. |
| - nir: Move V3D's "the shader was TGSI, ignore FS output types" flag to |
| NIR. |
| - v3d: Fix precompile of FRAG_RESULT_DATA1 and higher outputs. |
| - v3d: Store the actual mask of color buffers present in the key. |
| - v3d: Fix dumping of shaders with alpha test. |
| - v3d: Fix pack/unpack of VFPACK operand unpacks. |
| - v3d: Fix input packing of .l for rounding/fdx/fdy. |
| - v3d: Fix copy-propagation of input unpacks. |
| - v3d: Whitespace consistency fix. |
| - nir: Move panfrost's isign lowering to nir_opt_algebraic. |
| - v3d: Use the NIR lowering for isign instead of rolling our own. |
| - intel: Use the NIR lowering for isign. |
| - freedreno: Use the NIR lowering for isign. |
| - v3d: Clear the GMP on initialization of the simulator. |
| - v3d: Sync indirect draws on the last rendering. |
| - v3d: Use the early_fragment_tests flag for the shader's disable-EZ |
| field. |
| - v3d: Fix incorrect flagging of ldtmu as writing r4 on v3d 4.x. |
| - v3d: Drop a perf note about merging unpack_half_*, which has been |
| implemented. |
| - v3d: Drop our hand-lowered nir_op_ffract. |
| - v3d: Add a helper function for getting a nop register. |
| - v3d: Refactor bcsel and if condition handling. |
| - v3d: Do bool-to-cond for discard_if as well. |
| - v3d: Kill off vir_PF(), which is hard to use right. |
| - v3d: Fix f2b32 behavior. |
| - v3d: Fix the check for "is the last thrsw inside control flow" |
| - v3d: Add a function to describe what the c->execute.file check means. |
| - v3d: Stop tracking num_inputs for VPM loads. |
| - v3d: Delay emitting ldvpm on V3D 4.x until it's actually used. |
| - v3d: Emit a simpler negate for the iabs implementation. |
| - v3d: Move i2b and f2b support into emit_comparison. |
| - kmsro: Add the rest of the current set of tinydrm drivers. |
| - nir: Just return when asked to rewrite uses of an SSA def to itself. |
| - v3d: Fix vir_is_raw_mov() for input unpacks. |
| - v3d: Dump the VIR after register spilling if we were forced to. |
| - v3d: Rematerialize MOVs of uniforms instead of spilling them. |
| - v3d: Fix build of NEON code with Mesa's cflags not targeting NEON. |
| - v3d: Restrict live intervals to the blocks reachable from any def. |
| - v3d: Stop treating exec masking specially. |
| - nir: Improve printing of load_input/store_output variable names. |
| - v3d: Translate f2i(fround_even) as FTOIN. |
| - v3d: Move the stores for fixed function VS output reads into NIR. |
| - v3d: Fix temporary leaks of temp_registers and when spilling. |
| - v3d: Do uniform rematerialization spilling before dropping |
| threadcount |
| - v3d: Switch implicit uniforms over to being any qinst->uniform != ~0. |
| - v3d: Add support for vir-to-qpu of ldunif instructions to a temp. |
| - v3d: Drop the old class bits splitting up the accumulators. |
| - v3d: Add support for register-allocating a ldunif to a QFILE_TEMP. |
| - v3d: Use ldunif instructions for uniforms. |
| - v3d: Eliminate the TLB and TLBU files. |
| - v3d: Drop the V3D 3.x vpm read dead code elimination. |
| - v3d: Include a count of register pressure in the RA failure dumps. |
| - st/dri: Set the PIPE_BIND_SHARED flag on create_image_with_modifiers. |
| - util: Add a DAG datastructure. |
| - vc4: Switch over to using the DAG datastructure for QIR scheduling. |
| - v3d: Reuse list_for_each_entry_rev(). |
| - vc4: Reuse list_for_each_entry_rev(). |
| - v3d: Use the DAG datastructure for QPU instruction scheduling. |
| - vc4: Switch the post-RA scheduler over to the DAG datastructure. |
| - v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER. |
| - v3d: Fix leak of the mem_ctx after the DAG refactor. |
| - v3d: Fix leak of the renderonly struct on screen destruction. |
| - mesa/st: Make sure that prog_to_nir NIR gets freed. |
| - mesa/st: Fix leaks of TGSI tokens in VP variants. |
| - v3d: Always lay out shared tiled buffers with UIF_TOP set. |
| - v3d: Allow the UIF modifier with renderonly. |
| - v3d: Expose the dma-buf modifiers query. |
| - v3d: Rename v3d_tmu_config_data to v3d_unit_data. |
| - v3d: Move constant offsets to UBO addresses into the main uniform |
| stream. |
| - v3d: Upload all of UBO[0] if any indirect load occurs. |
| - v3d: Remove some dead members of struct v3d_compile. |
| - egl: Add a 565 pbuffer-only EGL config under X11. |
| - dri3: Return the current swap interval from glXGetSwapIntervalMESA(). |
| - v3d: Add support for handling OOM signals from the simulator. |
| - v3d: Bump the maximum texture size to 4k for V3D 4.x. |
| - v3d: Don't try to use the TFU blit path if a scissor is enabled. |
| - v3d: Add some more new packets for V3D 4.x. |
| - st: Lower uniforms in st in the !PIPE_CAP_PACKED_UNIFORMS case as |
| well. |
| - vc4: Don't forget to set the range when scalarizing our uniforms. |
| - vc4: Split UBO0 and UBO1 address uniform handling. |
| - vc4: Upload CS/VS UBO uniforms together. |
| - v3d: Add an optimization pass for redundant flags updates. |
| - nir: Drop comments about the constant_index slots for load/stores. |
| - nir: Drop remaining references to const_index in favor of the call to |
| use. |
| - nir: Add a comment about how intrinsic definitions work. |
| - v3d: Add and use a define for the number of channels in a QPU |
| invocation. |
| - v3d: Drop a note for the future about PIPE_CAP_PACKED_UNIFORMS. |
| - v3d: Include the number of max temps used in the shader-db output. |
| - v3d: Replace the old shader-db env var output with the |
| ARB_debug_output. |
| - v3d: Add Compute Shader compilation support. |
| - v3d: Add missing base offset to CS shared memory accesses. |
| - v3d: Add missing dumping for the spill offset/size uniforms. |
| - v3d: Detect the correct number of QPUs and use it to fix the spill |
| size. |
| - v3d: Use the new lower_to_scratch implementation for indirects on |
| temps. |
| - v3d: Only look up the 3rd texture gather offset for non-arrays. |
| - v3d: Always set up the qregs for CSD payload. |
| - v3d: Fix an invalid reuse of flags generation from before a thrsw. |
| - v3d: Fix atomic cmpxchg in shaders on hardware. |
| - nir: Fix deref offset calculation for structs. |
| - nir: Use the nir_builder \_imm helpers in setting up deref offsets. |
| - gallium: Remove the pool pipebuffer manager. |
| - gallium: Remove the ondemand pipebuffer manager. |
| - gallium: Remove the "alt" pipebuffer manager interface. |
| - gallium: Remove the malloc pipebuffer manager. |
| - st/mesa: Don't set atomic counter size != 0 if MAX_SHADER_BUFFERS == |
| 0. |
| - v3d: Disable SSBOs and atomic counters on vertex shaders. |
| - v3d: Fill in the ignored segment size fields to appease new |
| simulator. |
| - v3d: Apply the GFXH-930 workaround to the case where the VS loads |
| attrs. |
| - v3d: Assert that we do request the normal texturing return data. |
| - v3d: Use \_mesa_hash_table_remove_key() where appropriate. |
| - vc4: Use \_mesa_hash_table_remove_key() where appropriate. |
| - v3d: Add a note about i/o indirection for future performance work. |
| - v3d: Don't try to update the shadow texture for separate stencil. |
| - Revert "v3d: Disable PIPE_CAP_BLIT_BASED_TEXTURE_TRANSFER." |
| - v3d: Re-add support for memory_barrier_shared. |
| - v3d: Fix detection of the last ldtmu before a new TMU op. |
| - v3d: Fix detection of TMU write sequences in register spilling. |
| - kmsro: Add support for V3D. |
| - vc4: Fall back to renderonly if the vc4 driver doesn't have v3d. |
| |
| Eric Engestrom (142): |
| |
| - wsi/display: add comment |
| - egl: use coherent variable names |
| - gitlab-ci: add ubuntu container |
| - gitlab-ci: add a meson vulkan build |
| - gitlab-ci: add a make vulkan build |
| - gitlab-ci: add a scons no-llvm build |
| - gitlab-ci: add scons llvm 3.5 build |
| - gitlab-ci: add scons SWR build |
| - gitlab-ci: add meson loader/classic DRI build |
| - gitlab-ci: add meson gallium SWR build |
| - gitlab-ci: add meson gallium RadeonSI build |
| - gitlab-ci: add meson gallium "other drivers" build |
| - gitlab-ci: add meson gallium ST Clover (LLVM 5.0) build |
| - gitlab-ci: add meson gallium ST Clover (LLVM 6.0) build |
| - gitlab-ci: add meson gallium ST Clover (LLVM 7.0) build |
| - gitlab-ci: add meson gallium ST "Other" build |
| - gitlab-ci: add make loaders/classic DRI build |
| - gitlab-ci: add make Gallium Drivers SWR build |
| - gitlab-ci: add make Gallium Drivers RadeonSI build |
| - gitlab-ci: add make Gallium Drivers "Other" build |
| - gitlab-ci: add make Gallium ST Clover LLVM-3.9 build |
| - gitlab-ci: add make Gallium ST Clover LLVM-4.0 build |
| - gitlab-ci: add make Gallium ST Clover LLVM-5.0 build |
| - gitlab-ci: add make Gallium ST Clover LLVM-6.0 build |
| - gitlab-ci: add make Gallium ST Clover LLVM-7 build |
| - gitlab-ci: add make Gallium ST Other build |
| - travis: remove unused linux code path |
| - travis: remove unused scons code path |
| - gitlab-ci: add meson glvnd build |
| - xvmc: fix string comparison |
| - xvmc: fix string comparison |
| - meson: add script to print the options before configuring a builddir |
| - driconf: drop unused macro |
| - travis: fix osx make build |
| - gitlab-ci: workaround docker bug for users with uppercase characters |
| - wsi: query the ICD's max dimensions instead of hard-coding them |
| - gitlab-ci: limit ninja to 4 threads max |
| - drm-uapi/README: remove explicit list of driver names |
| - drm-uapi: use local files, not system libdrm |
| - gbm: drop duplicate #defines |
| - st/dri: drop duplicate #define |
| - etnaviv: drop duplicate #define |
| - anv/tests: compile to something sensible in release builds |
| - util/tests: compile to something sensible in release builds |
| - gitlab-ci: use ccache to speed up builds |
| - tegra/meson: add missing dep_libdrm |
| - tegra/autotools: add missing libdrm cflags |
| - gitlab-ci: limit the automatic CI to master and MRs |
| - gitlab-ci: automatically run the CI on pushes to \`ci/\*\` branches |
| - anv: sort extensions alphabetically |
| - anv: sort vendors extensions after KHR and EXT |
| - anv: make sure the extensions stay sorted |
| - anv: drop unused imports |
| - anv: use anv_shader_bin_write_to_blob()'s return value |
| - gitlab-ci: always run the containers build |
| - dri_interface: add missing #include |
| - driinfo: add DTD to allow the xml to be validated |
| - meson/swr: replace hard-coded path with current_build_dir() |
| - egl/android: replace magic 0=CbCr,1=CrCb with simple enum |
| - vulkan: use VkBase{In,Out}Structure instead of a custom struct |
| - driconf: add DTD to allow the drirc xml (00-mesa-defaults.conf) to be |
| validated |
| - gitlab-ci: install xmllint to validate 00-mesa-defaults.conf |
| - anv: simplify chained comparison |
| - anv: drop unused parameter |
| - anv: remove spaces around kwargs assignment |
| - anv: fix typo |
| - Revert "swr/rast: Archrast codegen updates" |
| - meson: avoid going back up the tree with include_directories() |
| - anv: use the platform defines in vk.xml instead of hard-coding them |
| - radv: use the platform defines in vk.xml instead of hard-coding them |
| - util: #define PATH_MAX when undefined (eg. Hurd) |
| - vulkan: import missing file from Khronos |
| - egl: fix libdrm-less builds |
| - vulkan: import vk_layer.h from Khronos |
| - gitlab-ci: drop job prefixes |
| - meson: fix with_dri2 definition for GNU Hurd |
| - meson: remove unused include_directories(vulkan) |
| - vulkan/util: use the platform defines in vk.xml instead of |
| hard-coding them |
| - vulkan/overlay: fix missing var rename in previous commit |
| - meson: don't build libGLES*.so with GLVND |
| - autotools: don't build libGLES*.so with GLVND |
| - travis: fix meson build by letting \`auto\` do its job |
| - travis: drop unused vars |
| - travis: clean up |
| - gitlab-ci: only build the default (=latest) and oldest llvm versions |
| - gitlab-ci: autotools needs to be told which llvm version to use |
| - r600: cast pointer to expected type |
| - build: make passing an incorrect pointer type a hard error |
| - gitlab-ci: fix llvm version (7 doesn't have a ".0") |
| - hgl/meson: drop unused include directory |
| - glx/meson: use full include path for dri_interface.h |
| - android: fix missing backspace for line continuation |
| - panfrost: fix tgsi_to_nir() call |
| - panfrost: move #include to fix compilation |
| - gitlab-ci: add panfrost to the gallium drivers build |
| - wsi: deduplicate get_current_time() functions between display and x11 |
| - wsi/display: s/#if/#ifdef/ to fix -Wundef |
| - wsi/wayland: fix pointer casting warning on 32bit |
| - wsi/x11: use WSI_FROM_HANDLE() instead of pointer casts |
| - turnip: use the platform defines in vk.xml instead of hard-coding |
| them |
| - travis: fix osx meson build |
| - nir: const \`nir_call_instr::callee\` |
| - gitlab-ci: add clang build |
| - gitlab-ci: drop most autotools builds |
| - util/disk_cache: close fd in the fallback path |
| - egl: hide entrypoints that shouldn't be exported when using glvnd |
| - meson: strip rpath from megadrivers |
| - gallium/hud: fix memory leaks |
| - gallium/hud: prevent buffer overflow |
| - gallium/hud: fix rounding error in nic bps computation |
| - simplify LLVM version string printing |
| - util/process: document memory leak |
| - vk/util: remove unneeded array index |
| - bin: drop unused import from install_megadrivers.py |
| - meson: remove meson-created megadrivers symlinks |
| - gitlab-ci: build gallium extra hud |
| - gitlab-ci: add lima to the build |
| - delete autotools .gitignore files |
| - delete autotools input files |
| - docs: remove unsupported GL function name mangling |
| - docs: drop autotools python information |
| - docs: replace autotools intructions with meson equivalent |
| - docs: use past tense when talking about autotools |
| - docs: haiku can be built using meson |
| - egl: fixup autotools-specific wording |
| - util: add os_read_file() helper |
| - anv: add support for VK_EXT_memory_budget |
| - radv: update to use the new features struct names |
| - turnip: update to use the new features struct names |
| - gitlab-ci: build vulkan drivers in clang build |
| - util: move #include out of #if linux |
| - wsi/wayland: document lack of vkAcquireNextImageKHR timeout support |
| - egl: hard-code destroy function instead of passing it around as a |
| pointer |
| - gitlab-ci: add scons windows build using mingw |
| - gitlab-ci: merge several meson jobs |
| - gitlab-ci: meson-gallium-radeonsi was a subset of |
| meson-gallium-clover-llvm |
| - gitlab-ci: simplify meson job names |
| - gitlab-ci: merge meson-glvnd into meson-swr |
| - travis: fix syntax, and drop unused stuff |
| - util/os_file: always use the 'grow' mechanism |
| - meson: expose glapi through osmesa |
| - util/os_file: actually return the error read() gave us |
| |
| Erico Nunes (5): |
| |
| - lima/ppir: support ppir_op_ceil |
| - nir/algebraic: add lowering for fsign |
| - lima: enable nir fsign lowering in ppir |
| - lima/gpir: add limit of max 512 instructions |
| - lima/ppir: support nir_op_ftrunc |
| |
| Erik Faye-Lund (79): |
| |
| - mesa: expose NV_conditional_render on GLES |
| - st/mesa: remove unused header-file |
| - swr/codegen: fix autotools build |
| - virgl: remove unused variables |
| - virgl: remove unused variable |
| - virgl: remove unused variable |
| - virgl: remove unused variable |
| - virgl: do not allow compressed formats for buffers |
| - virgl: stricter usage of compressed 3d textures |
| - virgl: also destroy all read-transfers |
| - virgl: use debug_printf instead of fprintf |
| - virgl: unsigned int -> unsigned |
| - virgl: only warn about unchecked flags |
| - virgl: do not warn about display-target binding |
| - virgl: use debug_printf instead of fprintf |
| - virgl: remove pointless transfer-counter |
| - virgl: tmp_resource -> templ |
| - virgl: track full virgl_resource instead of just virgl_hw_res |
| - virgl: simplify virgl_texture_transfer_unmap logic |
| - virgl: make unmap queuing a bit more straight-forward |
| - virgl: check for readback on correct resource |
| - virgl: wait for the right resource |
| - virgl: return error if allocating resolve_tmp fails |
| - virgl: rewrite core of virgl_texture_transfer_map |
| - virgl: use pipe_box for blit dst-rect |
| - virgl: support write-back with staged transfers |
| - virgl: make sure bind is set for non-buffers |
| - gallium/util: support translating between uint and sint formats |
| - virgl: get readback-formats from host |
| - virgl: only blit if resource is read |
| - virgl: do color-conversion during when mapping transfer |
| - virgl: document potentially failing blit |
| - mesa/st: remove impossible error-check |
| - gallium/u_vbuf: support NULL-resources |
| - i915: support NULL-resources |
| - nouveau: support NULL-resources |
| - swr: support NULL-resources |
| - mesa/st: accept NULL and empty buffer objects |
| - mesa/st: remove always-false state |
| - softpipe: setup pixel_offset for all primitive types |
| - docs: normaize css-indent style |
| - docs: remove non-existent css attribute |
| - docs: remove long commented out css |
| - docs: add missing semicolon |
| - docs: avoid repeating the font |
| - docs: avoid repeating the color |
| - docs: remove spurious newline |
| - docs: use multiple background-images for header |
| - docs: simplify css-centering |
| - docs: do not hard-code header-height |
| - docs: properly escape '>' |
| - docs: properly escape ampersand |
| - docs: remove stray paragraph-close |
| - docs: use h2 instead of b-tag for headings |
| - docs: use dl/dd instead of blockquote for freedesktop link |
| - docs: open list-item before closing it |
| - docs: close paragraphs before lists |
| - docs: close lists |
| - docs: remove stray paragraph-close |
| - docs: close paragraphs before preformatted text |
| - docs: start paragraph before closing it |
| - docs: drop paragraph around preformatted text |
| - docs: fix incorrectly closed paragraph |
| - docs: don't pointlessly close and re-start definition lists |
| - docs: remove stray list-start |
| - docs: fixup bad paragraphing |
| - docs: add missing lists |
| - docs: fix closing of paragraphs |
| - docs: fixup list-item tags |
| - docs: fix closing of list-items |
| - docs: replace empty list with a none-paragraph |
| - docs: turn faq-index into an ordered list |
| - docs: drop centered heading for faq |
| - docs: reorder heading and notice |
| - meson: lift driver-collection out into parent build-file |
| - meson: give dri- and gallium-drivers separate vars |
| - meson: add build-summary |
| - docs: fixup mistake in contents |
| - draw: flush when setting stream-out targets |
| |
| Ernestas Kulik (2): |
| |
| - vc4: Fix leak in HW queries error path |
| - v3d: Fix leak in resource setup error path |
| |
| Francisco Jerez (6): |
| |
| - intel/dump_gpu: Disambiguate between BOs from different GEM handle |
| spaces. |
| - intel/fs: Exclude control sources from execution type and region |
| alignment calculations. |
| - intel/fs: Lower integer multiply correctly when destination stride |
| equals 4. |
| - intel/fs: Cap dst-aligned region stride to maximum representable |
| hstride value. |
| - intel/fs: Implement extended strides greater than 4 for IR source |
| regions. |
| - intel/fs: Rely on undocumented unrestricted regioning for 32x16-bit |
| integer multiply. |
| |
| Fritz Koenig (4): |
| |
| - freedreno: pass count to query_dmabuf_modifiers |
| - freedreno/a6xx: UBWC support |
| - freedreno: UBWC allocator |
| - freedreno/a6xx: Enable UBWC modifier |
| |
| Gert Wollny (35): |
| |
| - mesa/core: Enable EXT_texture_sRGB_R8 also for desktop GL |
| - radeonsi: release tokens after creating the shader program |
| - mesa: release references to image textures when a context is |
| destroyed |
| - virgl: Enable mixed color FBO attachemnets only when the host |
| supports it |
| - mesa/core: Enable EXT_depth_clamp for GLES >= 2.0 |
| - nir: Add posibility to not lower to source mod 'abs' for ops with |
| three sources |
| - mesa: Expose EXT_texture_query_lod and add support for its use |
| shaders |
| - softpipe: Enable PIPE_CAP_MIXED_COLORBUFFER_FORMATS It seems |
| softpipe actually supports this. This change enables the following |
| piglits as passing without regressions in the gpu test set: |
| - virgl: Add a caps feature check version |
| - softpipe: Implement ATOMFADD and enable cap TGSI_ATOMFADD |
| - virgl: define MAX_VERTEX_STREAMS based on availability of TF3 |
| - softpipe: Use mag texture filter also for clamped lod == 0 |
| - softpipe: Don't use mag filter for gather op |
| - softpipe: raise number of bits used for X coordinate texture lookup |
| - softpipe: Add an extra code path for the buffer texel lookup |
| - softpipe: Enable PIPE_CAP_TEXTURE_BUFFER_OFFSET_ALIGNMENT |
| - Gallium: Add new CAP that indicated whether IO array definitions can |
| be shriked |
| - virgl: Enable passing arrays as input to fragment shaders |
| - doc/features: Add a few extensions to the feature matrix |
| - softpipe: Factor gradient evaluation out of the lambda evaluation |
| - softpipe: Prepare handling explicit gradients |
| - softpipe: Pipe gather_comp through from st_tgsi_get_samples |
| - softpipe: Move selection of shadow values up and clean parameter list |
| - softpipe: tie in new code path for lod evaluation |
| - softpipe: keep input lod for explicite derivatives |
| - softpipe: evaluate cube the faces on a per sample bases |
| - softpipe: Factor out evaluation of the source indices |
| - softpipe: Add an per-input array for interpolator correctors to |
| machine |
| - softpipe: Add (fake) support for TGSI_OPCODE_INTERP_SAMPLE |
| - softpipe: Add support for TGSI_OPCODE_INTERP_OFFSET |
| - softpipe: Add support for TGSI_OPCODE_INTERP_CENTROID |
| - softpipe: Increase the GLSL feature level |
| - doc: Update feature matrix |
| - softpipe/buffer: load only as many components as the the buffer |
| resource type provides |
| - Revert "softpipe/buffer: load only as many components as the the |
| buffer resource type provides" |
| |
| Greg V (3): |
| |
| - util: emulate futex on FreeBSD using umtx |
| - gallium/hud: add CPU usage support for FreeBSD |
| - gallium: enable dmabuf on BSD as well |
| |
| Grigori Goronzy (1): |
| |
| - glx: add support for GLX_ARB_create_context_no_error (v3) |
| |
| Guido Günther (4): |
| |
| - docs: Fix 19.0.x version numbers |
| - gallium: ddebug: Add missing fence related wrappers |
| - gallium/u_dump: util_dump_sampler_view: Dump u.tex.first_level |
| - gallium: trace: Add missing fence related wrappers |
| |
| Gurchetan Singh (44): |
| |
| - mesa/main: Expose EXT_texture_compression_s3tc_srgb |
| - i965: Set flag for EXT_texture_compression_s3tc_srgb |
| - st/mesa: expose EXT_texture_compression_s3tc_srgb |
| - docs: add GL_EXT_texture_compression_s3tc_srgb to release notes |
| - virgl: add ability to do finer grain dirty tracking |
| - virgl: use virgl_resource_dirty helper |
| - virgl: don't mark unclean after a flush |
| - virgl: track level cleanliness rather than resource cleanliness |
| - virgl: make alignment smaller when uploading index user buffers |
| - virgl: unmap uploader at flush time |
| - virgl: when creating / freeing transfers, pass slab pool directly |
| - virgl: add protocol for resource transfers |
| - virgl: use virgl_transfer in inline write |
| - virgl: limit command length to 16 bits |
| - virgl: keep track of number of computations |
| - virgl: pass virgl transfer to virgl_res_needs_flush_wait |
| - virgl: add extra checks in virgl_res_needs_flush_wait |
| - virgl: make winsys modifications for encoded transfers |
| - virgl: add encoder functions for new protocol |
| - virgl: introduce transfer queue |
| - virgl: use transfer queue |
| - virgl: use virgl_transfer_inline_write even less |
| - virgl/vtest: deprecate protocol version 1 |
| - egl/sl: also allow virtgpu to fallback to kms_swrast |
| - virgl: use uint16_t mask instead of separate booleans |
| - configure.ac / meson: depend on libnativewindow when appropriate |
| - anv: move anv_GetMemoryAndroidHardwareBufferANDROID up a bit |
| - anv: fix build on Nougat |
| - egl/android: move droid_image_loader_extension down a bit |
| - egl/android: move droid_open_device_drm_gralloc down a bit |
| - egl/android: droid_open_device_drm_gralloc --> droid_open_device |
| - egl/android: refactor droid_load_driver a bit |
| - egl/android: plumb swrast option |
| - egl/android: use swrast option in droid_load_driver |
| - egl/android: use software rendering when appropriate |
| - egl/android: chose node type based on swrast and preprocessor flags |
| - virgl: wait after a flush |
| - virgl/vtest: execute a transfer_get when flushing the front buffer |
| - virgl/vtest: add utilities for receiving fds |
| - virgl/vtest: plumb support for shared memory |
| - virgl/vtest: receive and handle shared memory fd |
| - virgl/vtest: modify sending and receiving data for shared memory |
| - virgl/vtest: wait after issuing a transfer get |
| - virgl/vtest: bump up protocol version + support encoded transfers |
| |
| Guttula, Suresh (1): |
| |
| - st/va:Add support for indirect manner by returning |
| VA_STATUS_ERROR_OPERATION_FAILED |
| |
| Hal Gentz (1): |
| |
| - glx: Fix synthetic error generation in \__glXSendError |
| |
| Heinrich (1): |
| |
| - gbm: Improve documentation of BO import |
| |
| Iago Toral Quiroga (39): |
| |
| - compiler/nir: add an is_conversion field to nir_op_info |
| - compiler/nir: add lowering option for 16-bit fmod |
| - compiler/nir: add lowering for 16-bit flrp |
| - compiler/nir: add lowering for 16-bit ldexp |
| - intel/compiler: add a NIR pass to lower conversions |
| - intel/compiler: split float to 64-bit opcodes from int to 64-bit |
| - intel/compiler: handle b2i/b2f with other integer conversion opcodes |
| - intel/compiler: assert restrictions on conversions to half-float |
| - intel/compiler: lower some 16-bit float operations to 32-bit |
| - intel/compiler: handle extended math restrictions for half-float |
| - intel/compiler: implement 16-bit fsign |
| - intel/compiler: drop unnecessary temporary from 32-bit fsign |
| implementation |
| - intel/compiler: add instruction setters for Src1Type and Src2Type. |
| - intel/compiler: add new half-float register type for 3-src |
| instructions |
| - intel/compiler: don't compact 3-src instructions with Src1Type or |
| Src2Type bits |
| - intel/compiler: allow half-float on 3-source instructions since gen8 |
| - intel/compiler: set correct precision fields for 3-source float |
| instructions |
| - intel/compiler: fix ddx and ddy for 16-bit float |
| - intel/compiler: fix ddy for half-float in Broadwell |
| - intel/compiler: workaround for SIMD8 half-float MAD in gen8 |
| - intel/compiler: split is_partial_write() into two variants |
| - intel/compiler: activate 16-bit bit-size lowerings also for 8-bit |
| - intel/compiler: rework conversion opcodes |
| - intel/compiler: ask for an integer type if requesting an 8-bit type |
| - intel/eu: force stride of 2 on NULL register for Byte instructions |
| - intel/compiler: generalize the combine constants pass |
| - intel/compiler: implement is_zero, is_one, is_negative_one for |
| 8-bit/16-bit |
| - intel/compiler: add a brw_reg_type_is_integer helper |
| - intel/compiler: fix cmod propagation for non 32-bit types |
| - intel/compiler: remove inexact algebraic optimizations from the |
| backend |
| - intel/compiler: skip MAD algebraic optimization for half-float or |
| mixed mode |
| - intel/compiler: implement SIMD16 restrictions for mixed-float |
| instructions |
| - intel/compiler: also set F execution type for mixed float mode in BDW |
| - intel/compiler: validate region restrictions for half-float |
| conversions |
| - intel/compiler: validate conversions between 64-bit and 8-bit types |
| - intel/compiler: validate region restrictions for mixed float mode |
| - compiler/spirv: move the check for Int8 capability |
| - anv/pipeline: support Float16 and Int8 SPIR-V capabilities in gen8+ |
| - anv/device: expose VK_KHR_shader_float16_int8 in gen8+ |
| |
| Ian Romanick (55): |
| |
| - nir: Silence zillions of unused parameter warnings in release builds |
| - intel/compiler: Silence warning about value that may be used |
| uninitialized |
| - nir: Document some fields of nir_loop_terminator |
| - nir: Refactor code that checks phi nodes in opt_peel_loop_initial_if |
| - nir: Select phi nodes using prev_block instead of continue_block |
| - nir: Split ALU instructions in loops that read phis |
| - nir: Convert a bcsel with only phi node sources to a phi node |
| - spirv: Add missing break |
| - nir/algebraic: Convert some f2u to f2i |
| - nir/algebraic: Simplify comparison with sequential integers starting |
| with 0 |
| - intel/vec4: Emit constants for some ALU sources as immediate values |
| - nir/algebraic: Replace i2b used by bcsel or if-statement with |
| comparison |
| - intel/fs: Relax type matching rules in cmod propagation from MOV |
| instructions |
| - intel/fs: Handle OR source modifiers in algebraic optimization |
| - intel/fs: Refactor ALU source and destination handling to a separate |
| function |
| - intel/fs: Emit logical-not of operands on Gen8+ |
| - intel/fs: Use De Morgan's laws to avoid logical-not of a logic result |
| on Gen8+ |
| - intel/fs: Emit better code for b2f(inot(a)) and b2i(inot(a)) |
| - nir/algebraic: Replace a bcsel of a b2f sources with a b2f(!(a \|\| |
| b)) |
| - intel/fs: Generate if instructions with inverted conditions |
| - nir/algebraic: Replace a-fract(a) with floor(a) |
| - intel/fs: Don't assert on b2f with a saturate modifier |
| - nir/algebraic: Optimize away an fsat of a b2f |
| - intel/compiler: Silence many unused parameter warnings in brw_eu.h |
| - intel/compiler: Silence unused parameter warning in |
| brw_interpolation_map.c |
| - intel/fs: nir_op_extract_i8 extracts a byte, not a word |
| - intel/fs: Fix extract_u8 of an odd byte from a 64-bit integer |
| - nir/algebraic: Fix up extract_[iu]8 after loop unrolling |
| - nir/algebraic: Remove redundant extract_[iu]8 patterns |
| - nir/algebraic: Add missing 64-bit extract_[iu]8 patterns |
| - nir/algebraic: Add missing 16-bit extract_[iu]8 patterns |
| - nir/algebraic: Fix up extract_[iu]8 after loop unrolling |
| - nir/algebraic: Remove redundant extract_[iu]8 patterns |
| - nir/algebraic: Add missing 64-bit extract_[iu]8 patterns |
| - nir/algebraic: Add missing 16-bit extract_[iu]8 patterns |
| - nir: Add nir_const_value_negative_equal |
| - nir: Add nir_alu_srcs_negative_equal |
| - nir: Add partial redundancy elimination for compares |
| - intel/compiler: Use partial redundancy elimination for compares |
| - intel/fs: Eliminate dead code first |
| - intel/fs: Refactor code generation for nir_op_fsign to its own |
| function |
| - intel/fs: Add a scale factor to emit_fsign |
| - intel/fs: Generate better code for fsign multiplied by a value |
| - nir/algebraic: Recognize open-coded copysign(1.0, a) |
| - nir/algebraic: Replace a pattern where iand with a Boolean is used as |
| a bcsel |
| - nir/algebraic: Fix some 1-bit Boolean weirdness |
| - nir/algebraic: Strength reduce some compares of x and -x |
| - intel/fs: Add support for float16 to the fsign optimizations |
| - glsl: Silence may unused parameter warnings in glsl/ir.h |
| - intel/compiler: Don't have sepearate, per-Gen nir_options |
| - intel/compiler: Lower ffma on Gen4 and Gen5 |
| - intel/fs: Fix D to W conversion in opt_combine_constants |
| - mesa: Add missing display list support for GL_FOG_COORDINATE_SOURCE |
| - nir: Saturating integer arithmetic is not associative |
| - Revert "nir: add late opt to turn inot/b2f combos back to bcsel" |
| |
| Icenowy Zheng (5): |
| |
| - lima: add dummy set_sample_mask function |
| - lima: make lima_context_framebuffer subtype of pipe_framebuffer_state |
| - lima: implement blit with util_blitter |
| - lima: lower bool to float when building shaders |
| - lima: add Android build |
| |
| Ilia Mirkin (14): |
| |
| - nv50,nvc0: add explicit settings for recent caps |
| - nvc0: add support for handling indirect draws with attrib conversion |
| - nvc0/ir: always use CG mode for loads from atomic-only buffers |
| - nvc0/ir: fix second tex argument after levelZero optimization |
| - nvc0: fix 3d images on kepler |
| - nv50,nvc0: use condition for occlusion queries when already complete |
| - nvc0: stick zero values for the compute invocation counts |
| - nvc0: we have 16k-sized framebuffers, fix default scissors |
| - swr: set PIPE_CAP_MAX_VARYINGS correctly |
| - mesa: add explicit enable for EXT_float_blend, and error condition |
| - st/mesa: enable GL_EXT_float_blend when possible |
| - i965: always enable EXT_float_blend |
| - nv50: disable compute |
| - glsl: fix recording of variables for XFB in TCS shaders |
| |
| Illia Iorin (1): |
| |
| - mesa/main: Fix multisample texture initialize |
| |
| James Zhu (12): |
| |
| - gallium/auxiliary/vl: Move dirty define to header file |
| - gallium/auxiliary/vl: Split vl_compositor graphic shaders from |
| vl_compositor API |
| - gallium/auxiliary/vl: Rename csc_matrix and increase its size. |
| - gallium/auxiliary/vl: Add compute shader to support video compositor |
| render |
| - gallium/auxiliary/vl: Add video compositor compute shader render |
| - gallium/auxiliary/vl: Fix transparent issue on compute shader with |
| rgba |
| - gallium/auxiliary/vl: Increase shader_params size |
| - gallium/auxiliary/vl: Change grid setting |
| - gallium/auxiliary/vl: Change weave compute shader implementation |
| - gallium/auxiliary/vl: Fixed blur issue with weave compute shader |
| - gallium/auxiliary/vl: Fixed blank issue with compute shader |
| - gallium/auxiliary/vl: Add barrier/unbind after compute shader launch. |
| |
| Jan Vesely (2): |
| |
| - Partially revert "gallium: fix autotools build of pipe_msm.la" |
| - gallium/aux: Report error if loading of a pipe driver fails. |
| |
| Jan Zielinski (1): |
| |
| - swr/rast: fix 32-bit compilation on Linux |
| |
| Jason Ekstrand (212): |
| |
| - spirv: Replace vtn_constant_value with vtn_constant_uint |
| - spirv: Rework handling of spec constant workgroup size built-ins |
| - spirv: Handle constants and types before execution modes |
| - spirv: Handle OpExecutionModeId |
| - spirv: Support LocalSizeId and LocalSizeHintId execution modes |
| - intel/nir: Add global support to lower_mem_access_bit_sizes |
| - intel/fs/cse: Split create_copy_instr into three cases |
| - intel/fs: Properly handle 64-bit types in LOAD_PAYLOAD |
| - intel/fs: Do the grf127 hack on SIMD8 instructions in SIMD16 mode |
| - intel/fs: Implement load/store_global with A64 untyped messages |
| - intel/fs: Use SENDS for A64 writes on gen9+ |
| - intel/fs: Implement nir_intrinsic_global_atomic\_\* |
| - anv: Implement VK_EXT_buffer_device_address |
| - relnotes: Add VK_EXT_buffer_device_address |
| - nir/deref: Drop zero ptr_as_array derefs |
| - README: Drop the badges from the readme |
| - intel/fs: Use enumerated array assignments in fb read TXF setup |
| - nir/deref: Rematerialize parents in |
| rematerialize_derefs_in_use_blocks |
| - nir: Silence a couple of warnings in release builds |
| - anv/blorp: Delete a pointless assert |
| - anv: Silence some compiler warnings in release builds |
| - intel/fs: Silence a compiler warning |
| - intel/fs: Bail in optimize_extract_to_float if we have modifiers |
| - nir/dead_cf: Inline cf_node_has_side_effects |
| - nir/dead_cf: Stop relying on liveness analysis |
| - compiler/types: Add a contains_64bit helper |
| - nir/xfb: Properly align 64-bit values |
| - nir: Rewrite lower_clip_cull_distance_arrays to do a lot less |
| lowering |
| - nir/xfb: Work in terms of components rather than slots |
| - nir/xfb: Handle compact arrays in gather_xfb_info |
| - nir: Fix a compile warning |
| - nir/lower_clip_cull: Fix an incorrect assert |
| - iris: Don't lower image formats for write-only images |
| - iris/compute: Don't increment the grid size offset |
| - iris/compute: Zero out the last grid size on indirect dispatches |
| - iris: Configure the L3$ on the compute context |
| - iris: Don't set constant read lengths at upload time |
| - iris: Allocate buffer resources separately |
| - iris: Copy anv's MI_MATH helpers for multiplication and division |
| - nir/split_vars: Don't compact vectors unnecessarily |
| - nir/builder: Don't emit no-op swizzles |
| - intel/eu: Add an EOT parameter to send_indirect_[split]_message |
| - intel/fs: Add an enum type for logical sampler inst sources |
| - intel/fs: Re-order logical surface arguments |
| - intel/fs: Drop the fs_surface_builder |
| - intel/vec4: Drop dead code for handling typed surface messages |
| - intel/fs: Get rid of the IMAGE_SIZE opcode |
| - intel/compiler: Drop unused surface opcodes |
| - intel/schedule_instructions: Move some comments |
| - intel/compiler: Re-prefix non-logical surface opcodes with VEC4 |
| - anv: Count surfaces for non-YCbCr images in |
| GetDescriptorSetLayoutSupport |
| - spirv: OpImageQueryLod requires a sampler |
| - intel,nir: Lower TXD with min_lod when the sampler index is not < 16 |
| - anv: Use an actual binding for gl_NumWorkgroups |
| - anv/pipeline: Drop anv_fill_binding_table |
| - anv/descriptor_set: Refactor alloc/free of descriptor sets |
| - anv: Rework arguments to anv_descriptor_set_write\_\* |
| - anv: Stop allocating buffer views for dynamic buffers |
| - anv: Count image param entries rather than images |
| - anv: Clean up descriptor set layouts |
| - anv: drop add_var_binding from anv_nir_apply_pipeline_layout.c |
| - anv: Refactor descriptor pushing a bit |
| - anv: Take references to push descriptor set layouts |
| - anv: Add a concept of a descriptor buffer |
| - spirv: Pull offset/stride from the pointer for OpArrayLength |
| - spirv: Use the generic dereference function for OpArrayLength |
| - spirv: Use the same types for resource indices as pointers |
| - anv: Implement VK_EXT_inline_uniform_block |
| - nir: Expose double and int64 op_to_options_mask helpers |
| - nir: Teach loop unrolling about 64-bit instruction lowering |
| - i965: Compile the fp64 program based on nir options |
| - intel/debug: Add a debug flag to force software fp64 |
| - intel/nir: Drop an unneeded lower_constant_initializers call |
| - glsl/nir: Add a shared helper for building float64 shaders |
| - glsl/nir: Inline functions in float64_funcs_to_nir |
| - nir/inline_functions: Break inlining into a builder helper |
| - nir/deref: Expose nir_opt_deref_impl |
| - nir/lower_doubles: Inline functions directly in lower_doubles |
| - intel/nir: Move 64-bit lowering later |
| - st/nir: Move 64-bit lowering later |
| - nir/builder: Emit better code for iadd/imul_imm |
| - nir/builder: Cast array indices in build_deref_follower |
| - nir/builder: Add a build_deref_array_imm helper |
| - intel/nir: Move lower_mem_access_bit_sizes to postprocess_nir |
| - anv/pipeline: Move lower_explicit_io much later |
| - nir: Add a pass for lowering IO back to vector when possible |
| - intel/nir: Vectorize all IO |
| - anv: Ignore VkRenderPassInputAttachementAspectCreateInfo |
| - nir/loop_unroll: Fix out-of-bounds access handling |
| - glsl/list: Add a list variant of insert_after |
| - glsl/lower_vector_derefs: Don't use a temporary for TCS outputs |
| - anv: Stop using VK_TRUE/FALSE |
| - anv/pass: Flag the need for a RT flush for resolve attachments |
| - anv: Only set 3DSTATE_PS::VectorMaskEnable on gen8+ |
| - nir/algebraic: Add a couple optimizations for iabs and ishr |
| - nir/validate: Only require bare types to match for copy_deref |
| - nir/validate: Allow 32-bit boolean load/store intrinsics |
| - compiler/types: Add a new is_interface C wrapper |
| - compiler/types: Add a C wrapper to get full struct field data |
| - compiler/types: Add helpers to get explicit types for standard |
| layouts |
| - nir/deref: Consider COHERENT decorated var derefs as aliasing |
| - nir: Rename nir_address_format_vk_index_offset to not be vk |
| - nir/lower_io: Add a new buffer_array_length intrinsic and lowering |
| - glsl: Don't lower vector derefs for SSBOs, UBOs, and shared |
| - glsl/nir: Set explicit types on UBO/SSBO variables |
| - glsl/nir: Handle unlowered SSBO atomic and array_length intrinsics |
| - glsl/nir: Add a pass to lower UBO and SSBO access |
| - i965: Stop setting LowerBuferInterfaceBlocks |
| - st/mesa: Let NIR lower UBO and SSBO access when we have it |
| - nir/builder: Add a vector extract helper |
| - nir: Add a new pass to lower array dereferences on vectors |
| - intel/nir: Lower array-deref-of-vector UBO and SSBO loads |
| - anv: Implement VK_EXT_host_query_reset |
| - anv,radv: Implement VK_KHR_surface_capability_protected |
| - Revert "nir: const \`nir_call_instr::callee`" |
| - anv: Bump maxComputeWorkgroupInvocations |
| - nir: Constant values are per-column not per-component |
| - anv,radv,turnip: Lower TG4 offsets with nir_lower_tex |
| - spirv: Drop inline tg4 lowering |
| - nir/lower_io: Add a bounds-checked 64-bit global address format |
| - nir: Add a lowering pass for non-uniform resource access |
| - nir: Add texture sources and intrinsics for bindless |
| - nir: Add access flags to deref and SSBO atomics |
| - spirv: Handle the NonUniformEXT decoration |
| - Revert "anv/radv: release memory allocated by glsl types during |
| spirv_to_nir" |
| - nir: Lock around validation fail shader dumping |
| - nir/algebraic: Drop some @bool specifiers |
| - nir/algebraic: Add some logical OR and AND patterns |
| - vc4: Prefer nir_src_comp_as_uint over nir_src_as_const_value |
| - nir/search: Search for all combinations of commutative ops |
| - nir: Get rid of nir_register::is_packed |
| - nir: Get rid of global registers |
| - intel/common: Add a MI command builder |
| - intel/common: Add unit tests for gen_mi_builder |
| - anv: Use gen_mi_builder for CmdDrawIndirectByteCount |
| - anv: Use gen_mi_builder for computing resolve predicates |
| - anv: Use gen_mi_builder for indirect draw parameters |
| - anv: Use gen_mi_builder for indirect dispatch |
| - anv: Use gen_mi_builder for conditional rendering |
| - anv: Use gen_mi_builder for queries |
| - anv: Move mi_memcpy and mi_memset to gen_mi_builder |
| - anv/cmd_buffer: Use gen_mi_sub instead of gen_mi_add with a negative |
| - intel/common: Support bigger right-shifts with mi_builder |
| - anv/pipeline: Fix MEDIA_VFE_STATE::PerThreadScratchSpace on gen7 |
| - nir: Add a pass for selectively lowering variables to scratch space |
| - intel/nir: Take a nir_tex_instr and src index in brw_texture_offset |
| - nir/builder: Add a nir_imm_zero helper |
| - nir/print: Use nir_src_as_int for array indices |
| - nir/constant_folding: Get rid of a bit size switch statement |
| - spirv: Drop some unneeded bit size switch statements |
| - nir/load_const_to_scalar: Get rid of a bit size switch statement |
| - nir/validate: Require unused bits of nir_const_value to be zero |
| - vulkan: Update the XML and headers to 1.1.106 |
| - anv: Update to use the new features struct names |
| - nir/algebraic: Move the template closer to the render function |
| - nir/algebraic: Use a cache to avoid re-emitting structs |
| - intel/mi_builder: Re-order an initializer |
| - intel/mi_builder: Disable mem_mem tests on IVB |
| - nir: Drop "struct" from some nir\_\* declarations |
| - nir: Rework nir_src_as_alu_instr to not take a pointer |
| - nir: Add a nir_src_as_intrinsic() helper |
| - anv: Re-sort the GetPhysicalDeviceFeatures2 switch statement |
| - anv: Drop some unneeded ANV_FROM_HANDLE for physical devices |
| - intel/fs: Account for live range lengths in spill costs |
| - anv: Make all VkDeviceMemory BOs resident permanently |
| - anv: Put image params in the descriptor set buffer on gen8 and |
| earlier |
| - anv: Add a #define for the max binding table size |
| - anv/pipeline: Sort bindings by most used first |
| - anv/pipeline: Add skeleton support for spilling to bindless |
| - nir/lower_io: Expose some explicit I/O lowering helpers |
| - intel/nir: Re-run int64 lowering in postprocess_nir |
| - anv: Add a has_a64_buffer_access to anv_physical_device |
| - anv: Lower some SSBO operations in apply_pipeline_layout |
| - anv: Implement SSBOs bindings with GPU addresses in the descriptor BO |
| - anv: Implement VK_KHR_shader_atomic_int64 |
| - intel,nir: Lower TXD with a bindless sampler |
| - intel/fs: Add support for bindless texture ops |
| - anv: Count the number of planes in each descriptor binding |
| - anv: Use write_image_view to initialize immutable samplers |
| - anv: Pass the plane into lower_tex_deref |
| - anv: Use bindless textures and samplers |
| - intel/fs: Add support for bindless image load/store/atomic |
| - anv: Use bindless handles for images |
| - anv: Put binding flags in descriptor set layouts |
| - anv: Implement VK_EXT_descriptor_indexing |
| - nir: Add helpers for getting the type of an address format |
| - anv/nir: Add a central helper for figuring out SSBO address formats |
| - anv: Ignore descriptor binding flags if bindingCount == 0 |
| - anv: Rework the descriptor set layout create loop |
| - anv,radv: Update release notes for newly implemented extensiosn |
| - nir: Use the NIR_SRC_AS\_ macro to define nir_src_as_deref |
| - anv/descriptor_set: Unlink sets from the pool in set_destroy |
| - anv/descriptor_set: Destroy sets before pool finalization |
| - anv/descriptor_set: Only vma_heap_finish if we have a descriptor |
| buffer |
| - anv/descriptor_set: Properly align descriptor buffer to a page |
| - anv: Better handle 32-byte alignment of descriptor set buffers |
| - anv/descriptor_set: Don't fully destroy sets in pool destroy/reset |
| - nir/algebraic: Optimize integer cast-of-cast |
| - util/bitset: Return an actual bool from test macros |
| - anv: Stop including POS in FS input limits |
| - anv,i965: Stop warning about incomplete gen11 support |
| - nir: Add a SSA type gathering pass |
| - intel/fs/ra: Only add dest interference to sources that exist |
| - intel/fs/ra: Stop adding RA interference to too many SENDS nodes |
| - anv: Emulate texture swizzle in the shader when needed |
| - anv: Stop forcing bindless for images |
| - anv: Only consider minSampleShading when sampleShadingEnable is set |
| - iris: Don't assume UBO indices are constant |
| - intel/fs,vec4: Use g0 as the header for MFENCE |
| - intel/fs: Do a stalling MFENCE in endInvocationInterlock() |
| - nir/dead_cf: Call instructions aren't dead |
| - nir/propagate_invariant: Don't add NULL vars to the hash table |
| |
| Jian-Hong Pan (1): |
| |
| - intel: Fix the description of Coffeelake pci-id 0x3E98 |
| |
| Jiang, Sonny (1): |
| |
| - va: use a compute shader for the blit |
| |
| John Stultz (3): |
| |
| - mesa: android: freedreno: Fix build failure due to path change |
| - mesa: Makefile.sources: Add |
| ir3_nir_lower_load_barycentric_at_sample/offset to Makefile.sources |
| - mesa: Makefile.sources: Add nir_lower_fb_read.c to Makefile.sources |
| list |
| |
| Jon Turney (1): |
| |
| - meson: Force '.so' extension for DRI drivers |
| |
| Jonathan Marek (22): |
| |
| - nir: add missing vec opcodes in lower_bool_to_float |
| - freedreno: a2xx: fix fast clear |
| - freedreno: a2xx: don't write 4th vertex in mem2gmem |
| - freedreno: a2xx: add use_hw_binning function |
| - freedreno: a2xx: fix fast clear for some gmem configurations |
| - freedreno: a2xx: fix mipmapping for NPOT textures |
| - freedreno: use renderonly path for buffers allocated with modifiers |
| - freedreno: catch failing fd_blit and fallback to software blit |
| - mesa: add GL_AMD_compressed_ATC_texture support |
| - gallium: add ATC format support |
| - llvmpipe, softpipe: no support for ATC textures |
| - st/mesa: add ATC support |
| - freedreno: a3xx: add GL_AMD_compressed_ATC_texture support |
| - freedreno: a2xx: add GL_AMD_compressed_ATC_texture support |
| - svga: add new ATC formats to the format conversion table |
| - freedreno: a2xx: fix builtin blit program compilation |
| - freedreno: a2xx: disable PIPE_CAP_PACKED_UNIFORMS |
| - freedreno: a2xx: use nir_lower_io for TGSI shaders |
| - freedreno: a2xx: enable batch reordering |
| - freedreno: a2xx: same gmem2mem sequence for all tiles |
| - nir: improve convert_yuv_to_rgb |
| - freedreno/ir3: fix input ncomp for vertex shaders |
| |
| Jordan Justen (22): |
| |
| - iris: Set num_uniforms in bytes |
| - iris/compute: Set mask bits on PIPELINE_SELECT |
| - iris: Add IRIS_DIRTY_CONSTANTS_CS |
| - iris: Add iris_restore_compute_saved_bos |
| - iris/compute: Add MEDIA_STATE_FLUSH following WALKER |
| - iris/compute: Flush compute batches |
| - iris/compute: Get group counts from grid->grid |
| - iris/program: Don't try to push ubo ranges for compute |
| - iris/compute: Wait on compute batch when mapping |
| - iris/compute: Provide binding table entry for gl_NumWorkGroups |
| - iris/compute: Flush compute batch on memory-barriers |
| - iris/compute: Push subgroup-id |
| - iris/compute: Support indirect compute dispatch |
| - iris: Emit default L3 config for the render pipeline |
| - genxml/gen_bits_header.py: Use regex to strip no alphanum chars |
| - genxml: Remove extra space in gen4/45/5 field name |
| - iris: Add gitlab-ci build testing |
| - iris: Always use in-tree i915_drm.h |
| - nir: Add int64/doubles options into nir_shader_compiler_options |
| - intel/compiler: Move int64/doubles lowering options |
| - scons: Generate float64_glsl.h for glsl_to_nir fp64 lowering |
| - intel/genxml: Support base-16 in value & start fields in |
| gen_sort_tags.py |
| |
| Jose Maria Casanova Crespo (4): |
| |
| - iris: Enable ARB_shader_draw_parameters support |
| - glsl: fix typos in comments "transfor" -> "transform" |
| - glsl: TCS outputs can not be transform feedback candidates on GLES |
| - iris: setup EdgeFlag Vertex Element when needed. |
| |
| José Fonseca (1): |
| |
| - scons: Workaround failures with MSVC when using SCons 3.0.[2-4]. |
| |
| Juan A. Suarez Romero (22): |
| |
| - anv/cmd_buffer: check for NULL framebuffer |
| - nir: move ALU instruction before the jump instruction |
| - nir: remove jump from two merging jump-ending blocks |
| - genxml: add missing field values for 3DSTATE_SF |
| - anv: advertise 8 subpixel precision bits |
| - nir/spirv: return after emitting a branch in block |
| - anv: destroy descriptor sets when pool gets reset |
| - nir: deref only for OpTypePointer |
| - anv: advertise 8 subtexel/mipmap precision bits |
| - nir/xfb: do not use bare interface type |
| - meson: Add dependency on genxml to anvil genfiles |
| - Revert "intel/compiler: split is_partial_write() into two variants" |
| - spirv: add missing SPV_EXT_descriptor_indexing capabilities |
| - radv: enable descriptor indexing capabilities |
| - anv: enable descriptor indexing capabilities |
| - Update version to 19.1.0-rc1 |
| - Update version to 19.1.0-rc2 |
| - cherry-ignore: radeonsi: update buffer descriptors in all contexts |
| after buffer invalidation |
| - Update version to 19.1.0-rc3 |
| - Update version to 19.1.0-rc4 |
| - Update version to 19.1.0-rc5 |
| - Update version to 19.1.0 |
| |
| Julien Isorce (5): |
| |
| - gallium: add resource_get_info to pipe_screen |
| - radeonsi: implement resource_get_info |
| - st/va: properly set stride and offset in vlVaDeriveImage |
| - r600: implement resource_get_info |
| - st/va: check resource_get_info nullity in vlVaDeriveImage |
| |
| Józef Kucia (3): |
| |
| - mesa: Fix GL_NUM_DEVICE_UUIDS_EXT |
| - radv: Fix driverUUID |
| - radv: clear vertex bindings while resetting command buffer |
| |
| Karol Herbst (82): |
| |
| - nvc0/ir: replace cvt instructions with add to improve shader |
| performance |
| - gk104/ir: Use the new rcp/rsq in library |
| - gm107/ir: add fp64 rcp |
| - gm107/ir: add fp64 rsq |
| - gallium: add PIPE_CAP_MAX_VARYINGS |
| - st/mesa: require RGBA2, RGB4, and RGBA4 to be renderable |
| - glsl_type: initialize offset and location to -1 for glsl_struct_field |
| - nir/opt_if: don't mark progress if nothing changes |
| - clover: update ICD table to support everything up to 2.2 |
| - nir: replace magic numbers with M_PI |
| - nir/spirv: improve parsing of the memory model |
| - nir: add support for address bit sized system values |
| - nir/vtn: add support for SpvBuiltInGlobalLinearId |
| - nir/spirv: initial handling of OpenCL.std extension opcodes |
| - prog_to_nir: fix write from vps to FOG |
| - nvc0: print the shader type when dumping headers |
| - nv50/ir: move common converter code in base class |
| - nv50/ir: add lowering helper |
| - nouveau: add support for nir |
| - nouveau: fix nir and TGSI shader cache collision |
| - nv50/ir/nir: run some passes to make the conversion easier |
| - nv50/ir/nir: track defs and provide easy access functions |
| - nv50/ir/nir: add nir type helper functions |
| - nv50/ir/nir: run assignSlots |
| - nv50/ir/nir: add loadFrom and storeTo helpler |
| - nv50/ir/nir: parse NIR shader info |
| - nv50/ir/nir: implement nir_load_const_instr |
| - nv50/ir/nir: add skeleton for nir_intrinsic_instr |
| - nv50/ir/nir: implement nir_alu_instr handling |
| - nv50/ir/nir: implement nir_intrinsic_load_uniform |
| - nv50/ir/nir: implement nir_intrinsic_store_(per_vertex\_)output |
| - nv50/ir/nir: implement load_(interpolated\_)input/output |
| - nv50/ir/nir: implement intrinsic_discard(_if) |
| - nv50/ir/nir: implement loading system values |
| - nv50/ir/nir: implement nir_ssa_undef_instr |
| - nv50/ir/nir: implement nir_instr_type_tex |
| - nv50/ir/nir: add skeleton getOperation for intrinsics |
| - nv50/ir/nir: implement vote and ballot |
| - nv50/ir/nir: implement variable indexing |
| - nv50/ir/nir: implement geometry shader nir_intrinsics |
| - nv50/ir/nir: implement nir_intrinsic_load_ubo |
| - nv50/ir/nir: implement ssbo intrinsics |
| - nv50/ir/nir: implement images |
| - nv50/ir/nir: add memory barriers |
| - nv50/ir/nir: implement load_per_vertex_output |
| - nv50/ir/nir: implement intrinsic shader_clock |
| - nv50/ir/nir: handle user clip planes for each emitted vertex |
| - nv50ir/nir: move immediates before use |
| - glsl: add packed for struct types |
| - glsl: add cl_size and cl_alignment |
| - nir/lower_locals_to_regs: cast array index to 32 bit |
| - nir/spirv: handle kernel function parameters |
| - nir/spirv: support physical pointers |
| - nir: add support for gather offsets |
| - nv50/ir/nir: support gather offsets |
| - nir/lower_tex: Add support for tg4 offsets lowering |
| - nir/print: fix printing the image_array intrinsic index |
| - nir/validate: validate that tex deref sources are actually derefs |
| - v3d: prefer using nir_src_comp_as_int over nir_src_as_const_value |
| - panfrost/midgard: use nir_src_is_const and nir_src_as_uint |
| - glsl/standalone: add GLES3.1 and GLES3.2 compatibility |
| - nir: move brw_nir_rewrite_image_intrinsic into common code |
| - glsl_to_nir: handle bindless textures |
| - glsl/nir: fetch the type for images from the deref instruction |
| - glsl/nir: add support for lowering bindless images_derefs |
| - nv50/ir/nir: handle bindless texture |
| - nv50/ir/nir: add support for bindless images |
| - nvc0/nir: enable bindless texture |
| - lima: add bool parameter to type_size function |
| - amd/nir: some cleanups |
| - radv: use nir constant helpers |
| - intel/nir: use nir_src_is_const and nir_src_as_uint |
| - freedreno/ir3: use nir_src_as_uint in a few places |
| - lima: use nir_src_as_float |
| - nir/builder: Move nir_imm_vec2 from blorp into the builder |
| - nir/loop_analyze: use nir_const_value.b for boolean results, not u32 |
| - spirv: reduce array size in vtn_handle_constant |
| - nir: make nir_const_value scalar |
| - vtn: handle bitcast with pointer src/dest |
| - nir: Add a nir_builder_alu variant which takes an array of components |
| - nir: Add nir_op_vec helper |
| - spirv/cl: support vload/vstore |
| |
| Kasireddy, Vivek (3): |
| |
| - nir/lower_tex: Add support for XYUV lowering |
| - dri: Add XYUV8888 format |
| - i965: Add support for sampling from XYUV images |
| |
| Kenneth Graunke (872): |
| |
| - st/mesa: Set pipe_image_view::shader_access in PBO readpixels. |
| - st/nir: Move varying setup code to a helper function. |
| - st/nir: Make new helpers for constructing built-in NIR shaders. |
| - st/mesa: Add a NIR version of the drawpixels/bitmap VS copy shader. |
| - st/mesa: Add NIR versions of the drawpixels Z/stencil fragment |
| shaders. |
| - st/mesa: Add NIR versions of the clear shaders. |
| - st/mesa: Add a NIR version of the OES_draw_texture built-in shaders. |
| - st/mesa: Add NIR versions of the PBO upload/download shaders. |
| - program: Use u_bit_scan64 in prog_to_nir. |
| - program: Extend prog_to_nir handle system values. |
| - nir: Record info->fs.pixel_center_integer in lower_system_values |
| - compiler: Mark clip/cull distance arrays as compact before lowering. |
| - nir: Bail on clip/cull distance lowering if GLSL IR already did it. |
| - nir: Avoid clip/cull distance lowering multiple times. |
| - nir: Avoid splitting compact arrays into per-element variables. |
| - st/nir: Call nir_lower_clip_cull_distance_arrays(). |
| - gallium: Add a PIPE_CAP_NIR_COMPACT_ARRAYS capability bit. |
| - nouveau: Silence unhandled cap warnings |
| - st/mesa: Limit GL_MAX_[NATIVE\_]PROGRAM_PARAMETERS_ARB to 2048 |
| - glsl: Allow gl_nir_lower_samplers*() without a gl_shader_program |
| - glsl: Don't look at sampler uniform storage for internal vars |
| - i965: Call nir_lower_samplers for ARB programs. |
| - st/nir: Pull sampler lowering into a helper function. |
| - st/nir: Lower sampler derefs for builtin shaders. |
| - st/nir: Use sampler derefs in built-in shaders. |
| - program: Make prog_to_nir create texture/sampler derefs. |
| - nir: Use sampler derefs in drawpixels and bitmap lowering. |
| - nir: Gather texture bitmasks in gl_nir_lower_samplers_as_deref. |
| - i965: Drop unnecessary 'and' with prog->SamplerUnits |
| - i965: Use info->textures_used instead of prog->SamplersUsed. |
| - mesa: Advertise EXT_float_blend in ES 3.0+ contexts. |
| - anv: Put MOCS in the correct location |
| - spirv: Eliminate dead input/output variables after translation. |
| - nir: Don't reassociate add/mul chains containing only constants |
| - compiler: Make is_64bit(GL_*) helper more broadly available |
| - mesa: Align doubles to a 64-bit starting boundary, even if packing. |
| - radeonsi: Go back to using llvm.pow intrinsic for nir_op_fpow |
| - st/mesa: Copy VP TGSI tokens if they exist, even for NIR shaders. |
| - nir: Don't forget if-uses in new nir_opt_dead_cf liveness check |
| - iris: Initial commit of a new 'iris' driver for Intel Gen8+ GPUs. |
| - iris: viewport state, sort of |
| - iris: port over batchbuffer updates |
| - iris: initial render state upload |
| - iris: packing with valgrind. |
| - iris: merge pack |
| - iris: initial gpu state, merges |
| - iris: RASTER + SF + some CLIP, fix DIRTY vs. NEW |
| - iris: scissors |
| - iris: SF_CLIP_VIEWPORT |
| - iris: Surfaces! |
| - iris: sampler views |
| - iris: stipples and vertex elements |
| - iris: framebuffers |
| - iris: don't segfault on !old_cso |
| - iris: fix SF_CL length |
| - iris: a bit of depth |
| - iris: some draw info, vbs, sample mask |
| - iris: fix crash - CSO binding can be NULL (when destroying context) |
| - iris: COLOR_CALC_STATE |
| - iris: sampler states |
| - iris: emit 3DSTATE_SAMPLER_STATE_POINTERS |
| - iris: basic push constant alloc |
| - iris: some program code |
| - iris: linear resources |
| - iris: maps |
| - iris: shader debug log |
| - iris: drop unused field |
| - iris: make an ice->render_batch field |
| - iris: disable execbuf for now |
| - iris: delete iris_pipe.c, shuffle code around |
| - iris: init the batch! |
| - iris: fix/rework line stipple |
| - iris: actually save VBs |
| - iris: msaa sample count packing problems |
| - iris: fix prim type |
| - iris: fix bogus index buffer reference |
| - iris: draw->restart_index is uninitialized if PR is not enabled |
| - iris: parse INTEL_DEBUG |
| - iris: reworks, FS compile pieces |
| - iris: import program cache code |
| - iris: do the FS...asserts because we don't lower uniforms yet |
| - iris: lower io |
| - iris: make iris_batch target a particular ring |
| - iris: kill iris_new_batch |
| - iris: move MAX defines to iris_batch.h |
| - iris: bit of SBA code |
| - iris: flag SBA updates when instruction BO changes |
| - iris: try and have an iris address |
| - iris: so, sba then. |
| - iris: reference VB BOs |
| - iris: VB addresses |
| - iris: DEBUG=bat |
| - iris: VB fixes |
| - iris: actually APPEND commands, not stomp over the top and never incr |
| - iris: actually flush the commands |
| - iris: actually advance forward when emitting commands |
| - iris: initialize dirty bits to ~0ull |
| - iris: hack to stop crashing on samplers for now |
| - iris: fix indentation |
| - iris: fix assert |
| - iris: fix VBs |
| - iris: vertex packet fixes |
| - iris: fix VF instancing length so we don't get garbage in batch |
| - iris: 3DPRIMITIVE fields |
| - iris: bind_state -> compute state |
| - iris: scissor slots |
| - iris: some shader bits |
| - iris: promote iris_program_cache_item to iris_compiled_shader |
| - iris: actually save derived state |
| - iris: emit shader packets |
| - iris: convert IRIS_DIRTY\_\* to #defines |
| - iris: don't forget about TE |
| - iris: reorganize commands to match brw |
| - iris: initial gpu state |
| - iris: WM. |
| - iris: index buffer BO |
| - iris: more comes from bits filled in |
| - iris: drop const from prog data parameters |
| - iris: softpin some things |
| - iris: use vtbl to avoid multiple symbols, fix state base address |
| - iris: fix SBA |
| - iris: move key pop to state module |
| - iris: bits of WM key |
| - iris: shuffle comments |
| - iris: no NEW_SBA |
| - iris: rewrite program cache to use u_upload_mgr |
| - iris: actually destroy the cache |
| - iris: actually softpin at an address |
| - iris: actually set KSP offsets |
| - iris: URB configs. |
| - iris: dummy constants |
| - iris: blend state |
| - iris: alpha testing in PSB |
| - iris: basic SBE code |
| - iris: warning fixes |
| - iris: fix silly unused batch with addr macro |
| - iris: render targets! |
| - iris: don't do samplers for disabled stages |
| - iris: smaller blend state |
| - iris: actually pin the instruction cache buffers |
| - iris: compctrl |
| - iris: more sketchy SBE |
| - iris: fix dmabuf retval comparisons |
| - iris: more SF CL VPs |
| - iris: catastrophic state pointer mistake |
| - iris: fix extents |
| - iris: write DISABLES are not write ENABLES...whoops |
| - iris: sample mask...not 0. |
| - iris: uniform bits...badly |
| - iris: warn if execbuf fails |
| - iris: NOOP pad batches correctly |
| - iris: decode batches if they fail to submit |
| - iris: enable a few more formats |
| - iris: set strides on transfers |
| - iris: stop adding 9 to our varyings |
| - iris: bufmgr updates. |
| - iris: some thinking about binding tables |
| - iris: Soft-pin the universe |
| - iris: fix icache memzone |
| - iris: dump gtt offset in dump_validation_list |
| - iris: Also set SUPPORTS_48B? Not sure if necessary. |
| - iris: more uploaders |
| - iris: rewrite to use memzones and not relocs |
| - iris: set EXEC_OBJECT_WRITE |
| - iris: include p_defines.h in iris_bufmgr.h |
| - iris: binders |
| - iris: hook up batch decoder |
| - iris: binder fixes |
| - iris: decoder fixes |
| - iris: update vb BO handling now that we have softpin |
| - iris: validation dumping improvements |
| - iris: canonicalize addresses. |
| - iris: delete more trash |
| - iris: allocate SURFACE_STATEs up front and stop streaming them |
| - iris: same treatment for sampler views |
| - iris: assemble SAMPLER_STATE table at bind time |
| - iris: fix a scissor bug |
| - iris: SBA once at context creation, not per batch |
| - iris: TES stash |
| - iris: isv freeing fixes |
| - iris: set sampler views |
| - iris: decoder fixes |
| - iris: better BT asserts |
| - iris: increase allocator alignment |
| - iris: fix index |
| - iris: port bug fix from i965 |
| - iris: fixes from i965 |
| - iris: fixes |
| - iris: crazy pipe control code |
| - iris: bo reuse |
| - iris: vma fixes - don't free binder address |
| - iris: vma - fix assert |
| - iris: better SBE |
| - iris: fix texturing! |
| - iris: Move get_command_space to iris_batch.c |
| - iris: Defines for base addresses rather than numbers everywhere |
| - iris: pull in newer comments |
| - iris: copy over i965's cache tracking |
| - iris: move bo_offset_from_sba |
| - iris: bits of blorp code |
| - iris: more blitting code to make readpixels work |
| - iris: drop bogus binder free |
| - iris: fix sampler view crashes |
| - iris: more blorp |
| - iris: fix blorp prog data crashes |
| - iris: add INTEL_DEBUG=reemit |
| - iris: drop the 48b printout, we never use anything else |
| - iris: hacky flushing for now |
| - iris: linear staging buffers - fast CPU access... |
| - iris: make blorp pin the binder |
| - iris: blorp URB |
| - iris: no more drawing rectangle in blorp |
| - iris: assert surf init |
| - iris: some depth stuff :( |
| - iris: bump GL version to 4.2 |
| - iris: uniforms for VS |
| - iris: proper length for VE packet? |
| - iris: proper # of uniforms |
| - iris: properly reject formats, fixes RGB32 rendering with texture |
| float |
| - iris: blorp bug fixes |
| - iris: delete growing code and just die for now |
| - iris: just turn batch reset_and_clear_caches into reset |
| - iris: chaining not growing |
| - iris: caps |
| - iris: fix batch chaining... |
| - iris: fix decoding and undo testing code |
| - iris: Lower the max number of decoded VBO lines |
| - iris: fix whitespace |
| - iris: fix 3DSTATE_VERTEX_ELEMENTS length |
| - iris: more depth stuffs... |
| - iris: fix VF INSTANCING length |
| - iris: util_copy_framebuffer_state (ported from Rob's v3d patches) |
| - iris: transfers |
| - iris: flush always |
| - iris: maybe slightly less boats uniforms |
| - iris: fix constant packet length to match i965 |
| - iris: better ubo handling |
| - iris: completely rewrite binder |
| - iris: have more than one const_offset |
| - iris: make surface states for cbufs |
| - iris: fill out pull constant buffers |
| - iris: fix pull bufs that aren't the first user upload |
| - iris: use u_transfer helpers for now |
| - iris: better VFI |
| - iris: fix release builds |
| - iris: drop assert for now |
| - iris: disable \__gen_validate_value in release mode |
| - iris: allow mapped buffers during execution (faster) |
| - iris: comment about reemitting and flushing |
| - iris: state cleaning |
| - iris: untested index buffer upload |
| - iris: delete some pointless STATIC_ASSERTS |
| - iris: untested SAMPLER_STATE pin BO fix |
| - iris: put back the always flush - fixes some things :( |
| - iris: save pointers to streamed state resources |
| - iris: fix the validation list on new batches |
| - iris: flag DIRTY_WM properly |
| - iris: bindings dirty tracking |
| - iris: some dirty fixes |
| - iris: clear dirty |
| - iris: plug leaks |
| - iris: more leak fixes |
| - iris: pc fixes |
| - iris: remove 4 bytes of padding in iris_compiled_shader |
| - iris: rzalloc iris_compiled_shader so memcmp works even if padding |
| creeps in |
| - iris: don't leak sampler state table resources |
| - iris: don't leak keyboxes when searching for an existing program |
| - iris: indentation |
| - iris: use pipe resources not direct BOs |
| - iris: clean up some warnings so I can see through the noise |
| - iris: print binder utilization in INTEL_DEBUG=submit |
| - iris: redo VB CSO a bit |
| - iris: print refcounts in INTEL_DEBUG=submit |
| - iris: support signed vertex buffer offsets |
| - iris: fix major refcounting bug with resources |
| - iris: fix caps so tests run again |
| - iris: avoid crashing on unbound constant resources |
| - iris: emit 3DSTATE_SBE_SWIZ |
| - iris: max VP index |
| - iris: fix viewport counts and settings |
| - iris: fix num viewports to be based on programs |
| - iris: fix VP iteration |
| - iris: scissor count fixes |
| - iris: actually init num_viewports |
| - iris: print second batch size separately |
| - iris: don't always flush |
| - iris: Handle batch submission failure "better" |
| - iris: bad inherited comments |
| - iris: colorize batchbuffer failures to make them stand out |
| - iris: iris - fix QWord aligned endings after batch chaining rework |
| - iris: tidy comments about mirroring modes |
| - iris: Disable unsupported mirror clamp modes |
| - iris: fix fragcoord ytransform |
| - iris: better boxing on maps |
| - iris: clears |
| - iris: rework DEBUG_REEMIT |
| - iris: shader dirty bits |
| - iris: clear fix |
| - iris: fall back to u_generate_mipmap |
| - iris: implement copy image |
| - iris: lightmodel flat |
| - iris: maybe-flush before blorp operations |
| - iris: fix provoking vertex ordering |
| - iris: larger polygon offset |
| - iris: TES uniform fixes |
| - iris: geometry shader support |
| - iris: don't emit garbage 3DSTATE_VERTEX_BUFFERS when there aren't any |
| - iris: fix 3DSTATE_VERTEX_ELEMENTS / VF_INSTANCING for 0 elements |
| - iris: fix GS dispatch mode |
| - iris: depth clears |
| - iris: null surface for unbound textures |
| - iris: state ref tuple |
| - iris: don't include binder in surface VMA range |
| - iris: border color memory zone :( |
| - iris: implement border color, fix other sampler nonsense |
| - iris: dead pointer |
| - iris: just malloc one iris_genx_state instead of a bunch of oddball |
| pieces |
| - iris: SBE change stash |
| - iris: fix zoffset asserts with 2DArray/Cube |
| - iris: rename map->stride |
| - iris: actually set cube bit properly |
| - iris: keep DISCARD_RANGE |
| - iris: actually handle array layers in blits |
| - iris: comment out l/a/i/la |
| - iris: fix clip flagging on fb changes |
| - iris: fix depth bounds clamp enables |
| - iris: don't crash on shader perf logs |
| - iris: slab allocate transfers |
| - iris: rearrange iris_resource.h |
| - iris: Implement 3DSTATE_SO_DECL_LIST |
| - iris: SO buffers |
| - iris: streamout |
| - iris: set even if no outputs |
| - iris: bother setting program_string_id... |
| - iris: fix SO_DECL_LIST |
| - iris: actually pin the buffers |
| - iris: fix sample mask for MSAA-off |
| - iris: disable 6x MSAA support |
| - iris: multislice transfer maps |
| - iris: fix CC_VIEWPORT |
| - iris: draw indirect support? |
| - iris: save query type |
| - iris: bits of multisample program key |
| - iris: s/hwcso/state/g |
| - iris: bind state helper function |
| - iris: NOS mechanics |
| - iris: record FS NOS |
| - iris: fix crash |
| - iris: fix sampler views of TBOs |
| - iris: fix texture buffer stride |
| - iris: TES program key inputs |
| - iris: compile a TCS...don't bother with passthrough yet |
| - iris: don't emit SO_BUFFERS and SO_DECL_LIST unless streamout is |
| enabled |
| - iris: vertex ID, instance ID |
| - iris: fix SGVS when there are no valid vertex elements |
| - iris: fill out MAX_PATCH_VERTICES |
| - iris: assert about passthrough shaders to make this easier to detect |
| - iris: fix EmitNoIndirect |
| - iris: fix Z24 |
| - iris: reemit blend state for alpha test function changes |
| - iris: point sprite enables |
| - iris: hack around samples confusion |
| - iris: fix blorp filters |
| - iris: expose more things that we already support |
| - iris: fix msaa flipping filters |
| - iris: export get_shader_info |
| - iris: implement set_shader_buffers |
| - iris: emit binding table for atomic counters and SSBOs |
| - iris: shorten loop |
| - iris: unbind compiled shaders if none are present |
| - iris: fix TBO alignment to match 965 |
| - iris: enable SSBOs |
| - iris: fix SSBO indexing |
| - iris: fix for disabling ssbos |
| - iris: update bindings when changing programs |
| - iris: drop unused bo parameter |
| - iris: implement texture/memory barriers |
| - iris: Don't reserve new binding table section unless things are dirty |
| - iris: update a todo comment |
| - iris: BIG OL' HACK for UBO updates |
| - iris: enable texture gather |
| - iris: Avoid croaking when trying to create FBO surfaces with bad |
| formats |
| - iris: fix GS output component limit |
| - iris: drop pipe_shader_state |
| - iris: fix sample mask |
| - iris: cube arrays are cubes too |
| - iris: we don't support textureGatherOffsets, need it lowered |
| - iris: add minor comments |
| - iris: comment everything |
| - iris: sync bugfixes from brw_bufmgr |
| - iris: remember to set bo->userptr |
| - iris: rename ring to engine |
| - iris: simplify batch len qword alignment |
| - iris: get angry about execbuf failures |
| - iris: fill out more caps |
| - iris: depth or stencil fixes |
| - iris: clear stencil |
| - iris: actually emit stencil packets |
| - iris: allow S8 as a stencil format |
| - iris: WTF transfers |
| - iris: use u_transfer_helper for depth stencil packing/unpacking |
| - iris: drop stencil handling now that u_transfer_helper does it |
| - iris: refcounting, who needs it? |
| - iris: actually do stencil blits |
| - iris: say no to more formats |
| - iris: deal with Marek's new MSAA caps |
| - iris: we can do multisample Z resolves |
| - iris: Convert RGBX to RGBA for rendering. |
| - iris: disallow RGB32 formats too |
| - iris: Fix tiled memcpy for cubes...and for array slices |
| - iris: blorp blit multiple slices |
| - iris: assert depth is 1 in resource_copy_region |
| - iris: call maybe_flush for each blorp operation |
| - iris: implement ARB_clear_texture |
| - iris: last VUE map NOS, handle > 16 FS inputs |
| - iris: drop dead assignments |
| - iris: drop pwrite |
| - iris: port non-bucket alignment bugfix |
| - iris: don't emit SBE all the time |
| - iris: rename pipe to base |
| - iris: Drop bogus sampler state saving |
| - iris: move iris_shader_state from ice->shaders.state to |
| ice->state.shaders |
| - iris: Move things to iris_shader_state |
| - iris: Move iris_sampler_view declaration to iris_resource.h |
| - iris: track depth/stencil writes enabled |
| - iris: use consistent copyright formatting |
| - iris: Move cache tracking to iris_resolve.c |
| - iris: proper cache tracking |
| - iris: precompute hashes for cache tracking |
| - iris: Reduce binder alignment from 64 to 32 |
| - iris: reenable R32G32B32 texture buffers |
| - iris: z_res -> s_res |
| - iris: implement get_sample_position |
| - iris: fix line-aa-width |
| - iris: try to hack around binder issue |
| - iris: fix sampler state setting |
| - iris: big old hack for tex-miplevel-selection |
| - iris: use linear for 1D textures |
| - iris: handle level/layer in direct maps |
| - iris: fix crash when binding optional shader for the first time |
| - iris: Skip primitive ID overrides if the shader wrote a custom value |
| - iris: fix blend state memcpy |
| - iris: new caps |
| - iris: use Eric's new caps helper |
| - iris: Allow inlining of require/get_command_space |
| - iris: skip over whole function if dirty == 0 |
| - iris: don't unconditionally emit 3DSTATE_VF / 3DSTATE_VF_TOPOLOGY |
| - iris: fix constant buffer 0 to be absolute |
| - iris: set EXEC_OBJECT_CAPTURE on all driver internal buffers |
| - iris: fix null FB and unbound tex surface state addresses |
| - iris: Support multiple binder BOs, update Surface State Base Address |
| - iris: fix SO offset writes for multiple streams |
| - iris: update comments for multibinder |
| - iris: move binder pinning outside the dirty == 0 check |
| - iris: re-pin binding table contents if we didn't re-emit them |
| - iris: enable ARB_enhanced_layouts |
| - iris: refactor LRIs in context setup |
| - iris: initialize "don't suck" bits, as Ben likes to call them |
| - iris: totally untested icelake support |
| - iris: refactor program CSO stuff |
| - iris: silence const warning |
| - iris: fix context restore of 3DSTATE_CONSTANT ranges |
| - iris: properly re-pin stencil buffers |
| - iris: delete bogus comment |
| - iris: inherit the index buffer properly |
| - iris: use 0 for TCS passthrough program string ID |
| - iris: rw_bo for pipe controls |
| - iris: LRM/SRM/SDI hooks |
| - iris: initial query code |
| - iris: gen10+ workarounds and break fix |
| - iris: results write |
| - iris: flush batch when asking for result via QBO |
| - iris: fix random failures via CS stall...but why? |
| - iris: gpr0 to bool |
| - iris: play chicken with timer queries for now |
| - iris: pipeline stats |
| - iris: primitives generated query support |
| - iris: drop explicit pinning |
| - iris: timestamps |
| - iris: ...and SO prims emitted queries |
| - iris: glGet timestamps, more correct timestamps |
| - iris: Need to \| 1 when asking for timestamps |
| - iris: 36-bit overflow fixes |
| - iris: early return properly |
| - iris: better query file comment |
| - iris: magic number 36 -> #define |
| - iris: Enable ARB_shader_vote |
| - iris: just mark snapshots_landed from the CPU |
| - iris: drop a bunch of pipe_sampler_state stuff we don't need |
| - iris: vma_free bo->size, not bo_size |
| - iris: don't mark contains_draw = false when chaining batches |
| - iris: fix Z32_S8 depth sampling |
| - iris: stencil texturing |
| - iris: force persample interp cap |
| - iris: pipe to scs -> iris_pipe.h |
| - iris: inline stage_from_pipe to avoid unused warnings |
| - iris: add gen11 to genX_call |
| - iris: Allow PIPE_CONTROL with Stall at Scoreboard and RT flush |
| - iris: rework format translation apis |
| - iris: Use R/RG instead of I/L/A when sampling |
| - iris: enable I/L formats |
| - iris: X32_S8X24 :/ |
| - iris: set the binding table size |
| - iris: lower storage image derefs |
| - iris: implement set_shader_images hook |
| - iris: bother with BTIs |
| - iris: set image access correctly |
| - iris: actually set image access |
| - iris: null for non-existent cbufs |
| - iris: move images next to textures in binding table |
| - iris: advertise GL_ARB_shader_texture_image_samples |
| - iris: Enable fb fetch |
| - iris: initial compute caps |
| - iris: yes |
| - iris: drop dead format //'s |
| - iris: drop XXX's about swizzling |
| - iris: little bits of compute basics |
| - iris: drop XXX that Jordan handled |
| - iris: drop unnecessary #ifdefs |
| - iris: leave XXX about unnecessary binding table uploads |
| - iris: bail if SLM is needed |
| - iris: fix whitespace |
| - iris: XXX for compute state tracking :/ |
| - iris: rewrite grid surface handling |
| - iris: better dirty checking |
| - iris: don't let render/compute contexts stomp each other's dirty bits |
| - iris: hack to avoid memorybarriers out the wazoo |
| - iris: do PIPELINE_SELECT for render engine, add flushes, GLK hacks |
| - iris: fix SBA flushing by refactoring code |
| - iris: try and avoid pointless compute submissions |
| - iris: fix UBOs with bindings that have an offset |
| - iris: flag CC_VIEWPORT when changing num viewports |
| - iris: fix SF_CLIP_VIEWPORT array indexing with multiple VPs |
| - iris: Fix texture buffer / image buffer sizes. |
| - iris: Clamp UBO and SSBO access to the actual BO size, for safety |
| - iris: Move snapshots_landed to the front. |
| - iris: Fix off by one in scissoring, empty scissors, default scissors |
| - iris: Fall back to 1x1x1 null surface if no framebuffer supplied |
| - iris: SO_DECL_LIST fix |
| - iris: Fix refcounting of grid surface |
| - iris: delete dead code |
| - iris: fix overhead regression from "don't stomp each other's dirty |
| bits" |
| - iris: allow binding a null vertex buffer |
| - iris: Flag constants dirty on program changes |
| - iris: Disable a PIPE_CONTROL workaround on Icelake |
| - iris: Enable ARB_shader_stencil_export |
| - iris: Enable A8/A16_UNORM in an inefficient manner |
| - iris: Drop B5G5R5X1 support |
| - iris: Use at least 1x1 size for null FB surface state. |
| - iris: Cross-link iris_batches so they can potentially flush each |
| other |
| - iris: cross batch flushing |
| - iris: Don't leak the compute batch |
| - iris: Actually create/destroy HW contexts |
| - iris: Enable msaa_map transfer helpers |
| - iris: tidy more warnings |
| - iris: implement scratch space! |
| - iris: Fix MSAA smooth points |
| - iris: Fix TextureBarrier |
| - iris: Fix multiple RTs with non-independent blending |
| - iris: partial set_query_active_state |
| - iris: Print the batch name when decoding |
| - iris: Clone the NIR |
| - iris: Defer cbuf0 upload to draw time |
| - iris: drop unnecessary param[] setup from iris_setup_uniforms |
| - iris: add param domain defines |
| - iris: fill out params array with built-ins, like clip planes |
| - iris: only bother with params if there are any... |
| - iris: lower user clip planes |
| - iris: hook up key stuff for clip plane lowering |
| - iris: fix system value remapping |
| - iris: dodge backend UCP lowering |
| - iris: bypass params and do it ourselves |
| - iris: actually upload clip planes. |
| - iris: fix num clip plane consts |
| - iris: fix more uniform setup |
| - iris: drop iris_setup_push_uniform_range |
| - iris: enable push constants if we have sysvals but no uniforms |
| - iris: regather info so we get CLIP_DIST slots, not CLIP_VERTEX |
| - iris: don't support pull constants. |
| - iris: don't trip on param asserts |
| - iris: drop param stuffs |
| - iris: don't forget to upload CS consts |
| - iris: fix sysval only binding tables |
| - iris: only clip lower if there's something to clip against |
| - iris: leave another TODO |
| - iris: Fix SourceAlphaBlendFactor |
| - iris: "Fix" transfer maps of buffers |
| - iris: Fix independent alpha blending. |
| - iris: more TODO |
| - iris: scissored and mirrored blits |
| - iris: more todo notes |
| - iris: Fix TCS/TES slot unification |
| - iris: properly pin stencil buffers |
| - iris: Fix SLM |
| - iris: Use iris_use_pinned_bo rather than add_exec_bo directly |
| - iris: Combine iris_use_pinned_bo and add_exec_bo |
| - iris: Avoid cross-batch synchronization on read/reads |
| - iris: Avoid synchronizing due to the workaround BO |
| - iris: replace vestiges of fence fds with newer exec_fence API |
| - iris: Drop vestiges of throttling code |
| - iris: Hang on to the last batch's sync-point, so we can wait on it |
| - iris: Add wait fences to properly sync between render/compute |
| - iris: leave a TODO |
| - iris: flush the compute batch too if border pool is redone |
| - iris: put render batch first in fence code |
| - iris: Put batches in an array |
| - iris: PIPE_CONTROL workarounds for GPGPU mode |
| - iris: RT flush for memorybarrier with texture bit |
| - iris: update comment |
| - iris: Enable ctx->Const.UseSTD430AsDefaultPacking |
| - iris: Lie about indirects |
| - iris: Fix buffer -> buffer copy_region |
| - iris: Fix VIEWPORT/LAYER in stream output info |
| - iris: Do the 48-bit vertex buffer address invalidation workaround |
| - iris: drop long dead XXX comment |
| - iris: Track a binding history for buffer resources |
| - iris: add iris_flush_and_dirty_for_history |
| - iris: Flush for history at various moments |
| - iris: Re-pin even if nothing is dirty |
| - iris: fix prototype warning |
| - iris: export iris_upload_shader |
| - iris: fix comment location |
| - iris: Use wrappers for create_xs_state rather than a switch statement |
| - iris: rework program cache interface |
| - iris: Enable precompiles |
| - iris: Use program's num textures not the state tracker's bound |
| - iris: drop pull constant binding table entry |
| - iris: add assertions about binding table starts |
| - iris: add an extra BT assert from Chris Wilson |
| - iris: actually flush for storage images |
| - iris: fix some SO overflow query bugs and tidy the code a bit |
| - iris: drop key_size_for_cache |
| - iris: for BLORP, only use the predicate enable bit when USE_BIT |
| - iris: check query first |
| - iris: fix conditional compute, don't stomp predicate for pipelined |
| queries |
| - iris: Rework tiling/modifiers handling |
| - iris: Fix failed to compile TCS message |
| - iris: Destroy transfer helper on screen teardown |
| - iris: Destroy the border color pool |
| - iris: Unref unbound_tex resource |
| - iris: Fix IRIS_MEMZONE_COUNT to exclude the border color pool |
| - iris: Destroy the bufmgr |
| - iris: Stop leaking iris_uncompiled_shaders like mad |
| - iris: move some non-buffer case code in a bit |
| - iris: Don't bother considering if the underlying surface is a cube |
| - iris: fix alpha channel for RGB BC1 formats |
| - iris: fix dma buf import strides |
| - iris: CS stall for stream out -> VB |
| - iris: make clipper statistics dynamic |
| - iris: reject all clipping when we can't use streamout render disabled |
| - iris: omask can kill |
| - iris: reemit SBE when sprite coord origin changes |
| - iris: re-pin inherited streamout buffers |
| - iris: Fix NOS mechanism |
| - iris: fix overhead regression from flushing for storage images |
| - iris: fix set_sampler_views to not unbind, be better about bounds |
| - iris: Fix set_sampler_views with start > 0 |
| - iris: Replace num_textures etc with a bitmask we can scan |
| - iris: Drop continues in resolve |
| - iris: Fix clear dimensions |
| - iris: Clamp viewport extents to the framebuffer dimensions |
| - iris: Enable guardband clipping |
| - iris: Fix primitive generated query active flag |
| - iris: Always do rasterizer discard in clipper |
| - iris: override alpha to one src1 blend factors |
| - iris: handle PatchVerticesIn as a system value. |
| - iris: rewrite set_vertex_buffer and VB handling |
| - iris: Reorder LRR parameters to have dst first. |
| - iris: Add \_MI_ALU helpers that don't paste |
| - iris: Don't bother packing 3DSTATE_SO_BUFFER at create time |
| - iris: Move iris_stream_output_target def to iris_context.h |
| - iris: only get space for one offset in stream output targets |
| - iris: Implement DrawTransformFeedback() |
| - iris: drop unnecessary genx->streamout field |
| - iris: Fix for PIPE_CAP_SIGNED_VERTEX_BUFFER_OFFSET |
| - iris: Fix the prototype for iris_bo_alloc_tiled |
| - iris: don't print the pointer in INTEL_DEBUG=submit |
| - iris: Use a surface state fill helper |
| - iris: Make a alloc_surface_state helper |
| - iris: whitespace fixes |
| - iris: Track blend enables, save outbound for resolve code |
| - iris: always pin the binder...in the compute context, too. |
| - iris: delete finished comments |
| - iris: pin and re-pin the scratch BO |
| - iris: more dead comments |
| - iris: only mark depth/stencil as writable if writes are actually |
| enabled |
| - iris: better MOCS |
| - iris: Fix scratch space allocation on Icelake. |
| - iris: Only resolve inputs for actual shader stages |
| - iris: Add a more long term TODO about timebase scaling |
| - iris: Fix compute scratch pinning |
| - iris: Delete bogus comment about cube array counting. |
| - iris: Fix framebuffer layer count |
| - iris: Don't enable push constants just because there are system |
| values |
| - iris: Don't make duplicate system values |
| - iris: Fill out brw_image_params for storage images on Broadwell |
| - iris: Fix surface states for Gen8 lowered-to-untype images |
| - iris: Leave a comment about why Broadwell images are broken |
| - iris: Implement multi-slice copy_region |
| - iris: Flush the render cache in flush_and_dirty_for_history |
| - iris: Handle PIPE_TRANSFER_DISCARD_WHOLE_RESOURCE somewhat |
| - iris: Don't check other batches for our batch BO |
| - iris: Drop a dead comment |
| - iris: Delete genx->bound_vertex_buffers |
| - iris: Fix Broadwell WaDividePSInvocationCountBy4 |
| - iris: Use new PIPE_STAT_QUERY enums rather than hardcoded numbers. |
| - iris: Switch to the new PIPELINE_STATISTICS_QUERY_SINGLE capability |
| - iris: fail to create screen for older unsupported HW |
| - iris: Allow sample mask of 0 |
| - iris: Don't enable smooth points when point sprites are enabled |
| - iris: Assert about blits with color masking |
| - iris: Pay attention to blit masks |
| - iris: CS stall on VF cache invalidate workarounds |
| - iris: Fix SO issue with INTEL_DEBUG=reemit, set fewer bits |
| - iris: Don't whack SO dirty bits when finishing a BLORP op |
| - iris: Fix memzone_for_address for the surface and binder zones |
| - iris: Do binder address allocations per-context, not globally. |
| - iris: Zero the compute predicate when changing the render condition |
| - iris: Remap stream output indexes back to VARYING_SLOT_*. |
| - iris: Enable PIPE_CAP_COMPACT_ARRAYS |
| - iris: Drop comment about ISP_DIS |
| - iris: Drop dead state_size hash table |
| - iris: Unreference some more things on state module teardown |
| - iris: minor tidying |
| - iris: Fix bug in bound vertex buffer tracking |
| - iris: Implement ALT mode for ARB_{vertex,fragment}_shader |
| - iris: Add a timeout_nsec parameter, rename check_syncpt to |
| wait_syncpt |
| - iris: Fix accidental busy-looping in query waits |
| - iris: Use READ_ONCE and WRITE_ONCE for snapshots_landed |
| - iris: Make a iris_batch_reference_signal_syncpt helper function. |
| - iris: Add PIPE_CAP_MAX_VARYINGS |
| - iris: rework num textures to util_lastbit |
| - iris: Stop chopping off the first nine characters of the renderer |
| string |
| - iris: Drop XXX about alpha testing |
| - iris: Set 3DSTATE_WM::ForceThreadDispatchEnable |
| - iris: Set HasWriteableRT correctly |
| - iris: Drop XXX about checking for swizzling |
| - iris: Move create and bind driver hooks to the end of iris_program.c |
| - iris: Make an IRIS_MAX_MIPLEVELS define |
| - iris: Simplify iris_get_depth_stencil_resources |
| - iris: Add missing depth cache flushes |
| - iris: Always emit at least one BLEND_STATE |
| - iris: Add iris_resource fields for aux surfaces |
| - iris: Fill out res->aux.possible_usages |
| - iris: Fill out SURFACE_STATE entries for each possible aux usage |
| - iris: create aux surface if needed |
| - iris: Initial import of resolve code |
| - iris: blorp using resolve hooks |
| - iris: add some draw resolve hooks |
| - iris: actually use the multiple surf states for aux modes |
| - iris: try to fix copyimage vs copybuffers |
| - iris: be sure to skip buffers in resolve code |
| - iris: resolve before transfer maps |
| - iris: pin the buffers |
| - iris: store modifier info in res |
| - iris: Make blit code use actual aux usages |
| - iris: consider framebuffer parameter for aux usages |
| - iris: Resolves for compute |
| - iris: disable aux for external things |
| - iris: some initial HiZ bits |
| - iris: don't use hiz for MSAA buffers |
| - iris: Set program key fields for MCS |
| - iris: make surface states for CCS_D too |
| - iris: do flush for buffers still |
| - iris: Allow disabling aux via INTEL_DEBUG options |
| - iris: Fix aux usage in render resolve code |
| - iris: Only resolve compute resources for compute shaders |
| - iris: Enable auxiliary buffer support |
| - iris: Enable -msse2 and -mstackrealign |
| - Revert "iris: Enable auxiliary buffer support" |
| - vulkan: Fix 32-bit build for the new overlay layer |
| - mesa: Fix RGBBuffers for renderbuffers with sized internal formats |
| - iris: Drop RGBX -> RGBA for storage image usages |
| - iris: Properly allow rendering to RGBX formats. |
| - i965: Implement threaded GL support. |
| - tgsi_to_nir: use sampler variables and derefs |
| - iris: Fix MOCS for blits and clears |
| - isl: Add a swizzle parameter to isl_buffer_fill_state() |
| - iris: Plumb through ISL_SWIZZLE_IDENTITY in buffer surface emitters |
| - iris: Defer uploading sampler state tables until draw time |
| - iris: Properly support alpha and luminance-alpha formats |
| - iris: Drop PIPE_CAP_BUFFER_SAMPLER_VIEW_RGBA_ONLY |
| - iris: Spruce up "are we using this engine?" checks for flushing |
| - iris: Export a copy_region helper that doesn't flush |
| - iris: Use copy_region and staging resources to avoid transfer stalls |
| - Revert MR 369 (Fix extract_i8 and extract_u8 for 64-bit integers) |
| - iris: Fix backface stencil write condition |
| - iris: Rework default tessellation level uploads |
| - iris: Fix TES gl_PatchVerticesIn handling. |
| - iris: Move depth/stencil flushes so they actually do something |
| - iris: Refactor depth/stencil buffer pinning into a helper. |
| - iris: Fix write enable in pinning of depth/stencil resources |
| - i965: Move some genX infrastructure to genX_boilerplate.h. |
| - i965: Rename ISP_DIS to INDIRECT_STATE_POINTERS_DISABLE. |
| - i965: Use genxml for emitting PIPE_CONTROL. |
| - i965: Reimplement all the PIPE_CONTROL rules. |
| - intel/fs: Fix opt_peephole_csel to not throw away saturates. |
| - iris: Don't mutate box in transfer map code |
| - iris: Don't flush the batch for unsynchronized mappings |
| - iris: Slightly better bounds on buffer sizes |
| - gallium: Add PIPE_BARRIER_UPDATE_BUFFER and UPDATE_TEXTURE bits. |
| - nvc0: Skip new update barrier bits |
| - nir: Record non-vector/scalar varyings as unmovable when compacting |
| - iris: Fix util_vma_heap_init size for IRIS_MEMZONE_SHADER |
| - iris: Skip input resolve handling if bindings haven't changed |
| - iris: Skip framebuffer resolve tracking if framebuffer isn't dirty |
| - iris: Skip resolves and flushes altogether if unnecessary |
| - iris: Fix batch chaining map_next increment. |
| - iris: Actually advertise some modifiers |
| - st/nir: Free the GLSL IR after linking. |
| - st/mesa: Fix blitting from GL_DEPTH_STENCIL to GL_STENCIL_INDEX |
| - iris: Fix blits with S8_UINT destination |
| - iris: Print the memzone name when allocating BOs with INTEL_DEBUG=buf |
| - iris: Save/restore MI_PREDICATE_RESULT, not MI_PREDICATE_DATA. |
| - iris: Silence unused variable warnings in release mode |
| - gallium/util: Add const to u_range_intersect |
| - iris: Actually pin the scratch BO. |
| - glsl: Set location on structure-split sampler uniform variables |
| - intel: Emit 3DSTATE_VF_STATISTICS dynamically |
| - iris: Actually mark blorp_copy_buffer destinations as written. |
| - iris: Preserve all PIPE_TRANSFER flags in xfer->usage |
| - iris: Fix FLUSH_EXPLICIT handling with staging buffers. |
| - iris: Make shader_perf_log print to stderr if INTEL_DEBUG=perf is set |
| - i965: Move program key debugging to the compiler. |
| - iris: Print the reason for shader recompiles. |
| - iris: Move iris_debug_recompile calls before uploading. |
| - iris: Change vendor and renderer strings |
| - iris: Add texture cache flushing hacks for blit and |
| resource_copy_region |
| - iris: Be less aggressive at postdraw work skipping |
| - iris: Add mechanism for iris-specific driconf options |
| - iris: Enable the dual_color_blend_by_location driconf option. |
| - iris: Track bound and writable SSBOs |
| - Revert "glsl: Set location on structure-split sampler uniform |
| variables" |
| - i965: Ignore uniform storage for samplers or images, use binding info |
| - i965: Tidy bogus indentation left by previous commit |
| - iris: Mark constants dirty on transfer unmap even if no flushes occur |
| - iris: Track bound constant buffers |
| - iris: Rework UBOs and SSBOs to use pipe_shader_buffer |
| - iris: Rework image views to store pipe_image_view. |
| - iris: Make a gl_shader_stage -> pipe_shader_stage helper function |
| - iris: Make memzone_for_address non-static |
| - iris: Replace buffer backing storage and rebind to update addresses. |
| - iris: Make a resource_is_busy() helper |
| - iris: Track valid data range and infer unsynchronized mappings. |
| - iris: Make some offset math helpers take a const isl_surf pointer |
| - iris: Fix DrawTransformFeedback math when there's a buffer offset |
| - iris: Prefer staging blits when destination supports CCS_E. |
| - iris: Actually put Mesa in GL_RENDERER string |
| - iris: Split iris_flush_and_dirty_for_history into two helpers. |
| - iris: Enable GL_AMD_depth_clamp_separate |
| - iris: Advertise EXT_texture_sRGB_R8 support |
| - iris: Some tidying for preemption support |
| - iris: Silence unused function warning |
| - iris: Fix zeroing of transform feedback offsets in strange cases. |
| - glsl/list: Add an exec_list_is_singular() helper. |
| - nir: Add a new nir_cf_list_is_empty_block() helper. |
| - intel/fs: Don't emit empty ELSE blocks. |
| - iris: Set XY Clipping correctly. |
| - iris: Only enable GL_AMD_depth_clamp_separate on Gen9+ |
| - iris: Fix imageBuffer and PBO download. |
| - iris: Disable dual source blending when shader doesn't handle it |
| - iris: Resolve textures used by the program, not merely bound textures |
| - iris: Fix 4GB memory zone heap sizes. |
| - iris: leave the top 4Gb of the high heap VMA unused |
| - iris: Force VMA alignment to be a multiple of the page size. |
| - iris: Delete bucketing allocators |
| - i965: Fix BRW_MEMZONE_LOW_4G heap size. |
| - i965: Force VMA alignment to be a multiple of the page size. |
| - i965: leave the top 4Gb of the high heap VMA unused |
| - i965: Fix memory leaks in brw_upload_cs_work_groups_surface(). |
| - iris: Use full ways for L3 cache setup on Icelake. |
| - egl/x11: calloc dri2_surf so it's properly zeroed |
| |
| Kevin Strasser (1): |
| |
| - egl/dri: Avoid out of bounds array access |
| |
| Khaled Emara (1): |
| |
| - freedreno: PIPE_CAP_SHADER_BUFFER_OFFSET_ALIGNMENT unreachable |
| statement |
| |
| Khem Raj (1): |
| |
| - winsys/svga/drm: Include sys/types.h |
| |
| Kishore Kadiyala (1): |
| |
| - android: static link with libexpat with Android O+ |
| |
| Konstantin Kharlamov (1): |
| |
| - mapi: work around GCC LTO dropping assembly-defined functions |
| |
| Kristian Høgsberg (49): |
| |
| - st/nir: Use src/ relative include path for autotools |
| - freedreno/a6xx: Emit blitter dst with OUT_RELOCW |
| - freedreno/a6xx: Use tiling for all resources |
| - freedreno/a6xx: regen headers |
| - freedreno/a6xx: Drop render condition check in blitter |
| - freedreno: Log number of draw for sysmem passes |
| - freedreno/a6xx: Use the right resource for separate stencil stride |
| - freedreno/a6xx: Combine emit_blit and fd6_blit |
| - freedreno: Consolidate u_blitter functions in freedreno_blitter.c |
| - freedreno: Don't tell the blitter what it can't do |
| - freedreno/a6xx: Move blit check so as to restore comment |
| - freedreno/a6xx: Support some depth/stencil blits on blitter |
| - freedreno/a6xx: Support y-inverted blits |
| - freedreno/a6xx: Add format argument to fd6_tex_swiz() |
| - freedreno/a6xx: Fall back to masked RGBA blits for depth/stencil |
| - freedreno/a6xx: Clean up mixed use of swap and swizzle for texture |
| state |
| - freedreno/a6xx: Update headers |
| - freedreno/a6xx: Front facing needs UNK3 bit |
| - freedreno/a6xx: Fix point coord |
| - .mailmap: Add a few more alises for myself |
| - freedreno: Update headers |
| - freedreno/a6xx: Copy stencil as R8_UINT |
| - freedreno/a6xx: Support MSAA resolve blits on blitter |
| - freedreno/a6xx: Only output MRT control for used framebuffers |
| - freedreno/a6xx: Don't zero SO buffer addresses |
| - freedreno: Fix a couple of warnings |
| - turnip: Only get bo offset when we need to mmap |
| - freedreno: Use c_vis_args and no_override_init_args |
| - freedreno/a6xx: Remove extra parens |
| - freedreno/ir3: Track whether shader needs derivatives |
| - freedreno/ir3: Fix operand order for DSX/DSY |
| - st/glsl_to_nir: Calculate num_uniforms from NumParameterValues |
| - freedreno/ir3: Enable PIPE_CAP_PACKED_UNIFORMS |
| - freedreno/ir3: Push UBOs to constant file |
| - freedreno/ir3: Don't access beyond available regs |
| - freedreno/ir3: Add workaround for VS samgq |
| - freedreno/ir3: Mark ir3_context_error() as NORETURN |
| - freedreno/a2xx: Fix redundant if statement |
| - freedreno: Use enum values from matching enum |
| - freedreno/a6xx: Add helper for incrementing regid |
| - freedreno: Fix format string warning |
| - .gitignore: Remove autotool artifacts |
| - tgsi: Mark tgsi_strings_check() unused |
| - glsl_to_nir: Initialize debug variable |
| - nir_opcodes.py: Saturate to expression that doesn't overflow |
| - ralloc: Fully qualify non-virtual destructor call |
| - egl/dri2: Mark potentially unused 'display' variable with |
| MAYBE_UNUSED |
| - gallium/auxiliary/vl: Fix a couple of warnings |
| - freedreno/drm: Quiet pointer to u64 conversion warning |
| |
| Leo Liu (6): |
| |
| - st/va: fix the incorrect max profiles report |
| - st/va/vp9: set max reference as default of VP9 reference number |
| - vl/dri3: remove the wait before getting back buffer |
| - radeon/vcn: add H.264 constrained baseline support |
| - radeon/vcn/vp9: search the render target from the whole list |
| - winsys/amdgpu: add VCN JPEG to no user fence group |
| |
| Lepton Wu (2): |
| |
| - virgl: close drm fd when destroying virgl screen. |
| - virgl: Set bind when creating temp resource. |
| |
| Lionel Landwerlin (127): |
| |
| - anv: assert that color attachment are valid |
| - radv: assert that colorAttachment is valid for CmdClearAttachment |
| - i965: scale factor changes should trigger recompile |
| - vulkan: Update the XML and headers to 1.1.101 |
| - anv: implement VK_EXT_depth_clip_enable |
| - build: move imgui out of src/intel/tools to be reused |
| - imgui: bump copy |
| - imgui: make sure our copy of imgui doesn't clash with others in the |
| same process |
| - vulkan: add an overlay layer |
| - intel: fix urb size for CFL GT1 |
| - anv: add support for INTEL_DEBUG=bat |
| - Revert "anv: add support for INTEL_DEBUG=bat" |
| - intel/aub_viewer: printout 48bits addresses |
| - intel/aub_viewer: silence compiler warning |
| - intel/aub_viewer: silence more compiler warnings |
| - vulkan/overlay: fix missing installation of layer |
| - vulkan/overlay: fix includes |
| - imgui: update commit |
| - imgui: update memory editor |
| - vulkan/overlay: install layer binary in libdir |
| - intel/compiler: use correct swizzle for replacement |
| - vulkan/overlay: fix min/max computations |
| - vulkan/overlay: rework option parsing |
| - vulkan/overlay: add support for fps output in file |
| - anv: add support for INTEL_DEBUG=bat |
| - vulkan: update headers/registry to 1.1.102 |
| - anv: update supported patch version |
| - radv: set num_components on vulkan_resource_index intrinsic |
| - vulkan/util: make header available from c++ |
| - vulkan/util: generate instance/device dispatch tables |
| - vulkan/overlay: drop dependency on validation layer headers |
| - intel/decoders: add address space indicator to get BOs |
| - intel/decoders: handle decoding MI_BBS from ring |
| - intel/decoders: limit number of decoded batchbuffers |
| - intel/aub_read: reuse defines from gen_context |
| - intel/aub_write: split comment section from HW setup |
| - intel/aub_write: write header in init |
| - intel/aub_write: break execlist write in 2 |
| - intel/aub_write: switch to use i915_drm engine classes |
| - intel/aub_write: log mmio writes |
| - intel/aub_write: store the physical page allocator in struct |
| - intel/aub_write: turn context images arrays into functions |
| - intel/aub_write: factorize context image/pphwsp/ring creation |
| - iris: fix decoder call |
| - iris: fix decode_get_bo callback |
| - intel/error2aub: build a list of BOs before writing them |
| - intel/error2aub: identify buffers by engine |
| - intel/error2aub: strenghten batchbuffer identifier marker |
| - intel/error2aub: parse other buffer types |
| - intel/error2aub: annotate buffer with their address space |
| - intel/error2aub: store engine last ring buffer head/tail pointers |
| - intel/error2aub: write GGTT buffers into the aub file |
| - intel/error2aub: add a verbose option |
| - intel/error2aub: deal with GuC log buffer |
| - intel/error2aub: support older style engine names |
| - vulkan: factor out wsi dependencies |
| - anv: implement VK_EXT_pipeline_creation_feedback |
| - vulkan/overlay: properly register layer object with loader |
| - vulkan/overlay: silence validation layer warnings |
| - vulkan/overlay: check return value of swapchain get images |
| - vulkan/overlay: improve error reporting |
| - i965: perf: sklgt2: update a priority for register programming |
| - i965: perf: sklgt2: update compute metrics config |
| - i965: perf: sklgt2: update memory write config |
| - i965: perf: add PMA stall metrics |
| - i965: perf: chv: fixup counters names |
| - i965: perf: hsw: drop register programming not needed on HSW |
| - i965: perf: sklgt2: drop programming of an unused NOA register |
| - i965: perf: add Icelake metrics |
| - i965: perf: enable Icelake metrics |
| - i965: perf: add ring busyness metric for cfl gt2 |
| - i965: perf: update render basic configs for big core gen9/gen10 |
| - anv: implement VK_KHR_swapchain revision 70 |
| - intel: add dependency on genxml generated files |
| - genxml: add a sorting script |
| - genxml: sort xml files using new script |
| - anv: don't use default pipeline cache for hits for |
| VK_EXT_pipeline_creation_feedback |
| - anv: store heap address bounds when initializing physical device |
| - anv: leave the top 4Gb of the high heap VMA unused |
| - i965: store device revision in gen_device_info |
| - i965: extract performance query metrics |
| - i965: move mdapi data structure to intel/perf |
| - i965: move OA accumulation code to intel/perf |
| - i965: move brw_timebase_scale to device info |
| - i965: move mdapi result data format to intel/perf |
| - i965: move mdapi guid into intel/perf |
| - intel/perf: stub gen10/11 missing definitions |
| - i965: perf: add mdapi pipeline statistics queries on gen10/11 |
| - intel/perf: drop counter size field |
| - intel/perf: constify accumlator parameter |
| - iris: implement WaEnableStateCacheRedirectToCS |
| - i965: implement WaEnableStateCacheRedirectToCS |
| - anv: implement WaEnableStateCacheRedirectToCS |
| - anv: fix uninitialized pthread cond clock domain |
| - intel/devinfo: fix missing num_thread_per_eu on ICL |
| - intel/devinfo: add basic sanity tests on device database |
| - anv: limit URB reconfigurations when using blorp |
| - intel: workaround VS fixed function issue on Gen9 GT1 parts |
| - anv: fix argument name for vkCmdEndQuery |
| - i965: fix icelake performance query enabling |
| - Revert "anv: limit URB reconfigurations when using blorp" |
| - vulkan/util: generate a helper function to return pNext struct sizes |
| - vulkan/overlay: update help printout |
| - vulkan/overlay: record stats in command buffers and accumulate on |
| exec/submit |
| - vulkan/overlay: add pipeline statistic & timestamps support |
| - vulkan/overlay: add no display option |
| - vulkan/overlay: add a margin to the size of the window |
| - vulkan/overlay: record all select metrics into output file |
| - vulkan/overlay: add a frame counter option |
| - vulkan/overlay: make overlay size configurable |
| - vulkan/overlay: make overriden functions static |
| - vulkan/overlay: add TODO list |
| - anv: fix crash when application does not provide push constants |
| - anv: rework queries writes to ensure ordering memory writes |
| - anv: fix use after free |
| - anv: Use corresponding type from the vector allocation |
| - vulkan/overlay: keep allocating draw data until it can be reused |
| - nir: fix lower_non_uniform_access pass |
| - vulkan/overlay-layer: fix cast errors |
| - vulkan/overlay: fix truncating error on 32bit platforms |
| - nir: lower_non_uniform_access: iterate over instructions safely |
| - vulkan/overlay: fix timestamp query emission with no pipeline stats |
| - vulkan: fix build dependency issue with generated files |
| - anv: fix apply_pipeline_layout pass for arrays of YCbCr descriptors |
| - nir/lower_non_uniform: safely iterate over blocks |
| - intel/perf: fix EuThreadsCount value in performance equations |
| - intel/perf: improve dynamic loading config detection |
| |
| Lubomir Rintel (3): |
| |
| - kmsro: Extend to include armada-drm |
| - gallivm: guess CPU features also on ARM |
| - gallivm: disable NEON instructions if they are not supported |
| |
| Lucas Stach (3): |
| |
| - etnaviv: don't flush own context when updating resource use |
| - etnaviv: flush all pending contexts when accessing a resource with |
| the CPU |
| - etnaviv: only try to construct scanout resource when on KMS winsys |
| |
| Marek Olšák (121): |
| |
| - radeonsi: enable dithered alpha-to-coverage for better quality |
| - radeonsi: merge & rename texture BO metadata functions |
| - radeonsi: unify error paths in si_texture_create_object |
| - winsys/amdgpu: remove amdgpu_drm.h definitions |
| - r600: add -Wstrict-overflow=0 to meson to silence the warning |
| - radeonsi: fix a comment typo in si_fine_fence_set |
| - gallium: allow more PIPE_RESOURCE\_ driver flags |
| - meson: drop the xcb-xrandr version requirement |
| - radeonsi: handle render_condition_enable in |
| si_compute_clear_render_target |
| - radeonsi: fix crashing performance counters (division by zero) |
| - radeonsi: initialize textures using DCC to black when possible |
| - radeonsi: clear allocator_zeroed_memory with SDMA |
| - radeonsi: make allocator_zeroed_memory unmappable and use bigger |
| buffers |
| - radeonsi: don't leak an index buffer if draw_vbo fails |
| - radeonsi: use local ws variable in si_need_dma_space |
| - gallium/u_threaded: fix EXPLICIT_FLUSH for flush offsets > 0 |
| - radeonsi: fix EXPLICIT_FLUSH for flush offsets > 0 |
| - winsys/amdgpu: don't drop manually added fence dependencies |
| - winsys/amdgpu: unify fence list code |
| - winsys/amdgpu: use a separate fence list for syncobjs |
| - winsys/amdgpu: remove occurence of INDIRECT_BUFFER_CONST |
| - winsys/amdgpu: clean up IB buffer size computation |
| - winsys/amdgpu: cs_check_space sets the minimum IB size for future IBs |
| - radeonsi: add AMD_DEBUG env var as an alternative to R600_DEBUG |
| - radeonsi: use MEM instead of MEM_GRBM in COPY_DATA.DST_SEL |
| - radeonsi: add driconf option radeonsi_enable_nir |
| - radeonsi: always enable NIR for Civilization 6 to fix corruption |
| - driconf: add Civ6Sub executable for Civilization 6 |
| - st/mesa: always unmap the uploader in st_atom_array.c |
| - gallium/u_threaded: always unmap const_uploader |
| - gallium/u_upload_mgr: allow use of FLUSH_EXPLICIT with persistent |
| mappings |
| - radeonsi: use SDMA for uploading data through const_uploader |
| - tgsi: don't set tgsi_info::uses_bindless_images for constbufs and hw |
| atomics |
| - radeonsi: always use compute rings for clover on CI and newer (v2) |
| - gallium/u_tests: use a compute-only context to test GCN compute ring |
| - gallium: add pipe_grid_info::last_block |
| - omx: clean up enc_LoadImage_common |
| - omx: add a compute path in enc_LoadImage_common |
| - radeonsi: fix assertion failure by using the correct type |
| - mesa: implement ARB/KHR_parallel_shader_compile |
| - gallium: implement ARB/KHR_parallel_shader_compile |
| - util/queue: move thread creation into a separate function |
| - util/queue: add ability to kill a subset of threads |
| - util/queue: hold a lock when reading num_threads in util_queue_finish |
| - util/queue: add util_queue_adjust_num_threads |
| - radeonsi: implement ARB/KHR_parallel_shader_compile callbacks |
| - radeonsi: don't use PFP_SYNC_ME with compute-only contexts |
| - docs/relnotes: document parallel_shader_compile changes in 19.1.0, |
| not 19.0.0 |
| - amd/addrlib: fix uninitialized values for |
| Addr2ComputeDccAddrFromCoord |
| - radeonsi/gfx9: add support for PIPE_ALIGNED=0 |
| - radeonsi: add ability to bind images as image buffers |
| - radeonsi: add support for displayable DCC for 1 RB chips |
| - radeonsi: add support for displayable DCC for multi-RB chips |
| - radeonsi: enable displayable DCC on Ravens |
| - gallium: add writable_bitmask parameter into set_shader_buffers |
| - glsl: remember which SSBOs are not read-only and pass it to gallium |
| - radeonsi: set exact shader buffer read/write usage in CS |
| - tegra: fix the build after the set_shader_buffers change |
| - radeonsi: fix a crash when unbinding sampler states |
| - glsl: fix shader_storage_blocks_write_access for SSBO block arrays |
| - Revert "glsl: fix shader_storage_blocks_write_access for SSBO block |
| arrays" |
| - glsl: allow the #extension directive within code blocks for the dri |
| option |
| - mesa: don't overwrite existing shader files with |
| MESA_SHADER_CAPTURE_PATH |
| - radeonsi: set AC_FUNC_ATTR_READNONE for image opcodes where it was |
| missing |
| - ac: use the common helper ac_apply_fmask_to_sample |
| - ac: fix incorrect bindless atomic code in visit_image_atomic |
| - radeonsi: enable GL_EXT_shader_image_load_formatted |
| - nir: optimize gl_SampleMaskIn to gl_HelperInvocation for radeonsi |
| when possible |
| - winsys/amdgpu: don't set GTT with GDS & OA placements on APUs |
| - radeonsi/gfx9: use the correct condition for the DPBB + QUANT_MODE |
| workaround |
| - radeonsi: use CP DMA for the null const buffer clear on CIK |
| - tgsi/scan: add uses_drawid |
| - ac: add radeon_info::marketing_name, replacing the winsys callback |
| - ac: add radeon_info::is_pro_graphics |
| - ac: add ac_get_i1_sgpr_mask |
| - ac: add REWIND and GDS registers to register headers |
| - winsys/amdgpu: make IBs writable and expose their address |
| - winsys/amdgpu: reorder chunks, make BO_HANDLES first, IB and FENCE |
| last |
| - winsys/amdgpu: enable chaining for compute IBs |
| - winsys/amdgpu: clean up and remove nonsensical assertion |
| - radeonsi: add si_cp_copy_data |
| - radeonsi: add helper si_get_minimum_num_gfx_cs_dwords |
| - radeonsi: delay adding BOs at the beginning of IBs until the first |
| draw |
| - gallium: document conservative rasterization flags |
| - st/dri: simplify throttling code |
| - gallium: replace DRM_CONF_THROTTLE with PIPE_CAP_MAX_FRAMES_IN_FLIGHT |
| - gallium: replace DRM_CONF_SHARE_FD with PIPE_CAP_DMABUF |
| - gallium: replace drm_driver_descriptor::configuration with |
| driconf_xml |
| - gallium: set PIPE_CAP_MAX_FRAMES_IN_FLIGHT to 2 for all drivers |
| - gallium: add PIPE_CAP_PREFER_COMPUTE_BLIT_FOR_MULTIMEDIA |
| - util: fix a compile failure in u_compute.c on windows |
| - mesa: enable glGet for EXT_gpu_shader4 |
| - glsl: add \`unsigned int\` type for EXT_GPU_shader4 |
| - glsl: apply some 1.30 and other rules to EXT_gpu_shader4 as well |
| - glsl: add builtin variables for EXT_gpu_shader4 |
| - glsl: add arithmetic builtin functions for EXT_gpu_shader4 |
| - glsl: add texture builtin functions for EXT_gpu_shader4 |
| - glsl: allow "varying out" for fragment shader outputs with |
| EXT_gpu_shader4 |
| - mesa: expose EXT_texture_buffer_object |
| - mesa: only allow EXT_gpu_shader4 in the compatibility profile |
| - st/mesa: expose EXT_gpu_shader4 if GLSL 1.40 is supported |
| - glsl: handle interactions between EXT_gpu_shader4 and texture |
| extensions |
| - radeonsi: add BOs after need_cs_space |
| - radeonsi/gfx9: set that window_rectangles always roll the context |
| - radeonsi/gfx9: rework the gfx9 scissor bug workaround (v2) |
| - radeonsi: remove dirty slot masks from scissor and viewport states |
| - glsl: fix shader_storage_blocks_write_access for SSBO block arrays |
| (v2) |
| - radeonsi: don't ignore PIPE_FLUSH_ASYNC |
| - mesa: rework error handling in glDrawBuffers |
| - mesa: fix pbuffers because internally they are front buffers |
| - st/mesa: don't flush the front buffer if it's a pbuffer |
| - radeonsi: use new atomic LLVM helpers |
| - radeonsi: set sampler state and view functions for compute-only |
| contexts |
| - st/dri: decrease input lag by syncing sooner in SwapBuffers |
| - glsl: fix and clean up NV_compute_shader_derivatives support |
| - st/mesa: fix 2 crashes in st_tgsi_lower_yuv |
| - radeonsi: remove old_va parameter from si_rebind_buffer by |
| remembering offsets |
| - radeonsi: update buffer descriptors in all contexts after buffer |
| invalidation |
| - radeonsi: fix a regression in si_rebind_buffer |
| - u_blitter: don't fail mipmap generation for depth formats containing |
| stencil |
| - ac: fix a typo in ac_build_wg_scan_bottom |
| |
| Mario Kleiner (1): |
| |
| - drirc: Add sddm-greeter to adaptive_sync blacklist. |
| |
| Mark Janes (5): |
| |
| - mesa: properly report the length of truncated log messages |
| - mesa: rename logging functions to reflect that they format strings |
| - mesa: add logging function for formatted string |
| - intel/common: move gen_debug to intel/dev |
| - intel/tools: Remove redundant definitions of INTEL_DEBUG |
| |
| Mateusz Krzak (2): |
| |
| - panfrost: cast bo_handles pointer to uintptr_t first |
| - panfrost: use os_mmap and os_munmap |
| |
| Mathias Fröhlich (22): |
| |
| - st/mesa: Reduce array updates due to current changes. |
| - mesa: Track buffer object use also for VAO usage. |
| - st/mesa: Invalidate the gallium array atom only if needed. |
| - mesa: Implement helper functions to map and unmap a VAO. |
| - mesa: Factor out \_mesa_array_element. |
| - mesa: Use \_mesa_array_element in dlist save. |
| - mesa: Replace \_ae_{,un}map_vbos with \_mesa_vao_{,un}map_arrays |
| - mesa: Remove \_ae_{,un}map_vbos and dependencies. |
| - mesa: Use mapping tools in debug prints. |
| - vbo: Fix basevertex handling in display list compiles. |
| - vbo: Fix GL_PRIMITIVE_RESTART_FIXED_INDEX in display list compiles. |
| - mesa: Add assert to \_mesa_primitive_restart_index. |
| - mesa: Factor out index function that will have multiple use. |
| - mesa: Use glVertexAttrib*NV functions for fixed function attribs. |
| - mesa: Implement \_mesa_array_element by walking enabled arrays. |
| - mesa: Rip out now unused gl_context::aelt_context. |
| - mesa: Remove the now unused \_NEW_ARRAY state change flag. |
| - mesa: Constify static const array in api_arrayelt.c |
| - mesa: Remove the \_glapi_table argument from \_mesa_array_element. |
| - mesa: Set CurrentSavePrimitive in vbo_save_NotifyBegin. |
| - mesa: Correct the is_vertex_position decision for dlists. |
| - mesa: Leave aliasing of vertex and generic0 attribute to the dlist |
| code. |
| |
| Matt Turner (7): |
| |
| - intel/compiler/test: Set devinfo->gen = 7 |
| - intel/compiler: Avoid propagating inequality cmods if types are |
| different |
| - intel/compiler/test: Add unit test for mismatched signedness |
| comparison |
| - intel/compiler: Add commas on final values of compaction table arrays |
| - intel/compiler: Use SIMD16 instructions in fs saturate prop unit test |
| - intel/compiler: Add unit tests for sat prop for different exec sizes |
| - intel/compiler: Improve fix_3src_operand() |
| |
| Matthias Lorenz (1): |
| |
| - vulkan/overlay: Add fps counter |
| |
| Mauro Rossi (6): |
| |
| - android: intel/isl: remove redundant building rules |
| - android: anv: fix generated files depedencies (v2) |
| - android: anv: fix libexpat shared dependency |
| - android: nouveau: add support for nir |
| - android: fix LLVM version string related building errors |
| - draw: fix building error in draw_gs_init() |
| |
| Maya Rashish (1): |
| |
| - configure: fix test portability |
| |
| Michel Dänzer (19): |
| |
| - loader/dri3: Use strlen instead of sizeof for creating VRR property |
| atom |
| - gitlab-ci: Re-use docker image from the main repo in forked repos |
| - gitlab-ci: List some longer-running jobs before others of the same |
| stage |
| - gitlab-ci: Use 8 CPU cores in autotools job |
| - gitlab-ci: Make sure clang job actually uses ccache |
| - gitlab-ci: Only pull/push cache contents in build+test stage jobs |
| - gitlab-ci: Automatically retry jobs after runner system failure |
| - gitlab-ci: Run CI pipeline for all branches in the main repository |
| - gitlab-ci: Use Debian stretch instead of Ubuntu bionic |
| - gitlab-ci: Use HTTPS for APT repositories |
| - gitlab-ci: Use Debian packages instead of pip ones for meson and |
| scons |
| - gitlab-ci: Install most packages from Debian buster |
| - gitlab-ci: Remove unneded (stuff from) APT command lines |
| - gitlab-ci: Remove unused Debian packages from Docker image |
| - gitlab-ci: Use clang 8 instead of 7 |
| - gitlab-ci: Drop unused clang 5/6 packages |
| - gitlab-ci: Do not use subshells for compiling dependencies |
| - gitlab-ci: Use LLVM 3.4 from Debian jessie for scons-llvm job |
| - gitlab-ci: Use meson buildtype debug instead of default |
| debugoptimized |
| |
| Mike Blumenkrantz (6): |
| |
| - iris: support INTEL_NO_HW environment variable |
| - gallium: add pipe cap for inner_coverage conservative raster mode |
| - st/mesa: indicate intel extension support for inner_coverage based on |
| cap |
| - iris: add support for INTEL_conservative_rasterization |
| - iris: add preemption support on gen9 |
| - iris: enable preemption support for gen10 |
| |
| Nanley Chery (3): |
| |
| - i965: Rename intel_mipmap_tree::r8stencil\_\* -> ::shadow\_\* |
| - anv: Fix some depth buffer sampling cases on ICL+ |
| - anv/cmd_buffer: Initalize the clear color struct for CNL+ |
| |
| Nataraj Deshpande (1): |
| |
| - anv: Fix check for isl_fmt in assert |
| |
| Neha Bhende (2): |
| |
| - st/mesa: Fix topogun-1.06-orc-84k-resize.trace crash |
| - draw: fix memory leak introduced 7720ce32a |
| |
| Nicolai Hähnle (9): |
| |
| - amd/surface: provide firstMipIdInTail for metadata surface |
| calculations |
| - radeonsi: add si_debug_options for convenient adding/removing of |
| options |
| - util/u_log: flush auto loggers before starting a new page |
| - ddebug: set thread name |
| - ddebug: log calls to pipe->flush |
| - ddebug: dump driver state into a separate file |
| - ddebug: expose some helper functions as non-inline |
| - radeonsi: add radeonsi_aux_debug option for aux context debug dumps |
| - radeonsi: add radeonsi_sync_compile option |
| |
| Oscar Blumberg (3): |
| |
| - intel/fs: Fix memory corruption when compiling a CS |
| - radeonsi: Fix guardband computation for large render targets |
| - glsl: Fix function return typechecking |
| |
| Patrick Lerda (1): |
| |
| - lima/ppir: fix pointer referenced after a free |
| |
| Patrick Rudolph (1): |
| |
| - d3dadapter9: Support software renderer on any DRI device |
| |
| Philipp Zabel (1): |
| |
| - etnaviv: fill missing offset in etna_resource_get_handle |
| |
| Pierre Moreau (12): |
| |
| - include/CL: Update to the latest OpenCL 2.2 headers |
| - clover: Avoid warnings from new OpenCL headers |
| - clover: Remove the TGSI backend as unused |
| - clover: Add an helper for checking if an IR is supported |
| - clover/api: Rework the validation of devices for building |
| - clover/api: Fail if trying to build a non-executable binary |
| - clover: Disallow creating libraries from other libraries |
| - clover: Validate program and library linking options |
| - clover: Move device extensions definitions to core/device.cpp |
| - clover: Move platform extensions definitions to clover/platform.cpp |
| - clover: Only use devices supporting IR_NATIVE |
| - clover: Fix indentation issues |
| |
| Pierre-Eric Pelloux-Prayer (1): |
| |
| - radeonsi: init sctx->dma_copy before using it |
| |
| Plamena Manolova (3): |
| |
| - i965: Disable ARB_fragment_shader_interlock for platforms prior to |
| GEN9 |
| - isl: Set ClearColorConversionEnable. |
| - i965: Re-enable fast color clears for GEN11. |
| |
| Qiang Yu (9): |
| |
| - u_math: add ushort_to_float/float_to_ushort |
| - u_dynarray: add util_dynarray_grow_cap |
| - gallium/u_vbuf: export u_vbuf_get_minmax_index |
| - drm-uapi: add lima_drm.h |
| - gallium: add lima driver |
| - lima/gpir: fix compile fail when two slot node |
| - lima/gpir: fix alu check miss last store slot |
| - lima: fix lima_blit with non-zero level source resource |
| - lima: fix render to non-zero level texture |
| |
| Rafael Antognolli (45): |
| |
| - iris: Store internal_format when getting resource from handle. |
| - iris: Skip msaa16 on gen < 9. |
| - iris: Flush before hiz_exec. |
| - iris: Pin HiZ buffers when rendering. |
| - iris: Avoid leaking if we fail to allocate the aux buffer. |
| - iris/clear: Pass on render_condition_enabled. |
| - iris: Skip resolve if there's no context. |
| - iris: Flag ALL_DIRTY_BINDINGS on aux state change. |
| - iris: Add resolve on iris_flush_resource. |
| - iris: Convert RGBX to RGBA always. |
| - iris: Enable auxiliary buffer support again |
| - iris: Enable HiZ for multisampled depth surfaces. |
| - iris: Make intel_hiz_exec public. |
| - iris: Allocate buffer space for the fast clear color. |
| - iris: Use the clear depth when emitting 3DSTATE_CLEAR_PARAMS. |
| - iris: Fast clear depth buffers. |
| - iris: Add helper to convert fast clear color. |
| - iris: Add function to update clear color in surface state. |
| - iris: Bring back check for srgb and fast clear color. |
| - intel/isl: Add isl_format_has_color_component() function. |
| - intel/blorp: Make swizzle_color_value public. |
| - iris: Implement fast clear color. |
| - iris: Add iris_resolve_conditional_render(). |
| - iris: Stall on the CPU and resolve predication during fast clears. |
| - iris: Track fast clear color. |
| - iris: Let blorp update the clear color for us. |
| - i965/blorp: Remove unused parameter from blorp_surf_for_miptree. |
| - iris: Only update clear color for gens 8 and 9. |
| - iris/gen8: Re-emit the SURFACE_STATE if the clear color changed. |
| - iris: Manually apply fast clear color channel overrides. |
| - iris: Do not allocate clear_color_bo for gen8. |
| - iris: Add aux.sampler_usages. |
| - iris: Enable fast clears on gen8. |
| - intel/fs: Only propagate saturation if exec_size is the same. |
| - intel/fs: Move the scalar-region conversion to the generator. |
| - intel/fs: Add a lowering pass for linear interpolation. |
| - intel/fs: Remove fs_generator::generate_linterp from gen11+. |
| - intel/isl: Resize clear color buffer to full cacheline |
| - intel/genxml: Update MI_ATOMIC genxml definition. |
| - intel/blorp: Make blorp update the clear color in gen11. |
| - iris: Do not advertise multisampled image load/store. |
| - iris: Support sRGB fast clears even if the colorspaces differ. |
| - iris: Use the linear version of the surface format during fast |
| clears. |
| - iris: Update the surface state clear color address when available. |
| - iris: Enable fast clear colors on gen11. |
| |
| Ray Zhang (1): |
| |
| - glx: fix shared memory leak in X11 |
| |
| Rhys Kidd (1): |
| |
| - iris: Fix assertion in iris_resource_from_handle() tiling usage |
| |
| Rhys Perry (28): |
| |
| - nvc0: add compute invocation counter |
| - radv: bitcast 16-bit outputs to integers |
| - radv: ensure export arguments are always float |
| - ac/nir: implement 8-bit nir_load_const_instr |
| - ac/nir: fix 64-bit nir_op_f2f16_rtz |
| - ac/nir: make ac_build_clamp work on all bit sizes |
| - ac/nir: make ac_build_isign work on all bit sizes |
| - ac/nir: make ac_build_fdiv support 16-bit floats |
| - ac/nir: implement half-float nir_op_frcp |
| - ac/nir: implement half-float nir_op_frsq |
| - ac/nir: implement half-float nir_op_ldexp |
| - ac/nir: fix 16-bit ssbo stores |
| - ac/nir: implement 8-bit push constant, ssbo and ubo loads |
| - ac/nir: implement 8-bit ssbo stores |
| - ac/nir: add 8-bit types to glsl_base_to_llvm_type |
| - ac/nir: implement 8-bit conversions |
| - radv: enable VK_KHR_8bit_storage |
| - ac/nir: implement 16-bit pack/unpack opcodes |
| - radv: lower 16-bit flrp |
| - ac: add 16-bit support to ac_build_ddxy() |
| - nir,ac/nir: fix cube_face_coord |
| - gallium: add support for formatted image loads |
| - mesa, glsl: add support for EXT_shader_image_load_formatted |
| - st/mesa: add support for EXT_shader_image_load_formatted |
| - vc4: fix build |
| - ac,ac/nir: use a better sync scope for shared atomics |
| - radv: fix set_output_usage_mask() with composite and 64-bit types |
| - ac/nir: mark some texture intrinsics as convergent |
| |
| Rob Clark (135): |
| |
| - freedreno: fix release tarball |
| - freedreno: more fixing release tarball |
| - freedreno/a6xx: small compiler warning fix |
| - freedreno/ir3: fix varying packing vs. tex sharp edge |
| - freedreno/a6xx: move stream-out emit to helper |
| - freedreno/a6xx: clean up some open-coded bits |
| - freedreno/ir3: split out image helpers |
| - freedreno/ir3: split out a4xx+ instructions |
| - freedreno/ir3: fix ncomp for \_store_image() src |
| - freedreno/ir3: add image/ssbo <-> ibo/tex mapping |
| - freedreno/ir3: add a6xx instruction encoding |
| - freedreno/ir3: add a6xx+ SSBO/image support |
| - freedreno/ir3: HIGH reg w/a for a6xx |
| - freedreno/a6xx: border-color offset helper |
| - freedreno/a6xx: image/ssbo state emit |
| - freedreno/a6xx: compute support |
| - freedreno/a6xx: cache flush harder |
| - freedreno/a6xx: fix helper_invocation (sampler mask/id) |
| - freedreno/ir3: handle quirky atomic dst for a6xx |
| - freedreno/ir3: fix legalize for vecN inputs |
| - freedreno/ir3: fix crash in compile fail case |
| - freedreno/a6xx: 3d and cube image fixes |
| - freedreno: fix crash w/ masked non-SSA dst |
| - freedreno/ir3: rename put_dst() |
| - freedreno/ir3/a6xx: fix load_ssbo barrier type. |
| - freedreno/ir3: sync instr/disasm and add ldib encoding |
| - freedreno/ir3/a6xx: use ldib for ssbo reads |
| - freedreno/a6xx: samplerBuffer fixes |
| - freedreno/a6xx: enable tiled images |
| - freedreno: fix race condition |
| - freedreno/ir3: don't hardcode wrmask |
| - freedreno/a6xx: fix border-color offset |
| - freedreno/a6xx: cube image fix |
| - freedreno/a6xx: fix hangs with large shaders |
| - freedreno/ir3: use nopN encoding when possible |
| - freedreno/a6xx: fix ssbo alignment |
| - freedreno/ir3/a6xx: fix non-ssa atomic dst |
| - freedreno/a6xx: fix DRAW_IDX_INDIRECT max_indicies |
| - freedreno/a6xx: vertex_id is not \_zero_based |
| - freedreno/ir3/a6xx: fix atomic shader outputs |
| - freedreno/ir3: gsampler2DMSArray fixes |
| - freedreno/ir3: include nopN in expanded instruction count |
| - freedreno/ir3: add Sethi–Ullman numbering pass |
| - freedreno/ir3: track register pressure in sched |
| - freedreno: fix ir3_cmdline build |
| - freedreno/a6xx: remove astc_srgb workaround |
| - freedreno/a6xx: refactor fd6_tex_swiz() |
| - freedreno/a6xx: fix border-color swizzles |
| - freedreno/a6xx: perfcntrs |
| - freedreno/ir3: fix ir3_cmdline harder |
| - freedreno/ir3: turn on [iu]mul_high |
| - freedreno/a6xx: more bcolor fixes |
| - freedreno/ir3/cp: fix ldib bug |
| - freedreno/ir3/a6xx: fix ssbo comp_swap |
| - freedreno/ir3 better cat6 encoding detection |
| - freedreno/ir3/ra: fix half-class conflicts |
| - freedreno/ir3: fix sam.s2en decoding |
| - freedreno/ir3: fix sam.s2en encoding |
| - freedreno/ir3: fix regmask for merged regs |
| - nir: move gls_type_get_{sampler,image}_count() |
| - freedreno/ir3: find # of samplers from uniform vars |
| - freedreno/ir3: enable indirect tex/samp (sam.s2en) |
| - freedreno/ir3: optimize sam.s2en to sam |
| - freedreno/ir3: additional lowering |
| - freedreno/ir3: fix bit_count |
| - freedreno/ir3: dynamic UBO indexing vs 64b pointers |
| - freedreno/ir3: rename has_kill to no_earlyz |
| - freedreno/ir3: disable early-z for SSBO/image writes |
| - gallium: add PIPE_CAP_ESSL_FEATURE_LEVEL |
| - mesa/st: use ESSL cap top enable gpu_shader5 |
| - freedreno: add ESSL cap |
| - docs: update freedreno status |
| - freedreno/a6xx: small cleanup |
| - freedreno/ir3: sched fix |
| - freedreno/ir3: reads/writes to unrelated arrays are not dependent |
| - freedreno/ir3: align const size to vec4 |
| - nir: print var name for load_interpolated_input too |
| - nir: add lower_all_io_to_elements |
| - freedreno/ir3: re-indent comment |
| - freedreno/ir3: rework varying packing |
| - freedreno/ir3: add pass to move varying loads |
| - freedreno/ir3: convert to "new style" frag inputs |
| - gallium/docs: clarify set_sampler_views (v2) |
| - iris: fix set_sampler_view |
| - freedreno/ir3: fix const assert |
| - freedreno/drm: update for robustness |
| - freedreno: add robustness support |
| - compiler: rename SYSTEM_VALUE_VARYING_COORD |
| - freedreno/ir3: fix rgetpos decoding |
| - freedreno/ir3: more emit-cat5 fixes |
| - freedreno/ir3: cleanup instruction builder macros |
| - freedreno: update generated headers |
| - freedreno/ir3: lower load_barycentric_at_sample |
| - freedreno/ir3: lower load_barycentric_at_offset |
| - freedreno/ir3: remove bogus assert |
| - freedreno/ir3: rename frag_vcoord -> ij_pixel |
| - freedreno/a6xx: add VALIDREG/CONDREG helper macros |
| - freedreno/ir3: fix load_interpolated_input slot |
| - freedreno: wire up core sample-shading support |
| - freedreno/ir3: sample-shading support |
| - freedreno/a6xx: sample-shading support |
| - docs/features: update GL too |
| - freedreno/ir3: switch fragcoord to sysval |
| - freedreno/a6xx: small texture emit cleanup |
| - freedreno/a6xx: pre-bake UBWC flags in texture-view |
| - freedreno/ir3: fixes for half reg in/out |
| - freedreno/ir3: fix shader variants vs UBO analysis |
| - freedreno/ir3: fix lowered ubo region alignment |
| - freedreno/ir3: add IR3_SHADER_DEBUG flag to disable ubo lowering |
| - freedreno/ir3: add some ubo range related asserts |
| - nir: rework tex instruction printing |
| - nir: fix lower_wpos_ytransform in load_frag_coord case |
| - nir: add pass to lower fb reads |
| - freedreno/drm: expose GMEM_BASE address |
| - freedreno/ir3: fb read support |
| - freedreno/a6xx: KHR_blend_equation_advanced support |
| - freedreno/a6xx: smaller hammer for fb barrier |
| - docs: mark KHR_blend_equation_advanced done on a6xx |
| - nir: fix nir tex print harder |
| - freedreno/ir3: remove assert |
| - freedreno/a6xx: OUT_RELOC vs OUT_RELOCW fixes |
| - freedreno: update generated headers |
| - freedreno/a6xx: UBWC fixes |
| - freedreno/a6xx: UBWC support for images |
| - freedreno: mark imported resources as valid |
| - freedreno/a6xx: buffer resources cannot be compressed |
| - freedreno: move UBWC color offset to fd_resource_offset() |
| - freedreno: add ubwc_enabled helper |
| - freedreno/a6xx: deduplicate a few lines |
| - freedreno: remove unused forward struct declaration |
| - freedreno/ir3: fix rasterflat/glxgears |
| - freedreno/ir3: set more barrier bits |
| - freedreno/a6xx: fix GPU crash on small render targets |
| - freedreno/a6xx: fix issues with gallium HUD |
| - freedreno/a6xx: fix hangs with newer sqe fw |
| |
| Rob Herring (2): |
| |
| - kmsro: Add lima renderonly support |
| - kmsro: Add platform support for exynos and sun4i |
| |
| Rodrigo Vivi (1): |
| |
| - intel: Add more PCI Device IDs for Coffee Lake and Ice Lake. |
| |
| Roland Scheidegger (2): |
| |
| - gallivm: fix bogus assert in get_indirect_index |
| - gallivm: fix saturated signed add / sub with llvm 9 |
| |
| Romain Failliot (1): |
| |
| - docs: changed "Done" to "DONE" in features.txt |
| |
| Ross Burton (1): |
| |
| - Revert "meson: drop GLESv1 .so version back to 1.0.0" |
| |
| Ryan Houdek (1): |
| |
| - panfrost: Adds Bifrost shader disassembler utility |
| |
| Sagar Ghuge (10): |
| |
| - iris: Don't allocate a BO per query object |
| - nir/glsl: Add another way of doing lower_imul64 for gen8+ |
| - glsl: [u/i]mulExtended optimization for GLSL |
| - nir/algebraic: Optimize low 32 bit extraction |
| - spirv: Allow [i/u]mulExtended to use new nir opcode |
| - iris: Refactor code to share 3DSTATE_URB\_\* packet |
| - iris: Track last VS URB entry size |
| - iris: Flag fewer dirty bits in BLORP |
| - intel/fs: Remove unused condition from opt_algebraic case |
| - intel/compiler: Fix assertions in brw_alu3 |
| |
| Samuel Iglesias Gonsálvez (4): |
| |
| - isl: remove the cache line size alignment requirement |
| - isl: the display engine requires 64B alignment for linear surfaces |
| - radv: don't overwrite results in VkGetQueryPoolResults() when queries |
| are not available |
| - radv: write availability status vkGetQueryPoolResults() when the data |
| is not available |
| |
| Samuel Pitoiset (147): |
| |
| - radv/winsys: fix hash when adding internal buffers |
| - radv: fix build |
| - radv: bail out when no image transitions will be performed |
| - radv: remove unused radv_render_pass_attachment::view_mask |
| - radv: remove useless MAYBE_UNUSED in CmdBeginRenderPass() |
| - radv: add radv_cmd_buffer_begin_subpass() helper |
| - radv: move subpass image transitions to |
| radv_cmd_buffer_begin_subpass() |
| - radv: store the list of attachments for every subpass |
| - radv: use the new attachments array when starting subpasses |
| - radv: determine the last subpass id for every attachments |
| - radv: handle final layouts at end of every subpass and render pass |
| - radv: move some render pass things to radv_render_pass_compile() |
| - radv: add radv_render_pass_add_subpass_dep() helper |
| - radv: track if subpasses have color attachments |
| - radv: handle subpass dependencies correctly |
| - radv: accumulate all ingoing external dependencies to the first |
| subpass |
| - radv: execute external subpass barriers after ending subpasses |
| - radv: drop useless checks when resolving subpass color attachments |
| - radv: do not set preserveAttachments for internal render passes |
| - radv: don't flush src stages when dstStageMask == BOTTOM_OF_PIPE |
| - radv: fix compiler issues with GCC 9 |
| - radv: gather more info about push constants |
| - radv: gather if shaders load dynamic offsets separately |
| - radv: keep track of the number of remaining user SGPRs |
| - radv: add support for push constants inlining when possible |
| - radv: fix using LOAD_CONTEXT_REG with old GFX ME firmwares on GFX8 |
| - radv/winsys: fix BO list creation when RADV_DEBUG=allbos is set |
| - radv: always export gl_SampleMask when the fragment shader uses it |
| - ac: make use of ac_build_expand_to_vec4() in visit_image_store() |
| - radv: use MAX_{VBS,VERTEX_ATTRIBS} when defining max vertex input |
| limits |
| - radv: store vertex attribute formats as pipeline keys |
| - radv: reduce the number of loaded channels for vertex input fetches |
| - radv: fix radv_fixup_vertex_input_fetches() |
| - radv: fix invalid element type when filling vertex input default |
| values |
| - ac: add ac_build_llvm8_tbuffer_load() helper |
| - ac: use new LLVM 8 intrinsic when loading 16-bit values |
| - radv: write the alpha channel of MRT0 when alpha coverage is enabled |
| - radv: remove unused variable in gather_push_constant_info() |
| - radv: fix writing the alpha channel of MRT0 when alpha coverage is |
| enabled |
| - radv: fix clearing attachments in secondary command buffers |
| - radv: fix out-of-bounds access when copying descriptors BO list |
| - radv: don't copy buffer descriptors list for samplers |
| - rav: use 32_AR instead of 32_ABGR when alpha coverage is required |
| - radv: allocate enough space in cmdbuf when starting a subpass |
| - radv: properly align the fence and EOP bug VA on GFX9 |
| - radv: enable lower_mul_2x32_64 |
| - Revert "radv: execute external subpass barriers after ending |
| subpasses" |
| - radv: fix pointSizeRange limits |
| - radv: set the maximum number of IBs per submit to 192 |
| - ac: rework typed buffers loads for LLVM 7 |
| - radv: store more vertex attribute infos as pipeline keys |
| - radv: use typed buffer loads for vertex input fetches |
| - ac: add ac_build_{struct,raw}_tbuffer_load() helpers |
| - ac: use the raw tbuffer version for 16-bit SSBO loads |
| - radv: always initialize HTILE when the src layout is UNDEFINED |
| - radv: always load 3 channels for formats that need to be shuffled |
| - ac: use llvm.amdgcn.fract intrinsic for nir_op_ffract |
| - radv: fix binding transform feedback buffers |
| - ac: make use of ac_get_store_intr_attribs() where possible |
| - ac/nir: set attrib flags for SSBO and image store operations |
| - ac: add ac_build_buffer_store_format() helper |
| - ac/nir: remove one useless check in visit_store_ssbo() |
| - ac/nir: use new LLVM 8 intrinsics for SSBO atomic operations |
| - ac/nir: use ac_build_buffer_load() for SSBO load operations |
| - ac/nir: use ac_build_buffer_store_dword() for SSBO store operations |
| - ac: use new LLVM 8 intrinsics in ac_build_buffer_load() |
| - ac: add ac_build_{struct,raw}_tbuffer_store() helpers |
| - ac: use new LLVM 8 intrinsic when storing 16-bit values |
| - ac: use new LLVM 8 intrinsics in ac_build_buffer_store_dword() |
| - ac: add various int8 definitions |
| - ac: add ac_build_tbuffer_load_byte() helper |
| - ac: add ac_build_tbuffer_store_byte() helper |
| - radv: add missing initializations since |
| VK_EXT_pipeline_creation_feedback |
| - ac: add f16_0 and f16_1 constants |
| - ac: add 16-bit support fo fsign |
| - ac: add 16-bit support to fract |
| - ac: fix 16-bit shifts |
| - ac: fix incorrect argument type for tbuffer.{load,store} with LLVM 7 |
| - nir: use generic float types for frexp_exp and frexp_sig |
| - spirv,nir: lower frexp_exp/frexp_sig inside a new NIR pass |
| - nir: add nir_{load,store}_deref_with_access() helpers |
| - spirv: propagate the access flag for store and load derefs |
| - ac: use llvm.amdgcn.fmed3 intrinsic for nir_op_fmed3 |
| - ac: add ac_build_frexp_mant() helper and 16-bit/32-bit support |
| - ac: add ac_build_frex_exp() helper ans 16-bit/32-bit support |
| - radv: do not lower frexp_exp and frexp_sig |
| - radv: enable VK_AMD_gpu_shader_int16 |
| - radv: skip updating depth/color metadata for conditional rendering |
| - radv: do not always initialize HTILE in compressed state |
| - ac: fix return type for llvm.amdgcn.frexp.exp.i32.64 |
| - ac/nir: fix nir_op_b2i16 |
| - ac: fix ac_build_bit_count() for 16-bit integer type |
| - ac: fix ac_build_bitfield_reverse() for 16-bit integer type |
| - ac: fix ac_find_lsb() for 16-bit integer type |
| - ac: fix ac_build_umsb() for 16-bit integer type |
| - ac/nir: add support for nir_op_b2i8 |
| - ac: add 8-bit support to ac_build_bit_count() |
| - ac: add 8-bit support to ac_find_lsb() |
| - ac: add 8-bit support to ac_build_umsb() |
| - ac: add 8-bit and 64-bit support to ac_build_bitfield_reverse() |
| - radv: partially enable VK_KHR_shader_float16_int8 |
| - nir: do not pack varying with different types |
| - ac/nir: fix intrinsic names for atomic operations with LLVM 9+ |
| - radv: fix getting the vertex strides if the bindings aren't |
| contiguous |
| - ac/nir: fix nir_op_b2f16 |
| - radv: enable VK_AMD_gpu_shader_half_float |
| - wsi: allow to override the present mode with MESA_VK_WSI_PRESENT_MODE |
| - ac/nir: make use of ac_build_imax() where possible |
| - ac/nir: make use of ac_build_imin() where possible |
| - ac/nir: make use of ac_build_umin() where possible |
| - ac: add ac_build_umax() and use it where possible |
| - ac: add ac_build_ddxy_interp() helper |
| - ac: add ac_build_load_helper_invocation() helper |
| - ac/nir: remove useles LLVMGetUndef for nir_op_pack_64_2x32_split |
| - ac/nir: remove useless integer cast in |
| adjust_sample_index_using_fmask() |
| - ac/nir: remove useless integer cast in visit_image_load() |
| - ac/nir: remove some useless integer casts for ALU operations |
| - spirv: add SpvCapabilityFloat16 support |
| - radv: enable VK_KHR_shader_float16_int8 |
| - radv: set ACCESS_NON_READABLE on stores for copy/fill/clear meta |
| shaders |
| - radv: enable shaderInt8 on SI and CIK |
| - radv: sort the shader capabilities alphabetically |
| - ac/nir: use new LLVM 8 intrinsics for SSBO atomics except cmpswap |
| - ac/nir: add 64-bit SSBO atomic operations support |
| - radv: add VK_KHR_shader_atomic_int64 but disable it for now |
| - ac: add support for more types with struct/raw LLVM intrinsics |
| - ac: use struct/raw load intrinsics for 8-bit/16-bit int with LLVM 9+ |
| - ac: use struct/raw store intrinsics for 8-bit/16-bit int with LLVM 9+ |
| - ac/nir: only use the new raw/struct image atomic intrinsics with LLVM |
| 9+ |
| - ac/nir: only use the new raw/struct SSBO atomic intrinsics with LLVM |
| 9+ |
| - ac/nir: use the new raw/struct SSBO atomic intrisics for comp_swap |
| - radv: add VK_NV_compute_shader_derivates support |
| - radv: add missing VEGA20 chip in radv_get_device_name() |
| - radv: do not need to force emit the TCS regs on Vega20 |
| - radv: fix color conversions for normalized uint/sint formats |
| - radv: implement a workaround for VK_EXT_conditional_rendering |
| - ac: tidy up ac_build_llvm8_tbuffer_{load,store} |
| - radv: set WD_SWITCH_ON_EOP=1 when drawing primitives from a stream |
| output buffer |
| - radv: only need to force emit the TCS regs on Vega10 and Raven1 |
| - radv: fix radv_get_aspect_format() for D+S formats |
| - radv: apply the indexing workaround for atomic buffer operations on |
| GFX9 |
| - radv: fix setting the number of rectangles when it's dyanmic |
| - radv: add a workaround for Monster Hunter World and LLVM 7&8 |
| - radv: allocate more space in the CS when emitting events |
| - radv: do not use gfx fast depth clears for layered depth/stencil |
| images |
| - radv: fix alpha-to-coverage when there is unused color attachments |
| - radv: fix setting CB_SHADER_MASK for dual source blending |
| |
| Sergii Romantsov (4): |
| |
| - dri: meson: do not prefix user provided dri-drivers-path |
| - d3d: meson: do not prefix user provided d3d-drivers-path |
| - i965,iris/blorp: do not blit 0-sizes |
| - glsl: Fix input/output structure matching across shader stages |
| |
| Sonny Jiang (1): |
| |
| - radeonsi: use compute for clear_render_target when possible |
| |
| Tapani Pälli (42): |
| |
| - nir: add option to use scaling factor when sampling planes YUV |
| lowering |
| - dri: add P010, P012, P016 for 10bit/12bit/16bit YUV420 formats |
| - intel/compiler: add scale_factors to sampler_prog_key_data |
| - i965: add P0x formats and propagate required scaling factors |
| - drirc/i965: add option to disable 565 configs and visuals |
| - mesa: return NULL if we exceed MaxColorAttachments in |
| get_fb_attachment |
| - anv: anv: refactor error handling in anv_shader_bin_write_to_blob() |
| - iris: add Android build |
| - nir: initialize value in copy_prop_vars_block |
| - nir: use nir_variable_create instead of open-coding the logic |
| - android: add liblog to libmesa_intel_common build |
| - android: make libbacktrace optional on USE_LIBBACKTRACE |
| - iris: add libmesa_iris_gen8 library to the build |
| - util: fix a warning when building against clang7 headers |
| - anv: retain the is_array state in create_plane_tex_instr_implicit |
| - anv: toggle on support for VK_EXT_ycbcr_image_arrays |
| - anv: use anv_gem_munmap in block pool cleanup |
| - anv: call blob_finish when done with it |
| - nir: free dead_ctx in case of no progress |
| - anv: destroy descriptor sets when pool gets destroyed |
| - anv: release memory allocated by bo_heap when descriptor pool is |
| destroyed |
| - anv: release memory allocated by glsl types during spirv_to_nir |
| - anv: revert "anv: release memory allocated by glsl types during |
| spirv_to_nir" |
| - i965: remove scaling factors from P010, P012 |
| - isl: fix automake build when sse41 is not supported |
| - android: Build fixes for OMR1 |
| - iris: initialize num_cbufs |
| - iris: mark switch case fallthrough |
| - anv/radv: release memory allocated by glsl types during spirv_to_nir |
| - st/mesa: fix compilation warning on storage_flags_to_buffer_flags |
| - st/mesa: fix warnings about implicit conversion on enumeration type |
| - spirv: fix a compiler warning |
| - st/nir: run st_nir_opts after 64bit ops lowering |
| - iris: move variable to the scope where it is being used |
| - iris: move iris_flush_resource so we can call it from get_handle |
| - iris: handle aux properly in iris_resource_get_handle |
| - egl: setup fds array correctly when exporting dmabuf |
| - compiler/glsl: handle case where we have multiple users for types |
| - android/iris: fix driinfo header filename |
| - nir: use braces around subobject in initializer |
| - glsl: use empty brace initializer |
| - anv: expose VK_EXT_queue_family_foreign on Android |
| |
| Thomas Hellstrom (5): |
| |
| - winsys/svga: Add an environment variable to force host-backed |
| operation |
| - winsys/svga: Enable the transfer_from_buffer GPU command for vgpu10 |
| - svga: Avoid bouncing buffer data in malloced buffers |
| - winsys/svga: Update the drm interface file |
| - winsys/svga: Don't abort on EBUSY errors from execbuffer |
| |
| Timo Aaltonen (1): |
| |
| - util/os_misc: Add check for PIPE_OS_HURD |
| |
| Timothy Arceri (72): |
| |
| - st/glsl_to_nir: remove dead local variables |
| - ac/radv/radeonsi: add ac_get_num_physical_sgprs() helper |
| - radv: take LDS into account for compute shader occupancy stats |
| - util: move BITFIELD macros to util/macros.h |
| - st/glsl_to_nir: call nir_remove_dead_variables() after lowing local |
| indirects |
| - nir: add support for marking used patches when packing varyings |
| - nir: add glsl_type_is_32bit() helper |
| - nir: add is_packing_supported_for_type() helper |
| - nir: rewrite varying component packing |
| - nir: prehash instruction in nir_instr_set_add_or_rewrite() |
| - nir: turn ssa check into an assert |
| - nir: turn an ssa check in nir_search into an assert |
| - nir: remove simple dead if detection from nir_opt_dead_cf() |
| - radeonsi/nir: set input_usage_mask properly |
| - radeonsi/nir: set colors_read properly |
| - radeonsi/nir: set shader_buffers_declared properly |
| - st/nir: use NIR for asm programs |
| - nir: remove non-ssa support from nir_copy_prop() |
| - nir: clone instruction set rather than removing individual entries |
| - nir: allow nir_lower_phis_to_scalar() on more src types |
| - radeonsi: fix query buffer allocation |
| - glsl: fix shader cache for packed param list |
| - radeonsi/nir: move si_lower_nir() call into compiler thread |
| - glsl: rename is_record() -> is_struct() |
| - glsl: rename get_record_instance() -> get_struct_instance() |
| - glsl: rename record_location_offset() -> struct_location_offset() |
| - glsl: rename record_types -> struct_types |
| - nir: rename glsl_type_is_struct() -> glsl_type_is_struct_or_ifc() |
| - glsl/freedreno/panfrost: pass gl_context to the standalone compiler |
| - glsl: use NIR function inlining for drivers that use glsl_to_nir() |
| - i965: stop calling nir_lower_returns() |
| - radeonsi/nir: stop calling nir_lower_returns() |
| - st/glsl: start spilling out common st glsl conversion code |
| - anv: add support for dumping shader info via VK_EXT_debug_report |
| - nir: add guess trip count support to loop analysis |
| - nir: add new partially_unrolled bool to nir_loop |
| - nir: add partial loop unrolling support |
| - nir: calculate trip count for more loops |
| - nir: unroll some loops with a variable limit |
| - nir: simplify the loop analysis trip count code a little |
| - nir: add helper to return inversion op of a comparison |
| - nir: add get_induction_and_limit_vars() helper to loop analysis |
| - nir: pass nir_op to calculate_iterations() |
| - nir: find induction/limit vars in iand instructions |
| - st/glsl_to_nir: fix incorrect arrary access |
| - radeonsi/nir: call some more var optimisation passes |
| - ac/nir_to_llvm: add assert to emit_bcsel() |
| - nir: only override previous alu during loop analysis if supported |
| - nir: fix opt_if_loop_last_continue() |
| - nir: add support for user defined loop control |
| - spirv: make use of the loop control support in nir |
| - nir: add support for user defined select control |
| - spirv: make use of the select control support in nir |
| - Revert "ac/nir: use new LLVM 8 intrinsics for SSBO atomic operations" |
| - nir: propagate known constant values into the if-then branch |
| - Revert "nir: propagate known constant values into the if-then branch" |
| - nir/radv: remove restrictions on opt_if_loop_last_continue() |
| - nir: initialise some variables in opt_if_loop_last_continue() |
| - nir/i965/freedreno/vc4: add a bindless bool to type size functions |
| - ac/nir_to_llvm: make get_sampler_desc() more generic and pass it the |
| image intrinsic |
| - ac/nir_to_llvm: add image bindless support |
| - nir: fix packing components with arrays |
| - radeonsi/nir: fix scanning of bindless images |
| - st/mesa/radeonsi: fix race between destruction of types and shader |
| compilation |
| - nir: fix nir_remove_unused_varyings() |
| - radeonsi/nir: create si_nir_opts() helper |
| - radeonsi/nir: call radeonsi nir opts before the scan pass |
| - util/drirc: add workarounds for bugs in Doom 3: BFG |
| - radeonsi: add config entry for Counter-Strike Global Offensive |
| - Revert "glx: Fix synthetic error generation in \__glXSendError" |
| - Revert "st/mesa: expose 0 shader binary formats for compat profiles |
| for Qt" |
| - st/glsl: make sure to propagate initialisers to driver storage |
| |
| Timur Kristóf (19): |
| |
| - radeonsi/nir: Use uniform location when calculating const_file_max. |
| - iris: implement clearing render target and depth stencil |
| - nir: Add ability for shaders to use window space coordinates. |
| - tgsi_to_nir: Fix the TGSI ARR translation by converting the result to |
| int. |
| - tgsi_to_nir: Fix TGSI LIT translation by using flt. |
| - tgsi_to_nir: Make the TGSI IF translation code more readable. |
| - tgsi_to_nir: Split to smaller functions. |
| - nir: Move nir_lower_uniforms_to_ubo to compiler/nir. |
| - nir: Add multiplier argument to nir_lower_uniforms_to_ubo. |
| - freedreno: Plumb pipe_screen through to irX_tgsi_to_nir. |
| - tgsi_to_nir: Produce optimized NIR for a given pipe_screen. |
| - tgsi_to_nir: Restructure system value loads. |
| - tgsi_to_nir: Extract ttn_emulate_tgsi_front_face into its own |
| function. |
| - tgsi_to_nir: Support FACE and POSITION properly. |
| - tgsi_to_nir: Improve interpolation modes. |
| - tgsi_to_nir: Set correct location for uniforms. |
| - radeonsi/nir: Only set window_space_position for vertex shaders. |
| - iris: Face should be a system value. |
| - gallium: fix autotools build of pipe_msm.la |
| |
| Tobias Klausmann (1): |
| |
| - vulkan/util: meson build - add wayland client include |
| |
| Tomasz Figa (1): |
| |
| - llvmpipe: Always return some fence in flush (v2) |
| |
| Tomeu Vizoso (19): |
| |
| - panfrost: Add gem_handle to panfrost_memory and panfrost_bo |
| - panfrost: Add backend targeting the DRM driver |
| - panfrost/midgard: Add support for MIDGARD_MESA_DEBUG |
| - panfrost: Add support for PAN_MESA_DEBUG |
| - panfrost: Set bo->size[0] in the DRM backend |
| - panfrost: Set bo->gem_handle when creating a linear BO |
| - panfrost: Adapt to uapi changes |
| - panfrost: Fix sscanf format options |
| - panfrost: Set the GEM handle for AFBC buffers |
| - panfrost: Also tell the kernel about the checksum_slab |
| - panfrost: Pass the context BOs to the kernel so they aren't unmapped |
| while in use |
| - panfrost: Wait for last job to finish in force_flush_fragment |
| - panfrost: split asserts in pandecode |
| - panfrost: Guard against reading past end of buffer |
| - panfrost/ci: Initial commit |
| - panfrost/midgard: Skip register allocation if there's no work to do |
| - panfrost/midgard: Skip liveness analysis for instructions without |
| dest |
| - panfrost: Fix two uninitialized accesses in compiler |
| - panfrost: Only take the fast paths on buffers aligned to block size |
| |
| Toni Lönnberg (8): |
| |
| - intel/genxml: Only handle instructions meant for render engine when |
| generating headers |
| - intel/genxml: Media instructions and structures for gen6 |
| - intel/genxml: Media instructions and structures for gen7 |
| - intel/genxml: Media instructions and structures for gen7.5 |
| - intel/genxml: Media instructions and structures for gen8 |
| - intel/genxml: Media instructions and structures for gen9 |
| - intel/genxml: Media instructions and structures for gen10 |
| - intel/genxml: Media instructions and structures for gen11 |
| |
| Topi Pohjolainen (2): |
| |
| - intel/compiler/icl: Use tcs barrier id bits 24:30 instead of 24:27 |
| - intel/compiler/fs/icl: Use dummy masked urb write for tess eval |
| |
| Vasily Khoruzhick (2): |
| |
| - lima: use individual tile heap for each GP job. |
| - lima: add support for depth/stencil fbo attachments and textures |
| |
| Vinson Lee (5): |
| |
| - gallium/auxiliary/vl: Fix duplicate symbol build errors. |
| - nir: Fix anonymous union initialization with older GCC. |
| - swr: Fix build with llvm-9.0. |
| - gallium: Fix autotools build with libxatracker.la. |
| - freedreno: Fix GCC build error. |
| |
| Vivek Kasireddy (1): |
| |
| - drm-uapi: Update headers from drm-next |
| |
| Xavier Bouchoux (1): |
| |
| - nir/spirv: Fix assert when unsampled OpTypeImage has unknown 'Depth' |
| |
| Yevhenii Kolesnikov (1): |
| |
| - i965: Fix allow_higher_compat_version workaround limited by OpenGL |
| 3.0 |
| |
| coypu (1): |
| |
| - gbm: don't return void |
| |
| davidbepo (1): |
| |
| - drirc: add Waterfox to adaptive-sync blacklist |
| |
| grmat (1): |
| |
| - drirc: add Spectacle, Falkon to a-sync blacklist |
| |
| pal1000 (1): |
| |
| - scons: Compatibility with Scons development version string |
| |
| suresh guttula (3): |
| |
| - vl: Add cropping flags for H264 |
| - radeon/vce:Add support for frame_cropping_flag of |
| VAEncSequenceParameterBufferH264 |
| - st/va/enc: Add support for frame_cropping_flag of |
| VAEncSequenceParameterBufferH264 |