21ffacff8c70c35649679fd67f2ee770245751e4 - platform/external/mesa3d

commit	21ffacff8c70c35649679fd67f2ee770245751e4	[log] [tgz]
author	Marcin Ślusarz <marcin.slusarz@intel.com>	Wed Oct 14 16:32:55 2020 +0200
committer	Marge Bot <eric+marge@anholt.net>	Tue Nov 03 10:49:04 2020 +0000
tree	f0a3c4d8134344f42cde436315b976959992a83f
parent	06764e0e5d5e37f9a3e00db7676b76d5472e305b [diff]

intel/compiler: remove branch weight heuristic

As a result of this patch, compiler chooses SIMD32 shaders more
frequently.

Current logic is designed to avoid regressions from enabling SIMD32 at
all cost, even though the cases where regression can happen are probably
for smaller draw calls (far away from the camera and though smaller).

In Intel perf CI this patch improves FPS in:
- gfxbench5 alu2:      21.92% (gen9), 23.7%  (gen11)
- synmark OglShMapVsm:  3.26% (gen9),  4.52% (gen11)
- gfxbench5 car chase:  1.34% (gen9),  1.32% (gen11)
No observed regressions there.

In my testing, it also improves FPS in:
- The Talos Principle:   2.9% (gen9)

The other 16 games I tested had very minor changes in performance
(2/3 positive, but not significant enough to list here).

Note: this patch harms synmark OglDrvState (which is not in Intel perf
CI) by ~2.9%, but this benchmark renders multiple scenes from other
workloads (including OglShMapVsm, which is helped in standalone mode)
in tiny rectangles. Rendering so small drastically changes branching
statistics, which favors smaller SIMD modes. I assume this matters
only in micro-benchmarks, as in real workloads more expensive (with
more uniform branching behavior) draw calls dominate.

Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com>
Acked-by: Francisco Jerez <currojerez@riseup.net>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7137>

src/intel/compiler/brw_ir_performance.cpp[diff]

1 file changed

tree: f0a3c4d8134344f42cde436315b976959992a83f