commit	cd46354dbd10820158edabe14dbd49d9f9010722	[log] [tgz]
author	SingleAccretion <62474226+SingleAccretion@users.noreply.github.com>	Tue Jun 24 21:40:47 2025 +0300
committer	GitHub <noreply@github.com>	Tue Jun 24 11:40:47 2025 -0700
tree	9642fadb5bae891dcec19be887530dfcb935354c
parent	de569ad6b845310335c37e19105e41f201c45dd9 [diff]

commit

cd46354dbd10820158edabe14dbd49d9f9010722

[log] [tgz]

author

SingleAccretion <62474226+SingleAccretion@users.noreply.github.com>

Tue Jun 24 21:40:47 2025 +0300

committer

GitHub <noreply@github.com>

Tue Jun 24 11:40:47 2025 -0700

tree

9642fadb5bae891dcec19be887530dfcb935354c

parent

de569ad6b845310335c37e19105e41f201c45dd9 [diff]

[WebAssembly] Enable a limited amount of stackification for debug code (#136510) This change is a step towards fixing one long-standing problem with LLVM's debug WASM codegen: excessive use of locals. One local for each temporary value in IR (roughly speaking). This has a lot of problems: 1) It makes it easy to hit engine limitations of 50K locals with certain code patterns and large functions. 2) It makes for larger binaries that are slower to load and slower to compile to native code. 3) It makes certain compilation strategies (spill all WASM locals to stack, for example) for debug code excessively expensive and makes debug WASM code either run very slow, or be less debuggable. 4) It slows down LLVM itself. This change addresses these partially by running a limited version of the stackification pass for unoptimized code, one that gets rid of the most 'obviously' unnecessary locals. Care needs to be taken to not impact LLVM's ability to produce high quality debug variable locations with this pass. To that end: 1) We only allow stackification when it doesn't require moving any instructions. 2) We disable stackification of any locals that are used in DEBUG_VALUEs, or as a frame base. I have verified on a moderately large example that the baseline and the diff produce the same kinds (local/global/stack) of locations, and the only differences are due to the shifting of instruction offsets, with many local.[get|set]s not being present anymore. Even with this quite conservative approach, the results are pretty good: 1) 30% reduction in raw code size, up to 10x reduction in the number of locals for select large methods (~1000 => ~100). 2) ~10% reduction in instructions retired for an "llc -O0" run on a moderately sized input.

tree: 9642fadb5bae891dcec19be887530dfcb935354c

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called “LLVM”. This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, LLVM Office Hours or Regular sync-ups.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.