commit	03461c9c6e21e43a6e1c699bfb254ddb3d575c93	[log] [tgz]
author	Hsiangkai Wang <hsiangkai.wang@arm.com>	Thu Jun 19 07:56:30 2025 +0100
committer	GitHub <noreply@github.com>	Thu Jun 19 07:56:30 2025 +0100
tree	7af43d631e8ddde5261ad0343953532c7ff41399
parent	590066bee70db37636311881c5b232464d6d4aec [diff]

commit

03461c9c6e21e43a6e1c699bfb254ddb3d575c93

[log] [tgz]

author

Hsiangkai Wang <hsiangkai.wang@arm.com>

Thu Jun 19 07:56:30 2025 +0100

committer

GitHub <noreply@github.com>

Thu Jun 19 07:56:30 2025 +0100

tree

7af43d631e8ddde5261ad0343953532c7ff41399

parent

590066bee70db37636311881c5b232464d6d4aec [diff]

[mlir][gpu][spirv] Remove rotation semantics of gpu.shuffle up/down (#139105) From the description of gpu.shuffle operation, shuffle up/down rotates values in the subgroup because it applies modulo on the shifted value to calculate the result lane ID. It is inconsistent with the definition of SPIR-V shuffle up/down and NVVM data movement definitions within subgroup. In NVVM, it says "If the computed source lane index j is in range, the returned i32 value will be the value of %a from lane j; otherwise, it will be the the value of %a from the current thread." It will keep the original value if the result land ID is out of range. In SPIR-V OpGroupNonUniformShuffleUp and OpGroupNonUniformShuffleDown, it says "The resulting value is undefined if Delta is greater than the current invocation’s id within the scope or if the identified invocation is not in scope restricted tangle." It's an undefined value if the result land ID is out of range. Anyway, there is no circular movement in shuffle up/down from these 2 specifications. This patch removes the circular movement in gpu.shuffle up/down and lower gpu.shuffle up/down to SPIR-V OpGroupNonUniformShuffleUp and OpGroupNonUniformShuffleDown directly. Reference: https://docs.nvidia.com/cuda/archive/12.2.1/nvvm-ir-spec/index.html#data-movement https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#OpGroupNonUniformShuffleUp https://registry.khronos.org/SPIR-V/specs/unified1/SPIRV.html#OpGroupNonUniformShuffleDown

tree: 7af43d631e8ddde5261ad0343953532c7ff41399

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called “LLVM”. This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, LLVM Office Hours or Regular sync-ups.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.