commit	83c308f014da00cadbbe9ac7c8fe8a48ff777b76	[log] [tgz]
author	Lucas Ramirez <11032120+lucas-rami@users.noreply.github.com>	Fri Aug 08 14:26:04 2025 +0200
committer	GitHub <noreply@github.com>	Fri Aug 08 14:26:04 2025 +0200
tree	58ad7a7d4b6c027bff2bd596a31af184c23fe044
parent	ab7281d8969152a2cb0f302fe645e99e58e6d281 [diff]

commit

83c308f014da00cadbbe9ac7c8fe8a48ff777b76

[log] [tgz]

author

Lucas Ramirez <11032120+lucas-rami@users.noreply.github.com>

Fri Aug 08 14:26:04 2025 +0200

committer

GitHub <noreply@github.com>

Fri Aug 08 14:26:04 2025 +0200

tree

58ad7a7d4b6c027bff2bd596a31af184c23fe044

parent

ab7281d8969152a2cb0f302fe645e99e58e6d281 [diff]

[AMDGPU][Scheduler] Consistent occupancy calculation during rematerialization (#149224) The `RPTarget`'s way of determining whether VGPRs are beneficial to save and whether the target has been reached w.r.t. VGPR usage currently assumes, if `CombinedVGPRSavings` is true, that free slots in one VGPR RC can always be used for the other. Implicitly, this makes the rematerialization stage (only current user of `RPTarget`) follow a different occupancy calculation than the "regular one" that the scheduler uses, one that assumes that ArchVGPR/AGPR usage can be balanced perfectly and at no cost, which is untrue in general. This ultimately yields suboptimal rematerialization decisions that require cross-VGPR-RC copies unnecessarily. This fixes that, making the `RPTarget`'s internal model of occupancy consistent with the regular one. The `CombinedVGPRSavings` flag is removed, and a form of cross-VGPR-RC saving implemented only for unified RFs, which is where it makes the most sense. Only when the amount of free VGPRs in a given VGPR RC (ArchVPGR or AGPR) is lower than the excess VGPR usage in the other VGPR RC does the `RPTarget` consider that a pressure reduction in the former will be beneficial to the latter.

tree: 58ad7a7d4b6c027bff2bd596a31af184c23fe044

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called “LLVM”. This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, LLVM Office Hours or Regular sync-ups.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.