[VPlan] Move tail folding out of VPlanPredicator. NFC (#176143) Currently the logic for introducing a header mask and predicating the vector loop region is done inside introduceMasksAndLinearize. This splits the tail folding part out into an individual VPlan transform so that VPlanPredicator.cpp doesn't need to worry about tail folding, which seemed to be a temporary measure according to a comment in VPlanTransforms.h. To perform tail folding independently, this splits the "body" of the vector loop region between the phis in the header and the branch + iv increment in the latch: Before: ``` +-------------------------------------------+ |%iv = ... | |... | |%iv.next = add %iv, vfxuf | |branch-on-count %iv.next, vector-trip-count| +-------------------------------------------+ ``` After: ``` +-------------------------------------------+ |%iv = ... | |%wide.iv = widen-canonical-iv ... | |%header-mask = icmp ule %wide.iv, BTC |---+ |branch-on-cond %header-mask | | +-------------------------------------------+ | | | v | +-------------------------------------------+ | |... | | +-------------------------------------------+ | | | v | +-------------------------------------------+ | |%iv.next = add %iv, vfxuf |<--+ |branch-on-count %iv.next, vector-trip-count| +-------------------------------------------+ ``` Phis are then inserted in the latch for any value in the loop body that have outside uses, with poison as their incoming value from the header edge. The motivation for this is to allow us to share the same "predicate all successor blocks" type of predication we do for tail folding, but for early-exit loops in #172454. This may also allow us to directly emit an EVL based header mask, instead of having to match + transform the existing header mask in addExplicitVectorLength. This also allows us to eventually handle recurrences in the same transform, avoiding the need to special case tail folding in addReductionResultComputation.

tree: f5059edcf30e18034d5ec6e10a66593136879e99

README.md

The LLVM Compiler Infrastructure

Welcome to the LLVM project!

This repository contains the source code for LLVM, a toolkit for the construction of highly optimized compilers, optimizers, and run-time environments.

The LLVM project has multiple components. The core of the project is itself called “LLVM”. This contains all of the tools, libraries, and header files needed to process intermediate representations and convert them into object files. Tools include an assembler, disassembler, bitcode analyzer, and bitcode optimizer.

C-like languages use the Clang frontend. This component compiles C, C++, Objective-C, and Objective-C++ code into LLVM bitcode -- and from there into object files, using LLVM.

Other components include: the libc++ C++ standard library, the LLD linker, and more.

Getting the Source Code and Building LLVM

Consult the Getting Started with LLVM page for information on building and running LLVM.

For information on how to contribute to the LLVM project, please take a look at the Contributing to LLVM guide.

Getting in touch

Join the LLVM Discourse forums, Discord chat, LLVM Office Hours or Regular sync-ups.

The LLVM project has adopted a code of conduct for participants to all modes of communication within the project.