c3b4f60ae291e7eb677d8b15986e12100fa2299d - llvm-project/llvm

commit: c3b4f60ae291e7eb677d8b15986e12100fa2299d
[log]
author: Sanjay Patel <spatel@rotateright.com>
Sun Jul 19 10:03:55 2020 -0400
committer: Copybara-Service <copybara-worker@google.com>
Tue Oct 27 05:32:51 2020 -0700
tree: 5ed6bd1536e910df1520af5903591b6521570d6c
parent: 1466c390f7e2f69905541d204a968ac387d315a2 [diff]

[x86] split FMA with fast-math-flags to avoid libcall

fma reassoc A, B, C --> fadd (fmul A, B), C (when target has no FMA hardware)

C/C++ code may use explicit fma() calls (which become LLVM fma
intrinsics in IR) but then gets compiled with -ffast-math or similar.
For targets that do not have FMA hardware, we don't want to go out to
the math library for a precise but slow FMA result.

I tried this as a generic DAGCombine, but it caused infinite looping
on more than 1 other target, so there's likely some over-reaching fma
formation happening.

There's also a potential intersection of strict FP with fast-math here.
Deferring to current behavior for that case (assuming that strict-ness
overrides fast-ness).

Differential Revision: https://reviews.llvm.org/D83981

GitOrigin-RevId: 50afa18772daca0b6de253a7c5311c81b0a46682

2 files changed

tree: 5ed6bd1536e910df1520af5903591b6521570d6c