39bed5ede9c59c6f653e9a90135f88a0ef0e72f2 - clang

commit	39bed5ede9c59c6f653e9a90135f88a0ef0e72f2	[log] [tgz]
author	Craig Topper <craig.topper@intel.com>	Tue Jun 19 19:13:54 2018 +0000
committer	Craig Topper <craig.topper@intel.com>	Tue Jun 19 19:13:54 2018 +0000
tree	6e542b614cb8d1610f2b2770fbf9541394897642
parent	78e14633a4c4dad11a3327e56bda2703f4767491 [diff]

[X86] Rewrite the max and min reduction intrinsics to make better use of other functions and to reduce width to 256 and 128 bits were possible.

We only need to use 512 bit vectors all the way through v8i64 reductions since those max instructions are new to avx512f and only available in 512 bits until SKX.

For v16i32 and floating point we have legacy 128/256 bit instructions we can use.

I've tried to use other intrinsics to reduce the verbosity of the code and avoid having to mention all the shuffles. I've also removed all the -1 shuffle indices so the output sequence is fully specified and not left to backend optimization.

Differential Revision: https://reviews.llvm.org/D47401

git-svn-id: https://llvm.org/svn/llvm-project/cfe/trunk@335070 91177308-0d34-0410-b5e6-96231b3b80d8

2 files changed

tree: 6e542b614cb8d1610f2b2770fbf9541394897642