[X86][AVX] Shuffle->Permute+Blend if we have one v4f64/v4i64 shuffle input in place

Even on AVX1 we can pretty cheaply (VPERM2F128+VSHUFPD) permute a single v4f64/v4i64 input (on AVX2 its just a single VPERMPD), followed by a BLENDPD.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@354729 91177308-0d34-0410-b5e6-96231b3b80d8
3 files changed