HomePhabricator

[x86] allow insert/extract when matching horizontal ops

Description

[x86] allow insert/extract when matching horizontal ops

Previously, we limited this transform to cases where the
extraction into the build vector happens from vectors of
the same type as the build vector, but that's not required.

There's a slight potential regression seen in the AVX512
result for phadd -- we're using the 256-bit flavor of the
instruction now even though the 128-bit subset is sufficient.
The same problem could already be seen in the AVX2 result.
Follow-up patches will attempt to narrow that back down.

Details

Committed
spatelFri, Jan 11, 6:27 AM
Parents
rL350927: [llvm-objcopy] [COFF] Implmement --strip-unneeded and -x/--discard-all for…
Branches
Unknown
Tags
Unknown