A small step towards combining shuffles across vector sizes - this recognises when a shuffle's operands are all extracted from the same larger source and tries to combine to an unary shuffle of that source instead. Fixes one of the test cases from PR34380.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
test/CodeGen/X86/avx512-shuffles/partial_permute.ll | ||
---|---|---|
4411 | This is annoying - I'd be happier if the mask in zmm2 was at least reduced to a <1,4> xmm load (or broadcasted) but that's a common issue in a lot of the masks generated in this patch. |
Comment Actions
LGTM
test/CodeGen/X86/avx512-shuffles/partial_permute.ll | ||
---|---|---|
4411 | Mark this with a TODO and/or file a bug report, so we don't lose track? |
This is annoying - I'd be happier if the mask in zmm2 was at least reduced to a <1,4> xmm load (or broadcasted) but that's a common issue in a lot of the masks generated in this patch.