This patch extends LoopVectorize to handle the vectorization of interleaved
memory accesses with scalable vectors when mask is required or/and predicated
tail folding is enabled.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Time | Test | |
---|---|---|
700 ms | x64 debian > lld.ELF::arm-data-relocs.s |
Event Timeline
I think it would be worthwhile to have a test covering conditional execution inside the original source loop as well as the active lane mask for tail folding. See llvm/test/Transforms/LoopVectorize/interleaved-accesses-masked-group.ll for examples.
llvm/lib/Target/AArch64/AArch64TargetTransformInfo.cpp | ||
---|---|---|
2885–2886 | We could use SVE masking for fixed length as well, but that can be done at a later date if you want. |
llvm/lib/Transforms/Vectorize/LoopVectorize.cpp | ||
---|---|---|
2679 | could you add an assert here: "Unsupported deinterleave factor for scalable vectors"); there is one in line 2731m but this function is called before reaching that line. |
LGTM.
llvm/lib/Transforms/Vectorize/LoopVectorize.cpp | ||
---|---|---|
2688 | Nit: when passing a null or boolean literal into a function, it's customary in LLVM to add an inline comment before it indicating the parameter name, as below. /*FMFSource=*/nullptr would work. |
We could use SVE masking for fixed length as well, but that can be done at a later date if you want.