In the canonical form of the shuffle the poison/undef operand is the
second operand, the patch tries to emit canonical form for partial
vectorization of the buildvector sequence.
Also, this patch starts emitting freeze instruction for shuffles with undef indices if the second shuffle operan is undef, not poison. It is an initial step to D93818, where undef mask element are treated as returning poison value.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/test/Transforms/SLPVectorizer/X86/PR35865.ll | ||
---|---|---|
15 | I'm not sure I understand well why freeze is generated here. I likely missed something but the LangRef says: It would be nice if you extend patch description with some information about what is deemed to be a canonical form of a shuffle. |
llvm/test/Transforms/SLPVectorizer/X86/PR35865.ll | ||
---|---|---|
15 | It is going to be changed soon to be poisoned instead of undefined. |
llvm/test/Transforms/SLPVectorizer/X86/PR35865.ll | ||
---|---|---|
15 | Ah, okay. Hence there must be a discussion about that somewhere. Can you please point out to the discussion? Since this is yet anticipated change the patch description should definitely be more detailed. |
llvm/test/Transforms/SLPVectorizer/X86/PR35865.ll | ||
---|---|---|
15 |
llvm/test/Transforms/SLPVectorizer/X86/PR35865.ll | ||
---|---|---|
15 | Done |
Hi @ABataev, we have an internal test where this seemed to cause the codegen for a horizontal add test to get worse (I think) after your change. I have filed the issue as #58177, can you take a look?
Looks like need to add the analysis of freeze instructions to the shuffles emission/cost estimation function, will do it later today.
I'm not sure I understand well why freeze is generated here. I likely missed something but the LangRef says:
"If the shuffle mask is undefined, the result vector is undefined. If the shuffle mask selects an undefined element from one of the input vectors, the resulting element is undefined. An undefined element in the mask vector specifies that the resulting element is undefined. An undefined element in the mask vector prevents a poisoned vector element from propagating."
So here the first two lanes of VECINS_I_5_I1 are taken from TMP6, all the rest is undef. The for TMP6 the first two lanes taken from TMP5, all the rest of TM5 is undef, TMP5 is a bitcast of TMP4, both lines of which are normal values. It means that all lanes of VECINS_I_5_I1 are either normal values or undef. So why freeze is here?
It would be nice if you extend patch description with some information about what is deemed to be a canonical form of a shuffle.