The code did not check if operand was undef before casting it to Instruction.
Details
Diff Detail
Event Timeline
test/Transforms/SLPVectorizer/X86/crash_reordering_undefs.ll | ||
---|---|---|
25 | Can you cleanup the test var names (-instnamer might do this for you)? |
lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
775 | I don't think we are having this issue here, because ReorderingMode::Load only accepts loads. If we run into an Undef while in ReorderingMode::Load, getBestOperand() will return none, and the mode will change to RerorderingMode::Failed in line 1013. On the other hand ReorderingMode::Opcode accepts both Instructions and undefs. | |
792 | I guess we could accept that too. Let me try it out. |
lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
792 | If the OpLastLane is the instruction and Op is Undef, maybe OpIdx must be considered as BestOp.Idx? Or this handled already by good|best scores? |
lib/Transforms/Vectorize/SLPVectorizer.cpp | ||
---|---|---|
792 | Yes, we will accept the Undef, if the other Op is not an Instruction. This is what the good/best scores are used for: they give higher priority to VLs with instructions only, as opposed to those with both instructions and undefs. So if let's say OpLastLane was an instruction and we have two candidate Ops, one Undef and one Instruction, we will choose the instruction, because it will get the BestScore. In a similar situation, if we have to choose between two Ops, one instruction and one undef, but OpLastLane is an undef, then I am not sure which one would be better. |
Thanks for fixing!
The original case where I saw the crash compiles successfully with this patch.
test/Transforms/SLPVectorizer/X86/crash_reordering_undefs.ll | ||
---|---|---|
2 | Often people prefer to have at least a few CHECKs that the output is reasonable in some way instead of just seeing that we don't get a crash. |
test/Transforms/SLPVectorizer/X86/crash_reordering_undefs.ll | ||
---|---|---|
4 | I think you can remove this from the test since the parameter with the triple is provided |
Thank you for the reviews, and @uabelho for reporting the issue.
Please commit the patch as I don't have commit access.
Probably, here you also may have a problem with OpLastLane as UndefValue, no?