The bug is in Low <-> High boundaries calculation. The High boundary should be "last memory access pointer + element size".
(It may be "last memory access pointer + element size -1", since we use "<=" in bounds condition)
Paths
| Differential D23176
[LoopVectorizer] Fixed a bug in memory conflict run-time check ClosedPublic Authored by delena on Aug 4 2016, 1:15 PM.
Details Summary The bug is in Low <-> High boundaries calculation. The High boundary should be "last memory access pointer + element size". (It may be "last memory access pointer + element size -1", since we use "<=" in bounds condition)
Diff Detail
Event Timeline
Comment Actions Did you forget to add the test?
Comment Actions
No, I didn't. I fixed the existing lit tests that explore the change. Do you want one more lit test? Comment Actions
Yes, please. I would certainly like to have the testcase included in the patch that triggers the bug with an explanation why. You should be able to construct a testcase by checking the memcheck part of the output of LAA. Once the other bug is fixed you can add an additional RUN line for -loop-vectorize and then check the compares directly. delena edited edge metadata. Comment Actions
delena marked 3 inline comments as done. Comment ActionsAdded more explanations to the code according to Adam's remarks.
Comment Actions I am happy with this now with the comments addressed as long as Ayal also likes it. Thanks for the fix!
Closed by commit rL279930: [Loop Vectorizer] Fixed memory confilict checks. (authored by delena). · Explain WhyAug 28 2016, 2:02 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 67147 ../lib/Analysis/LoopAccessAnalysis.cpp
../test/Analysis/LoopAccessAnalysis/memcheck-off-by-one-error.ll../test/Analysis/LoopAccessAnalysis/number-of-memchecks.ll
../test/Analysis/LoopAccessAnalysis/reverse-memcheck-bounds.ll
../test/Transforms/LoopVectorize/runtime-check-readonly.ll
../test/Transforms/LoopVectorize/tbaa-nodep.ll
../test/Transforms/LoopVersioningLICM/loopversioningLICM1.ll
|
can add:
"which is clearly the complement of having the intervals disjoint/ordered:
= !((P2.Start >= P1.End) || (P1.Start >= P2.End))"