This is an archive of the discontinued LLVM Phabricator instance.

Reassociate: reprocess RedoInsts after each instruction
Needs ReviewPublic

Authored by aditya_nandakumar on Jan 15 2015, 3:36 AM.

Details

Reviewers
majnemer
Summary

Previously the RedoInsts was processed at the end of the block.
However it was possible that it left behind some instructions that
were not canonicalized.
This should guarantee that any previous instruction in the basic
block is canonicalized before we process a new instruction.

Diff Detail

Event Timeline

mehdi_amini retitled this revision from to Reassociate: reprocess RedoInsts after each instruction.
mehdi_amini updated this object.
mehdi_amini edited the test plan for this revision. (Show Details)
mehdi_amini added a subscriber: Unknown Object (MLST).
majnemer added inline comments.
lib/Transforms/Scalar/Reassociate.cpp
2279

Please insert a space between the if and the parenthesis.

test/Transforms/Reassociate/crash2.ll
4–19 ↗(On Diff #18219)

Your test has no CHECK lines, please consider adding some.

Taken comments into account: clang-format and CHECK line in the test.

Why did the old approach fall down on these testcases?

Hi David,

Let’s consider:

%_0 = add i32 %in, 1
%_1 = mul i32 %in, -2
%_2 = add i32 %_0, %_1
%_3 = add i32 %_1, %_2
%_4 = add i32 %_3, 1
%_5 = add i32 %in, %_3
ret i32 %_5

Now when processing %3, the expression is:

"%1 + %in + 1 + %1”

and it is factorized to:

“%in + 1 + ( 2 * %1)”

the IR becomes:

%_0 = add i32 %in, 1
%_1 = mul i32 %in, -2
%factor = mul i32 %_1, 2
%_2 = add i32 %in, 1
%_3 = add i32 %_2, %factor
%_4 = add i32 %_3, 1
%_5 = add i32 %in, %_3
ret i32 %_5

And %factor was added to the RedoInsts list because we known it might not be canonicalized.
And indeed in this case it is 2*(%1*-2) which can be turned into -4*%1.

Let continue to %5, it will consider

%in + %in + 1 + %factor

It will try to factorize, and list for all of the operands the factors to find some in common.
When processing %factor, the list of factors is

%in * -2 * 2

However this contains two constant and is forbidden because it should have been folded earlier. Because RedoInsts was processed at the end of the block instead of at the end of a transformation this canonicalization would have happened later.

Does it makes sense?

Thanks.

I still don't understand why this is the correct approach? Are we missing an optimization without your change?

Hi Chad,

No we are not missing an optimization without my approach, we are just hitting an assert...

I think it is the right thing to do anyway because I don't see the point of growing a long list of instructions to reprocess for later if you can do it now.

Mehdi

mehdi_amini planned changes to this revision.Jan 21 2015, 10:46 AM

My fuzzer found a case where we hit this same assert with this patch but not without. So it seems I still have to work on it.... Hold on for the review.

mehdi_amini retitled this revision from Reassociate: reprocess RedoInsts after each instruction to Reassociate: inst's operands must be processed before the inst itself .Jan 22 2015, 10:10 PM
mehdi_amini retitled this revision from Reassociate: inst's operands must be processed before the inst itself to Reassociate: reprocess RedoInsts after each instruction.

I had to made significant changes, this is ready for review now!

majnemer added inline comments.Jan 23 2015, 2:03 AM
lib/Transforms/Scalar/Reassociate.cpp
1957

I think this comment needs to be expanded a bit. Why must we refrain from optimizing the PHI operands?

1958

Please format this a little nicer.

2269

Why is topological in quotes?

2275–2287

Instead of having both RedoInsts and Worklist, what if we had a single SetVector that held both?

2281–2284

This would probably be better as:

SmallSet<Instruction *, 8> Worklist(BI->begin(), BI->end());

Thanks for your comment. I'll update the patch soon. I found out that I could use the RankMap to simplify what I'm trying to achieve.
I also found another but where we end up with an instruction moved to a new block (and hit an assertion).

lib/Transforms/Scalar/Reassociate.cpp
1957

Because the operand might be locate in a block that wasn't already processed.
I'll relax that by allowing operands that are located in block already processed.

2269

Because we don't have a DAG.

2275–2287

I don't see how would it be possible?

2281–2284

That was my first intention, but it seems that SmallSet does not provide such constructor.

I also tried:

SmallSet<Instruction *, 8> Worklist;
Worklist.insert(BI->begin(), BI->end());

But was bitten by: "candidate function not viable: no known conversion from 'llvm::Instruction' to 'llvm::Instruction *' for 1st argument; take the address of the argument with &"

Improve the filtering of instructions to reprocess based on basic block rank.
Add a new "crashing" test case that triggered this bug.

Reuse the RPOT iterator with BuildRankMap() for efficiency purpose.

Note, this patch has been ran over the weekend on my fuzzer without any
new crash.

Clear the RedoInsts immediately, even when erasing an instruction.

FYI: Still 3 of these tests are crashing as of now (r253245).

(Wow, this is from a while ago.)

If three of these tests are crashing, shouldn't we just commit the fix?
(Does it still apply?)

Chad, do you still have concerns? Is it about compile time or something
else?

Hi Mehdi,

Please consider merging the test files. They all have the same test line and as far as I know you are fixing the source of one crash, not eight.
Also please:

  • Add check lines.
  • Use opt -instnamer to get rid of the %[num] variables.

My 2c.

Thanks,
-Quentin

Only 3 test are still crashing now, but at the time I wrote the patch, I iterated on a fuzzer and these six tests (not eight) were stressing different patterns and different part of reassociate, so they're absolutely not redundant.

I'm not sure either what CHECK line to put for a compiler crash non-regression, there are multiple cases in the test suite like this where FileCheck is not involved.

(and the fact that 3 tests were fixed in the meantime shows that they're independent)

Chad, what needs to be done here?

What Quentin meant in his review was that you should concatenate all of the tests in a single file and run them that way, in general that's "accepted practice" for the project. I'm adding David as a reviewer as he's been doing that. Once you merge the testcases and David says ok, then we're good.

Thanks!

Sorry, this one fell completely off my radar.

Per Quentin/Eric, I would merge all the tests into a single file. Quentin's suggestion to use the instnamer seems fairly reasonable as well.

I'd like to hear David's comments before providing a LGTM as he has actually reviewed this patch in earnest.

I assume this should also be merged into the 3.8 branch, if we're actually crashing without this fix.

lib/Transforms/Scalar/Reassociate.cpp
2262

A "Topological" what? A "Topological" ranking... The sentences seems incomplete.

2263

were processed before -> have been processed. (Or something similar)

2278

No need for extra curly brackets.

2287

No need for extra curly brackets.

aditya_nandakumar commandeered this revision.Jan 15 2016, 10:49 AM
aditya_nandakumar updated this revision to Diff 45013.
aditya_nandakumar added a reviewer: mehdi_amini.

Updated based on feedback.

majnemer edited edge metadata.Jan 19 2016, 9:40 AM

Sorry for the delay, I'll take a look today.

majnemer added inline comments.Jan 20 2016, 1:20 PM
lib/Transforms/Scalar/Reassociate.cpp
2264

ben -> been ?

2284–2285

Is it possible for I to equal II ? Removing this check doesn't seem to make any tests fail.

test/Transforms/Reassociate/prev_insts_canonicalized.ll
1 ↗(On Diff #45013)

I'd follow the pattern used by the other tests:

; RUN: opt < %s -reassociate -S | FileCheck %s

This will ensure that the test's filename will not show up in the test's output.

aditya_nandakumar edited edge metadata.

Updated based on feedback

aditya_nandakumar marked 3 inline comments as done.Jan 22 2016, 10:55 AM