This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Transforms/InstCombine/
-
lib/
-
Transforms/
-
InstCombine/
1/1
InstructionCombining.cpp

Differential D83160

[InstCombine] Lower infinite combine loop detection thresholds
ClosedPublic

Authored by lebedev.ri on Jul 4 2020, 9:32 AM.

Download Raw Diff

Details

Reviewers

spatel
nikic
kuhar

Commits

rGcd7f8051ac7b: [InstCombine] Lower infinite combine loop detection thresholds

Summary

1000 iteratons is still kinda a lot.
Would it make sense to iteratively lower it, until it becomes 2,
with some delay inbetween in order to let users actually potentially encounter it?

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lebedev.ri created this revision.Jul 4 2020, 9:32 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 4 2020, 9:32 AM

Harbormaster failed remote builds in B62906: Diff 275505!Jul 4 2020, 10:45 AM

Overall, I think this is a good direction, but I'd like to understand better why InstCombine needs more than a few iterations and how 'fix' these cases without bailing out after a fixed number of iterations.
I'd be interested to find out if we can make an IR pattern generator that forces InstCombine to run N iterations. Are there any algorithmic guarantees of the current implementation which we could use to show that InstCombine doesn't go into an infinite loop?

In D83160#2131473, @kuhar wrote:

Overall, I think this is a good direction, but I'd like to understand better why InstCombine needs more than a few iterations and how 'fix' these cases without bailing out after a fixed number of iterations.
I'd be interested to find out if we can make an IR pattern generator that forces InstCombine to run N iterations. Are there any algorithmic guarantees of the current implementation which we could use to show that InstCombine doesn't go into an infinite loop?

@nikic can say more, but the general idea is that instcombine traditionally didn't pay much attention
to adding instructions-to-be-revisited back into worklist after changing something,
so we wouldn't revisit some instructions until we do the next iteration with worklist containing all function's instructions.

The main fix is to ensure that we consistently replenish worklist.
The fix is NOT to bailout after a number of iterations.

In D83160#2131474, @lebedev.ri wrote:

In D83160#2131473, @kuhar wrote:

Overall, I think this is a good direction, but I'd like to understand better why InstCombine needs more than a few iterations and how 'fix' these cases without bailing out after a fixed number of iterations.
I'd be interested to find out if we can make an IR pattern generator that forces InstCombine to run N iterations. Are there any algorithmic guarantees of the current implementation which we could use to show that InstCombine doesn't go into an infinite loop?

@nikic can say more, but the general idea is that instcombine traditionally didn't pay much attention
to adding instructions-to-be-revisited back into worklist after changing something,
so we wouldn't revisit some instructions until we do the next iteration with worklist containing all function's instructions.

The main fix is to ensure that we consistently replenish worklist.
The fix is NOT to bailout after a number of iterations.

Right. I think the three most common sources of additional InstCombine iterations are:

Dead instructions are not added to the worklist.
Changed/dependent instructions are not added to the worklist.
Instruction scans are performed forward instead of backward.

I spent some time a few months ago eliminating such issues and IIRC got most/all InstCombine tests to run in 3 iterations. I haven't found any cases where InstCombine genuinely needed many iterations and it was not just an issue of worklist management.

Incrementally dropping the limit sounds like a good idea to me and makes it more likely that we'll become aware of issues (and probably also more likely that fuzzing can find them). As suggested inline, I would limit this to assertion-enabled builds, to avoid impacting end users hitting pathological cases.

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp
133	Not really important for 100, which is still a very conservative value, but going forward I would split this up according to NDEBUG: static constexpr unsigned InstCombineDefaultMaxIterations = 1000; #ifndef NDEBUG static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 100; #else static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 1000; #endif In particular, I think we want to avoid making the end user compiler crash if they hit cases where instcombine is very slow, but still converges.

Addressing review notes.

Harbormaster completed remote builds in B62948: Diff 275580.Jul 5 2020, 4:32 PM

LGTM

This revision is now accepted and ready to land.Jul 6 2020, 12:22 AM

Closed by commit rGcd7f8051ac7b: [InstCombine] Lower infinite combine loop detection thresholds (authored by lebedev.ri). · Explain WhyJul 6 2020, 3:20 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

InstCombine/

InstructionCombining.cpp

5 lines

Diff 275634

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

	Show First 20 Lines • Show All 117 Lines • ▼ Show 20 Lines
	STATISTIC(NumDeadInst , "Number of dead inst eliminated");			STATISTIC(NumDeadInst , "Number of dead inst eliminated");
	STATISTIC(NumSunkInst , "Number of instructions sunk");			STATISTIC(NumSunkInst , "Number of instructions sunk");
	STATISTIC(NumExpand, "Number of expansions");			STATISTIC(NumExpand, "Number of expansions");
	STATISTIC(NumFactor , "Number of factorizations");			STATISTIC(NumFactor , "Number of factorizations");
	STATISTIC(NumReassoc , "Number of reassociations");			STATISTIC(NumReassoc , "Number of reassociations");
	DEBUG_COUNTER(VisitCounter, "instcombine-visit",			DEBUG_COUNTER(VisitCounter, "instcombine-visit",
	"Controls which instructions are visited");			"Controls which instructions are visited");

				// FIXME: these limits eventually should be as low as 2.
	static constexpr unsigned InstCombineDefaultMaxIterations = 1000;			static constexpr unsigned InstCombineDefaultMaxIterations = 1000;
				#ifndef NDEBUG
				static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 100;
				#else
	static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 1000;			static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 1000;
				#endif

				nikicUnsubmitted Done Reply Inline Actions Not really important for 100, which is still a very conservative value, but going forward I would split this up according to NDEBUG: static constexpr unsigned InstCombineDefaultMaxIterations = 1000; #ifndef NDEBUG static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 100; #else static constexpr unsigned InstCombineDefaultInfiniteLoopThreshold = 1000; #endif In particular, I think we want to avoid making the end user compiler crash if they hit cases where instcombine is very slow, but still converges. nikic: Not really important for 100, which is still a very conservative value, but going forward I…
	static cl::opt<bool>			static cl::opt<bool>
	EnableCodeSinking("instcombine-code-sinking", cl::desc("Enable code sinking"),			EnableCodeSinking("instcombine-code-sinking", cl::desc("Enable code sinking"),
	cl::init(true));			cl::init(true));

	static cl::opt<unsigned> LimitMaxIterations(			static cl::opt<unsigned> LimitMaxIterations(
	"instcombine-max-iterations",			"instcombine-max-iterations",
	cl::desc("Limit the maximum number of instruction combining iterations"),			cl::desc("Limit the maximum number of instruction combining iterations"),
	cl::init(InstCombineDefaultMaxIterations));			cl::init(InstCombineDefaultMaxIterations));
	▲ Show 20 Lines • Show All 3,735 Lines • Show Last 20 Lines