This is an archive of the discontinued LLVM Phabricator instance.

[LCSSA] Handle PHI insertion in disjoint loops
ClosedPublic

Authored by bruno on Dec 11 2014, 11:52 AM.

Download Raw Diff

Details

Reviewers

chandlerc
hfinkel

Commits

rGbad65c3b708a: [LCSSA] Handle PHI insertion in disjoint loops

Summary

Take two disjoint Loops L1 and L2.

LoopSimplify skips to simplify some loops (e.g. when indirect branches are involved). In such situations, it can happen that an exit for L1 is the header of L2. Thus, when we create PHIs in one of such exits we are also inserting PHIs in L2 header.

This could break LCSSA form for L2 because these inserted PHIs can also have uses in L2 exits, which are never handled in the current implementation. Provide a fix for this corner case and test that we don't assert/crash on that.

Diff Detail

Event Timeline

bruno updated this revision to Diff 17179.Dec 11 2014, 11:52 AM

bruno retitled this revision from to [LCSSA] Handle PHI insertion in disjoint loops.

bruno updated this object.

bruno edited the test plan for this revision. (Show Details)

bruno added reviewers: chandlerc, hfinkel.

bruno set the repository for this revision to rL LLVM.

bruno added a subscriber: Unknown Object (MLST).

Ping!

chandlerc added inline comments.Dec 15 2014, 11:19 AM

lib/Transforms/Utils/LCSSA.cpp
97	Rather than a set pointing into AddedPHIs, just keep a list? We don't put PHI nodes into AddedPHIs more than once, so there is no need for set logic, just a separate list for the ones needing post-processing?
136–137	I would say "LoopSimplify might fail to simplify some loops (...)."
139	"PHIs in such an exit block, we are also inserting PHIs into L2's header." reads a bit better to me.
141	"can also have uses outside of L2."? It doesn't matter where the uses are, the key is that we're inserting a loop value in the form of a PHI node in the header.
182–192	Add a comment that we specifically recurse here? Note that I can probably blow out the stack here with a suitably horrible crafted IR: simply set up a chain of loops such as this that is arbitrarily long. It's pretty contrived, but at least leave a FIXME that we should really convert this entire thing to a worklist approach where we process a vector of instructions...

Thanks for the review Chandler, patch updated with your suggestions.

Ping!

One totally nit-picky comment, and one substantial comment. Feel free to commit once addressed.

lib/Transforms/Utils/LCSSA.cpp
144–146	I would write this as: if (auto OtherLoop = LI->getLoopFor(ExitBB)) if (!L.contains(OtherLoop)) PostProcessPHIs.push_back(PN)
177–181	Ow. This is quadratic in all cases AFAICT. Either we move all remaining elements in PostProcessPHIs down a slot, or we skip over an element in every trip throuh AddedPHIs (AddedPHIs is a superset and ordered the same). Instead of all this, how about we do the post processing first, and then this loop? You can add a similar guard to skip post processing of phis with empty use lists. That avoids all the complexity here.

This revision is now accepted and ready to land.Dec 22 2014, 11:28 AM

bruno added inline comments.Dec 22 2014, 11:56 AM

lib/Transforms/Utils/LCSSA.cpp
144–146	Neat!
177–181	Yep, I was trying to avoid it by using SmallPtrSet in the first patch, thought you had noticed the tradeoff, should have made that explicit. But this solution looks better anyway. Thanks again. :-)

bruno closed this revision.Dec 22 2014, 2:39 PM

Revision Contents

Path

Size

include/

llvm/

Transforms/

Utils/

LoopUtils.h

5 lines

lib/

Transforms/

Scalar/

LICM.cpp

3 lines

Utils/

LCSSA.cpp

56 lines

LoopUnroll.cpp

2 lines

test/

Transforms/

LCSSA/

indirectbr.ll

40 lines

Diff 17332

include/llvm/Transforms/Utils/LoopUtils.h

Context not available.
	/// If ScalarEvolution is passed in, it will be preserved.	/// If ScalarEvolution is passed in, it will be preserved.
	///	///
	/// Returns true if any modifications are made to the loop.	/// Returns true if any modifications are made to the loop.
	bool formLCSSA(Loop &L, DominatorTree &DT, ScalarEvolution *SE = nullptr);	bool formLCSSA(Loop &L, DominatorTree &DT, LoopInfo *LI,
		ScalarEvolution *SE = nullptr);

	/// \brief Put a loop nest into LCSSA form.	/// \brief Put a loop nest into LCSSA form.
	///	///
Context not available.
	/// If ScalarEvolution is passed in, it will be preserved.	/// If ScalarEvolution is passed in, it will be preserved.
	///	///
	/// Returns true if any modifications are made to the loop.	/// Returns true if any modifications are made to the loop.
	bool formLCSSARecursively(Loop &L, DominatorTree &DT,	bool formLCSSARecursively(Loop &L, DominatorTree &DT, LoopInfo *LI,
	ScalarEvolution *SE = nullptr);	ScalarEvolution *SE = nullptr);
	}	}

Context not available.

lib/Transforms/Scalar/LICM.cpp

Context not available.
	// SSAUpdater strategy during promotion that was LCSSA aware and reformed	// SSAUpdater strategy during promotion that was LCSSA aware and reformed
	// it as it went.	// it as it went.
	if (Changed)	if (Changed)
	formLCSSARecursively(L, DT, getAnalysisIfAvailable<ScalarEvolution>());	formLCSSARecursively(L, DT, LI,
		getAnalysisIfAvailable<ScalarEvolution>());
	}	}

	// Check that neither this loop nor its parent have had LCSSA broken. LICM is	// Check that neither this loop nor its parent have had LCSSA broken. LICM is
Context not available.

lib/Transforms/Utils/LCSSA.cpp

Context not available.
	/// uses.	/// uses.
	static bool processInstruction(Loop &L, Instruction &Inst, DominatorTree &DT,	static bool processInstruction(Loop &L, Instruction &Inst, DominatorTree &DT,
	const SmallVectorImpl<BasicBlock *> &ExitBlocks,	const SmallVectorImpl<BasicBlock *> &ExitBlocks,
	PredIteratorCache &PredCache) {	PredIteratorCache &PredCache, LoopInfo *LI) {
	SmallVector<Use *, 16> UsesToRewrite;	SmallVector<Use *, 16> UsesToRewrite;

	BasicBlock *InstBB = Inst.getParent();	BasicBlock *InstBB = Inst.getParent();
Context not available.
	DomTreeNode *DomNode = DT.getNode(DomBB);	DomTreeNode *DomNode = DT.getNode(DomBB);

	SmallVector<PHINode *, 16> AddedPHIs;	SmallVector<PHINode *, 16> AddedPHIs;
		SmallVector<PHINode *, 8> PostProcessPHIs;
		chandlercUnsubmitted Not Done Reply Inline Actions Rather than a set pointing into AddedPHIs, just keep a list? We don't put PHI nodes into AddedPHIs more than once, so there is no need for set logic, just a separate list for the ones needing post-processing? chandlerc: Rather than a set pointing into AddedPHIs, just keep a list? We don't put PHI nodes into…

	SSAUpdater SSAUpdate;	SSAUpdater SSAUpdate;
	SSAUpdate.Initialize(Inst.getType(), Inst.getName());	SSAUpdate.Initialize(Inst.getType(), Inst.getName());
Context not available.

	// Remember that this phi makes the value alive in this block.	// Remember that this phi makes the value alive in this block.
	SSAUpdate.AddAvailableValue(ExitBB, PN);	SSAUpdate.AddAvailableValue(ExitBB, PN);

		// LoopSimplify might fail to simplify some loops (e.g. when indirect
		// branches are involved). In such situations, it might happen that an exit
		chandlercUnsubmitted Not Done Reply Inline Actions I would say "LoopSimplify might fail to simplify some loops (...)." chandlerc: I would say "LoopSimplify might fail to simplify some loops (...)."
		// for Loop L1 is the header of a disjoint Loop L2. Thus, when we create
		// PHIs in such an exit block, we are also inserting PHIs into L2's header.
		chandlercUnsubmitted Not Done Reply Inline Actions "PHIs in such an exit block, we are also inserting PHIs into L2's header." reads a bit better to me. chandlerc: "PHIs in such an exit block, we are also inserting PHIs into L2's header." reads a bit better…
		// This could break LCSSA form for L2 because these inserted PHIs can also
		// have uses outside of L2. Remember all PHIs in such situation as to
		chandlercUnsubmitted Not Done Reply Inline Actions "can also have uses outside of L2."? It doesn't matter where the uses are, the key is that we're inserting a loop value in the form of a PHI node in the header. chandlerc: "can also have uses outside of L2."? It doesn't matter where the uses are, the key is that…
		// revisit than later on. FIXME: Remove this if indirectbr support into
		// LoopSimplify gets improved.
		Loop *OtherLoop = LI->getLoopFor(ExitBB);
		if (OtherLoop && !L.contains(OtherLoop))
		PostProcessPHIs.push_back(PN);
		chandlercUnsubmitted Not Done Reply Inline Actions I would write this as: if (auto OtherLoop = LI->getLoopFor(ExitBB)) if (!L.contains(OtherLoop)) PostProcessPHIs.push_back(PN) chandlerc: I would write this as: if (auto OtherLoop = LI->getLoopFor(ExitBB)) if (!L.contains…
		brunoAuthorUnsubmitted Not Done Reply Inline Actions Neat! bruno: ------ Neat!
	}	}

	// Rewrite all uses outside the loop in terms of the new PHIs we just	// Rewrite all uses outside the loop in terms of the new PHIs we just
Context not available.

	// Remove PHI nodes that did not have any uses rewritten.	// Remove PHI nodes that did not have any uses rewritten.
	for (unsigned i = 0, e = AddedPHIs.size(); i != e; ++i) {	for (unsigned i = 0, e = AddedPHIs.size(); i != e; ++i) {
	if (AddedPHIs[i]->use_empty())	if (!AddedPHIs[i]->use_empty())
	AddedPHIs[i]->eraseFromParent();	continue;
		for (unsigned j = 0, ee = PostProcessPHIs.size(); j != ee; ++j)
		if (PostProcessPHIs[j] == AddedPHIs[i]) {
		PostProcessPHIs.erase(PostProcessPHIs.begin()+j);
		break;
		}
		chandlercUnsubmitted Not Done Reply Inline Actions Ow. This is quadratic in all cases AFAICT. Either we move all remaining elements in PostProcessPHIs down a slot, or we skip over an element in every trip throuh AddedPHIs (AddedPHIs is a superset and ordered the same). Instead of all this, how about we do the post processing first, and then this loop? You can add a similar guard to skip post processing of phis with empty use lists. That avoids all the complexity here. chandlerc: Ow. This is quadratic in all cases AFAICT. Either we move all remaining elements in…
		brunoAuthorUnsubmitted Not Done Reply Inline Actions Yep, I was trying to avoid it by using SmallPtrSet in the first patch, thought you had noticed the tradeoff, should have made that explicit. But this solution looks better anyway. Thanks again. :-) bruno: ---------- Yep, I was trying to avoid it by using SmallPtrSet in the first patch, thought you…
		AddedPHIs[i]->eraseFromParent();
		}

		// Post process PHI instructions that were inserted into another disjoint loop
		// and update their exits properly.
		for (auto *I : PostProcessPHIs) {
		BasicBlock *PHIBB = I->getParent();
		Loop *OtherLoop = LI->getLoopFor(PHIBB);
		SmallVector<BasicBlock *, 8> EBs;
		OtherLoop->getExitBlocks(EBs);
		if (EBs.empty())
		chandlercUnsubmitted Not Done Reply Inline Actions Add a comment that we specifically recurse here? Note that I can probably blow out the stack here with a suitably horrible crafted IR: simply set up a chain of loops such as this that is arbitrarily long. It's pretty contrived, but at least leave a FIXME that we should really convert this entire thing to a worklist approach where we process a vector of instructions... chandlerc: Add a comment that we specifically recurse here? Note that I can probably blow out the stack…
		continue;

		// Recurse and re-process each PHI instruction. FIXME: we should really
		// convert this entire thing to a worklist approach where we process a
		// vector of instructions...
		processInstruction(OtherLoop, I, DT, EBs, PredCache, LI);
	}	}

	return true;	return true;
Context not available.
	return false;	return false;
	}	}

	bool llvm::formLCSSA(Loop &L, DominatorTree &DT, ScalarEvolution *SE) {	bool llvm::formLCSSA(Loop &L, DominatorTree &DT, LoopInfo *LI,
		ScalarEvolution *SE) {
	bool Changed = false;	bool Changed = false;

	// Get the set of exiting blocks.	// Get the set of exiting blocks.
Context not available.
	!isa<PHINode>(I->user_back())))	!isa<PHINode>(I->user_back())))
	continue;	continue;

	Changed \|= processInstruction(L, *I, DT, ExitBlocks, PredCache);	Changed \|= processInstruction(L, *I, DT, ExitBlocks, PredCache, LI);
	}	}
	}	}

Context not available.
	}	}

	/// Process a loop nest depth first.	/// Process a loop nest depth first.
	bool llvm::formLCSSARecursively(Loop &L, DominatorTree &DT,	bool llvm::formLCSSARecursively(Loop &L, DominatorTree &DT, LoopInfo *LI,
	ScalarEvolution *SE) {	ScalarEvolution *SE) {
	bool Changed = false;	bool Changed = false;

	// Recurse depth-first through inner loops.	// Recurse depth-first through inner loops.
	for (Loop::iterator LI = L.begin(), LE = L.end(); LI != LE; ++LI)	for (Loop::iterator I = L.begin(), E = L.end(); I != E; ++I)
	Changed \|= formLCSSARecursively(**LI, DT, SE);	Changed \|= formLCSSARecursively(**I, DT, LI, SE);

	Changed \|= formLCSSA(L, DT, SE);	Changed \|= formLCSSA(L, DT, LI, SE);
	return Changed;	return Changed;
	}	}

Context not available.

	// Simplify each loop nest in the function.	// Simplify each loop nest in the function.
	for (LoopInfo::iterator I = LI->begin(), E = LI->end(); I != E; ++I)	for (LoopInfo::iterator I = LI->begin(), E = LI->end(); I != E; ++I)
	Changed \|= formLCSSARecursively(*I, DT, SE);	Changed \|= formLCSSARecursively(*I, DT, LI, SE);

	return Changed;	return Changed;
	}	}
Context not available.

lib/Transforms/Utils/LoopUnroll.cpp

Context not available.
	while (OuterL->getParentLoop() != LatchLoop)	while (OuterL->getParentLoop() != LatchLoop)
	OuterL = OuterL->getParentLoop();	OuterL = OuterL->getParentLoop();

	formLCSSARecursively(OuterL, DT, SE);	formLCSSARecursively(OuterL, DT, LI, SE);
	}	}
	}	}

Context not available.

test/Transforms/LCSSA/indirectbr.ll

	; RUN: opt < %s -lcssa -verify-loop-info -verify-dom-info -disable-output	; RUN: opt < %s -loop-simplify -lcssa -verify-loop-info -verify-dom-info -S \| FileCheck %s
	; PR5437

	; LCSSA should work correctly in the case of an indirectbr that exits	; LCSSA should work correctly in the case of an indirectbr that exits
	; the loop, and the loop has exits with predecessors not within the loop	; the loop, and the loop has exits with predecessors not within the loop
	; (and btw these edges are unsplittable due to the indirectbr).	; (and btw these edges are unsplittable due to the indirectbr).
		; PR5437
	define i32 @js_Interpret() nounwind {	define i32 @test0() nounwind {
		; CHECK-LABEL: @test0
	entry:	entry:
	br i1 undef, label %"4", label %"3"	br i1 undef, label %"4", label %"3"

Context not available.
	"1862": ; preds = %"1836", %"692"	"1862": ; preds = %"1836", %"692"
	unreachable	unreachable
	}	}

		; An exit for Loop L1 may be the header of a disjoint Loop L2. Thus, when we
		; create PHIs in one of such exits we are also inserting PHIs in L2 header. This
		; could break LCSSA form for L2 because these inserted PHIs can also have uses
		; in L2 exits. Test that we don't assert/crash on that.
		define void @test1() {
		; CHECK-LABEL: @test1
		br label %lab1

		lab1:
		%tmp21 = add i32 undef, 677038203
		br i1 undef, label %lab2, label %exit

		lab2:
		indirectbr i8* undef, [label %lab1, label %lab3]

		lab3:
		; CHECK: %tmp21.lcssa1 = phi i32 [ %tmp21.lcssa1, %lab4 ], [ %tmp21, %lab2 ]
		%tmp12 = phi i32 [ %tmp21, %lab2 ], [ %tmp12, %lab4 ]
		br i1 undef, label %lab5, label %lab4

		lab4:
		br label %lab3

		lab5:
		; CHECK: %tmp21.lcssa1.lcssa = phi i32 [ %tmp21.lcssa1, %lab3 ]
		%tmp15 = add i32 %tmp12, undef
		br label %exit

		exit:
		ret void
		}
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

[LCSSA] Handle PHI insertion in disjoint loopsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 17332

include/llvm/Transforms/Utils/LoopUtils.h

lib/Transforms/Scalar/LICM.cpp

lib/Transforms/Utils/LCSSA.cpp

lib/Transforms/Utils/LoopUnroll.cpp

test/Transforms/LCSSA/indirectbr.ll

[LCSSA] Handle PHI insertion in disjoint loops
ClosedPublic