This is an archive of the discontinued LLVM Phabricator instance.

lib/Transforms/Utils/LoopUnroll.cpp
386–387 ↗	(On Diff #66747)	So, this is done because we leave the first copy of the loop alone when unrolling? If not, shouldn't we handle this below so that we don't add inner loops that won't actually exist after unrolling?
692 ↗	(On Diff #66747)	Related to the above TODO comment, the ways in which this can break loop simplified form seem fairly constrained. If this is a compile time issue, its likely a more targeted approach could be used... Happy to just leave a comment here if you don't yet know whether we need to consider the compile time here.

mzolotukhin added inline comments.Aug 3 2016, 7:48 PM

lib/Transforms/Utils/LoopUnroll.cpp
386–387 ↗	(On Diff #66747)	this is done because we leave the first copy of the loop alone when unrolling? Yes, exactly. When we unroll something like this: outer_loop { inner_loop } we could get something like this: outer_loop { inner_loop inner_loop2 } or even inner_loop inner_loop2 in the case of complete unrolling. `inner_loop2` is a new loop, created by loop-unrolling (see lines 417-418). `inner_loop` existed in the original IR and remains (almost) untouched. It still needs to be simplified, as it'll be using the same exit block, as the cloned loops.
692 ↗	(On Diff #66747)	the ways in which this can break loop simplified form seem fairly constrained This is true, that's why I left the TODO there. It should be definitely possible to fix only broken parts (i.e. insert preheaders and split exit-edges). On the other hand, we've been calling `simplifyLoop` on `OuterL` without any issues before, which also simplifies all nested loops - that was my motivation for keeping it simple for now. My idea was that comment from above relates to this spot as well, just didn't want to replicate the same TODO.

Patch LGTM.

Would it make sense to only fill in LoopsToSimplify if OuterL is null? Either way, feel free to land.

chandlerc accepted this revision.Aug 8 2016, 11:45 AM

chandlerc edited edge metadata.

This revision is now accepted and ready to land.Aug 8 2016, 11:45 AM

Closed by commit rL278038: [LoopUnroll] Simplify loops created by unrolling. (authored by mzolotukhin). · Explain WhyAug 8 2016, 12:10 PM

This revision was automatically updated to reflect the committed changes.

Thanks, committed in r278038.

Would it make sense to only fill in LoopsToSimplify if OuterL is null?

I think the checks will clutter the code for not so big benefit, so I decided to go without them.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Transforms/

Utils/

LoopUnroll.cpp

19 lines

Diff 67214

llvm/trunk/lib/Transforms/Utils/LoopUnroll.cpp

Show First 20 Lines • Show All 371 Lines • ▼ Show 20 Lines	bool llvm::UnrollLoop(Loop *L, unsigned Count, unsigned TripCount, bool Force,
LoopBlocksDFS DFS(L);		LoopBlocksDFS DFS(L);
DFS.perform(LI);		DFS.perform(LI);

// Stash the DFS iterators before adding blocks to the loop.		// Stash the DFS iterators before adding blocks to the loop.
LoopBlocksDFS::RPOIterator BlockBegin = DFS.beginRPO();		LoopBlocksDFS::RPOIterator BlockBegin = DFS.beginRPO();
LoopBlocksDFS::RPOIterator BlockEnd = DFS.endRPO();		LoopBlocksDFS::RPOIterator BlockEnd = DFS.endRPO();

std::vector<BasicBlock*> UnrolledLoopBlocks = L->getBlocks();		std::vector<BasicBlock*> UnrolledLoopBlocks = L->getBlocks();

		// Loop Unrolling might create new loops. While we do preserve LoopInfo, we
		// might break loop-simplified form for these loops (as they, e.g., would
		// share the same exit blocks). We'll keep track of loops for which we can
		// break this so that later we can re-simplify them.
		SmallSetVector<Loop *, 4> LoopsToSimplify;
		for (Loop SubLoop : L)
		LoopsToSimplify.insert(SubLoop);

for (unsigned It = 1; It != Count; ++It) {		for (unsigned It = 1; It != Count; ++It) {
std::vector<BasicBlock*> NewBlocks;		std::vector<BasicBlock*> NewBlocks;
SmallDenseMap<const Loop , Loop , 4> NewLoops;		SmallDenseMap<const Loop , Loop , 4> NewLoops;
NewLoops[L] = L;		NewLoops[L] = L;

for (LoopBlocksDFS::RPOIterator BB = BlockBegin; BB != BlockEnd; ++BB) {		for (LoopBlocksDFS::RPOIterator BB = BlockBegin; BB != BlockEnd; ++BB) {
ValueToValueMapTy VMap;		ValueToValueMapTy VMap;
BasicBlock New = CloneBasicBlock(BB, VMap, "." + Twine(It));		BasicBlock New = CloneBasicBlock(BB, VMap, "." + Twine(It));
Show All 14 Lines	for (LoopBlocksDFS::RPOIterator BB = BlockBegin; BB != BlockEnd; ++BB) {
assert(*BB == OldLoop->getHeader() &&		assert(*BB == OldLoop->getHeader() &&
"Header should be first in RPO");		"Header should be first in RPO");

Loop *NewLoopParent = NewLoops.lookup(OldLoop->getParentLoop());		Loop *NewLoopParent = NewLoops.lookup(OldLoop->getParentLoop());
assert(NewLoopParent &&		assert(NewLoopParent &&
"Expected parent loop before sub-loop in RPO");		"Expected parent loop before sub-loop in RPO");
NewLoop = new Loop;		NewLoop = new Loop;
NewLoopParent->addChildLoop(NewLoop);		NewLoopParent->addChildLoop(NewLoop);
		LoopsToSimplify.insert(NewLoop);

// Forget the old loop, since its inputs may have changed.		// Forget the old loop, since its inputs may have changed.
if (SE)		if (SE)
SE->forgetLoop(OldLoop);		SE->forgetLoop(OldLoop);
}		}
NewLoop->addBasicBlockToLoop(New, *LI);		NewLoop->addBasicBlockToLoop(New, *LI);
}		}

▲ Show 20 Lines • Show All 235 Lines • ▼ Show 20 Lines	bool llvm::UnrollLoop(Loop *L, unsigned Count, unsigned TripCount, bool Force,
// If we have a pass and a DominatorTree we should re-simplify impacted loops		// If we have a pass and a DominatorTree we should re-simplify impacted loops
// to ensure subsequent analyses can rely on this form. We want to simplify		// to ensure subsequent analyses can rely on this form. We want to simplify
// at least one layer outside of the loop that was unrolled so that any		// at least one layer outside of the loop that was unrolled so that any
// changes to the parent loop exposed by the unrolling are considered.		// changes to the parent loop exposed by the unrolling are considered.
if (DT) {		if (DT) {
if (!OuterL && !CompletelyUnroll)		if (!OuterL && !CompletelyUnroll)
OuterL = L;		OuterL = L;
if (OuterL) {		if (OuterL) {
		// OuterL includes all loops for which we can break loop-simplify, so
		// it's sufficient to simplify only it (it'll recursively simplify inner
		// loops too).
		// TODO: That potentially might be compile-time expensive. We should try
		// to fix the loop-simplified form incrementally.
simplifyLoop(OuterL, DT, LI, SE, AC, PreserveLCSSA);		simplifyLoop(OuterL, DT, LI, SE, AC, PreserveLCSSA);

// LCSSA must be performed on the outermost affected loop. The unrolled		// LCSSA must be performed on the outermost affected loop. The unrolled
// loop's last loop latch is guaranteed to be in the outermost loop after		// loop's last loop latch is guaranteed to be in the outermost loop after
// LoopInfo's been updated by markAsRemoved.		// LoopInfo's been updated by markAsRemoved.
Loop *LatchLoop = LI->getLoopFor(Latches.back());		Loop *LatchLoop = LI->getLoopFor(Latches.back());
if (!OuterL->contains(LatchLoop))		if (!OuterL->contains(LatchLoop))
while (OuterL->getParentLoop() != LatchLoop)		while (OuterL->getParentLoop() != LatchLoop)
OuterL = OuterL->getParentLoop();		OuterL = OuterL->getParentLoop();

if (NeedToFixLCSSA)		if (NeedToFixLCSSA)
formLCSSARecursively(OuterL, DT, LI, SE);		formLCSSARecursively(OuterL, DT, LI, SE);
else		else
assert(OuterL->isLCSSAForm(*DT) &&		assert(OuterL->isLCSSAForm(*DT) &&
"Loops should be in LCSSA form after loop-unroll.");		"Loops should be in LCSSA form after loop-unroll.");
		} else {
		// Simplify loops for which we might've broken loop-simplify form.
		for (Loop *SubLoop : LoopsToSimplify)
		simplifyLoop(SubLoop, DT, LI, SE, AC, PreserveLCSSA);
}		}
}		}

return true;		return true;
}		}

/// Given an llvm.loop loop id metadata node, returns the loop hint metadata		/// Given an llvm.loop loop id metadata node, returns the loop hint metadata
/// node with the given name (for example, "llvm.loop.unroll.count"). If no		/// node with the given name (for example, "llvm.loop.unroll.count"). If no
Show All 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LoopUnroll] Simplify loops created by unrolling.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 67214

llvm/trunk/lib/Transforms/Utils/LoopUnroll.cpp

[LoopUnroll] Simplify loops created by unrolling.
ClosedPublic