This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
2/3
LoopNestAnalysis.h
-
lib/Analysis/
-
Analysis/
8/9
LoopInfo.cpp
1/2
LoopNestAnalysis.cpp
-
test/Analysis/LoopNestAnalysis/
-
Analysis/
-
LoopNestAnalysis/
2/2
imperfectnest.ll
-
perfectnest.ll
-
unittests/Analysis/
-
Analysis/
-
LoopInfoTest.cpp

Differential D94717

[LoopNest] Consider loop nest with inner loop guard using outer loop induction variable to be perfect
ClosedPublic

Authored by Whitney on Jan 14 2021, 1:37 PM.

Download Raw Diff

Details

Reviewers

Meinersbur
bmahjour
fhahn
sidbav

Commits

rG1006ac3963ea: [LoopNest] Consider loop nest with inner loop guard using outer loop

Summary

This patch allow more conditional branches to be considered as loop guard, and so more loop nests can be considered perfect.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sidbav created this revision.Jan 14 2021, 1:37 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptJan 14 2021, 1:37 PM

sidbav requested review of this revision.Jan 14 2021, 1:37 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 14 2021, 1:37 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Whitney added inline comments.Jan 14 2021, 2:00 PM

llvm/test/Analysis/LoopNestAnalysis/imperfectnest.ll
428	This test case should be considered as imperfect, as the user branch is not the inner loop guard.

Harbormaster completed remote builds in B85236: Diff 316766.Jan 14 2021, 2:46 PM

bmahjour requested changes to this revision.Apr 28 2021, 1:19 PM

bmahjour added inline comments.

llvm/lib/Analysis/LoopInfo.cpp
405	apart from checking that the control flow can be simplified, we also need to check to make sure that the chain of BBs don't contain unsafe instructions (or at least check that they are empty).
llvm/test/Analysis/LoopNestAnalysis/imperfectnest.ll
428	the user branch is behaving as a guard so we can allow it per the current definition of a loop guard.

This revision now requires changes to proceed.Apr 28 2021, 1:19 PM

Ensure the chain of BBs are empty blocks.

sidbav marked 3 inline comments as done.Apr 29 2021, 9:59 AM

Whitney accepted this revision.Apr 29 2021, 10:43 AM

Harbormaster completed remote builds in B101667: Diff 341557.Apr 29 2021, 10:53 AM

bmahjour added inline comments.Apr 29 2021, 12:02 PM

llvm/lib/Analysis/LoopInfo.cpp
398	[nit] this comment needs to be reworded to be more clear.
401	AFAICS `skipEmptyBlockUntil` doesn't check that the unique successor blocks have unique predecessors. Don't we need to check for that too?

Whitney added inline comments.Apr 29 2021, 12:16 PM

llvm/lib/Analysis/LoopInfo.cpp
401	I thought about that too, but it seems to be questionable... if (cond) goto label; if (0 < N) { for (int i = 0; i < N; ++i) {...} label: } Should we consider `if (0 < N)` to be a loop guard? It actually guarded the loop, but it is not a single entry single exit region. If we decided to check for unique predecessor, it may make sense to do it in `skipEmptyBlockUntil`.

bmahjour added inline comments.Apr 29 2021, 1:01 PM

llvm/lib/Analysis/LoopInfo.cpp
401	I think we should keep the control-flow cases that are considered "guard-like" fairly simple (otherwise transforms will be faced with too many canonical forms to have to deal with). I'd say we do not consider `if (0 < N)` in the example above as a loop guard, given that `label:` is also being guarded by that condition. There may be legitimate use cases for the current semantics of `skipEmptyBlockUntil`, so I think we should create another version of that function (or pass a flag to it) to additionally check for existence of unique predecessors. Then we could use that version of the function here to decide whether the control flow structure should be considered a guard. @sidbav could you please do that?

Modify skipEmptyBlockUntil to also consider unique predecessors.

sidbav marked 4 inline comments as done.Apr 29 2021, 4:42 PM

sidbav added inline comments.

llvm/lib/Analysis/LoopInfo.cpp
401	Yes, I intended on adding adding it, but I ran into LIT test issues so I did not put those changes in the patch.... Just took another look at it, and I realized I made a mistake in the initial implementation. It is working now.

Whitney added inline comments.Apr 29 2021, 4:48 PM

llvm/include/llvm/Analysis/LoopNestAnalysis.h
64–66	Please update the description.
llvm/lib/Analysis/LoopInfo.cpp
401	@sidbav Can you please add a unit test in llvm/unittests/Analysis/LoopInfoTest.cpp for a test case like if (cond) goto label; if (0 < N) { for (int i = 0; i < N; ++i) {...} label: } where the branch `if (0 < N)` should be not a guard.

Harbormaster completed remote builds in B101781: Diff 341708.Apr 29 2021, 7:53 PM

bmahjour added inline comments.Apr 30 2021, 8:14 AM

llvm/include/llvm/Analysis/LoopNestAnalysis.h
69	[nit] rename UniquePred -> CheckUniquePred.
llvm/lib/Analysis/LoopInfo.cpp
403	[nit] rename UniquePred -> CheckUniquePred.
llvm/lib/Analysis/LoopNestAnalysis.cpp
210	[nit] rename UniquePred -> CheckUniquePred.

address review comments

sidbav marked 5 inline comments as done.Apr 30 2021, 2:33 PM

Whitney accepted this revision.Apr 30 2021, 2:35 PM

bmahjour accepted this revision.Apr 30 2021, 2:39 PM

This revision is now accepted and ready to land.Apr 30 2021, 2:39 PM

The previous Solution to bug was more of a hack rather than a fix. This patch actually resolves the bug

Whitney accepted this revision.Apr 30 2021, 2:52 PM

I think this changes the semantics for the cases that we used to handle before. For example wouldn't this require the exit successors to be empty? If so I think lcssa phis can prevent us from detecting guards. We need to test for those cases as well.

This revision now requires changes to proceed.Apr 30 2021, 3:10 PM

Harbormaster completed remote builds in B102023: Diff 342044.Apr 30 2021, 4:54 PM

Harbormaster completed remote builds in B102028: Diff 342051.Apr 30 2021, 4:57 PM

In D94717#2730582, @bmahjour wrote:

I think this changes the semantics for the cases that we used to handle before. For example wouldn't this require the exit successors to be empty? If so I think lcssa phis can prevent us from detecting guards. We need to test for those cases as well.

LCSSA phis should be in the loop exit block, not exit successors.

In D94717#2730948, @Whitney wrote:

In D94717#2730582, @bmahjour wrote:

I think this changes the semantics for the cases that we used to handle before. For example wouldn't this require the exit successors to be empty? If so I think lcssa phis can prevent us from detecting guards. We need to test for those cases as well.

LCSSA phis should be in the loop exit block, not exit successors.

int foo(char * restrict aa, int N) {
  int sum = 0;
  for (int i = 0; i < N; i++) {
    sum += aa[i];
  }
  return sum;
}

The corresponding IR would be:

define dso_local signext i32 @foo(i8* noalias %aa, i32 signext %N) #0 {
entry:
  %cmp1 = icmp sgt i32 %N, 0
  br i1 %cmp1, label %for.body.preheader, label %for.end

for.body.preheader:                               ; preds = %entry
  %wide.trip.count = zext i32 %N to i64
  br label %for.body

for.body:                                         ; preds = %for.body.preheader, %for.body
  %indvars.iv = phi i64 [ 0, %for.body.preheader ], [ %indvars.iv.next, %for.body ]
  %sum.02 = phi i32 [ %add, %for.body ], [ 0, %for.body.preheader ]
  %arrayidx = getelementptr inbounds i8, i8* %aa, i64 %indvars.iv
  %0 = load i8, i8* %arrayidx, align 1, !tbaa !2
  %conv = zext i8 %0 to i32
  %add = add nuw nsw i32 %sum.02, %conv
  %indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
  %exitcond = icmp ne i64 %indvars.iv.next, %wide.trip.count
  br i1 %exitcond, label %for.body, label %for.end.loopexit, !llvm.loop !5

for.end.loopexit:                                 ; preds = %for.body
  %add.lcssa = phi i32 [ %add, %for.body ]
  br label %for.end

for.end:                                          ; preds = %for.end.loopexit, %entry
  %sum.0.lcssa = phi i32 [ 0, %entry ], [ %add.lcssa, %for.end.loopexit ]
  ret i32 %sum.0.lcssa
}

As you can see for.end is not empty and so this loop's guard won't be detected!

As you can see for.end is not empty and so this loop's guard won't be detected!

From looking at skipEmptyBlockUntil, End is not require to be empty, so skipEmptyBlockUntil(for.end.loopexit, for.end) would return for.end.

llvm/lib/Analysis/LoopInfo.cpp
383–384	These 3 lines can be removed.
llvm/lib/Analysis/LoopNestAnalysis.cpp
223–224	Should change this to `const BasicBlock *PredBB = From;`

In D94717#2737167, @Whitney wrote:

As you can see for.end is not empty and so this loop's guard won't be detected!

From looking at skipEmptyBlockUntil, End is not require to be empty, so skipEmptyBlockUntil(for.end.loopexit, for.end) would return for.end.

True, although you can have another bb in between that has code, and then we won't be able to detect the guard. I don't think we would want to limit our ability to detect a guard when there is code in the epilogue. There is also the question of what "code" is considered neutral. Perhaps we need a way to distinguish between guards that are "dedicated" and those that are not.

Whitney commandeered this revision.May 5 2021, 1:55 PM

Whitney mentioned this in D101943: [LoopNest] Consider loop nest with inner loop guard using outer loop induction variable to be perfect.

Whitney edited reviewers, added: sidbav; removed: Whitney.

Addressed review comments.

So if I understand correctly, we still allow code in the exit block and the other target of the guard branch like before, so this is purely an improvement over what we had before. Although, as I said before I think we should allow non-empty blocks as well, but that can come as future extensions.

llvm/include/llvm/Analysis/LoopNestAnalysis.h
65	[nit] has an unique -> has a unique

This revision is now accepted and ready to land.May 5 2021, 2:14 PM

In D94717#2740268, @bmahjour wrote:

So if I understand correctly, we still allow code in the exit block and the other target of the guard branch like before, so this is purely an improvement over what we had before. Although, as I said before I think we should allow non-empty blocks as well, but that can come as future extensions.

Yes, our understanding are the same.

Harbormaster completed remote builds in B102841: Diff 343184.May 5 2021, 3:01 PM

Addressed review comment, and fixed a bug in detecting perfect nest using skipEmptyBlockUntil.

Harbormaster completed remote builds in B102904: Diff 343275.May 5 2021, 8:34 PM

LGTM

Closed by commit rG1006ac3963ea: [LoopNest] Consider loop nest with inner loop guard using outer loop (authored by Whitney). · Explain WhyMay 7 2021, 9:04 AM

This revision was automatically updated to reflect the committed changes.

Whitney added a commit: rG1006ac3963ea: [LoopNest] Consider loop nest with inner loop guard using outer loop.

MaskRay mentioned this in rG724604901a10: [unittest] Fix -Wunused-variable after D94717.May 7 2021, 11:42 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

LoopNestAnalysis.h

8 lines

lib/

Analysis/

LoopInfo.cpp

17 lines

LoopNestAnalysis.cpp

16 lines

test/

Analysis/

LoopNestAnalysis/

imperfectnest.ll

67 lines

perfectnest.ll

145 lines

unittests/

Analysis/

LoopInfoTest.cpp

48 lines

Diff 343697

llvm/include/llvm/Analysis/LoopNestAnalysis.h

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	public:
/// for(j) // loop at level 2		/// for(j) // loop at level 2
/// <code>		/// <code>
/// for(k) // loop at level 3		/// for(k) // loop at level 3
/// \endcode		/// \endcode
/// getMaxPerfectDepth(Loop_i) would return 2.		/// getMaxPerfectDepth(Loop_i) would return 2.
static unsigned getMaxPerfectDepth(const Loop &Root, ScalarEvolution &SE);		static unsigned getMaxPerfectDepth(const Loop &Root, ScalarEvolution &SE);

/// Recursivelly traverse all empty 'single successor' basic blocks of \p From		/// Recursivelly traverse all empty 'single successor' basic blocks of \p From
/// (if there are any). Return the last basic block found or \p End if it was		/// (if there are any). When \p CheckUniquePred is set to true, check if
/// reached during the search.		/// each of the empty single successors has a unique predecessor. Return
		bmahjourUnsubmitted Not Done Reply Inline Actions [nit] has an unique -> has a unique bmahjour: [nit] has an unique -> has a unique
		/// the last basic block found or \p End if it was reached during the search.
		WhitneyAuthorUnsubmitted Done Reply Inline Actions Please update the description. Whitney: Please update the description.
static const BasicBlock &skipEmptyBlockUntil(const BasicBlock *From,		static const BasicBlock &skipEmptyBlockUntil(const BasicBlock *From,
const BasicBlock *End);		const BasicBlock *End,
		bool CheckUniquePred = false);
		bmahjourUnsubmitted Done Reply Inline Actions [nit] rename UniquePred -> CheckUniquePred. bmahjour: [nit] rename UniquePred -> CheckUniquePred.

/// Return the outermost loop in the loop nest.		/// Return the outermost loop in the loop nest.
Loop &getOutermostLoop() const { return *Loops.front(); }		Loop &getOutermostLoop() const { return *Loops.front(); }

/// Return the innermost loop in the loop nest if the nest has only one		/// Return the innermost loop in the loop nest if the nest has only one
/// innermost loop, and a nullptr otherwise.		/// innermost loop, and a nullptr otherwise.
/// Note: the innermost loop returned is not necessarily perfectly nested.		/// Note: the innermost loop returned is not necessarily perfectly nested.
Loop *getInnermostLoop() const {		Loop *getInnermostLoop() const {
▲ Show 20 Lines • Show All 105 Lines • Show Last 20 Lines

llvm/lib/Analysis/LoopInfo.cpp

Show All 14 Lines

#include "llvm/Analysis/LoopInfo.h"		#include "llvm/Analysis/LoopInfo.h"
#include "llvm/ADT/DepthFirstIterator.h"		#include "llvm/ADT/DepthFirstIterator.h"
#include "llvm/ADT/ScopeExit.h"		#include "llvm/ADT/ScopeExit.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/Analysis/IVDescriptors.h"		#include "llvm/Analysis/IVDescriptors.h"
#include "llvm/Analysis/LoopInfoImpl.h"		#include "llvm/Analysis/LoopInfoImpl.h"
#include "llvm/Analysis/LoopIterator.h"		#include "llvm/Analysis/LoopIterator.h"
		#include "llvm/Analysis/LoopNestAnalysis.h"
#include "llvm/Analysis/MemorySSA.h"		#include "llvm/Analysis/MemorySSA.h"
#include "llvm/Analysis/MemorySSAUpdater.h"		#include "llvm/Analysis/MemorySSAUpdater.h"
#include "llvm/Analysis/ScalarEvolutionExpressions.h"		#include "llvm/Analysis/ScalarEvolutionExpressions.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/Config/llvm-config.h"		#include "llvm/Config/llvm-config.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
▲ Show 20 Lines • Show All 343 Lines • ▼ Show 20 Lines	BranchInst *Loop::getLoopGuardBranch() const {
if (!isRotatedForm())		if (!isRotatedForm())
return nullptr;		return nullptr;

// Disallow loops with more than one unique exit block, as we do not verify		// Disallow loops with more than one unique exit block, as we do not verify
// that GuardOtherSucc post dominates all exit blocks.		// that GuardOtherSucc post dominates all exit blocks.
BasicBlock *ExitFromLatch = getUniqueExitBlock();		BasicBlock *ExitFromLatch = getUniqueExitBlock();
if (!ExitFromLatch)		if (!ExitFromLatch)
return nullptr;		return nullptr;

BasicBlock *ExitFromLatchSucc = ExitFromLatch->getUniqueSuccessor();
if (!ExitFromLatchSucc)
return nullptr;

BasicBlock *GuardBB = Preheader->getUniquePredecessor();		BasicBlock *GuardBB = Preheader->getUniquePredecessor();
		WhitneyAuthorUnsubmitted Not Done Reply Inline Actions These 3 lines can be removed. Whitney: These 3 lines can be removed.
if (!GuardBB)		if (!GuardBB)
return nullptr;		return nullptr;

assert(GuardBB->getTerminator() && "Expecting valid guard terminator");		assert(GuardBB->getTerminator() && "Expecting valid guard terminator");

BranchInst *GuardBI = dyn_cast<BranchInst>(GuardBB->getTerminator());		BranchInst *GuardBI = dyn_cast<BranchInst>(GuardBB->getTerminator());
if (!GuardBI \|\| GuardBI->isUnconditional())		if (!GuardBI \|\| GuardBI->isUnconditional())
return nullptr;		return nullptr;

BasicBlock *GuardOtherSucc = (GuardBI->getSuccessor(0) == Preheader)		BasicBlock *GuardOtherSucc = (GuardBI->getSuccessor(0) == Preheader)
? GuardBI->getSuccessor(1)		? GuardBI->getSuccessor(1)
: GuardBI->getSuccessor(0);		: GuardBI->getSuccessor(0);
return (GuardOtherSucc == ExitFromLatchSucc) ? GuardBI : nullptr;
		// Check if ExitFromLatch (or any BasicBlock which is an empty unique
		bmahjourUnsubmitted Done Reply Inline Actions [nit] this comment needs to be reworded to be more clear. bmahjour: [nit] this comment needs to be reworded to be more clear.
		// successor of ExitFromLatch) is equal to GuardOtherSucc. If
		// skipEmptyBlockUntil returns GuardOtherSucc, then the guard branch for the
		// loop is GuardBI (return GuardBI), otherwise return nullptr.
		bmahjourUnsubmitted Done Reply Inline Actions AFAICS `skipEmptyBlockUntil` doesn't check that the unique successor blocks have unique predecessors. Don't we need to check for that too? bmahjour: AFAICS `skipEmptyBlockUntil` doesn't check that the unique successor blocks have unique…
		sidbavUnsubmitted Done Reply Inline Actions Yes, I intended on adding adding it, but I ran into LIT test issues so I did not put those changes in the patch.... Just took another look at it, and I realized I made a mistake in the initial implementation. It is working now. sidbav: Yes, I intended on adding adding it, but I ran into LIT test issues so I did not put those…
		WhitneyAuthorUnsubmitted Done Reply Inline Actions I thought about that too, but it seems to be questionable... if (cond) goto label; if (0 < N) { for (int i = 0; i < N; ++i) {...} label: } Should we consider `if (0 < N)` to be a loop guard? It actually guarded the loop, but it is not a single entry single exit region. If we decided to check for unique predecessor, it may make sense to do it in `skipEmptyBlockUntil`. Whitney: I thought about that too, but it seems to be questionable... ``` if (cond) goto label; if (0…
		bmahjourUnsubmitted Done Reply Inline Actions I think we should keep the control-flow cases that are considered "guard-like" fairly simple (otherwise transforms will be faced with too many canonical forms to have to deal with). I'd say we do not consider `if (0 < N)` in the example above as a loop guard, given that `label:` is also being guarded by that condition. There may be legitimate use cases for the current semantics of `skipEmptyBlockUntil`, so I think we should create another version of that function (or pass a flag to it) to additionally check for existence of unique predecessors. Then we could use that version of the function here to decide whether the control flow structure should be considered a guard. @sidbav could you please do that? bmahjour: I think we should keep the control-flow cases that are considered "guard-like" fairly simple…
		WhitneyAuthorUnsubmitted Done Reply Inline Actions @sidbav Can you please add a unit test in llvm/unittests/Analysis/LoopInfoTest.cpp for a test case like if (cond) goto label; if (0 < N) { for (int i = 0; i < N; ++i) {...} label: } where the branch `if (0 < N)` should be not a guard. Whitney: @sidbav Can you please add a unit test in llvm/unittests/Analysis/LoopInfoTest.cpp for a test…
		if (&LoopNest::skipEmptyBlockUntil(ExitFromLatch, GuardOtherSucc,
		/CheckUniquePred=/true) ==
		bmahjourUnsubmitted Done Reply Inline Actions [nit] rename UniquePred -> CheckUniquePred. bmahjour: [nit] rename UniquePred -> CheckUniquePred.
		GuardOtherSucc)
		return GuardBI;
		bmahjourUnsubmitted Done Reply Inline Actions apart from checking that the control flow can be simplified, we also need to check to make sure that the chain of BBs don't contain unsafe instructions (or at least check that they are empty). bmahjour: apart from checking that the control flow can be simplified, we also need to check to make sure…
		else
		return nullptr;
}		}

bool Loop::isCanonical(ScalarEvolution &SE) const {		bool Loop::isCanonical(ScalarEvolution &SE) const {
InductionDescriptor IndDesc;		InductionDescriptor IndDesc;
if (!getInductionDescriptor(SE, IndDesc))		if (!getInductionDescriptor(SE, IndDesc))
return false;		return false;

ConstantInt *Init = dyn_cast_or_null<ConstantInt>(IndDesc.getStartValue());		ConstantInt *Init = dyn_cast_or_null<ConstantInt>(IndDesc.getStartValue());
▲ Show 20 Lines • Show All 740 Lines • Show Last 20 Lines

llvm/lib/Analysis/LoopNestAnalysis.cpp

Show First 20 Lines • Show All 200 Lines • ▼ Show 20 Lines	while (SubLoops->size() == 1) {
SubLoops = &CurrentLoop->getSubLoops();		SubLoops = &CurrentLoop->getSubLoops();
++CurrentDepth;		++CurrentDepth;
}		}

return CurrentDepth;		return CurrentDepth;
}		}

const BasicBlock &LoopNest::skipEmptyBlockUntil(const BasicBlock *From,		const BasicBlock &LoopNest::skipEmptyBlockUntil(const BasicBlock *From,
const BasicBlock *End) {		const BasicBlock *End,
		bool CheckUniquePred) {
		bmahjourUnsubmitted Done Reply Inline Actions [nit] rename UniquePred -> CheckUniquePred. bmahjour: [nit] rename UniquePred -> CheckUniquePred.
assert(From && "Expecting valid From");		assert(From && "Expecting valid From");
assert(End && "Expecting valid End");		assert(End && "Expecting valid End");

if (From == End \|\| !From->getUniqueSuccessor())		if (From == End \|\| !From->getUniqueSuccessor())
return *From;		return *From;

auto IsEmpty = [](const BasicBlock *BB) {		auto IsEmpty = [](const BasicBlock *BB) {
return (BB->getInstList().size() == 1);		return (BB->getInstList().size() == 1);
};		};

// Visited is used to avoid running into an infinite loop.		// Visited is used to avoid running into an infinite loop.
SmallPtrSet<const BasicBlock *, 4> Visited;		SmallPtrSet<const BasicBlock *, 4> Visited;
const BasicBlock *BB = From->getUniqueSuccessor();		const BasicBlock *BB = From->getUniqueSuccessor();
const BasicBlock *PredBB = BB;		const BasicBlock *PredBB = From;
		WhitneyAuthorUnsubmitted Not Done Reply Inline Actions Should change this to `const BasicBlock PredBB = From;` Whitney:* Should change this to `const BasicBlock *PredBB = From;`
while (BB && BB != End && IsEmpty(BB) && !Visited.count(BB)) {		while (BB && BB != End && IsEmpty(BB) && !Visited.count(BB) &&
		(!CheckUniquePred \|\| BB->getUniquePredecessor())) {
Visited.insert(BB);		Visited.insert(BB);
PredBB = BB;		PredBB = BB;
BB = BB->getUniqueSuccessor();		BB = BB->getUniqueSuccessor();
}		}

return (BB == End) ? End : PredBB;		return (BB == End) ? End : PredBB;
}		}

▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	if (&SingleSucc != InnerLoopPreHeader) {
});		});
return false;		return false;
}		}
}		}
}		}

// Ensure the inner loop exit block lead to the outer loop latch possibly		// Ensure the inner loop exit block lead to the outer loop latch possibly
// through empty blocks.		// through empty blocks.
const BasicBlock &SuccInner =		if ((!ExtraPhiBlock \|\|
LoopNest::skipEmptyBlockUntil(InnerLoop.getExitBlock(), OuterLoopLatch);		&LoopNest::skipEmptyBlockUntil(InnerLoop.getExitBlock(),
if (&SuccInner != OuterLoopLatch && &SuccInner != ExtraPhiBlock) {		ExtraPhiBlock) != ExtraPhiBlock) &&
		(&LoopNest::skipEmptyBlockUntil(InnerLoop.getExitBlock(),
		OuterLoopLatch) != OuterLoopLatch)) {
DEBUG_WITH_TYPE(		DEBUG_WITH_TYPE(
VerboseDebug,		VerboseDebug,
dbgs() << "Inner loop exit block " << *InnerLoopExit		dbgs() << "Inner loop exit block " << *InnerLoopExit
<< " does not directly lead to the outer loop latch.\n";);		<< " does not directly lead to the outer loop latch.\n";);
return false;		return false;
}		}

return true;		return true;
Show All 32 Lines

llvm/test/Analysis/LoopNestAnalysis/imperfectnest.ll

Show First 20 Lines • Show All 418 Lines • ▼ Show 20 Lines	for.inc11:
br i1 %cmp, label %imperf_nest_5_loop_i, label %for.cond.for.end13_crit_edge		br i1 %cmp, label %imperf_nest_5_loop_i, label %for.cond.for.end13_crit_edge

for.cond.for.end13_crit_edge:		for.cond.for.end13_crit_edge:
br label %for.end13		br label %for.end13

for.end13:		for.end13:
ret void		ret void
}		}

; Test an imperfect loop nest of the form:
WhitneyAuthorUnsubmitted Done Reply Inline Actions This test case should be considered as imperfect, as the user branch is not the inner loop guard. Whitney: This test case should be considered as imperfect, as the user branch is not the inner loop…
bmahjourUnsubmitted Done Reply Inline Actions the user branch is behaving as a guard so we can allow it per the current definition of a loop guard. bmahjour: the user branch is behaving as a guard so we can allow it per the current definition of a loop…
; for (int i = 0; i < nx; ++i)
; if (i > 5) { // user branch
; for (int j = 1; j <= 5; j+=2)
; y[j][i] = x[i][j] + j;
; }

define void @imperf_nest_6(i32 %y, i32 %x, i32 signext %nx, i32 signext %ny) {
; CHECK-LABEL: IsPerfect=false, Depth=2, OutermostLoop: imperf_nest_6_loop_i, Loops: ( imperf_nest_6_loop_i imperf_nest_6_loop_j )
entry:
%cmp2 = icmp slt i32 0, %nx
br i1 %cmp2, label %imperf_nest_6_loop_i.lr.ph, label %for.end13

imperf_nest_6_loop_i.lr.ph:
br label %imperf_nest_6_loop_i

imperf_nest_6_loop_i:
%i.0 = phi i32 [ 0, %imperf_nest_6_loop_i.lr.ph ], [ %inc12, %for.inc11 ]
%cmp1 = icmp sgt i32 %i.0, 5
br i1 %cmp1, label %imperf_nest_6_loop_j.lr.ph, label %if.end

imperf_nest_6_loop_j.lr.ph:
br label %imperf_nest_6_loop_j

imperf_nest_6_loop_j:
%j.0 = phi i32 [ 1, %imperf_nest_6_loop_j.lr.ph ], [ %inc, %for.inc ]
%idxprom = sext i32 %i.0 to i64
%arrayidx = getelementptr inbounds i32, i32* %x, i64 %idxprom
%0 = load i32, i32* %arrayidx, align 8
%idxprom5 = sext i32 %j.0 to i64
%arrayidx6 = getelementptr inbounds i32, i32* %0, i64 %idxprom5
%1 = load i32, i32* %arrayidx6, align 4
%add = add nsw i32 %1, %j.0
%idxprom7 = sext i32 %j.0 to i64
%arrayidx8 = getelementptr inbounds i32, i32* %y, i64 %idxprom7
%2 = load i32, i32* %arrayidx8, align 8
%idxprom9 = sext i32 %i.0 to i64
%arrayidx10 = getelementptr inbounds i32, i32* %2, i64 %idxprom9
store i32 %add, i32* %arrayidx10, align 4
br label %for.inc

for.inc:
%inc = add nsw i32 %j.0, 2
%cmp3 = icmp sle i32 %inc, 5
br i1 %cmp3, label %imperf_nest_6_loop_j, label %for.cond2.for.end_crit_edge

for.cond2.for.end_crit_edge:
br label %for.end

for.end:
br label %if.end

if.end:
br label %for.inc11

for.inc11:
%inc12 = add nsw i32 %i.0, 1
%cmp = icmp slt i32 %inc12, %nx
br i1 %cmp, label %imperf_nest_6_loop_i, label %for.cond.for.end13_crit_edge

for.cond.for.end13_crit_edge:
br label %for.end13

for.end13:
ret void
}

llvm/test/Analysis/LoopNestAnalysis/perfectnest.ll

	Show First 20 Lines • Show All 316 Lines • ▼ Show 20 Lines
	for.cond.for.end7_crit_edge:			for.cond.for.end7_crit_edge:
	%split7 = phi i32 [ %x.addr.1.lcssa, %for.inc5 ]			%split7 = phi i32 [ %x.addr.1.lcssa, %for.inc5 ]
	br label %for.end7			br label %for.end7

	for.end7:			for.end7:
	%x.addr.0.lcssa = phi i32 [ %split7, %for.cond.for.end7_crit_edge ], [ %x, %entry ]			%x.addr.0.lcssa = phi i32 [ %split7, %for.cond.for.end7_crit_edge ], [ %x, %entry ]
	ret i32 %x.addr.0.lcssa			ret i32 %x.addr.0.lcssa
	}			}

				; Test a perfect loop nest of the form:
				; for (int i = 0; i < nx; ++i)
				; if (i < ny) { // guard branch for the j-loop
				; for (int j=i; j < ny; j+=1)
				; y[j][i] = x[i][j] + j;
				; }
				define double @perf_nest_guard_branch(i32 %y, i32 %x, i32 signext %nx, i32 signext %ny) {
				; CHECK-LABEL: IsPerfect=true, Depth=1, OutermostLoop: test6Loop2, Loops: ( test6Loop2 )
				; CHECK-LABEL: IsPerfect=true, Depth=2, OutermostLoop: test6Loop1, Loops: ( test6Loop1 test6Loop2 )
				entry:
				%cmp2 = icmp slt i32 0, %nx
				br i1 %cmp2, label %test6Loop1.lr.ph, label %for.end13

				test6Loop1.lr.ph: ; preds = %entry
				br label %test6Loop1

				test6Loop1: ; preds = %test6Loop1.lr.ph, %for.inc11
				%i.0 = phi i32 [ 0, %test6Loop1.lr.ph ], [ %inc12, %for.inc11 ]
				%cmp1 = icmp slt i32 %i.0, %ny
				br i1 %cmp1, label %test6Loop2.lr.ph, label %if.end

				test6Loop2.lr.ph: ; preds = %if.then
				br label %test6Loop2

				test6Loop2: ; preds = %test6Loop2.lr.ph, %for.inc
				%j.0 = phi i32 [ %i.0, %test6Loop2.lr.ph ], [ %inc, %for.inc ]
				%idxprom = sext i32 %i.0 to i64
				%arrayidx = getelementptr inbounds i32, i32* %x, i64 %idxprom
				%0 = load i32, i32* %arrayidx, align 8
				%idxprom5 = sext i32 %j.0 to i64
				%arrayidx6 = getelementptr inbounds i32, i32* %0, i64 %idxprom5
				%1 = load i32, i32* %arrayidx6, align 4
				%add = add nsw i32 %1, %j.0
				%idxprom7 = sext i32 %j.0 to i64
				%arrayidx8 = getelementptr inbounds i32, i32* %y, i64 %idxprom7
				%2 = load i32, i32* %arrayidx8, align 8
				%idxprom9 = sext i32 %i.0 to i64
				%arrayidx10 = getelementptr inbounds i32, i32* %2, i64 %idxprom9
				store i32 %add, i32* %arrayidx10, align 4
				br label %for.inc

				for.inc: ; preds = %test6Loop2
				%inc = add nsw i32 %j.0, 1
				%cmp3 = icmp slt i32 %inc, %ny
				br i1 %cmp3, label %test6Loop2, label %for.cond2.for.end_crit_edge

				for.cond2.for.end_crit_edge: ; preds = %for.inc
				br label %for.end

				for.end: ; preds = %for.cond2.for.end_crit_edge, %if.then
				br label %if.end

				if.end: ; preds = %for.end, %test6Loop1
				br label %for.inc11

				for.inc11: ; preds = %if.end
				%inc12 = add nsw i32 %i.0, 1
				%cmp = icmp slt i32 %inc12, %nx
				br i1 %cmp, label %test6Loop1, label %for.cond.for.end13_crit_edge

				for.cond.for.end13_crit_edge: ; preds = %for.inc11
				br label %for.end13

				for.end13: ; preds = %for.cond.for.end13_crit_edge, %entry
				%arrayidx14 = getelementptr inbounds i32, i32* %y, i64 0
				%3 = load i32, i32* %arrayidx14, align 8
				%arrayidx15 = getelementptr inbounds i32, i32* %3, i64 0
				%4 = load i32, i32* %arrayidx15, align 4
				%conv = sitofp i32 %4 to double
				ret double %conv
				}

				; Test a perfect loop nest of the form:
				; for (int i = 0; i < nx; ++i)
				; if (i < ny) { // guard branch for the j-loop
				; for (int j=i; j < ny; j+=1)
				; y[j][i] = x[i][j] + j;
				; }

				define double @test6(i32 %y, i32 %x, i32 signext %nx, i32 signext %ny) {
				; CHECK-LABEL: IsPerfect=true, Depth=1, OutermostLoop: test6Loop2, Loops: ( test6Loop2 )
				; CHECK-LABEL: IsPerfect=true, Depth=2, OutermostLoop: test6Loop1, Loops: ( test6Loop1 test6Loop2 )
				entry:
				%cmp2 = icmp slt i32 0, %nx
				br i1 %cmp2, label %test6Loop1.lr.ph, label %for.end13

				test6Loop1.lr.ph: ; preds = %entry
				br label %test6Loop1

				test6Loop1: ; preds = %test6Loop1.lr.ph, %for.inc11
				%i.0 = phi i32 [ 0, %test6Loop1.lr.ph ], [ %inc12, %for.inc11 ]
				%cmp1 = icmp slt i32 %i.0, %ny
				br i1 %cmp1, label %test6Loop2.lr.ph, label %if.end

				test6Loop2.lr.ph: ; preds = %if.then
				br label %test6Loop2

				test6Loop2: ; preds = %test6Loop2.lr.ph, %for.inc
				%j.0 = phi i32 [ %i.0, %test6Loop2.lr.ph ], [ %inc, %for.inc ]
				%idxprom = sext i32 %i.0 to i64
				%arrayidx = getelementptr inbounds i32, i32* %x, i64 %idxprom
				%0 = load i32, i32* %arrayidx, align 8
				%idxprom5 = sext i32 %j.0 to i64
				%arrayidx6 = getelementptr inbounds i32, i32* %0, i64 %idxprom5
				%1 = load i32, i32* %arrayidx6, align 4
				%add = add nsw i32 %1, %j.0
				%idxprom7 = sext i32 %j.0 to i64
				%arrayidx8 = getelementptr inbounds i32, i32* %y, i64 %idxprom7
				%2 = load i32, i32* %arrayidx8, align 8
				%idxprom9 = sext i32 %i.0 to i64
				%arrayidx10 = getelementptr inbounds i32, i32* %2, i64 %idxprom9
				store i32 %add, i32* %arrayidx10, align 4
				br label %for.inc

				for.inc: ; preds = %test6Loop2
				%inc = add nsw i32 %j.0, 1
				%cmp3 = icmp slt i32 %inc, %ny
				br i1 %cmp3, label %test6Loop2, label %for.cond2.for.end_crit_edge

				for.cond2.for.end_crit_edge: ; preds = %for.inc
				br label %for.end

				for.end: ; preds = %for.cond2.for.end_crit_edge, %if.then
				br label %if.end

				if.end: ; preds = %for.end, %test6Loop1
				br label %for.inc11

				for.inc11: ; preds = %if.end
				%inc12 = add nsw i32 %i.0, 1
				%cmp = icmp slt i32 %inc12, %nx
				br i1 %cmp, label %test6Loop1, label %for.cond.for.end13_crit_edge

				for.cond.for.end13_crit_edge: ; preds = %for.inc11
				br label %for.end13

				for.end13: ; preds = %for.cond.for.end13_crit_edge, %entry
				%arrayidx14 = getelementptr inbounds i32, i32* %y, i64 0
				%3 = load i32, i32* %arrayidx14, align 8
				%arrayidx15 = getelementptr inbounds i32, i32* %3, i64 0
				%4 = load i32, i32* %arrayidx15, align 4
				%conv = sitofp i32 %4 to double
				ret double %conv
				}

llvm/unittests/Analysis/LoopInfoTest.cpp

Show First 20 Lines • Show All 1,494 Lines • ▼ Show 20 Lines	runWithLoopInfo(*M, "foo", [&](Function &F, LoopInfo &LI) {

// This loop is in simplified form.		// This loop is in simplified form.
EXPECT_TRUE(L->isLoopSimplifyForm());		EXPECT_TRUE(L->isLoopSimplifyForm());

// This loop is not rotated.		// This loop is not rotated.
EXPECT_FALSE(L->isRotatedForm());		EXPECT_FALSE(L->isRotatedForm());
});		});
}		}

		TEST(LoopInfoTest, LoopUserBranch) {
		const char *ModuleStr =
		"target datalayout = \"e-m:o-i64:64-f80:128-n8:16:32:64-S128\"\n"
		"define void @foo(i32* %B, i64 signext %nx, i1 %cond) {\n"
		"entry:\n"
		" br i1 %cond, label %bb, label %guard\n"
		"guard:\n"
		" %cmp.guard = icmp slt i64 0, %nx\n"
		" br i1 %cmp.guard, label %for.i.preheader, label %for.end\n"
		"for.i.preheader:\n"
		" br label %for.i\n"
		"for.i:\n"
		" %i = phi i64 [ 0, %for.i.preheader ], [ %inc13, %for.i ]\n"
		" %Bi = getelementptr inbounds i32, i32* %B, i64 %i\n"
		" store i32 0, i32* %Bi, align 4\n"
		" %inc13 = add nsw i64 %i, 1\n"
		" %cmp = icmp slt i64 %inc13, %nx\n"
		" br i1 %cmp, label %for.i, label %for.i.exit\n"
		"for.i.exit:\n"
		" br label %bb\n"
		"bb:\n"
		" br label %for.end\n"
		"for.end:\n"
		" ret void\n"
		"}\n";

		// Parse the module.
		LLVMContext Context;
		std::unique_ptr<Module> M = makeLLVMModule(Context, ModuleStr);

		runWithLoopInfo(*M, "foo", [&](Function &F, LoopInfo &LI) {
		Function::iterator FI = F.begin();
		FI = ++FI;
		BasicBlock Guard = &FI;
		assert(Guard->getName() == "guard");

		FI = ++FI;
		BasicBlock Header = &(++FI);
		assert(Header->getName() == "for.i");

		Loop *L = LI.getLoopFor(Header);
		EXPECT_NE(L, nullptr);

		// L should not have a guard branch
		EXPECT_EQ(L->getLoopGuardBranch(), nullptr);
		});
		}

This is an archive of the discontinued LLVM Phabricator instance.

[LoopNest] Consider loop nest with inner loop guard using outer loop induction variable to be perfect ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 343697

llvm/include/llvm/Analysis/LoopNestAnalysis.h

llvm/lib/Analysis/LoopInfo.cpp

llvm/lib/Analysis/LoopNestAnalysis.cpp

llvm/test/Analysis/LoopNestAnalysis/imperfectnest.ll

llvm/test/Analysis/LoopNestAnalysis/perfectnest.ll

llvm/unittests/Analysis/LoopInfoTest.cpp

[LoopNest] Consider loop nest with inner loop guard using outer loop induction variable to be perfect
ClosedPublic