This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
InitializePasses.h
-
LinkAllPasses.h
-
Transforms/
-
Scalar.h
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
-
CMakeLists.txt
19
LoopSimplifyCFG.cpp
-
Scalar.cpp
-
test/Transforms/LoopSimplifyCFG/
-
Transforms/
-
LoopSimplifyCFG/
3
merge-header.ll

Differential D16382

Add LoopSimplifyCFG pass
ClosedPublic

Authored by escha on Jan 20 2016, 4:55 PM.

Download Raw Diff

Details

Reviewers

chandlerc
mzolotukhin
escha
resistor
hfinkel

Summary

super short version: this is a loop pass that does trivial CFG simplification on a loop, as requested by Chandler as the solution to the real problem below. it isn't used in the pass manager yet. right now it only merges consecutive blocks; it doesn't do anything fancier, but could in the future.

Details:

This IR has a perfectly reasonable nested loop that rotate -> unroll does not actually unroll all the way:

define i32 @foo(i32* %P, i64 *%Q) {
entry:
  br label %outer

outer:
  %y.2 = phi i32 [ 0, %entry ], [ %y.inc2, %outer.latch2 ]
  br label %inner

inner:
  %x.2 = phi i32 [ 0, %outer ], [ %inc2, %inner ]
  %inc2 = add nsw i32 %x.2, 1
  %exitcond2 = icmp eq i32 %inc2, 3
  store i32 %x.2, i32* %P
  br i1 %exitcond2, label %outer.latch, label %inner

outer.latch:
  %y.inc2 = add nsw i32 %y.2, 1
  %exitcond.outer = icmp eq i32 %y.inc2, 3
  store i32 %y.2, i32* %P
  br i1 %exitcond.outer, label %exit, label %outer.latch2

outer.latch2:
  %t = sext i32 %y.inc2 to i64 
  store i64 %t, i64* %Q
  br label %outer

exit:
  ret i32 0
}

This is because after unrolling the inner loop, the outer loop has two header blocks, which while valid and canonical in terms of LCSSA, is not what loop rotate understands. The hack solution is to run rotate -> unroll -> simplifycfg-> rotate -> unroll, which is bad. The slightly less hack is to put this simplification into LoopSimplify, which Chandler argues is a bad idea because LoopSimplify specifically simplifies in ways that maintain the canonical form, and nothing else (and we may want to run LoopSimplifyCFG in other places for other reasons). Chandler suggests that the most general solution is just to add a much-needed LoopSimplifyCFG, which I did.

The problem with using this right now is that in practice, you need a pipeline that looks like this to make use of it:

LoopPassManager:

Loop SimplifyCFG
Loop Rotate
Loop Unroll

And currently the PassManagerBuilder causes the LPMs to be split up due to analyses that are required being inserted in between (which chandler is working on). However, with a shim to require the associated analyses, this does work in practice in our pipeline out of tree, and a test just for this pass is included.

This is important to us because we have critical benchmark code that takes a form similar to this and similarly fails to unroll, resulting in catastrophic performance regressions.

Diff Detail

Repository: rL LLVM

Event Timeline

escha updated this revision to Diff 45465.Jan 20 2016, 4:55 PM

escha retitled this revision from to Add LoopSimplifyCFG pass.

escha updated this object.

escha added reviewers: chandlerc, resistor, mzolotukhin, hfinkel.

escha set the repository for this revision to rL LLVM.

escha added a subscriber: llvm-commits.

Herald added a subscriber: sanjoy. · View Herald TranscriptJan 20 2016, 4:55 PM

escha updated this object.Jan 20 2016, 4:55 PM

escha updated this object.Jan 20 2016, 4:58 PM

Hi,

First of all, thanks for working on this! One question on the test below.

Michael

test/Transforms/LoopSimplifyCFG/merge-header.ll
5–6	Which basic blocks do we merge here? From the check-lines it looks like we merge `%entry` with `%outer`, which doesn't sound right, since we want to merge only blocks inside loops. Am I misreading something here?

Minor drop by comments inline.

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
38	Minor: convention is to not indent namespaces: http://llvm.org/docs/CodingStandards.html#namespace-indentation Also, why not make this a struct?
91	Minor: coding style is to name this `Changed`.
106	Won't this invalidate `E`?

escha added inline comments.Jan 20 2016, 9:13 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
38	I was just copying LoopDeletion to make this pass; has the style changed since then?
91	Okay, will change this on the next update.
106	I don't think so? This only modifies things earlier in the list, not later. Should I recalculate E on each iteration just to be sure?
test/Transforms/LoopSimplifyCFG/merge-header.ll
5–6	We want to merge Entry and Outer here, yes. Entry is the loop header, Outer + Inner are the body, Outer.Latch2 is the latch, and Exit is the loop exit. Entry + Outer can be merged in this case.

majnemer added a subscriber: majnemer.Jan 20 2016, 9:40 PM

majnemer added inline comments.

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
106	`Loop::block_end` accesses the end iterator of a vector. `LoopInfo::removeBlock` calls `Loop::removeBlockFromLoop` which will `erase` an element from the vector. `std::vector::erase` invalidates all iterators at or beyond the point of the erase including the vector's `end` iterator.

sanjoy added inline comments.Jan 21 2016, 8:14 AM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
38	My guess would be that that indentation rule was added after `LoopDeletion` was added to LLVM. In any case, it is in the coding standard now, so we should just follow that. :) If you have spare time, I think fixing the indentation in LoopDeletion in a separate change will also be a appropriate thing to do.
106	Now that I think of it, is the `++I` safe? Won't `I` be invalidated by the `erase`?

chandlerc added inline comments.Jan 21 2016, 8:53 AM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
38	Feel free to just throw some clang-format on this code since it is "new", it'll do the right thing for you. =]
106	My suggestion would be to copy the loop blocks into a local smallvector, and then just loop over that so you don't have to worry about this and can freely update LoopInfo as you go.

Updated diff with suggested changes.

mzolotukhin added inline comments.Jan 21 2016, 12:32 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
89	Nitpick: s/changed/Changed/
test/Transforms/LoopSimplifyCFG/merge-header.ll
6–7	Hmm, the header here is `%outer`, `%entry` is a preheader: $ opt -analyze -loops < merge-header.ll Printing analysis 'Natural Loop Information' for function 'foo': Loop at depth 1 containing: %outer<header>,%inner<exiting>,%outer.latch2<latch> That said, I think the test is correct in current form (I misread it first time), but I think we might want to make it more explicit. Like, we can check something like this: CHECK: entry: CHECK-NEXT: br label %[[LOOP:%.]] CHECK: [[LOOP]: CHECK-NEXT: phi CHECK-NOT: br label CHECK: br i1 This way we make sure that the blocks are actually merged.

sanjoy added inline comments.Jan 21 2016, 12:34 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
110	This is going to `free` `Pred`, right? Is it possible that `Pred` is after `Succ` in `Blocks` and you'll end up trying to examine a `free`'ed block later?

I also like the idea.

I guess one more difference between this pass and LoopSimplify that LoopSimplify is a function pass whereas this one is a loop pass. So you do get the nice inner-to-outer propagation you're looking for.

escha added inline comments.Jan 21 2016, 3:40 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
110	Ugh, I guess this is possible. How do we avoid this given iterator invalidation issues?

sanjoy added inline comments.Jan 21 2016, 3:45 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
110	I think things would Just Work(TM) if you have `SmallVector<WeakVH, 16> Blocks`, and check if an entry has been null'ed out before you do anything with it.

zzheng added a subscriber: zzheng.Jan 21 2016, 4:05 PM

Updated per comments.

A meta concern here: how do we share code between this pass and the existing SimplifyCFG pass? It would be really unfortunate if we ended up duplicating large chunks of SimplifyCFG here. The simple unconditional edge folding case here is fine as a starting point and the code duplication isn't too terrible, but how do we avoid problems going forward?

The long-term plan is that if large chunks of SimplifyCFG are moved here for any reason, they'll be factored out into utility functions (to avoid duplicating things).

ping?

I have a couple of comments inline, after which this looks good to me.

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
70	Why do you need to depend on `TargetTransformInfoWrapperPass` and `AssumptionCacheTracker`?
97	I think you're copying `ValueHandle` s here (which will call `ValueHandleBase::AddToExistingUseList`). Can you change this to `const auto &` or `auto &` ?
100	You should be able to use `cast_or_null` here.

escha added inline comments.Jan 29 2016, 2:13 PM

lib/Transforms/Scalar/LoopSimplifyCFG.cpp
70	I guess I don't fully understand this section. Are these "dependencies" things we need to run, or are these "dependencies" things we can pass on to the next pass (preserved), or...? I notice a lot of passes seem to have things here that aren't listed as Required. How does this section relate to the Required/Preserves bit?

Nitpicks resolved and approved by Justin offline (he explained the pass dependencies bit to me as well).

escha accepted this revision.Jan 29 2016, 2:40 PM

escha added a reviewer: escha.

This revision is now accepted and ready to land.Jan 29 2016, 2:40 PM

escha closed this revision.Jan 29 2016, 2:41 PM

Revision Contents

Path

Size

include/

llvm/

InitializePasses.h

1 line

LinkAllPasses.h

1 line

Transforms/

Scalar.h

7 lines

lib/

Transforms/

Scalar/

CMakeLists.txt

1 line

LoopSimplifyCFG.cpp

117 lines

Scalar.cpp

5 lines

test/

Transforms/

LoopSimplifyCFG/

merge-header.ll

34 lines

Diff 45715

include/llvm/InitializePasses.h

Context not available.
	void initializeLoopInstSimplifyPass(PassRegistry&);	void initializeLoopInstSimplifyPass(PassRegistry&);
	void initializeLoopRotatePass(PassRegistry&);	void initializeLoopRotatePass(PassRegistry&);
	void initializeLoopSimplifyPass(PassRegistry&);	void initializeLoopSimplifyPass(PassRegistry&);
		void initializeLoopSimplifyCFGPass(PassRegistry&);
	void initializeLoopStrengthReducePass(PassRegistry&);	void initializeLoopStrengthReducePass(PassRegistry&);
	void initializeGlobalMergePass(PassRegistry&);	void initializeGlobalMergePass(PassRegistry&);
	void initializeLoopRerollPass(PassRegistry&);	void initializeLoopRerollPass(PassRegistry&);
Context not available.

include/llvm/LinkAllPasses.h

Context not available.
	(void) llvm::createLoopExtractorPass();	(void) llvm::createLoopExtractorPass();
	(void) llvm::createLoopInterchangePass();	(void) llvm::createLoopInterchangePass();
	(void) llvm::createLoopSimplifyPass();	(void) llvm::createLoopSimplifyPass();
		(void) llvm::createLoopSimplifyCFGPass();
	(void) llvm::createLoopStrengthReducePass();	(void) llvm::createLoopStrengthReducePass();
	(void) llvm::createLoopRerollPass();	(void) llvm::createLoopRerollPass();
	(void) llvm::createLoopUnrollPass();	(void) llvm::createLoopUnrollPass();
Context not available.

include/llvm/Transforms/Scalar.h

Context not available.
	//	//
	FunctionPass *createLoopLoadEliminationPass();	FunctionPass *createLoopLoadEliminationPass();

		//===----------------------------------------------------------------------===//
		//
		// LoopSimplifyCFG - This pass performs basic CFG simplification on loops,
		// primarily to help other loop passes.
		//
		Pass *createLoopSimplifyCFGPass();

	} // End llvm namespace	} // End llvm namespace

	#endif	#endif
Context not available.

lib/Transforms/Scalar/CMakeLists.txt

Context not available.
	LoopLoadElimination.cpp	LoopLoadElimination.cpp
	LoopRerollPass.cpp	LoopRerollPass.cpp
	LoopRotation.cpp	LoopRotation.cpp
		LoopSimplifyCFG.cpp
	LoopStrengthReduce.cpp	LoopStrengthReduce.cpp
	LoopUnrollPass.cpp	LoopUnrollPass.cpp
	LoopUnswitch.cpp	LoopUnswitch.cpp
Context not available.

lib/Transforms/Scalar/LoopSimplifyCFG.cpp

				//===--------- LoopSimplifyCFG.cpp - Loop CFG Simplification Pass ---------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements the Loop SimplifyCFG Pass. This pass is responsible for
				// basic loop CFG cleanup, primarily to assist other loop passes. If you
				// encounter a noncanonical CFG construct that causes another loop pass to
				// perform suboptimally, this is the place to fix it up.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Transforms/Scalar.h"
				#include "llvm/ADT/SmallVector.h"
				#include "llvm/ADT/Statistic.h"
				#include "llvm/Analysis/AliasAnalysis.h"
				#include "llvm/Analysis/BasicAliasAnalysis.h"
				#include "llvm/Analysis/AssumptionCache.h"
				#include "llvm/Analysis/DependenceAnalysis.h"
				#include "llvm/Analysis/GlobalsModRef.h"
				#include "llvm/Analysis/LoopInfo.h"
				#include "llvm/Analysis/LoopPass.h"
				#include "llvm/Analysis/ScalarEvolution.h"
				#include "llvm/Analysis/ScalarEvolutionAliasAnalysis.h"
				#include "llvm/Analysis/TargetTransformInfo.h"
				#include "llvm/IR/Dominators.h"
				#include "llvm/Transforms/Utils/Local.h"
				using namespace llvm;

				#define DEBUG_TYPE "loop-simplifycfg"

				namespace {
				class LoopSimplifyCFG : public LoopPass {
				public:
				sanjoyUnsubmitted Not Done Reply Inline Actions Minor: convention is to not indent namespaces: http://llvm.org/docs/CodingStandards.html#namespace-indentation Also, why not make this a struct? sanjoy: Minor: convention is to not indent namespaces: http://llvm.org/docs/CodingStandards.
				eschaAuthorUnsubmitted Not Done Reply Inline Actions I was just copying LoopDeletion to make this pass; has the style changed since then? escha: I was just copying LoopDeletion to make this pass; has the style changed since then?
				sanjoyUnsubmitted Not Done Reply Inline Actions My guess would be that that indentation rule was added after `LoopDeletion` was added to LLVM. In any case, it is in the coding standard now, so we should just follow that. :) If you have spare time, I think fixing the indentation in LoopDeletion in a separate change will also be a appropriate thing to do. sanjoy: My guess would be that that indentation rule was added after `LoopDeletion` was added to LLVM.
				chandlercUnsubmitted Not Done Reply Inline Actions Feel free to just throw some clang-format on this code since it is "new", it'll do the right thing for you. =] chandlerc: Feel free to just throw some clang-format on this code since it is "new", it'll do the right…
				static char ID; // Pass ID, replacement for typeid
				LoopSimplifyCFG() : LoopPass(ID) {
				initializeLoopSimplifyCFGPass(*PassRegistry::getPassRegistry());
				}

				bool runOnLoop(Loop *L, LPPassManager &) override;

				void getAnalysisUsage(AnalysisUsage &AU) const override {
				AU.addRequired<DominatorTreeWrapperPass>();
				AU.addRequired<LoopInfoWrapperPass>();

				AU.addPreserved<DominatorTreeWrapperPass>();
				AU.addPreserved<LoopInfoWrapperPass>();
				AU.addPreserved<GlobalsAAWrapperPass>();
				AU.addPreserved<BasicAAWrapperPass>();
				AU.addPreserved<AAResultsWrapperPass>();
				AU.addPreserved<ScalarEvolutionWrapperPass>();
				AU.addPreserved<SCEVAAWrapperPass>();
				AU.addPreserved<DependenceAnalysis>();
				AU.addPreservedID(LoopSimplifyID);
				AU.addPreservedID(LCSSAID);
				}
				};
				}

				char LoopSimplifyCFG::ID = 0;
				INITIALIZE_PASS_BEGIN(LoopSimplifyCFG, "loop-simplifycfg", "Simplify loop CFG",
				false, false)
				INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)
				INITIALIZE_PASS_DEPENDENCY(LoopInfoWrapperPass)
				INITIALIZE_PASS_DEPENDENCY(ScalarEvolutionWrapperPass)
				INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
				sanjoyUnsubmitted Not Done Reply Inline Actions Why do you need to depend on `TargetTransformInfoWrapperPass` and `AssumptionCacheTracker`? sanjoy: Why do you need to depend on `TargetTransformInfoWrapperPass` and `AssumptionCacheTracker`?
				eschaAuthorUnsubmitted Not Done Reply Inline Actions I guess I don't fully understand this section. Are these "dependencies" things we need to run, or are these "dependencies" things we can pass on to the next pass (preserved), or...? I notice a lot of passes seem to have things here that aren't listed as Required. How does this section relate to the Required/Preserves bit? escha: I guess I don't fully understand this section. Are these "dependencies" things we need to run…
				INITIALIZE_PASS_DEPENDENCY(AssumptionCacheTracker)
				INITIALIZE_PASS_DEPENDENCY(LoopSimplify)
				INITIALIZE_PASS_DEPENDENCY(LCSSA)
				INITIALIZE_PASS_DEPENDENCY(SCEVAAWrapperPass)
				INITIALIZE_PASS_DEPENDENCY(BasicAAWrapperPass)
				INITIALIZE_PASS_DEPENDENCY(GlobalsAAWrapperPass)
				INITIALIZE_PASS_END(LoopSimplifyCFG, "loop-simplifycfg", "Simplify loop CFG",
				false, false)

				Pass *llvm::createLoopSimplifyCFGPass() { return new LoopSimplifyCFG(); }

				/// runOnLoop - Perform basic CFG simplifications to assist other loop passes.
				/// For now, this only attempts to merge blocks in the trivial case.
				bool LoopSimplifyCFG::runOnLoop(Loop *L, LPPassManager &) {
				if (skipOptnoneFunction(L))
				return false;

				bool Changed = false;
				DominatorTree *DT = &getAnalysis<DominatorTreeWrapperPass>().getDomTree();
				mzolotukhinUnsubmitted Not Done Reply Inline Actions Nitpick: s/changed/Changed/ mzolotukhin: Nitpick: s/changed/Changed/
				LoopInfo *LI = &getAnalysis<LoopInfoWrapperPass>().getLoopInfo();

				sanjoyUnsubmitted Not Done Reply Inline Actions Minor: coding style is to name this `Changed`. sanjoy: Minor: coding style is to name this `Changed`.
				eschaAuthorUnsubmitted Not Done Reply Inline Actions Okay, will change this on the next update. escha: Okay, will change this on the next update.
				// Copy blocks into a temporary array to avoid iterator invalidation issues
				// as we remove them.
				SmallVector<WeakVH, 16> Blocks;
				Blocks.append(L->block_begin(), L->block_end());

				for (auto Block : Blocks) {
				sanjoyUnsubmitted Not Done Reply Inline Actions I think you're copying `ValueHandle` s here (which will call `ValueHandleBase::AddToExistingUseList`). Can you change this to `const auto &` or `auto &` ? sanjoy: I think you're copying `ValueHandle` s here (which will call `ValueHandleBase…
				// Attempt to merge blocks in the trivial case. Don't modify blocks which
				// belong to other loops.
				BasicBlock *Succ = dyn_cast_or_null<BasicBlock>(Block);
				sanjoyUnsubmitted Not Done Reply Inline Actions You should be able to use `cast_or_null` here. sanjoy: You should be able to use `cast_or_null` here.
				if (!Succ)
				continue;

				BasicBlock *Pred = Succ->getSinglePredecessor();
				if (!Pred \|\| !Pred->getSingleSuccessor() \|\| LI->getLoopFor(Pred) != L)
				continue;
				sanjoyUnsubmitted Not Done Reply Inline Actions Won't this invalidate `E`? sanjoy: Won't this invalidate `E`?
				eschaAuthorUnsubmitted Not Done Reply Inline Actions I don't think so? This only modifies things earlier in the list, not later. Should I recalculate E on each iteration just to be sure? escha: I don't think so? This only modifies things earlier in the list, not later. Should I…
				majnemerUnsubmitted Not Done Reply Inline Actions `Loop::block_end` accesses the end iterator of a vector. `LoopInfo::removeBlock` calls `Loop::removeBlockFromLoop` which will `erase` an element from the vector. `std::vector::erase` invalidates all iterators at or beyond the point of the erase including the vector's `end` iterator. majnemer: `Loop::block_end` accesses the end iterator of a vector. `LoopInfo::removeBlock` calls `Loop…
				sanjoyUnsubmitted Not Done Reply Inline Actions Now that I think of it, is the `++I` safe? Won't `I` be invalidated by the `erase`? sanjoy: Now that I think of it, is the `++I` safe? Won't `I` be invalidated by the `erase`?
				chandlercUnsubmitted Not Done Reply Inline Actions My suggestion would be to copy the loop blocks into a local smallvector, and then just loop over that so you don't have to worry about this and can freely update LoopInfo as you go. chandlerc: My suggestion would be to copy the loop blocks into a local smallvector, and then just loop…

				// Pred is going to disappear, so we need to update the loop info.
				if (L->getHeader() == Pred)
				L->moveToHeader(Succ);
				sanjoyUnsubmitted Not Done Reply Inline Actions This is going to `free` `Pred`, right? Is it possible that `Pred` is after `Succ` in `Blocks` and you'll end up trying to examine a `free`'ed block later? sanjoy: This is going to `free` `Pred`, right? Is it possible that `Pred` is after `Succ` in `Blocks`…
				eschaAuthorUnsubmitted Not Done Reply Inline Actions Ugh, I guess this is possible. How do we avoid this given iterator invalidation issues? escha: Ugh, I guess this is possible. How do we avoid this given iterator invalidation issues?
				sanjoyUnsubmitted Not Done Reply Inline Actions I think things would Just Work(TM) if you have `SmallVector<WeakVH, 16> Blocks`, and check if an entry has been null'ed out before you do anything with it. sanjoy: I think things would Just Work(TM) if you have `SmallVector<WeakVH, 16> Blocks`, and check if…
				LI->removeBlock(Pred);
				MergeBasicBlockIntoOnlyPred(Succ, DT);
				Changed = true;
				}

				return Changed;
				}

lib/Transforms/Scalar/Scalar.cpp

Context not available.
	initializeFloat2IntPass(Registry);	initializeFloat2IntPass(Registry);
	initializeLoopDistributePass(Registry);	initializeLoopDistributePass(Registry);
	initializeLoopLoadEliminationPass(Registry);	initializeLoopLoadEliminationPass(Registry);
		initializeLoopSimplifyCFGPass(Registry);
	}	}

	void LLVMInitializeScalarOpts(LLVMPassRegistryRef R) {	void LLVMInitializeScalarOpts(LLVMPassRegistryRef R) {
Context not available.
	unwrap(PM)->add(createLoopRerollPass());	unwrap(PM)->add(createLoopRerollPass());
	}	}

		void LLVMAddLoopSimplifyCFGPass(LLVMPassManagerRef PM) {
		unwrap(PM)->add(createLoopSimplifyCFGPass());
		}

	void LLVMAddLoopUnrollPass(LLVMPassManagerRef PM) {	void LLVMAddLoopUnrollPass(LLVMPassManagerRef PM) {
	unwrap(PM)->add(createLoopUnrollPass());	unwrap(PM)->add(createLoopUnrollPass());
	}	}
Context not available.

test/Transforms/LoopSimplifyCFG/merge-header.ll

				; RUN: opt -S -loop-simplifycfg < %s \| FileCheck %s

				; CHECK-LABEL: foo
				; CHECK: entry:
				; CHECK-NEXT: br label %[[LOOP:[a-z]+]]
				; CHECK: [[LOOP]]:
				mzolotukhinUnsubmitted Not Done Reply Inline Actions Which basic blocks do we merge here? From the check-lines it looks like we merge `%entry` with `%outer`, which doesn't sound right, since we want to merge only blocks inside loops. Am I misreading something here? mzolotukhin: Which basic blocks do we merge here? From the check-lines it looks like we merge `%entry` with…
				eschaAuthorUnsubmitted Not Done Reply Inline Actions We want to merge Entry and Outer here, yes. Entry is the loop header, Outer + Inner are the body, Outer.Latch2 is the latch, and Exit is the loop exit. Entry + Outer can be merged in this case. escha: We want to merge Entry and Outer here, yes. Entry is the loop header, Outer + Inner are the…
				; CHECK-NEXT: phi
				mzolotukhinUnsubmitted Not Done Reply Inline Actions Hmm, the header here is `%outer`, `%entry` is a preheader: $ opt -analyze -loops < merge-header.ll Printing analysis 'Natural Loop Information' for function 'foo': Loop at depth 1 containing: %outer<header>,%inner<exiting>,%outer.latch2<latch> That said, I think the test is correct in current form (I misread it first time), but I think we might want to make it more explicit. Like, we can check something like this: CHECK: entry: CHECK-NEXT: br label %[[LOOP:%.]] CHECK: [[LOOP]: CHECK-NEXT: phi CHECK-NOT: br label CHECK: br i1 This way we make sure that the blocks are actually merged. mzolotukhin: Hmm, the header here is `%outer`, `%entry` is a preheader: ``` $ opt -analyze -loops < merge…
				; CHECK-NOT: br label
				; CHECK: br i1
				define i32 @foo(i32* %P, i64* %Q) {
				entry:
				br label %outer

				outer: ; preds = %outer.latch2, %entry
				%y.2 = phi i32 [ 0, %entry ], [ %y.inc2, %outer.latch2 ]
				br label %inner

				inner: ; preds = %outer
				store i32 0, i32* %P
				store i32 1, i32* %P
				store i32 2, i32* %P
				%y.inc2 = add nsw i32 %y.2, 1
				%exitcond.outer = icmp eq i32 %y.inc2, 3
				store i32 %y.2, i32* %P
				br i1 %exitcond.outer, label %exit, label %outer.latch2

				outer.latch2: ; preds = %inner
				%t = sext i32 %y.inc2 to i64
				store i64 %t, i64* %Q
				br label %outer

				exit: ; preds = %inner
				ret i32 0
				}

This is an archive of the discontinued LLVM Phabricator instance.

Add LoopSimplifyCFG passClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 45715

include/llvm/InitializePasses.h

include/llvm/LinkAllPasses.h

include/llvm/Transforms/Scalar.h

lib/Transforms/Scalar/CMakeLists.txt

lib/Transforms/Scalar/LoopSimplifyCFG.cpp

lib/Transforms/Scalar/Scalar.cpp

test/Transforms/LoopSimplifyCFG/merge-header.ll

Add LoopSimplifyCFG pass
ClosedPublic