This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/Utils/
-
llvm/
-
Transforms/
-
Utils/
1/1
BasicBlockUtils.h
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
5/14
BasicBlockUtils.cpp
1/1
LoopPeel.cpp
-
test/Transforms/LoopUnroll/
-
Transforms/
-
LoopUnroll/
-
peel-multiple-unreachable-exits.ll

Differential D110922

[LoopPeel] Peel loops with exits followed by an unreachable or deopt block
ClosedPublic

Authored by dmakogon on Oct 1 2021, 4:59 AM.

Download Raw Diff

Details

Reviewers

mkazantsev
fhahn
reames
skatkov
nikic

Commits

rGe09958d5eb74: [LoopPeel] Peel loops with exits followed by an unreachable or deopt block
rGd68b59f3ebb2: Recommit "[LoopPeel] Peel loops with deoptimizing exits"
rG8a959625c433: [LoopPeel] Peel loops with deoptimizing exits

Summary

Added support for peeling loops with exits that are followed either by an unreachable-terminated block or block that has a terminatnig deoptimize call. All blocks in the sequence must have an unique successor, maybe except for the last one.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dmakogon created this revision.Oct 1 2021, 4:59 AM

Herald added subscribers: zzheng, hiraditya. · View Herald TranscriptOct 1 2021, 4:59 AM

dmakogon requested review of this revision.Oct 1 2021, 4:59 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 1 2021, 4:59 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

lebedev.ri added reviewers: fhahn, reames, skatkov.Oct 1 2021, 5:05 AM

lebedev.ri added a subscriber: lebedev.ri.

lebedev.ri added inline comments.

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
511	Nice, i was about to ask for that. But doesn't this lead to the issues with branch weights discussed in D108108?

dmakogon edited the summary of this revision. (Show Details)Oct 1 2021, 5:10 AM

dmakogon mentioned this in D110924: [LoopUtils] Consider unreachable-terminated blocks as deoptimizing.

dmakogon added a child revision: D110924: [LoopUtils] Consider unreachable-terminated blocks as deoptimizing.

mkazantsev requested changes to this revision.Oct 1 2021, 5:12 AM

mkazantsev added inline comments.

llvm/include/llvm/Transforms/Utils/BasicBlockUtils.h
132	This comment doesn't correspond to what you really want. Imagine that a block has 2 successors, one being deoptimizing and another is just a regular block. It falls into category "any of its children is deoptimizing", but it's definitely not what you are looking for. What I suggest is to rewrite it like "Check if we can prove that all paths starting from this block will converge to a block that is terminated by unreachable".
llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
497	What `DeoptimizingBlocks` is used for? To me it looks like a complete doubler of `VisitedBlocks`. Also no good reason for it to be a parameter.
500	Use `SmallPtrSet`. `contains` checks in vector are expensive.
501	Looks like it may be done much easier. while(BB && VisitedBlocks.insert(BB).second) { if ([terminates with unreachable](BB)) return true; BB = BB->getSingleSuccessor(); } return false; Please check if it's what you meant here, and simplify the code.

This revision now requires changes to proceed.Oct 1 2021, 5:12 AM

mkazantsev added inline comments.Oct 1 2021, 5:25 AM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
511	Well, it doesn't introduce a new bug, it just expands the scope of the existing problem. I'm pretty sure it's relatively safe to always set "many:1" metadata for unreached blocks, but need more digging to understand the impact of this.
519	Compile time-wise, I'd prefer to have traversal limit set by an option. In worst case, you'll need to walk dozens blocks for each exit, and here is where things may go astray.

Rewritten IsBlockDeoptimizing in a simpler way with the use of SmallPtrSet.
Added an option to limit the maximum traversed block chain depth.

dmakogon marked 5 inline comments as done.Oct 4 2021, 4:20 AM

mkazantsev added inline comments.Oct 5 2021, 1:28 AM

llvm/lib/Transforms/Utils/LoopPeel.cpp
847	Instead of this, please fill a SmallVector with DomTreeUpdates and then `DT->applyUpdates` once. It's a more canonical way of doing this.

mkazantsev added inline comments.Oct 5 2021, 1:37 AM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
495	This name is misleading. It's not always deoptimizing (if it ends with unreachable), and neither this block is deoptimizing. Maybe smth like `IsBlockFollowedByDeoptOrUnreached`?
503	There is one thing that bugs me. Imagine a sutiation: loop_exit: call foo() // will not return, but is not a deopt unreachable I'm not sure if it's bad actually. But maybe we should consider checking all other (non deopt) instruction with `isGuaranteedToTransferExecutionToSuccessor`. Opinions?

lebedev.ri added inline comments.Oct 5 2021, 2:12 AM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
503	This is actually exactly the situation would have wanted to ask to be supported.

Renamed IsBlockDeoptimizing to IsBlockFollowedByDeoptOrUnreachable
DT is now updated using applyUpdates

dmakogon marked 2 inline comments as done.Oct 5 2021, 8:57 AM

Harbormaster completed remote builds in B127097: Diff 377270.Oct 5 2021, 9:40 AM

mkazantsev added inline comments.Oct 5 2021, 8:29 PM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
503	Do you mean "supported" as "let's peel it" or "let's NOT peel it unless we know unreachable WILL execute"? :)

lebedev.ri added inline comments.Oct 6 2021, 12:26 AM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
503	"let's peel it"

In that case everything since done; @dmakogon pls add one more test with single-exit non-throwing unreachable exit and I think we can go with it, unless someone has cons.

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
503	Ok, we peel it now. :)

This revision is now accepted and ready to land.Oct 6 2021, 3:14 AM

lebedev.ri added inline comments.Oct 6 2021, 3:19 AM

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
503	:)

Closed by commit rG8a959625c433: [LoopPeel] Peel loops with deoptimizing exits (authored by mkazantsev). · Explain WhyOct 7 2021, 8:32 PM

This revision was automatically updated to reflect the committed changes.

mkazantsev added a commit: rG8a959625c433: [LoopPeel] Peel loops with deoptimizing exits.

mkazantsev mentioned this in D108114: [LoopPeel] Peel if it turns invariant loads dereferenceable..Oct 7 2021, 10:38 PM

This causes lots of lit test failures in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Can you please fix or revert?

Failed Tests (26):
  LLVM :: ThinLTO/X86/function_entry_count.ll
  LLVM :: Transforms/LoopFusion/guarded_peel.ll
  LLVM :: Transforms/LoopFusion/peel.ll
  LLVM :: Transforms/LoopUnroll/Hexagon/peel-small-loop.ll
  LLVM :: Transforms/LoopUnroll/dce.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-conditions-pgo-1.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-conditions-pgo-2.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-conditions.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-inner.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-nests.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-noalias-scope-decl.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-not-forced.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-pgo-deopt-idom-2.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-pgo-deopt-idom.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-pgo-deopt.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-pgo.ll
  LLVM :: Transforms/LoopUnroll/peel-loop-scev-invalidate.ll
  LLVM :: Transforms/LoopUnroll/peel-loop.ll
  LLVM :: Transforms/LoopUnroll/peel-loop2.ll
  LLVM :: Transforms/LoopUnroll/peel-multiple-unreachable-exits.ll
  LLVM :: Transforms/LoopUnroll/pr33437.ll
  LLVM :: Transforms/LoopUnroll/pr45939-peel-count-and-complete-unroll.ll
  LLVM :: Transforms/LoopUnroll/unroll-after-peel.ll
  LLVM :: Transforms/LoopUnroll/unroll-heuristics-pgo.ll
  LLVM :: Transforms/LoopUnroll/wrong_assert_in_peeling.ll
  LLVM :: Transforms/PhaseOrdering/X86/peel-before-lv-to-enable-vectorization.ll

mkazantsev added a reverting change: rG48a5a2d1af25: Revert "[LoopPeel] Peel loops with deoptimizing exits".Oct 8 2021, 2:08 AM

Reverted, Dima pls investigate.

Fixed failing tests with expensive checks enabled. Forgot to remove the DT->verify check that happened every iteration after all needed DT updates which were applied immediately, but with the patch we store some updates to apply them all later at once.

This revision was landed with ongoing or failed builds.Oct 8 2021, 4:10 AM

mkazantsev added a commit: rGd68b59f3ebb2: Recommit "[LoopPeel] Peel loops with deoptimizing exits".

Harbormaster completed remote builds in B127723: Diff 378159.Oct 8 2021, 4:47 AM

This causes crashes, reverting

$ ./build/rel/bin/opt -passes=loop-unroll-full /tmp/b.ll -disable-output
opt: ../../llvm/include/llvm/Support/CFGUpdate.h:87: void llvm::cfg::LegalizeUpdates(ArrayRef<Update<NodePtr>>, SmallVectorImpl<Update<NodePtr>> &, bool, bool) [NodePtr = llvm::BasicBlock *]: Assertion `std::abs(NumInsertions) <= 1 && "Unbalanced operations!"' failed.
PLEASE submit a bug report to https://bugs.llvm.org/ and include the crash backtrace.
Stack dump:
0.      Program arguments: ./build/rel/bin/opt -passes=loop-unroll-full /tmp/b.ll -disable-output
 #0 0x0000000001f01953 llvm::sys::PrintStackTrace(llvm::raw_ostream&, int) /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/lib/Support/Unix/Signals.inc:565:13
 #1 0x0000000001eff7ce llvm::sys::RunSignalHandlers() /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/lib/Support/Signals.cpp:98:18
 #2 0x0000000001f01cbf SignalHandler(int) /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/lib/Support/Unix/Signals.inc:407:1
 #3 0x00007f998f9e7140 __restore_rt (/lib/x86_64-linux-gnu/libpthread.so.0+0x14140)
 #4 0x00007f998f4c6ce1 raise ./signal/../sysdeps/unix/sysv/linux/raise.c:51:1
 #5 0x00007f998f4b0537 abort ./stdlib/abort.c:81:7
 #6 0x00007f998f4b040f get_sysdep_segment_value ./intl/loadmsgcat.c:509:8
 #7 0x00007f998f4b040f _nl_load_domain ./intl/loadmsgcat.c:970:34
 #8 0x00007f998f4bf662 (/lib/x86_64-linux-gnu/libc.so.6+0x34662)
 #9 0x0000000001539fd3 AdvancePastEmptyBuckets /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/include/llvm/ADT/DenseMap.h:1281:5
#10 0x0000000001539fd3 operator++ /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/include/llvm/ADT/DenseMap.h:1271:5
#11 0x0000000001539fd3 void llvm::cfg::LegalizeUpdates<llvm::BasicBlock*>(llvm::ArrayRef<llvm::cfg::Update<llvm::BasicBlock*> >, llvm::SmallVectorImpl<llvm::cfg::Update<llvm::BasicBlock*> >&, bool, bool) /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/include/llvm/Support/CFGUpdate.h:85:17
#12 0x0000000001537265 llvm::GraphDiff<llvm::BasicBlock*, false>::GraphDiff(llvm::ArrayRef<llvm::cfg::Update<llvm::BasicBlock*> >, bool) /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/include/llvm/Support/CFGDiff.h:0:5
#13 0x00000000025a0809 applyUpdates /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/include/llvm/Support/GenericDomTree.h:547:5
#14 0x00000000025a0809 llvm::peelLoop(llvm::Loop*, unsigned int, llvm::LoopInfo*, llvm::ScalarEvolution*, llvm::DominatorTree*, llvm::AssumptionCache*, bool) /usr/local/google/home/aeubanks/repos/llvm-project/build/rel/../../llvm/lib/Transforms/Utils/LoopPeel.cpp:803:7

b.ll3 KBDownload

aeubanks reopened this revision.Oct 8 2021, 10:53 AM

This revision is now accepted and ready to land.Oct 8 2021, 10:53 AM

aeubanks added a reverting change: rG9405217999ef: Revert "Recommit "[LoopPeel] Peel loops with deoptimizing exits"".Oct 8 2021, 10:53 AM

mkazantsev mentioned this in rG49ca01047f0c: [Test] Add commit justifying revert of D110922.Oct 9 2021, 12:34 AM

Thanks @aeubanks! I've checked in your test as test/Transforms/LoopUnroll/revert-D110922.ll to not skip this again. @dmakogon pls investigate.

This revision now requires changes to proceed.Oct 9 2021, 12:35 AM

Fixed crash on test/Transforms/LoopUnroll/revert-D110922.ll. It happened due to adding the same edges to DomTree due to the fact that one of the loop exiting blocks had a switch terminator and there were multiple edges from that exiting block to an exit. Now we construct a set from the loop exiting edges and then add edge insertion updates only for edges from the set.

Two drive by comments.

One the surface, this seems like it should be two changes. One to do an NFC restructuring of the domtree updates. One to actually enable the broader scope.

The description of the change also needs updated. An exit reaching an unreachable is *not* a "deoptimizing" exit.

Harbormaster completed remote builds in B128110: Diff 378675.Oct 11 2021, 8:59 AM

+1 to spliting off dom tree update, especially since it was causing troubles.

aeubanks removed a subscriber: aeubanks.Oct 11 2021, 9:33 AM

dmakogon mentioned this in D111611: [NFC] [LoopPeel] Update IDoms of non-loop blocks dominated by the loop.Oct 12 2021, 12:08 AM

Split the patch into 2:

D111611 changes the way DT is updated,
This patch adds support for peeling loops with exits followed by unreachable or deopt blocks.

Also changed this patch's title and description to address Philip's comment.

dmakogon added a parent revision: D111611: [NFC] [LoopPeel] Update IDoms of non-loop blocks dominated by the loop.Oct 12 2021, 12:38 AM

Do I understand correctly that the reason of crash was duplicating exiting edges, which caused duplicating DT, and this is now fixed in D111611?

Harbormaster completed remote builds in B128287: Diff 378905.Oct 12 2021, 1:24 AM

mkazantsev mentioned this in rGfa16329ae072: [NFC] [LoopPeel] Change the way DT is updated for loop exits.Oct 17 2021, 8:53 PM

mkazantsev requested changes to this revision.Oct 17 2021, 9:01 PM

mkazantsev added inline comments.

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp
57	Please rename the option and change description, taking Philip's comment into account. Unreachable-terminated block is not a deopt block.

This revision now requires changes to proceed.Oct 17 2021, 9:01 PM

Rebase.

Updated name of the option which limits the maximum path length when checking whether a BB is followed by an unreachable or deoptimizing block.

Harbormaster completed remote builds in B130620: Diff 382211.Oct 26 2021, 1:01 AM

dmakogon added a reviewer: nikic.Oct 27 2021, 12:43 AM

LGTM

This revision is now accepted and ready to land.Nov 1 2021, 12:03 AM

Closed by commit rGe09958d5eb74: [LoopPeel] Peel loops with exits followed by an unreachable or deopt block (authored by dmakogon). · Explain WhyNov 2 2021, 9:13 AM

This revision was automatically updated to reflect the committed changes.

dmakogon added a commit: rGe09958d5eb74: [LoopPeel] Peel loops with exits followed by an unreachable or deopt block.

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

Utils/

BasicBlockUtils.h

7 lines

lib/

Transforms/

Utils/

BasicBlockUtils.cpp

21 lines

LoopPeel.cpp

14 lines

test/

Transforms/

LoopUnroll/

peel-multiple-unreachable-exits.ll

44 lines

Diff 384134

llvm/include/llvm/Transforms/Utils/BasicBlockUtils.h

	Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	/// instruction.			/// instruction.
	void ReplaceInstWithInst(BasicBlock::InstListType &BIL,			void ReplaceInstWithInst(BasicBlock::InstListType &BIL,
	BasicBlock::iterator &BI, Instruction *I);			BasicBlock::iterator &BI, Instruction *I);

	/// Replace the instruction specified by From with the instruction specified by			/// Replace the instruction specified by From with the instruction specified by
	/// To. Copies DebugLoc from BI to I, if I doesn't already have a DebugLoc.			/// To. Copies DebugLoc from BI to I, if I doesn't already have a DebugLoc.
	void ReplaceInstWithInst(Instruction From, Instruction To);			void ReplaceInstWithInst(Instruction From, Instruction To);

				/// Check if we can prove that all paths starting from this block converge
				mkazantsevUnsubmitted Done Reply Inline Actions This comment doesn't correspond to what you really want. Imagine that a block has 2 successors, one being deoptimizing and another is just a regular block. It falls into category "any of its children is deoptimizing", but it's definitely not what you are looking for. What I suggest is to rewrite it like "Check if we can prove that all paths starting from this block will converge to a block that is terminated by unreachable". mkazantsev: This comment doesn't correspond to what you really want. Imagine that a block has 2 successors…
				/// to a block that either has a @llvm.experimental.deoptimize call
				/// prior to its terminating return instruction or is terminated by unreachable.
				/// All blocks in the traversed sequence must have an unique successor, maybe
				/// except for the last one.
				bool IsBlockFollowedByDeoptOrUnreachable(const BasicBlock *BB);

	/// Option class for critical edge splitting.			/// Option class for critical edge splitting.
	///			///
	/// This provides a builder interface for overriding the default options used			/// This provides a builder interface for overriding the default options used
	/// during critical edge splitting.			/// during critical edge splitting.
	struct CriticalEdgeSplittingOptions {			struct CriticalEdgeSplittingOptions {
	DominatorTree *DT;			DominatorTree *DT;
	PostDominatorTree *PDT;			PostDominatorTree *PDT;
	LoopInfo *LI;			LoopInfo *LI;
	▲ Show 20 Lines • Show All 438 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp

Show All 33 Lines
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/PseudoProbe.h"		#include "llvm/IR/PseudoProbe.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/User.h"		#include "llvm/IR/User.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/IR/ValueHandle.h"		#include "llvm/IR/ValueHandle.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <string>		#include <string>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "basicblock-utils"		#define DEBUG_TYPE "basicblock-utils"

		static cl::opt<unsigned> MaxDeoptOrUnreachableSuccessorCheckDepth(
		"max-deopt-or-unreachable-succ-check-depth", cl::init(8), cl::Hidden,
		mkazantsevUnsubmitted Not Done Reply Inline Actions Please rename the option and change description, taking Philip's comment into account. Unreachable-terminated block is not a deopt block. mkazantsev: Please rename the option and change description, taking Philip's comment into account.
		cl::desc("Set the maximum path length when checking whether a basic block "
		"is followed by a block that either has a terminating "
		"deoptimizing call or is terminated with an unreachable"));

void llvm::DetatchDeadBlocks(		void llvm::DetatchDeadBlocks(
ArrayRef<BasicBlock *> BBs,		ArrayRef<BasicBlock *> BBs,
SmallVectorImpl<DominatorTree::UpdateType> *Updates,		SmallVectorImpl<DominatorTree::UpdateType> *Updates,
bool KeepOneInputPHIs) {		bool KeepOneInputPHIs) {
for (auto *BB : BBs) {		for (auto *BB : BBs) {
// Loop through all of our successors and make sure they know that one		// Loop through all of our successors and make sure they know that one
// of their predecessors is going away.		// of their predecessors is going away.
SmallPtrSet<BasicBlock *, 4> UniqueSuccessors;		SmallPtrSet<BasicBlock *, 4> UniqueSuccessors;
▲ Show 20 Lines • Show All 417 Lines • ▼ Show 20 Lines	void llvm::ReplaceInstWithInst(BasicBlock::InstListType &BIL,

// Replace all uses of the old instruction, and delete it.		// Replace all uses of the old instruction, and delete it.
ReplaceInstWithValue(BIL, BI, I);		ReplaceInstWithValue(BIL, BI, I);

// Move BI back to point to the newly inserted instruction		// Move BI back to point to the newly inserted instruction
BI = New;		BI = New;
}		}

		bool llvm::IsBlockFollowedByDeoptOrUnreachable(const BasicBlock *BB) {
		mkazantsevUnsubmitted Done Reply Inline Actions This name is misleading. It's not always deoptimizing (if it ends with unreachable), and neither this block is deoptimizing. Maybe smth like `IsBlockFollowedByDeoptOrUnreached`? mkazantsev: This name is misleading. It's not always deoptimizing (if it ends with unreachable), and…
		// Remember visited blocks to avoid infinite loop
		SmallPtrSet<const BasicBlock *, 8> VisitedBlocks;
		mkazantsevUnsubmitted Done Reply Inline Actions What `DeoptimizingBlocks` is used for? To me it looks like a complete doubler of `VisitedBlocks`. Also no good reason for it to be a parameter. mkazantsev: What `DeoptimizingBlocks` is used for? To me it looks like a complete doubler of…
		unsigned Depth = 0;
		while (BB && Depth++ < MaxDeoptOrUnreachableSuccessorCheckDepth &&
		VisitedBlocks.insert(BB).second) {
		mkazantsevUnsubmitted Done Reply Inline Actions Use `SmallPtrSet`. `contains` checks in vector are expensive. mkazantsev: Use `SmallPtrSet`. `contains` checks in vector are expensive.
		if (BB->getTerminatingDeoptimizeCall() \|\|
		mkazantsevUnsubmitted Done Reply Inline Actions Looks like it may be done much easier. while(BB && VisitedBlocks.insert(BB).second) { if ([terminates with unreachable](BB)) return true; BB = BB->getSingleSuccessor(); } return false; Please check if it's what you meant here, and simplify the code. mkazantsev: Looks like it may be done much easier. ``` while(BB && VisitedBlocks.insert(BB).second) {…
		isa<UnreachableInst>(BB->getTerminator()))
		return true;
		mkazantsevUnsubmitted Not Done Reply Inline Actions There is one thing that bugs me. Imagine a sutiation: loop_exit: call foo() // will not return, but is not a deopt unreachable I'm not sure if it's bad actually. But maybe we should consider checking all other (non deopt) instruction with `isGuaranteedToTransferExecutionToSuccessor`. Opinions? mkazantsev: There is one thing that bugs me. Imagine a sutiation: ``` loop_exit: call foo() // will not…
		lebedev.riUnsubmitted Not Done Reply Inline Actions This is actually exactly the situation would have wanted to ask to be supported. lebedev.ri: This is actually exactly the situation would have wanted to ask to be supported.
		mkazantsevUnsubmitted Not Done Reply Inline Actions Do you mean "supported" as "let's peel it" or "let's NOT peel it unless we know unreachable WILL execute"? :) mkazantsev: Do you mean "supported" as "let's peel it" or "let's NOT peel it unless we know unreachable…
		lebedev.riUnsubmitted Not Done Reply Inline Actions "let's peel it" lebedev.ri: "let's peel it"
		mkazantsevUnsubmitted Not Done Reply Inline Actions Ok, we peel it now. :) mkazantsev: Ok, we peel it now. :)
		lebedev.riUnsubmitted Not Done Reply Inline Actions :) lebedev.ri: :)
		BB = BB->getUniqueSuccessor();
		}
		return false;
		}

void llvm::ReplaceInstWithInst(Instruction From, Instruction To) {		void llvm::ReplaceInstWithInst(Instruction From, Instruction To) {
BasicBlock::iterator BI(From);		BasicBlock::iterator BI(From);
ReplaceInstWithInst(From->getParent()->getInstList(), BI, To);		ReplaceInstWithInst(From->getParent()->getInstList(), BI, To);
		lebedev.riUnsubmitted Not Done Reply Inline Actions Nice, i was about to ask for that. But doesn't this lead to the issues with branch weights discussed in D108108? lebedev.ri: Nice, i was about to ask for that. But doesn't this lead to the issues with branch weights…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Well, it doesn't introduce a new bug, it just expands the scope of the existing problem. I'm pretty sure it's relatively safe to always set "many:1" metadata for unreached blocks, but need more digging to understand the impact of this. mkazantsev: Well, it doesn't introduce a new bug, it just expands the scope of the existing problem. I'm…
}		}

BasicBlock llvm::SplitEdge(BasicBlock BB, BasicBlock Succ, DominatorTree DT,		BasicBlock llvm::SplitEdge(BasicBlock BB, BasicBlock Succ, DominatorTree DT,
LoopInfo LI, MemorySSAUpdater MSSAU,		LoopInfo LI, MemorySSAUpdater MSSAU,
const Twine &BBName) {		const Twine &BBName) {
unsigned SuccNum = GetSuccessorNumber(BB, Succ);		unsigned SuccNum = GetSuccessorNumber(BB, Succ);

Instruction *LatchTerm = BB->getTerminator();		Instruction *LatchTerm = BB->getTerminator();
		mkazantsevUnsubmitted Done Reply Inline Actions Compile time-wise, I'd prefer to have traversal limit set by an option. In worst case, you'll need to walk dozens blocks for each exit, and here is where things may go astray. mkazantsev: Compile time-wise, I'd prefer to have traversal limit set by an option. In worst case, you'll…

CriticalEdgeSplittingOptions Options =		CriticalEdgeSplittingOptions Options =
CriticalEdgeSplittingOptions(DT, LI, MSSAU).setPreserveLCSSA();		CriticalEdgeSplittingOptions(DT, LI, MSSAU).setPreserveLCSSA();

if ((isCriticalEdge(LatchTerm, SuccNum, Options.MergeIdenticalEdges))) {		if ((isCriticalEdge(LatchTerm, SuccNum, Options.MergeIdenticalEdges))) {
// If it is a critical edge, and the succesor is an exception block, handle		// If it is a critical edge, and the succesor is an exception block, handle
// the split edge logic in this specific function		// the split edge logic in this specific function
if (Succ->isEHPad())		if (Succ->isEHPad())
▲ Show 20 Lines • Show All 1,287 Lines • Show Last 20 Lines

llvm/lib/Transforms/Utils/LoopPeel.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	bool llvm::canPeel(Loop *L) {

// Peeling is only supported if the latch is a branch.		// Peeling is only supported if the latch is a branch.
if (!isa<BranchInst>(Latch->getTerminator()))		if (!isa<BranchInst>(Latch->getTerminator()))
return false;		return false;

SmallVector<BasicBlock *, 4> Exits;		SmallVector<BasicBlock *, 4> Exits;
L->getUniqueNonLatchExitBlocks(Exits);		L->getUniqueNonLatchExitBlocks(Exits);
// The latch must either be the only exiting block or all non-latch exit		// The latch must either be the only exiting block or all non-latch exit
// blocks have either a deopt or unreachable terminator. Both deopt and		// blocks have either a deopt or unreachable terminator or compose a chain of
// unreachable terminators are a strong indication they are not taken. Note		// blocks where the last one is either deopt or unreachable terminated. Both
// that this is a profitability check, not a legality check. Also note that		// deopt and unreachable terminators are a strong indication they are not
// LoopPeeling currently can only update the branch weights of latch blocks		// taken. Note that this is a profitability check, not a legality check. Also
// and branch weights to blocks with deopt or unreachable do not need		// note that LoopPeeling currently can only update the branch weights of latch
		// blocks and branch weights to blocks with deopt or unreachable do not need
// updating.		// updating.
return all_of(Exits, [](const BasicBlock *BB) {		return all_of(Exits, [](const BasicBlock *BB) {
return BB->getTerminatingDeoptimizeCall() \|\|		return IsBlockFollowedByDeoptOrUnreachable(BB);
isa<UnreachableInst>(BB->getTerminator());
});		});
}		}

// This function calculates the number of iterations after which the given Phi		// This function calculates the number of iterations after which the given Phi
// becomes an invariant. The pre-calculated values are memorized in the map. The		// becomes an invariant. The pre-calculated values are memorized in the map. The
// function (shortcut is I) is calculated according to the following definition:		// function (shortcut is I) is calculated according to the following definition:
// Given %x = phi <Inputs from above the loop>, ..., [%y, %back.edge].		// Given %x = phi <Inputs from above the loop>, ..., [%y, %back.edge].
// If %y is a loop invariant, then I(%x) = 1.		// If %y is a loop invariant, then I(%x) = 1.
▲ Show 20 Lines • Show All 716 Lines • ▼ Show 20 Lines	for (unsigned Iter = 0; Iter < PeelCount; ++Iter) {
// Remap to use values from the current iteration instead of the		// Remap to use values from the current iteration instead of the
// previous one.		// previous one.
remapInstructionsInBlocks(NewBlocks, VMap);		remapInstructionsInBlocks(NewBlocks, VMap);

if (DT) {		if (DT) {
// Update IDoms of the blocks reachable through exits.		// Update IDoms of the blocks reachable through exits.
if (Iter == 0)		if (Iter == 0)
for (auto BBIDom : NonLoopBlocksIDom)		for (auto BBIDom : NonLoopBlocksIDom)
DT->changeImmediateDominator(BBIDom.first,		DT->changeImmediateDominator(BBIDom.first,
		mkazantsevUnsubmitted Done Reply Inline Actions Instead of this, please fill a SmallVector with DomTreeUpdates and then `DT->applyUpdates` once. It's a more canonical way of doing this. mkazantsev: Instead of this, please fill a SmallVector with DomTreeUpdates and then `DT->applyUpdates` once.
cast<BasicBlock>(LVMap[BBIDom.second]));		cast<BasicBlock>(LVMap[BBIDom.second]));
#ifdef EXPENSIVE_CHECKS		#ifdef EXPENSIVE_CHECKS
assert(DT->verify(DominatorTree::VerificationLevel::Fast));		assert(DT->verify(DominatorTree::VerificationLevel::Fast));
#endif		#endif
}		}

auto *LatchBRCopy = cast<BranchInst>(VMap[LatchBR]);		auto *LatchBRCopy = cast<BranchInst>(VMap[LatchBR]);
updateBranchWeights(InsertBot, LatchBRCopy, ExitWeight, FallThroughWeight);		updateBranchWeights(InsertBot, LatchBRCopy, ExitWeight, FallThroughWeight);
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopUnroll/peel-multiple-unreachable-exits.ll

	Show First 20 Lines • Show All 187 Lines • ▼ Show 20 Lines
	unreachable.exit:			unreachable.exit:
	call void @foo()			call void @foo()
	unreachable			unreachable
	}			}

	define void @peel_exits_to_blocks_branch_to_unreachable_block(i32* %ptr, i32 %N, i32 %x, i1 %c.1) {			define void @peel_exits_to_blocks_branch_to_unreachable_block(i32* %ptr, i32 %N, i32 %x, i1 %c.1) {
	; CHECK-LABEL: @peel_exits_to_blocks_branch_to_unreachable_block(			; CHECK-LABEL: @peel_exits_to_blocks_branch_to_unreachable_block(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
				; CHECK-NEXT: br label [[LOOP_HEADER_PEEL_BEGIN:%.*]]
				; CHECK: loop.header.peel.begin:
				; CHECK-NEXT: br label [[LOOP_HEADER_PEEL:%.*]]
				; CHECK: loop.header.peel:
				; CHECK-NEXT: [[C_PEEL:%.*]] = icmp ult i32 1, 2
				; CHECK-NEXT: br i1 [[C_PEEL]], label [[THEN_PEEL:%.]], label [[ELSE_PEEL:%.]]
				; CHECK: else.peel:
				; CHECK-NEXT: [[C_2_PEEL:%.]] = icmp eq i32 1, [[X:%.]]
				; CHECK-NEXT: br i1 [[C_2_PEEL]], label [[EXIT_2:%.]], label [[LOOP_LATCH_PEEL:%.]]
				; CHECK: then.peel:
				; CHECK-NEXT: br i1 [[C_1:%.]], label [[EXIT_1:%.]], label [[LOOP_LATCH_PEEL]]
				; CHECK: loop.latch.peel:
				; CHECK-NEXT: [[M_PEEL:%.*]] = phi i32 [ 0, [[THEN_PEEL]] ], [ [[X]], [[ELSE_PEEL]] ]
				; CHECK-NEXT: [[GEP_PEEL:%.]] = getelementptr i32, i32 [[PTR:%.*]], i32 1
				; CHECK-NEXT: store i32 [[M_PEEL]], i32* [[GEP_PEEL]], align 4
				; CHECK-NEXT: [[IV_NEXT_PEEL:%.*]] = add nuw nsw i32 1, 1
				; CHECK-NEXT: [[C_3_PEEL:%.*]] = icmp ult i32 1, 1000
				; CHECK-NEXT: br i1 [[C_3_PEEL]], label [[LOOP_HEADER_PEEL_NEXT:%.]], label [[EXIT:%.]]
				; CHECK: loop.header.peel.next:
				; CHECK-NEXT: br label [[LOOP_HEADER_PEEL_NEXT1:%.*]]
				; CHECK: loop.header.peel.next1:
				; CHECK-NEXT: br label [[ENTRY_PEEL_NEWPH:%.*]]
				; CHECK: entry.peel.newph:
	; CHECK-NEXT: br label [[LOOP_HEADER:%.*]]			; CHECK-NEXT: br label [[LOOP_HEADER:%.*]]
	; CHECK: loop.header:			; CHECK: loop.header:
	; CHECK-NEXT: [[IV:%.]] = phi i32 [ 1, [[ENTRY:%.]] ], [ [[IV_NEXT:%.]], [[LOOP_LATCH:%.]] ]			; CHECK-NEXT: [[IV:%.]] = phi i32 [ [[IV_NEXT_PEEL]], [[ENTRY_PEEL_NEWPH]] ], [ [[IV_NEXT:%.]], [[LOOP_LATCH:%.*]] ]
	; CHECK-NEXT: [[C:%.*]] = icmp ult i32 [[IV]], 2			; CHECK-NEXT: br i1 false, label [[THEN:%.]], label [[ELSE:%.]]
	; CHECK-NEXT: br i1 [[C]], label [[THEN:%.]], label [[ELSE:%.]]
	; CHECK: then:			; CHECK: then:
	; CHECK-NEXT: br i1 [[C_1:%.]], label [[EXIT_1:%.]], label [[LOOP_LATCH]]			; CHECK-NEXT: br i1 [[C_1]], label [[EXIT_1_LOOPEXIT:%.*]], label [[LOOP_LATCH]]
	; CHECK: else:			; CHECK: else:
	; CHECK-NEXT: [[C_2:%.]] = icmp eq i32 [[IV]], [[X:%.]]			; CHECK-NEXT: [[C_2:%.*]] = icmp eq i32 [[IV]], [[X]]
	; CHECK-NEXT: br i1 [[C_2]], label [[EXIT_2:%.*]], label [[LOOP_LATCH]]			; CHECK-NEXT: br i1 [[C_2]], label [[EXIT_2_LOOPEXIT:%.*]], label [[LOOP_LATCH]]
	; CHECK: loop.latch:			; CHECK: loop.latch:
	; CHECK-NEXT: [[M:%.*]] = phi i32 [ 0, [[THEN]] ], [ [[X]], [[ELSE]] ]			; CHECK-NEXT: [[M:%.*]] = phi i32 [ 0, [[THEN]] ], [ [[X]], [[ELSE]] ]
	; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[PTR:%.*]], i32 [[IV]]			; CHECK-NEXT: [[GEP:%.]] = getelementptr i32, i32 [[PTR]], i32 [[IV]]
	; CHECK-NEXT: store i32 [[M]], i32* [[GEP]], align 4			; CHECK-NEXT: store i32 [[M]], i32* [[GEP]], align 4
	; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1			; CHECK-NEXT: [[IV_NEXT]] = add nuw nsw i32 [[IV]], 1
	; CHECK-NEXT: [[C_3:%.*]] = icmp ult i32 [[IV]], 1000			; CHECK-NEXT: [[C_3:%.*]] = icmp ult i32 [[IV]], 1000
	; CHECK-NEXT: br i1 [[C_3]], label [[LOOP_HEADER]], label [[EXIT:%.*]]			; CHECK-NEXT: br i1 [[C_3]], label [[LOOP_HEADER]], label [[EXIT_LOOPEXIT:%.*]], !llvm.loop [[LOOP2:![0-9]+]]
				; CHECK: exit.loopexit:
				; CHECK-NEXT: br label [[EXIT]]
	; CHECK: exit:			; CHECK: exit:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
				; CHECK: exit.1.loopexit:
				; CHECK-NEXT: br label [[EXIT_1]]
	; CHECK: exit.1:			; CHECK: exit.1:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[UNREACHABLE_TERM:%.*]]			; CHECK-NEXT: br label [[UNREACHABLE_TERM:%.*]]
				; CHECK: exit.2.loopexit:
				; CHECK-NEXT: br label [[EXIT_2]]
	; CHECK: exit.2:			; CHECK: exit.2:
	; CHECK-NEXT: call void @bar()			; CHECK-NEXT: call void @bar()
	; CHECK-NEXT: br label [[UNREACHABLE_TERM]]			; CHECK-NEXT: br label [[UNREACHABLE_TERM]]
	; CHECK: unreachable.term:			; CHECK: unreachable.term:
	; CHECK-NEXT: call void @baz()			; CHECK-NEXT: call void @baz()
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	;			;
	entry:			entry:
	▲ Show 20 Lines • Show All 188 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LoopPeel] Peel loops with exits followed by an unreachable or deopt blockClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 384134

llvm/include/llvm/Transforms/Utils/BasicBlockUtils.h

llvm/lib/Transforms/Utils/BasicBlockUtils.cpp

llvm/lib/Transforms/Utils/LoopPeel.cpp

llvm/test/Transforms/LoopUnroll/peel-multiple-unreachable-exits.ll

[LoopPeel] Peel loops with exits followed by an unreachable or deopt block
ClosedPublic