This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/
-
CodeGen/
27/31
CallBrPrepare.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
2/2
callbr-prepare.ll

Differential D139872

[llvm][CallBrPrepare] split critical edges
ClosedPublic

Authored by nickdesaulniers on Dec 12 2022, 12:01 PM.

Download Raw Diff

Details

Reviewers

aeubanks
efriedma
void
jyknight

Commits

rG0a39af0eb72d: [llvm][CallBrPrepare] split critical edges

Summary

If we have a CallBrInst with output that's used, we need to split
critical edges so that we have some place to insert COPYs for physregs
to virtregs.

Part 2a of
https://discourse.llvm.org/t/rfc-syncing-asm-goto-with-outputs-with-gcc/65453/8.

Test cases and logic re-purposed from D138078.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

nickdesaulniers created this revision.Dec 12 2022, 12:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 12 2022, 12:01 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

nickdesaulniers requested review of this revision.Dec 12 2022, 12:01 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 12 2022, 12:01 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

nickdesaulniers added a parent revision: D139861: [llvm] boilerplate for new callbrprepare codegen IR pass.Dec 12 2022, 12:01 PM

nickdesaulniers added reviewers: aeubanks, efriedma.

nickdesaulniers added subscribers: void, craig.topper.

nickdesaulniers added a subscriber: jyknight.

Harbormaster completed remote builds in B202650: Diff 482218.Dec 12 2022, 12:02 PM

nickdesaulniers updated this revision to Diff 482229.Dec 12 2022, 12:40 PM

remove setPreservesAll

Harbormaster completed remote builds in B202661: Diff 482229.Dec 12 2022, 12:40 PM

nickdesaulniers added inline comments.Dec 12 2022, 12:41 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
72–104	@aeubanks does this delta look correct?
139–140	@aeubanks does this look correct?

nickdesaulniers added reviewers: void, jyknight.Dec 12 2022, 12:43 PM

nickdesaulniers edited subscribers, added: arsenm, nhaehnle; removed: jyknight, void.

nickdesaulniers added a child revision: D139883: [llvm][CallBrPrepare] add llvm.callbr.landingpad intrinsic.Dec 12 2022, 2:16 PM

nickdesaulniers planned changes to this revision.Dec 12 2022, 2:20 PM

nickdesaulniers added inline comments.

llvm/test/CodeGen/AArch64/callbr-prepare.ll
140–142	looks like this should have been split...

nickdesaulniers added inline comments.Dec 12 2022, 2:22 PM

llvm/test/CodeGen/AArch64/callbr-prepare.ll
140–142	oh, right, no uses of the output in the indirect edge.

aeubanks added inline comments.Dec 12 2022, 2:32 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
65–67	this needs to be changed to be like this to initialize dom tree (annoying legacy-PM boilerplate)
107	both `ShouldRun` can be static
111	I'd just return the list of CallBrInsts and check if it's empty in the caller, that's a lot more explicit
123–124	moving these variables into the loop makes this less errorprone
139–140	yup, lg
147	unnecessary if

add test case about missing use in indirect branch

Harbormaster completed remote builds in B202691: Diff 482271.Dec 12 2022, 2:35 PM

INITIALIZE_PASS_DEPENDENCY
return SmallVector (NVRO)
mark ShouldRun static
create new SmallPtrSet, SmallVector per loop iter

llvm/lib/CodeGen/CallBrPrepare.cpp
147	`SplitKnownCriticalEdge` is fallible. If we don't end up splitting the critical edge, then we shouldn't mark denote that the pass has made changes, right?

Harbormaster completed remote builds in B202692: Diff 482276.Dec 12 2022, 2:56 PM

aeubanks added inline comments.Dec 12 2022, 3:02 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	there's an assert right above that `Synth != nullptr`, it doesn't make sense to assert on something then check if it's true looking at `SplitKnownCriticalEdge`, it only returns `nullptr` for the options we're passing it when `DestBB->isEHPad()`, can that happen?

nickdesaulniers added inline comments.Dec 12 2022, 3:22 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	there's an assert right above that Synth != nullptr, it doesn't make sense to assert on something then check if it's true If assertions are disabled, we should still denote correctly whether the pass made modifications. looking at SplitKnownCriticalEdge, it only returns nullptr for the options we're passing it when DestBB->isEHPad(), can that happen? If someone was deranged enough to mix C++ structured exception handling with asm goto, perhaps. int y; asm goto ("":"=r"(y):::out); try {} catch (int x) { out: } Though looks like the frontend rejects it: https://godbolt.org/z/dKGnjKK49. I've also tried: define i32 @x() { %out = callbr i32 asm "", "=r,!i"() to label %direct [label %lp] direct: br label %lp lp: %foo = landingpad { ptr, i32 } cleanup ret i32 42 } which fails the verifier check Block containing LandingPadInst must be jumped to only by the unwind edge of an invoke. %foo = landingpad { ptr, i32 } cleanup LandingPadInst needs to be in a function with a personality. %foo = landingpad { ptr, i32 } cleanup I don't know enough about the rest of the EHPad instructions to know if someone could conjure up such an abomination, hence extra checks. "Let's support those if we actually ever do see them in the wild."

nickdesaulniers added inline comments.Dec 12 2022, 4:12 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
139–140	Oh, I guess later on (in a later commit) I will need to query `DT.dominates`. If DOM tree info isn't available, wat do?

nickdesaulniers added inline comments.Dec 12 2022, 4:16 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
139–140	Perhaps I would manually need to construct a DomTreeUpdater or something? I guess there's prior art in SafeStackLegacyPass::runOnFunction

efriedma added inline comments.Dec 12 2022, 4:23 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
139–140	If you need a domtree, just explicitly request one. With LegacyPM, something like `AU.addRequired<DominatorTreeWrapperPass>();` in CallBrPrepare::getAnalysisUsage

require domtree

Harbormaster completed remote builds in B202880: Diff 482524.Dec 13 2022, 9:34 AM

nickdesaulniers added subscribers: davide, ab, pcc, lebedev.ri.Dec 15 2022, 2:23 PM

nickdesaulniers added inline comments.

llvm/lib/CodeGen/CallBrPrepare.cpp
139–140	So I think the issue with `AU.addRequired<DominatorTreeWrapperPass>();` is that it might now introduce a `Dominator Tree Construction` into `-O0` pipelines pessimistically. The idea being that if we scanned the IR for `callbr`s (which are highly unlikely to exist in most programs outside of the Linux kernel and tcmalloc), we could lazily compute the DOMTree only if we needed (I think that's why `SafeStackLegacyPass::runOnFunction` has that pattern? cc @ab @pcc @davide @lebedev.ri )

nickdesaulniers mentioned this in D140160: [llvm][SelectionDAGBuilder] codegen callbr.landingpad intrinsic.Dec 15 2022, 2:43 PM

efriedma added inline comments.Dec 15 2022, 2:47 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
139–140	Oh, hmm, I see what you mean. Yes, that's what the code in SafeStackLegacyPass involving the `std::optional<DominatorTree>` is doing: if an existing domtree is available, it uses it, otherwise it only computes the domtree if it's needed.

nickdesaulniers planned changes to this revision.Dec 15 2022, 5:17 PM

nickdesaulniers added inline comments.Dec 20 2022, 3:13 PM

llvm/lib/CodeGen/CallBrPrepare.cpp

139–140

So it looks like there's a tradeoff (FWICT) with that pattern:

+  // It's highly likely that most programs do not contain CallBrInsts. Follow a
+  // similar pattern from SafeStackLegacyPass::runOnFunction to reuse previous
+  // domtree analysis if available, otherwise compute it lazily. This avoids
+  // forcing Dominator Tree Construction at -O0 for programs that likely do not
+  // contain CallBrInsts. It does pessimize programs with callbr at higher
+  // optimization levels, as the DominatorTree created here is not reused by
+  // subsequent passes.

lazily compute domtree

Harbormaster completed remote builds in B204259: Diff 484401.Dec 20 2022, 3:23 PM

Ready for review:

nickdesaulniers mentioned this in D139883: [llvm][CallBrPrepare] add llvm.callbr.landingpad intrinsic.Dec 21 2022, 2:28 PM

nickdesaulniers mentioned this in D139970: [llvm][CallBrPrepare] use SSAUpdater to use intrinsic value.

nickdesaulniers mentioned this in D140180: [llvm] add CallBrPrepare pass to pipelines.

nickdesaulniers mentioned this in D136497: [Clang] support for outputs along indirect edges of asm goto.

void added inline comments.Dec 21 2022, 3:05 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	I agree with @aeubanks. This sequence looks bad. I know that asserts aren't enabled for a release, but it's either bad or it isn't. If it's bad, we should at the very least report it.

nickdesaulniers added inline comments.Dec 22 2022, 11:26 AM

llvm/lib/CodeGen/CallBrPrepare.cpp

147

Is this really better looking?

diff --git a/llvm/lib/CodeGen/CallBrPrepare.cpp b/llvm/lib/CodeGen/CallBrPrepare.cpp
index 9835b5b41e2f..752e2d6f442f 100644
--- a/llvm/lib/CodeGen/CallBrPrepare.cpp
+++ b/llvm/lib/CodeGen/CallBrPrepare.cpp
@@ -117,8 +117,8 @@ bool CallBrPrepare::SplitCriticalEdges(ArrayRef<CallBrInst *> CBRs,
     for (unsigned i : UniqueCriticalSuccessorIndices) {
       BasicBlock *Synth = SplitKnownCriticalEdge(CBR, i, Options);
       assert(Synth && "Failed to split a known critical edge from callbr");
-      if (Synth)
-        Changed = true;
+      (void)Synth;
+      Changed = true;
     }
   }
   return Changed;

void added inline comments.Dec 22 2022, 12:45 PM

llvm/lib/CodeGen/CallBrPrepare.cpp

147

That's how it's done in other places. E.g. in llvm/include/llvm/CodeGen/LiveVariables.h:

/// removeVirtualRegisterKilled - Remove the specified kill of the virtual
/// register from the live variable information. Returns true if the
/// variable was marked as killed by the specified instruction,
/// false otherwise.
bool removeVirtualRegisterKilled(Register Reg, MachineInstr &MI) {
  if (!getVarInfo(Reg).removeKill(MI))
    return false;

  bool Removed = false;
  for (MachineOperand &MO : MI.operands()) {
    if (MO.isReg() && MO.isKill() && MO.getReg() == Reg) {
      MO.setIsKill(false);
      Removed = true;
      break;
    }
  }

  assert(Removed && "Register is not used by this instruction!");
  (void)Removed;
  return true;
}

It's not particularly satisfying, but at least it appears to be consistent with the rest of the code base.

remove unnecessary if, as per @aeubanks and @void

Harbormaster completed remote builds in B204660: Diff 484941.Dec 22 2022, 1:04 PM

In D139872#4014003, @nickdesaulniers wrote:

remove unnecessary if, as per @aeubanks and @void

I've been thinking about this and I'm not really sure why LLVM hasn't created their own version of assert yet. It could be something as simple as this:

#define LLVM_ASSERT(cond, msg)  if (!cond) assert(false && msg)

But that's a separate issue.

This pass looks okay to me. I'll accept, but I think others who commented should also accept.

This revision is now accepted and ready to land.Jan 3 2023, 3:16 PM

rebase, format

Harbormaster completed remote builds in B206880: Diff 487921.Jan 10 2023, 3:20 PM

rebase

efriedma accepted this revision.Jan 18 2023, 11:37 AM

Harbormaster completed remote builds in B208558: Diff 490238.Jan 18 2023, 12:22 PM

LGTM. One suggestion that you can ignore if you wish.

llvm/lib/CodeGen/CallBrPrepare.cpp
147	You can iterate over the indirect destinations. Something like: for (BasicBlock *BB : CBR->getIndirectDests()) { }

nickdesaulniers added inline comments.Jan 18 2023, 3:54 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	I would have preferred to use range-for iteration here. IIRC, the issue with that approach was that `isCriticalEdge` takes an `unsigned` as the successor number (rather than the `BasicBlock` successor itself. If it did, that would probably have improved the ergonomics here. Let me know if you think I should add an overload of `isCriticalEdge` perhaps, or if I missed something else?

void added inline comments.Jan 18 2023, 4:08 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	`isCriticalEdge` can also take a BB: llvm/include/llvm/Analysis/CFG.h: bool isCriticalEdge(const Instruction TI, const BasicBlock Succ, bool AllowIdenticalEdges = false);

nickdesaulniers added inline comments.Jan 19 2023, 10:04 AM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	Ah, it was `SplitKnownCriticalEdge` used later outside of this loop that expected the index of the successor number. That's why this loop builds up a vector of indices; the next loop will then split them. So I don't think I can change this first loop, but let me know if I'm still missing something.

jyknight added inline comments.Jan 19 2023, 11:01 AM

llvm/lib/CodeGen/CallBrPrepare.cpp
108–127	Doesn't seem to add anything to have "ShouldRun" as a separate function. I'd just inline the check here. If not that, rename it; the name "ShouldRun" is not adding any understandability here.
143–144	I don't understand this comment. With a single loop, if we iterate over the successors in order, and successors 1 and 3 are both to the same block, the `MergeIdenticalEdges` option should ensure that when we call `SplitKnownCriticalEdge` on 1, it'll also rewrite 3. Oh, I see -- I think you needed `isCriticalEdge(CBR, i, /AllowIdenticalEdges=/true)`, so that we don't report the edge as critical when it's only used from a single block (and won't split the already-split edge!)

nickdesaulniers added inline comments.Jan 19 2023, 2:13 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
143–144	if we iterate over the successors in order, and successors 1 and 3 are both to the same block, the MergeIdenticalEdges option should ensure that when we call SplitKnownCriticalEdge on 1, it'll also rewrite 3. Correct, see `@split_me1` in llvm/test/CodeGen/AArch64/callbr-prepare.ll which covers this part of the code. I think you needed isCriticalEdge(CBR, i, /AllowIdenticalEdges=/true), so that we don't report the edge as critical when it's only used from a single block (and won't split the already-split edge!) I don't think setting `AllowIdenticalEdges` to `true` works. I have vague memories of trying to use it, but I can't recall specifically why I wasn't able to use it and haven't been able to rework this block via setting `AllowIdenticalEdges` to `true`. The comment also mentions `SplitCriticalEdge` rather than `SplitKnownCriticalEdge`. That may be an artifact from an earlier revision. At the least, I should fix that part of the comment. Do you have thoughts on how I could better rephrase the comment block you didn't understand (assuming now that you do)? Are you asking that I try to use `AllowIdenticalEdges` set to `true`?

nickdesaulniers added inline comments.Jan 19 2023, 2:22 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
147	It looks like `llvm::GetSuccessorNumber` could be used. So even if `SplitKnownCriticalEdge` needs an index, I could use range-for then `llvm::GetSuccessorNumber`. That may be more readable, but it looks like it would be O(N^2) rather than two sequential O(N) loops? I guess N is likely too small to matter though.

nickdesaulniers added inline comments.Jan 19 2023, 2:29 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
143–144	For instance, I would have like for this code to simply have been something like: for (BasicBlock BB : CBR->getIndirectDests()) if (BasicBlock Synth = SplitCriticalEdge(CBR, GetSuccessorNumber(CBR->getParent(), BB), Options)) Changed = true; but that doesn't work, hence the two loops and (obtuse) commentary.

inline ShouldRun, as per @jyknight
update comment s/SplitCriticalEdge/SplitKnownCriticalEdge/

nickdesaulniers added inline comments.Jan 19 2023, 2:53 PM

llvm/lib/CodeGen/CallBrPrepare.cpp
143–144	So I think the issue here is what happens if the direct and indirect destination are the same. Maybe `@split_me5` test case added later, where we don't want to split both edges, vs two indirect branches that are the same, in case we'd like to split both of them `@split_me1` but only once.

Harbormaster completed remote builds in B208861: Diff 490666.Jan 19 2023, 4:06 PM

nickdesaulniers added inline comments.Jan 24 2023, 11:19 AM

llvm/lib/CodeGen/CallBrPrepare.cpp

143–144

@jyknight what are your thoughts on rewording the comment block as such (sorry, the block has shifted around from edits since your comment; we're referring to the comment blocks in CallBrPrepare::SplitCriticalEdges)

Conceptually, these two loops are doing:

for (BasicBlock *Dest : CBR->getIndirectDests())
  if (BasicBlock *Synth = SplitCriticalEdge(CBR, GetSuccessorNumber(CBR->getParent(), Dest)))
    Changed = true;

But there is a problem resulting in two special cases; a single callbr instruction may have multiple edges to the
same destination.  Consider the first case:

  %0 = callbr ... to label %x [label %y, label %y]

We want to split the edge to %y once, not twice so that both instance of %y go to the new destination.
Now consider the second case:

  %0 = callbr ... to label %x [label %x]

We want to split the indirect edge, but not the direct edge (since the direct edge doesn't
have the problem of where to store copies).  This pair of sequential loops is necessary to
handle what is logically a pass over indirect edges to split them.

(with necessary word wrapping, since I haven't typed it out in code yet) (I would then delete all existing comments in that method).

Thoughts?

refactor and significantly simply CallBrPrepare::SplitCriticalEdges.

@jyknight figured out how to properly use AllowIdenticalEdges together with
MergeIdenticalEdges. Use code he sent me and add a comment.

Harbormaster completed remote builds in B209740: Diff 491898.Jan 24 2023, 5:08 PM

Thanks, looks good now!

rebase

Harbormaster completed remote builds in B212128: Diff 495169.Feb 6 2023, 11:09 AM

final rebase

Harbormaster completed remote builds in B214260: Diff 498151.Feb 16 2023, 4:08 PM

Closed by commit rG0a39af0eb72d: [llvm][CallBrPrepare] split critical edges (authored by nickdesaulniers). · Explain WhyFeb 16 2023, 6:04 PM

This revision was automatically updated to reflect the committed changes.

nickdesaulniers added a commit: rG0a39af0eb72d: [llvm][CallBrPrepare] split critical edges.

nathanchance added a subscriber: nathanchance.Feb 17 2023, 9:30 AM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

CallBrPrepare.cpp

79 lines

test/

CodeGen/

AArch64/

callbr-prepare.ll

201 lines

Diff 498215

llvm/lib/CodeGen/CallBrPrepare.cpp

	Show All 25 Lines
	// Ideally, this could be done inside SelectionDAG, or in the			// Ideally, this could be done inside SelectionDAG, or in the
	// MachineInstruction representation, without the use of an IR-level intrinsic.			// MachineInstruction representation, without the use of an IR-level intrinsic.
	// But, within the current framework, it’s simpler to implement as an IR pass.			// But, within the current framework, it’s simpler to implement as an IR pass.
	// (If support for callbr in GlobalISel is implemented, it’s worth considering			// (If support for callbr in GlobalISel is implemented, it’s worth considering
	// whether this is still required.)			// whether this is still required.)
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

				#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/SmallVector.h"
				#include "llvm/Analysis/CFG.h"
	#include "llvm/CodeGen/Passes.h"			#include "llvm/CodeGen/Passes.h"
	#include "llvm/IR/BasicBlock.h"			#include "llvm/IR/BasicBlock.h"
				#include "llvm/IR/Dominators.h"
	#include "llvm/IR/Function.h"			#include "llvm/IR/Function.h"
	#include "llvm/IR/Instructions.h"			#include "llvm/IR/Instructions.h"
	#include "llvm/InitializePasses.h"			#include "llvm/InitializePasses.h"
	#include "llvm/Pass.h"			#include "llvm/Pass.h"
				#include "llvm/Transforms/Utils/BasicBlockUtils.h"

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "callbrprepare"			#define DEBUG_TYPE "callbrprepare"

	namespace {			namespace {

	class CallBrPrepare : public FunctionPass {			class CallBrPrepare : public FunctionPass {
				bool SplitCriticalEdges(ArrayRef<CallBrInst *> CBRs, DominatorTree &DT);

	public:			public:
	CallBrPrepare() : FunctionPass(ID) {}			CallBrPrepare() : FunctionPass(ID) {}
	static char ID;
	void getAnalysisUsage(AnalysisUsage &AU) const override;			void getAnalysisUsage(AnalysisUsage &AU) const override;
	bool runOnFunction(Function &Fn) override;			bool runOnFunction(Function &Fn) override;
				static char ID;
	};			};

	} // end anonymous namespace			} // end anonymous namespace

	char CallBrPrepare::ID = 0;			char CallBrPrepare::ID = 0;
	INITIALIZE_PASS(CallBrPrepare, DEBUG_TYPE, "Prepare callbr", false, false)			INITIALIZE_PASS_BEGIN(CallBrPrepare, DEBUG_TYPE, "Prepare callbr", false, false)
				INITIALIZE_PASS_DEPENDENCY(DominatorTreeWrapperPass)
				INITIALIZE_PASS_END(CallBrPrepare, DEBUG_TYPE, "Prepare callbr", false, false)
				aeubanksUnsubmitted Done Reply Inline Actions this needs to be changed to be like this to initialize dom tree (annoying legacy-PM boilerplate) aeubanks: this needs to be changed to be like [this](https://github.com/llvm/llvm…

	FunctionPass *llvm::createCallBrPass() { return new CallBrPrepare(); }			FunctionPass *llvm::createCallBrPass() { return new CallBrPrepare(); }

	void CallBrPrepare::getAnalysisUsage(AnalysisUsage &AU) const {			void CallBrPrepare::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.setPreservesAll();			AU.addPreserved<DominatorTreeWrapperPass>();
				}

				static SmallVector<CallBrInst *, 2> FindCallBrs(Function &Fn) {
				SmallVector<CallBrInst *, 2> CBRs;
				for (BasicBlock &BB : Fn)
				if (auto *CBR = dyn_cast<CallBrInst>(BB.getTerminator()))
				if (!CBR->getType()->isVoidTy() && !CBR->use_empty())
				CBRs.push_back(CBR);
				return CBRs;
				}

				bool CallBrPrepare::SplitCriticalEdges(ArrayRef<CallBrInst *> CBRs,
				DominatorTree &DT) {
				bool Changed = false;
				CriticalEdgeSplittingOptions Options(&DT);
				Options.setMergeIdenticalEdges();

				// The indirect destination might be duplicated between another parameter...
				// %0 = callbr ... [label %x, label %x]
				// ...hence MergeIdenticalEdges and AllowIndentical edges, but we don't need
				// to split the default destination if it's duplicated between an indirect
				// destination...
				// %1 = callbr ... to label %x [label %x]
				// ...hence starting at 1 and checking against successor 0 (aka the default
				// destination).
				for (CallBrInst *CBR : CBRs)
				for (unsigned i = 1, e = CBR->getNumSuccessors(); i != e; ++i)
				if (CBR->getSuccessor(i) == CBR->getSuccessor(0) \|\|
				isCriticalEdge(CBR, i, /AllowIdenticalEdges/ true))
				if (SplitKnownCriticalEdge(CBR, i, Options))
				Changed = true;
				return Changed;
				nickdesaulniersAuthorUnsubmitted Not Done Reply Inline Actions @aeubanks does this delta look correct? nickdesaulniers: @aeubanks does this delta look correct?
	}			}

	bool CallBrPrepare::runOnFunction(Function &Fn) {			bool CallBrPrepare::runOnFunction(Function &Fn) {
				aeubanksUnsubmitted Done Reply Inline Actions both `ShouldRun` can be static aeubanks: both `ShouldRun` can be static
	for (BasicBlock &BB : Fn) {			bool Changed = false;
	auto *CBR = dyn_cast<CallBrInst>(BB.getTerminator());			SmallVector<CallBrInst *, 2> CBRs = FindCallBrs(Fn);
	if (!CBR)
	continue;			if (CBRs.empty())
				aeubanksUnsubmitted Done Reply Inline Actions I'd just return the list of CallBrInsts and check if it's empty in the caller, that's a lot more explicit aeubanks: I'd just return the list of CallBrInsts and check if it's empty in the caller, that's a lot…
	// TODO: something interesting.			return Changed;
	// https://discourse.llvm.org/t/rfc-syncing-asm-goto-with-outputs-with-gcc/65453/8
				// It's highly likely that most programs do not contain CallBrInsts. Follow a
				// similar pattern from SafeStackLegacyPass::runOnFunction to reuse previous
				// domtree analysis if available, otherwise compute it lazily. This avoids
				// forcing Dominator Tree Construction at -O0 for programs that likely do not
				// contain CallBrInsts. It does pessimize programs with callbr at higher
				// optimization levels, as the DominatorTree created here is not reused by
				// subsequent passes.
				DominatorTree *DT;
				std::optional<DominatorTree> LazilyComputedDomTree;
				if (auto *DTWP = getAnalysisIfAvailable<DominatorTreeWrapperPass>())
				DT = &DTWP->getDomTree();
				aeubanksUnsubmitted Done Reply Inline Actions moving these variables into the loop makes this less errorprone aeubanks: moving these variables into the loop makes this less errorprone
				else {
				LazilyComputedDomTree.emplace(Fn);
				DT = &*LazilyComputedDomTree;
				jyknightUnsubmitted Done Reply Inline Actions Doesn't seem to add anything to have "ShouldRun" as a separate function. I'd just inline the check here. If not that, rename it; the name "ShouldRun" is not adding any understandability here. jyknight: Doesn't seem to add anything to have "ShouldRun" as a separate function. I'd just inline the…
	}			}
	return false;
				if (SplitCriticalEdges(CBRs, *DT))
				Changed = true;

				return Changed;
	}			}
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions @aeubanks does this look correct? nickdesaulniers: @aeubanks does this look correct?
				aeubanksUnsubmitted Done Reply Inline Actions yup, lg aeubanks: yup, lg
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Oh, I guess later on (in a later commit) I will need to query `DT.dominates`. If DOM tree info isn't available, wat do? nickdesaulniers: Oh, I guess later on (in a later commit) I will need to query `DT.dominates`. If DOM tree info…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Perhaps I would manually need to construct a DomTreeUpdater or something? I guess there's prior art in SafeStackLegacyPass::runOnFunction nickdesaulniers: Perhaps I would manually need to construct a DomTreeUpdater or something? I guess there's…
				efriedmaUnsubmitted Done Reply Inline Actions If you need a domtree, just explicitly request one. With LegacyPM, something like `AU.addRequired<DominatorTreeWrapperPass>();` in CallBrPrepare::getAnalysisUsage efriedma: If you need a domtree, just explicitly request one. With LegacyPM, something like `AU.
				nickdesaulniersAuthorUnsubmitted Not Done Reply Inline Actions So I think the issue with `AU.addRequired<DominatorTreeWrapperPass>();` is that it might now introduce a `Dominator Tree Construction` into `-O0` pipelines pessimistically. The idea being that if we scanned the IR for `callbr`s (which are highly unlikely to exist in most programs outside of the Linux kernel and tcmalloc), we could lazily compute the DOMTree only if we needed (I think that's why `SafeStackLegacyPass::runOnFunction` has that pattern? cc @ab @pcc @davide @lebedev.ri ) nickdesaulniers: So I think the issue with `AU.addRequired<DominatorTreeWrapperPass>();` is that it might now…
				efriedmaUnsubmitted Not Done Reply Inline Actions Oh, hmm, I see what you mean. Yes, that's what the code in SafeStackLegacyPass involving the `std::optional<DominatorTree>` is doing: if an existing domtree is available, it uses it, otherwise it only computes the domtree if it's needed. efriedma: Oh, hmm, I see what you mean. Yes, that's what the code in SafeStackLegacyPass involving the…
				nickdesaulniersAuthorUnsubmitted Not Done Reply Inline Actions So it looks like there's a tradeoff (FWICT) with that pattern: + // It's highly likely that most programs do not contain CallBrInsts. Follow a + // similar pattern from SafeStackLegacyPass::runOnFunction to reuse previous + // domtree analysis if available, otherwise compute it lazily. This avoids + // forcing Dominator Tree Construction at -O0 for programs that likely do not + // contain CallBrInsts. It does pessimize programs with callbr at higher + // optimization levels, as the DominatorTree created here is not reused by + // subsequent passes. nickdesaulniers: So it looks like there's a tradeoff (FWICT) with that pattern: ``` + // It's highly likely…
				aeubanksUnsubmitted Done Reply Inline Actions unnecessary if aeubanks: unnecessary if
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions `SplitKnownCriticalEdge` is fallible. If we don't end up splitting the critical edge, then we shouldn't mark denote that the pass has made changes, right? nickdesaulniers: `SplitKnownCriticalEdge` is fallible. If we don't end up splitting the critical edge, then we…
				aeubanksUnsubmitted Done Reply Inline Actions there's an assert right above that `Synth != nullptr`, it doesn't make sense to assert on something then check if it's true looking at `SplitKnownCriticalEdge`, it only returns `nullptr` for the options we're passing it when `DestBB->isEHPad()`, can that happen? aeubanks: there's an assert right above that `Synth != nullptr`, it doesn't make sense to assert on…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions there's an assert right above that Synth != nullptr, it doesn't make sense to assert on something then check if it's true If assertions are disabled, we should still denote correctly whether the pass made modifications. looking at SplitKnownCriticalEdge, it only returns nullptr for the options we're passing it when DestBB->isEHPad(), can that happen? If someone was deranged enough to mix C++ structured exception handling with asm goto, perhaps. int y; asm goto ("":"=r"(y):::out); try {} catch (int x) { out: } Though looks like the frontend rejects it: https://godbolt.org/z/dKGnjKK49. I've also tried: define i32 @x() { %out = callbr i32 asm "", "=r,!i"() to label %direct [label %lp] direct: br label %lp lp: %foo = landingpad { ptr, i32 } cleanup ret i32 42 } which fails the verifier check Block containing LandingPadInst must be jumped to only by the unwind edge of an invoke. %foo = landingpad { ptr, i32 } cleanup LandingPadInst needs to be in a function with a personality. %foo = landingpad { ptr, i32 } cleanup I don't know enough about the rest of the EHPad instructions to know if someone could conjure up such an abomination, hence extra checks. "Let's support those if we actually ever do see them in the wild." nickdesaulniers: > there's an assert right above that Synth != nullptr, it doesn't make sense to assert on…
				voidUnsubmitted Done Reply Inline Actions I agree with @aeubanks. This sequence looks bad. I know that asserts aren't enabled for a release, but it's either bad or it isn't. If it's bad, we should at the very least report it. void: I agree with @aeubanks. This sequence looks bad. I know that asserts aren't enabled for a…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Is this really better looking? diff --git a/llvm/lib/CodeGen/CallBrPrepare.cpp b/llvm/lib/CodeGen/CallBrPrepare.cpp index 9835b5b41e2f..752e2d6f442f 100644 --- a/llvm/lib/CodeGen/CallBrPrepare.cpp +++ b/llvm/lib/CodeGen/CallBrPrepare.cpp @@ -117,8 +117,8 @@ bool CallBrPrepare::SplitCriticalEdges(ArrayRef<CallBrInst > CBRs, for (unsigned i : UniqueCriticalSuccessorIndices) { BasicBlock Synth = SplitKnownCriticalEdge(CBR, i, Options); assert(Synth && "Failed to split a known critical edge from callbr"); - if (Synth) - Changed = true; + (void)Synth; + Changed = true; } } return Changed; nickdesaulniers: Is this really better looking? ``` diff --git a/llvm/lib/CodeGen/CallBrPrepare.cpp…
				voidUnsubmitted Done Reply Inline Actions That's how it's done in other places. E.g. in `llvm/include/llvm/CodeGen/LiveVariables.h`: /// removeVirtualRegisterKilled - Remove the specified kill of the virtual /// register from the live variable information. Returns true if the /// variable was marked as killed by the specified instruction, /// false otherwise. bool removeVirtualRegisterKilled(Register Reg, MachineInstr &MI) { if (!getVarInfo(Reg).removeKill(MI)) return false; bool Removed = false; for (MachineOperand &MO : MI.operands()) { if (MO.isReg() && MO.isKill() && MO.getReg() == Reg) { MO.setIsKill(false); Removed = true; break; } } assert(Removed && "Register is not used by this instruction!"); (void)Removed; return true; } It's not particularly satisfying, but at least it appears to be consistent with the rest of the code base. void: That's how it's done in other places. E.g. in `llvm/include/llvm/CodeGen/LiveVariables.h`: ```…
				voidUnsubmitted Done Reply Inline Actions You can iterate over the indirect destinations. Something like: for (BasicBlock BB : CBR->getIndirectDests()) { } void:* You can iterate over the indirect destinations. Something like: ``` for (BasicBlock *BB : CBR…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions I would have preferred to use range-for iteration here. IIRC, the issue with that approach was that `isCriticalEdge` takes an `unsigned` as the successor number (rather than the `BasicBlock` successor itself. If it did, that would probably have improved the ergonomics here. Let me know if you think I should add an overload of `isCriticalEdge` perhaps, or if I missed something else? nickdesaulniers: I would have preferred to use range-for iteration here. IIRC, the issue with that approach was…
				voidUnsubmitted Done Reply Inline Actions `isCriticalEdge` can also take a BB: llvm/include/llvm/Analysis/CFG.h: bool isCriticalEdge(const Instruction TI, const BasicBlock Succ, bool AllowIdenticalEdges = false); void: `isCriticalEdge` can also take a BB: ``` llvm/include/llvm/Analysis/CFG.h: bool isCriticalEdge…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Ah, it was `SplitKnownCriticalEdge` used later outside of this loop that expected the index of the successor number. That's why this loop builds up a vector of indices; the next loop will then split them. So I don't think I can change this first loop, but let me know if I'm still missing something. nickdesaulniers: Ah, it was `SplitKnownCriticalEdge` used later outside of this loop that expected the index of…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions It looks like `llvm::GetSuccessorNumber` could be used. So even if `SplitKnownCriticalEdge` needs an index, I could use range-for then `llvm::GetSuccessorNumber`. That may be more readable, but it looks like it would be O(N^2) rather than two sequential O(N) loops? I guess N is likely too small to matter though. nickdesaulniers: It looks like `llvm::GetSuccessorNumber` could be used. So even if `SplitKnownCriticalEdge`…
				jyknightUnsubmitted Done Reply Inline Actions I don't understand this comment. With a single loop, if we iterate over the successors in order, and successors 1 and 3 are both to the same block, the `MergeIdenticalEdges` option should ensure that when we call `SplitKnownCriticalEdge` on 1, it'll also rewrite 3. Oh, I see -- I think you needed `isCriticalEdge(CBR, i, /AllowIdenticalEdges=/true)`, so that we don't report the edge as critical when it's only used from a single block (and won't split the already-split edge!) jyknight: I don't understand this comment. With a single loop, if we iterate over the successors in order…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions if we iterate over the successors in order, and successors 1 and 3 are both to the same block, the MergeIdenticalEdges option should ensure that when we call SplitKnownCriticalEdge on 1, it'll also rewrite 3. Correct, see `@split_me1` in llvm/test/CodeGen/AArch64/callbr-prepare.ll which covers this part of the code. I think you needed isCriticalEdge(CBR, i, /AllowIdenticalEdges=/true), so that we don't report the edge as critical when it's only used from a single block (and won't split the already-split edge!) I don't think setting `AllowIdenticalEdges` to `true` works. I have vague memories of trying to use it, but I can't recall specifically why I wasn't able to use it and haven't been able to rework this block via setting `AllowIdenticalEdges` to `true`. The comment also mentions `SplitCriticalEdge` rather than `SplitKnownCriticalEdge`. That may be an artifact from an earlier revision. At the least, I should fix that part of the comment. Do you have thoughts on how I could better rephrase the comment block you didn't understand (assuming now that you do)? Are you asking that I try to use `AllowIdenticalEdges` set to `true`? nickdesaulniers: > if we iterate over the successors in order, and successors 1 and 3 are both to the same block…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions For instance, I would have like for this code to simply have been something like: for (BasicBlock BB : CBR->getIndirectDests()) if (BasicBlock Synth = SplitCriticalEdge(CBR, GetSuccessorNumber(CBR->getParent(), BB), Options)) Changed = true; but that doesn't work, hence the two loops and (obtuse) commentary. nickdesaulniers: For instance, I would have like for this code to simply have been something like: ``` for…
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions So I think the issue here is what happens if the direct and indirect destination are the same. Maybe `@split_me5` test case added later, where we don't want to split both edges, vs two indirect branches that are the same, in case we'd like to split both of them `@split_me1` but only once. nickdesaulniers: So I think the issue here is what happens if the direct and indirect destination are the same.
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions @jyknight what are your thoughts on rewording the comment block as such (sorry, the block has shifted around from edits since your comment; we're referring to the comment blocks in `CallBrPrepare::SplitCriticalEdges`) Conceptually, these two loops are doing: for (BasicBlock Dest : CBR->getIndirectDests()) if (BasicBlock Synth = SplitCriticalEdge(CBR, GetSuccessorNumber(CBR->getParent(), Dest))) Changed = true; But there is a problem resulting in two special cases; a single callbr instruction may have multiple edges to the same destination. Consider the first case: %0 = callbr ... to label %x [label %y, label %y] We want to split the edge to %y once, not twice so that both instance of %y go to the new destination. Now consider the second case: %0 = callbr ... to label %x [label %x] We want to split the indirect edge, but not the direct edge (since the direct edge doesn't have the problem of where to store copies). This pair of sequential loops is necessary to handle what is logically a pass over indirect edges to split them. (with necessary word wrapping, since I haven't typed it out in code yet) (I would then delete all existing comments in that method). Thoughts? nickdesaulniers: @jyknight what are your thoughts on rewording the comment block as such (sorry, the block has…

llvm/test/CodeGen/AArch64/callbr-prepare.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt %s -callbrprepare -S -o - \| FileCheck %s			; RUN: opt %s -callbrprepare -S -o - \| FileCheck %s

	; TODO: update this test to split critical edges.
	define i32 @test0() {			define i32 @test0() {
	; CHECK-LABEL: @test0(			; CHECK-LABEL: @test0(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[OUT:%.*]] = callbr i32 asm "# $0", "=r,!i"()			; CHECK-NEXT: [[OUT:%.*]] = callbr i32 asm "# $0", "=r,!i"()
	; CHECK-NEXT: to label [[DIRECT:%.*]] [label %indirect]			; CHECK-NEXT: to label [[DIRECT:%.*]] [label %entry.indirect_crit_edge]
				; CHECK: entry.indirect_crit_edge:
				; CHECK-NEXT: br label [[INDIRECT:%.*]]
	; CHECK: direct:			; CHECK: direct:
	; CHECK-NEXT: [[OUT2:%.*]] = callbr i32 asm "# $0", "=r,!i"()			; CHECK-NEXT: [[OUT2:%.*]] = callbr i32 asm "# $0", "=r,!i"()
	; CHECK-NEXT: to label [[DIRECT2:%.*]] [label %indirect]			; CHECK-NEXT: to label [[DIRECT2:%.*]] [label %direct.indirect_crit_edge]
				; CHECK: direct.indirect_crit_edge:
				; CHECK-NEXT: br label [[INDIRECT]]
	; CHECK: direct2:			; CHECK: direct2:
	; CHECK-NEXT: ret i32 0			; CHECK-NEXT: ret i32 0
	; CHECK: indirect:			; CHECK: indirect:
	; CHECK-NEXT: [[OUT3:%.]] = phi i32 [ [[OUT]], [[ENTRY:%.]] ], [ [[OUT2]], [[DIRECT]] ]			; CHECK-NEXT: [[OUT3:%.]] = phi i32 [ [[OUT]], [[ENTRY_INDIRECT_CRIT_EDGE:%.]] ], [ [[OUT2]], [[DIRECT_INDIRECT_CRIT_EDGE:%.*]] ]
	; CHECK-NEXT: ret i32 [[OUT3]]			; CHECK-NEXT: ret i32 [[OUT3]]
	;			;
	entry:			entry:
	%out = callbr i32 asm "# $0", "=r,!i"()			%out = callbr i32 asm "# $0", "=r,!i"()
	to label %direct [label %indirect]			to label %direct [label %indirect]
	direct:			direct:
	%out2 = callbr i32 asm "# $0", "=r,!i"()			%out2 = callbr i32 asm "# $0", "=r,!i"()
	to label %direct2 [label %indirect]			to label %direct2 [label %indirect]
	direct2:			direct2:
	ret i32 0			ret i32 0
	indirect:			indirect:
	%out3 = phi i32 [%out, %entry], [%out2, %direct]			%out3 = phi i32 [%out, %entry], [%out2, %direct]
	ret i32 %out3			ret i32 %out3
	}			}

				; Don't split edges unless they are critical, and callbr produces output, and
				; that output is used.
				; Here we have none of the above.
				define i32 @dont_split0() {
				; CHECK-LABEL: @dont_split0(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: callbr void asm "", "!i"()
				; CHECK-NEXT: to label [[X:%.*]] [label %y]
				; CHECK: x:
				; CHECK-NEXT: ret i32 42
				; CHECK: y:
				; CHECK-NEXT: ret i32 0
				;
				entry:
				callbr void asm "", "!i"()
				to label %x [label %y]

				x:
				ret i32 42

				y:
				ret i32 0
				}

				; Don't split edges unless they are critical, and callbr produces output, and
				; that output is used.
				; Here we have output, but no critical edge.
				define i32 @dont_split1() {
				; CHECK-LABEL: @dont_split1(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[TMP0:%.*]] = callbr i32 asm "", "=r,!i"()
				; CHECK-NEXT: to label [[X:%.*]] [label %y]
				; CHECK: x:
				; CHECK-NEXT: ret i32 42
				; CHECK: y:
				; CHECK-NEXT: ret i32 [[TMP0]]
				;
				entry:
				%0 = callbr i32 asm "", "=r,!i"()
				to label %x [label %y]

				x:
				ret i32 42

				y:
				ret i32 %0
				}

				; Don't split edges unless they are critical, and callbr produces output, and
				; that output is used.
				; Here we have a critical edge along an indirect branch, but no output.
				define i32 @dont_split2() {
				; CHECK-LABEL: @dont_split2(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: callbr void asm "", "!i"()
				; CHECK-NEXT: to label [[X:%.*]] [label %y]
				; CHECK: x:
				; CHECK-NEXT: br label [[Y:%.*]]
				; CHECK: y:
				; CHECK-NEXT: [[TMP0:%.]] = phi i32 [ 0, [[X]] ], [ 42, [[ENTRY:%.]] ]
				; CHECK-NEXT: ret i32 [[TMP0]]
				;
				entry:
				callbr void asm "", "!i"()
				to label %x [label %y]

				x:
				br label %y

				y:
				%0 = phi i32 [ 0, %x ], [ 42, %entry ]
				ret i32 %0
				}

				; Don't split edges unless they are critical, and callbr produces output, and
				; that output is used.
				; Here we're missing a use.
				define i32 @dont_split3() {
				; CHECK-LABEL: @dont_split3(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[TMP0:%.*]] = callbr i32 asm "", "=r,!i"()
				; CHECK-NEXT: to label [[X:%.*]] [label %v]
				; CHECK: x:
				; CHECK-NEXT: br label [[V:%.*]]
				; CHECK: v:
				; CHECK-NEXT: ret i32 42
				;
				entry:
				%0 = callbr i32 asm "", "=r,!i"() to label %x [label %v]

				x:
				br label %v

				v:
				ret i32 42
				}

				; Don't split edges unless they are critical, and callbr produces output, and
				; that output is used.
				; Here we have output and a critical edge along an indirect branch.
				define i32 @split_me0() {
				; CHECK-LABEL: @split_me0(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: [[TMP0:%.*]] = callbr i32 asm "", "=r,!i"()
				; CHECK-NEXT: to label [[X:%.*]] [label %entry.y_crit_edge]
				; CHECK: entry.y_crit_edge:
				; CHECK-NEXT: br label [[Y:%.*]]
				; CHECK: x:
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions looks like this should have been split... nickdesaulniers: looks like this should have been split...
				nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions oh, right, no uses of the output in the indirect edge. nickdesaulniers: oh, right, no uses of the output in the indirect edge.
				; CHECK-NEXT: br label [[Y]]
				; CHECK: y:
				; CHECK-NEXT: [[TMP1:%.]] = phi i32 [ [[TMP0]], [[ENTRY_Y_CRIT_EDGE:%.]] ], [ 42, [[X]] ]
				; CHECK-NEXT: ret i32 [[TMP1]]
				;
				entry:
				%0 = callbr i32 asm "", "=r,!i"()
				to label %x [label %y]

				x:
				br label %y

				y:
				%1 = phi i32 [ %0, %entry ], [ 42, %x ]
				ret i32 %1
				}

				; Here we have output and a critical edge along an indirect branch.
				; Ensure that if we repeat the indirect destination, that we only split it
				; once.
				define i32 @split_me1(i1 %z) {
				; CHECK-LABEL: @split_me1(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: br i1 [[Z:%.]], label [[W:%.]], label [[V:%.*]]
				; CHECK: w:
				; CHECK-NEXT: [[TMP0:%.*]] = callbr i32 asm "", "=r,!i,!i"()
				; CHECK-NEXT: to label [[X:%.]] [label [[W_V_CRIT_EDGE:%.]], label %w.v_crit_edge]
				; CHECK: w.v_crit_edge:
				; CHECK-NEXT: br label [[V]]
				; CHECK: x:
				; CHECK-NEXT: ret i32 42
				; CHECK: v:
				; CHECK-NEXT: [[TMP1:%.]] = phi i32 [ [[TMP0]], [[W_V_CRIT_EDGE]] ], [ undef, [[ENTRY:%.]] ]
				; CHECK-NEXT: ret i32 [[TMP1]]
				;
				entry:
				br i1 %z, label %w, label %v

				w:
				%0 = callbr i32 asm "", "=r,!i,!i"()
				to label %x [label %v, label %v]

				x:
				ret i32 42

				v:
				%1 = phi i32 [%0, %w], [%0, %w], [undef, %entry]
				ret i32 %1
				}

				; A more interessting case of @split_me1. Check that we still only split the
				; critical edge from w to v once.
				define i32 @split_me2(i1 %z) {
				; CHECK-LABEL: @split_me2(
				; CHECK-NEXT: entry:
				; CHECK-NEXT: br i1 [[Z:%.]], label [[W:%.]], label [[V:%.*]]
				; CHECK: w:
				; CHECK-NEXT: [[TMP0:%.*]] = callbr i32 asm "", "=r,!i,!i"()
				; CHECK-NEXT: to label [[X:%.]] [label [[W_V_CRIT_EDGE:%.]], label %w.v_crit_edge]
				; CHECK: w.v_crit_edge:
				; CHECK-NEXT: br label [[V]]
				; CHECK: x:
				; CHECK-NEXT: ret i32 42
				; CHECK: v:
				; CHECK-NEXT: [[TMP1:%.]] = phi i32 [ [[TMP0]], [[W_V_CRIT_EDGE]] ], [ 42, [[ENTRY:%.]] ]
				; CHECK-NEXT: ret i32 [[TMP1]]
				;
				entry:
				br i1 %z, label %w, label %v

				w:
				%0 = callbr i32 asm "", "=r,!i,!i"()
				to label %x [label %v, label %v]

				x:
				ret i32 42

				v:
				%1 = phi i32 [ %0, %w ], [ 42, %entry ], [ %0, %w ]
				ret i32 %1
				}

This is an archive of the discontinued LLVM Phabricator instance.

[llvm][CallBrPrepare] split critical edgesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 498215

llvm/lib/CodeGen/CallBrPrepare.cpp

llvm/test/CodeGen/AArch64/callbr-prepare.ll

[llvm][CallBrPrepare] split critical edges
ClosedPublic