This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
10/10
LoopUnroll.cpp
-
test/Transforms/LoopUnroll/
-
Transforms/
-
LoopUnroll/
2/2
callbr.ll

Differential D64101

[LoopUnroll] fix cloning callbr
AbandonedPublic

Authored by nickdesaulniers on Jul 2 2019, 2:19 PM.

Download Raw Diff

Details

Reviewers

fhahn
hfinkel
efriedma

Summary

There is currently a correctness issue when unrolling loops containing
callbr's where their indirect targets are being updated correctly to the
newly created labels, but their operands are not. This manifests in
unrolled loops where the second and subsequent copies of callbr
instructions have blockaddresses of the label from the first instance of
the unrolled loop, which would result in nonsensical runtime control
flow.

When cloning a callbr, update its blockaddress operands if they were cloned, too.

Link: https://bugs.llvm.org/show_bug.cgi?id=42489
Link: https://groups.google.com/forum/#!topic/clang-built-linux/z-hRWP9KqPI

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 34303
Build 34302: arc lint + arc unit

Event Timeline

nickdesaulniers created this revision.Jul 2 2019, 2:19 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 2 2019, 2:19 PM

Herald added subscribers: llvm-commits, dmgreen, zzheng, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B34217: Diff 207626.Jul 2 2019, 2:20 PM

nickdesaulniers added a reviewer: craig.topper.Jul 2 2019, 2:22 PM

nickdesaulniers added a subscriber: kees.Jul 2 2019, 2:27 PM

nathanchance added a subscriber: nathanchance.Jul 2 2019, 2:28 PM

Here's how I might modify the above test if for when we can loop unroll correctly: https://gist.github.com/nickdesaulniers/7216f6e5a17c7064285190440cb88f1d

E5ten added a subscriber: E5ten.Jul 2 2019, 3:18 PM

I have a more complete fix for this whole issue outright. Let me get the test case working, then I'll upload a v2.

fix the loop unroller

Harbormaster completed remote builds in B34233: Diff 207666.Jul 2 2019, 5:22 PM

nickdesaulniers retitled this revision from [LoopUnroll] do not unroll loops containing callbr to [LoopUnroll] fix cloning callbr.Jul 2 2019, 5:24 PM

nickdesaulniers edited the summary of this revision. (Show Details)

nickdesaulniers marked an inline comment as done.Jul 2 2019, 5:26 PM

nickdesaulniers added inline comments.

llvm/lib/Transforms/Utils/LoopUnroll.cpp
104	Notes to reviewers: let me know if the comments are excessive and I'll remove them. Also note that I manually peeled this loop out from the above in this function. I could place the `isa<CallBrInst>` and rest of this block in there. Just let me know WDYT?

srhines added inline comments.Jul 2 2019, 5:34 PM

llvm/lib/Transforms/Utils/LoopUnroll.cpp
106	Fold this line into the if statement for LLVM style.

style nit

nickdesaulniers marked an inline comment as done.Jul 2 2019, 5:42 PM

Harbormaster completed remote builds in B34236: Diff 207673.Jul 2 2019, 5:47 PM

nickdesaulniers marked an inline comment as done.Jul 2 2019, 6:08 PM

nickdesaulniers added inline comments.

llvm/lib/Transforms/Utils/LoopUnroll.cpp
112	Sorry, I think this should be a `dyn_cast` (we're "downcasting" from a Value* to a BasicBlock*, which may fail)

dyn_cast

Harbormaster completed remote builds in B34237: Diff 207684.Jul 2 2019, 6:10 PM

reroll loops, prefer conciseness

Harbormaster completed remote builds in B34238: Diff 207685.Jul 2 2019, 6:19 PM

fhahn added a subscriber: fhahn.Jul 3 2019, 3:26 AM

fhahn added inline comments.

llvm/lib/Transforms/Utils/LoopUnroll.cpp
101	IIUC, for block addresses, we now set the operand first by the code above and then also by this block here. It might be slightly better to first check for the special case and `continue` in case we hit it. Also, a brief comment why we need special handling here would be helpful IMO.
104	Is it possible for It->second to not be a BB here? I think if the original operand was a BB, the new one also should be a BB. So this could be a regular cast?
llvm/test/Transforms/LoopUnroll/callbr.ll
2	The test only checks the emitted IR and no debug output. So the 2>&1 should not be needed, right?
24	nit: could just be a void function.

check early, continue, fix test nits

llvm/lib/Transforms/Utils/LoopUnroll.cpp
104	Is it possible for It->second to not be a BB here? I'm pretty sure `It->second` can ONLY be a `BasicBlock` here, since the values stored in `VMap` were created by `llvm::CloneBasicBlock` (see `VMap` there). That said, I'm not super confident in my understanding of `dyn_cast` vs `static_cast`. The `VMap` is an instance of `ValueToValueMapTy`, which simply maps `Value` to `Value`. As `Value` is the base class of a lot of stuff in LLVM, we end up storing all kinds of pointers to derived classes in `VMap`. My understanding of when to use `dyn_cast` (or even `dynamic_cast`) vs `static_cast` is that it's always ok to `static_cast` from derived class to base class, but not vice versa. That's why `dyn_cast` or `dynamic_cast` might result in `nullptr` at runtime, and thus need to be checked. Since `It->second` should ALWAYS (IIUC) be a `BasicBlock*` in this case, I would have thought we still need a checked call to `dyn_cast`. Maybe I'm misunderstanding and that it's ok to `static_cast` from base to derived if you're certain that base is always derived?

nickdesaulniers added a reviewer: fhahn.Jul 3 2019, 10:47 AM

Harbormaster completed remote builds in B34303: Diff 207842.Jul 3 2019, 10:48 AM

nickdesaulniers added inline comments.Jul 3 2019, 10:50 AM

llvm/lib/Transforms/Utils/LoopUnroll.cpp
104	Ah, a quick reread of http://llvm.org/docs/ProgrammersManual.html#the-isa-cast-and-dyn-cast-templates makes it seem like `cast<BasicBlock>` should match with what I expect. Brb, enabling assertions in my build...

fhahn added inline comments.Jul 3 2019, 10:58 AM

llvm/lib/Transforms/Utils/LoopUnroll.cpp
104	Yep, `cast<BasicBlock>` will assert that it actually is a basic block with assertions enabled. That should be sufficient cover.

prefer cast<> to dyn_cast<>

nickdesaulniers marked 4 inline comments as done.Jul 3 2019, 11:12 AM

Harbormaster completed remote builds in B34308: Diff 207850.Jul 3 2019, 11:13 AM

LGTM, thanks!

llvm/lib/Transforms/Utils/LoopUnroll.cpp
609	nit: unrelated change?

This revision is now accepted and ready to land.Jul 3 2019, 1:06 PM

nickdesaulniers removed reviewers: void, glider, chandlerc, craig.topper.Jul 3 2019, 1:37 PM

nickdesaulniers added subscribers: craig.topper, chandlerc, glider, void.

Great! Thanks for the code review.

nickdesaulniers marked an inline comment as done.Jul 3 2019, 1:40 PM

nickdesaulniers added inline comments.

llvm/lib/Transforms/Utils/LoopUnroll.cpp
609	Intentional; helps readability to declare as late as possible.

Is this transform safe? The inline asm could stash the address of a destination in a variable in one loop iteration, and use it in a later loop iteration. Or is that not legal?

In D64101#1569218, @efriedma wrote:

Is this transform safe? The inline asm could stash the address of a destination in a variable in one loop iteration, and use it in a later loop iteration. Or is that not legal?

Isn't that an issue in inline assembly regardless of whether or not it's asm goto?
GCC seems to unroll the loop, even when capturing the induction variable:
https://godbolt.org/z/HVUTC4
Should the presence of inline assembly be such an optimization barrier?

If there's some rule that distinguishes blockaddresses used in callbr from general blockaddresses, we should state that explicitly somewhere in LangRef.

Maybe we could make the semantics of callbr a bit more explicit in LangRef?

Currently it only mentions that the only use is to implement asm goto's like in GCC. IIUC, asm goto cannot have outputs and the inputs are explicit, those can be updated while unrolling. According to https://gcc.gnu.org/onlinedocs/gcc/Extended-Asm.html 6.47.2.7 Goto Labels, if the assembly in the asm goto modifies anything, it needs a "memory" clobber. I think this should be handled like regular inline assembly already in LLVM?

it needs a "memory" clobber. I think this should be handled like regular inline assembly already in LLVM?

Neither GCC nor Clang prevent loop unrolling loops containing inline asm w/ "memory" clobber. You can observe this with my godbolt link above and manually add "memory" to the 4 position in the asm statement.

If there's some rule that distinguishes blockaddresses used in callbr from general blockaddresses, we should state that explicitly somewhere in LangRef.

I don't really understand what you're looking for @eli.friedman . Should I add a statment along the lines of "blockaddresses may be rewritten during optimization passes to refer to the address of newly created blocks"?

The question is whether something like the following is legal:

void bar(void) {
    long temp = 0;
    #pragma GCC unroll 3
    for (int j = 0; j < 3; ++j) {
        asm goto("lea %l2(%%rip), %%rax\n"
                 "cmp $0, %0\n"
                 "jne 1f\n"
                 "mov %%rax, %0\n"
                 "1:\n"
                 "jmp *%0\n" :: "m"(temp), "r"(j) :"memory","rax": baz);
        baz:;
    }
}
int main() { bar(); }

I guess the answer is that no, it isn't legal, given gcc's behavior. But there's nothing in LangRef that forbids it... we need some rule that ties the value passed to the "X" constraint to the actual successor.

Or actually, it might make more sense to change the way we generate/lower callbr, to make the label parameters implicit; instead of modeling them with blockaddress, we never write them in IR at all, and automatically generate them later based on the successor list.

In D64101#1569642, @efriedma wrote:

Or actually, it might make more sense to change the way we generate/lower callbr, to make the label parameters implicit; instead of modeling them with blockaddress, we never write them in IR at all, and automatically generate them later based on the successor list.

That's an appealing idea.

We do need to be careful, AFAIK, because the goto targets can be passed into the asm as parameters (where the block addresses come from the labels-as-values extension). However, in theory we already shouldn't unroll loops with blocks with their address taken, so I imagine that case is irrelevant here (*).

(*) LoopUnroll.cpp actually only seems to check Header->hasAddressTaken(), not all of the blocks, so maybe that's wrong?

In D64101#1569682, @hfinkel wrote:

In D64101#1569642, @efriedma wrote:

Or actually, it might make more sense to change the way we generate/lower callbr, to make the label parameters implicit; instead of modeling them with blockaddress, we never write them in IR at all, and automatically generate them later based on the successor list.

That's an appealing idea.

I kind of agree; looking at the arguments to callbr, the label list (the final part in []) feels like duplicate information of the earlier blockaddress operands. The blockaddress operands in the common case SHOULD match or be the address of blocks in the label list, except when you're explicitly passing the address of labels around (see below link). In that first case, why specify explicitly the blockaddress at all? Surely other transforms may update the label list but not the blockaddresses, just like this patch w/ LoopUnroll. I feel like the blockaddress could be implicit based on the label list, and generated as late as possible and only when needed.

That said, I'd really like to land this patch as it addresses an observable and bad bug in the Linux kernel, and I'd like to do so to make the clang-9 release train, rather than try to rearchitect callbr here.

We do need to be careful, AFAIK, because the goto targets can be passed into the asm as parameters (where the block addresses come from the labels-as-values extension). However, in theory we already shouldn't unroll loops with blocks with their address taken, so I imagine that case is irrelevant here (*).

(*) LoopUnroll.cpp actually only seems to check Header->hasAddressTaken(), not all of the blocks, so maybe that's wrong?

Great point. Looking at GCC:
https://godbolt.org/z/lKa_HD

Notice: it does unroll/duplicate the label within the loop when the label is passed in via the final label list in the asm goto. It does not duplicate the address of label passed in, even when the induction variable is captured.

So my patch as it stands in the current revision is incorrect or will differ from GCC; I need to differentiate between is this BlockAddress operand explicitly passed in as an input, or is it from the label list. I don't think it's hard for me to change what I have to at least match GCC. To @eli.friedman 's point; then yes blockaddress is indeed handled differently in this case. Let me fix up the patch and add a test based on the godbolt link.

And thanks to everyone so far for the feedback and discussion.

Here's another interesting case for me to add a test for: https://godbolt.org/z/WwKXaI

The same address being using in the label list AND explicitly passed in as an address of label. Note the difference in behavior there.

In D64101#1574332, @nickdesaulniers wrote:

Here's another interesting case for me to add a test for: https://godbolt.org/z/WwKXaI

The same address being using in the label list AND explicitly passed in as an address of label. Note the difference in behavior there.

Sorry, that test case would rely on https://reviews.llvm.org/D64167 first. ;)

Marking as requiring changes while the comments are being addressed.

That said, I'd really like to land this patch as it addresses an observable and bad bug in the Linux kernel, and I'd like to do so to make the clang-9 release train, rather than try to rearchitect callbr here.

IIUC the patch as is fixes an issue with unrolling asm goto, in cases unrolling is legal. But we need additional checks to prevent unrolling in the cases Eli mentioned. Maybe a safer approach would be to start with a patch to restrict the unrolling of loops with asm goto (that should also fix the bug in the linux kernel, right?) and then allow unrolling for the safe subset of cases where it is legal.

Sorry, that test case would rely on https://reviews.llvm.org/D64167 first. ;)

It's fine to have this patch depend on D64167

This revision now requires changes to proceed.Jul 8 2019, 2:34 PM

In D64101#1574429, @fhahn wrote:

Marking as requiring changes while the comments are being addressed.

That said, I'd really like to land this patch as it addresses an observable and bad bug in the Linux kernel, and I'd like to do so to make the clang-9 release train, rather than try to rearchitect callbr here.

IIUC the patch as is fixes an issue with unrolling asm goto, in cases unrolling is legal. But we need additional checks to prevent unrolling in the cases Eli mentioned. Maybe a safer approach would be to start with a patch to restrict the unrolling of loops with asm goto (that should also fix the bug in the linux kernel, right?) and then allow unrolling for the safe subset of cases where it is legal.

Sure thing, forked off v1 of this patch into: https://reviews.llvm.org/D64368

nickdesaulniers added a parent revision: D64368: [LoopUnroll+LoopUnswitch] do not transform loops containing callbr.Jul 8 2019, 3:02 PM

nickdesaulniers added a parent revision: D64167: [TargetLowering] support BlockAddress as "i" inline asm constraint.

Note to self: this needs to be rebased on top of https://reviews.llvm.org/rL366130 and a test case for LoopUnswitch needs to be added.

nickdesaulniers added a subscriber: nikic.Jul 18 2022, 10:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 18 2022, 10:21 AM

Superseded by: https://reviews.llvm.org/D129993

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Utils/

LoopUnroll.cpp

21 lines

test/

Transforms/

LoopUnroll/

callbr.ll

47 lines

Diff 207842

llvm/lib/Transforms/Utils/LoopUnroll.cpp

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
#else		#else
cl::init(false)		cl::init(false)
#endif		#endif
);		);

/// Convert the instruction operands from referencing the current values into		/// Convert the instruction operands from referencing the current values into
/// those specified by VMap.		/// those specified by VMap.
void llvm::remapInstruction(Instruction *I, ValueToValueMapTy &VMap) {		void llvm::remapInstruction(Instruction *I, ValueToValueMapTy &VMap) {
for (unsigned op = 0, E = I->getNumOperands(); op != E; ++op) {		for (unsigned OpNo = 0, E = I->getNumOperands(); OpNo != E; ++OpNo) {
Value *Op = I->getOperand(op);		Value *Op = I->getOperand(OpNo);

		// If we have a BlockAddress operand where the BasicBlock of the
		// BlockAddress was remapped and the BlockAddress is the operand of a
		// CallBrInst, then we need to update the CallBrInst's operand to be a
		// BlockAddress of the remapped BasicBlock, not the original BasicBlock.
		if (auto *BA = dyn_cast<BlockAddress>(Op)) {
		ValueToValueMapTy::iterator It = VMap.find(BA->getBasicBlock());
		if (It != VMap.end() && isa<CallBrInst>(*I))
		if (auto *NewBB = dyn_cast<BasicBlock>(It->second)) {
		I->setOperand(OpNo, BlockAddress::get(NewBB));
		continue;
		}
		}

// Unwrap arguments of dbg.value intrinsics.		// Unwrap arguments of dbg.value intrinsics.
bool Wrapped = false;		bool Wrapped = false;
if (auto *V = dyn_cast<MetadataAsValue>(Op))		if (auto *V = dyn_cast<MetadataAsValue>(Op))
if (auto *Unwrapped = dyn_cast<ValueAsMetadata>(V->getMetadata())) {		if (auto *Unwrapped = dyn_cast<ValueAsMetadata>(V->getMetadata())) {
Op = Unwrapped->getValue();		Op = Unwrapped->getValue();
Wrapped = true;		Wrapped = true;
}		}

auto wrap = [&](Value *V) {		auto wrap = [&](Value *V) {
auto &C = I->getContext();		auto &C = I->getContext();
return Wrapped ? MetadataAsValue::get(C, ValueAsMetadata::get(V)) : V;		return Wrapped ? MetadataAsValue::get(C, ValueAsMetadata::get(V)) : V;
};		};

ValueToValueMapTy::iterator It = VMap.find(Op);		ValueToValueMapTy::iterator It = VMap.find(Op);
if (It != VMap.end())		if (It != VMap.end())
I->setOperand(op, wrap(It->second));		I->setOperand(OpNo, wrap(It->second));
}		}

		fhahnUnsubmitted Done Reply Inline Actions IIUC, for block addresses, we now set the operand first by the code above and then also by this block here. It might be slightly better to first check for the special case and `continue` in case we hit it. Also, a brief comment why we need special handling here would be helpful IMO. fhahn: IIUC, for block addresses, we now set the operand first by the code above and then also by this…
if (PHINode *PN = dyn_cast<PHINode>(I)) {		if (PHINode *PN = dyn_cast<PHINode>(I)) {
for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {		for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {
ValueToValueMapTy::iterator It = VMap.find(PN->getIncomingBlock(i));		ValueToValueMapTy::iterator It = VMap.find(PN->getIncomingBlock(i));
		nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Notes to reviewers: let me know if the comments are excessive and I'll remove them. Also note that I manually peeled this loop out from the above in this function. I could place the `isa<CallBrInst>` and rest of this block in there. Just let me know WDYT? nickdesaulniers: Notes to reviewers: let me know if the comments are excessive and I'll remove them. Also note…
		fhahnUnsubmitted Done Reply Inline Actions Is it possible for It->second to not be a BB here? I think if the original operand was a BB, the new one also should be a BB. So this could be a regular cast? fhahn: Is it possible for It->second to not be a BB here? I think if the original operand was a BB…
		nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Is it possible for It->second to not be a BB here? I'm pretty sure `It->second` can ONLY be a `BasicBlock` here, since the values stored in `VMap` were created by `llvm::CloneBasicBlock` (see `VMap` there). That said, I'm not super confident in my understanding of `dyn_cast` vs `static_cast`. The `VMap` is an instance of `ValueToValueMapTy`, which simply maps `Value` to `Value`. As `Value` is the base class of a lot of stuff in LLVM, we end up storing all kinds of pointers to derived classes in `VMap`. My understanding of when to use `dyn_cast` (or even `dynamic_cast`) vs `static_cast` is that it's always ok to `static_cast` from derived class to base class, but not vice versa. That's why `dyn_cast` or `dynamic_cast` might result in `nullptr` at runtime, and thus need to be checked. Since `It->second` should ALWAYS (IIUC) be a `BasicBlock` in this case, I would have thought we still need a checked call to `dyn_cast`. Maybe I'm misunderstanding and that it's ok to `static_cast` from base to derived if you're certain that base is always derived? nickdesaulniers:* > Is it possible for It->second to not be a BB here? I'm pretty sure `It->second` can ONLY be…
		nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Ah, a quick reread of http://llvm.org/docs/ProgrammersManual.html#the-isa-cast-and-dyn-cast-templates makes it seem like `cast<BasicBlock>` should match with what I expect. Brb, enabling assertions in my build... nickdesaulniers: Ah, a quick reread of http://llvm.org/docs/ProgrammersManual.html#the-isa-cast-and-dyn-cast…
		fhahnUnsubmitted Done Reply Inline Actions Yep, `cast<BasicBlock>` will assert that it actually is a basic block with assertions enabled. That should be sufficient cover. fhahn: Yep, `cast<BasicBlock>` will assert that it actually is a basic block with assertions enabled.
if (It != VMap.end())		if (It != VMap.end())
PN->setIncomingBlock(i, cast<BasicBlock>(It->second));		PN->setIncomingBlock(i, cast<BasicBlock>(It->second));
		srhinesUnsubmitted Done Reply Inline Actions Fold this line into the if statement for LLVM style. srhines: Fold this line into the if statement for LLVM style.
}		}
}		}
}		}

/// Check if unrolling created a situation where we need to insert phi nodes to		/// Check if unrolling created a situation where we need to insert phi nodes to
/// preserve LCSSA form.		/// preserve LCSSA form.
		nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Sorry, I think this should be a `dyn_cast` (we're "downcasting" from a Value* to a BasicBlock, which may fail) nickdesaulniers:* Sorry, I think this should be a `dyn_cast` (we're "downcasting" from a Value* to a BasicBlock*…
/// \param Blocks is a vector of basic blocks representing unrolled loop.		/// \param Blocks is a vector of basic blocks representing unrolled loop.
/// \param L is the outer loop.		/// \param L is the outer loop.
/// It's possible that some of the blocks are in L, and some are not. In this		/// It's possible that some of the blocks are in L, and some are not. In this
/// case, if there is a use is outside L, and definition is inside L, we need to		/// case, if there is a use is outside L, and definition is inside L, we need to
/// insert a phi-node, otherwise LCSSA will be broken.		/// insert a phi-node, otherwise LCSSA will be broken.
/// The function is just a helper function for llvm::UnrollLoop that returns		/// The function is just a helper function for llvm::UnrollLoop that returns
/// true if this situation occurs, indicating that LCSSA needs to be fixed.		/// true if this situation occurs, indicating that LCSSA needs to be fixed.
static bool needToInsertPhisForLCSSA(Loop L, std::vector<BasicBlock > Blocks,		static bool needToInsertPhisForLCSSA(Loop L, std::vector<BasicBlock > Blocks,
▲ Show 20 Lines • Show All 424 Lines • ▼ Show 20 Lines	LoopUnrollResult llvm::UnrollLoop(Loop L, UnrollLoopOptions ULO, LoopInfo LI,
} else {		} else {
NumUnrolledWithHeader++;		NumUnrolledWithHeader++;
ContinueOnTrue = L->contains(HeaderBI->getSuccessor(0));		ContinueOnTrue = L->contains(HeaderBI->getSuccessor(0));
LoopExit = HeaderBI->getSuccessor(ContinueOnTrue);		LoopExit = HeaderBI->getSuccessor(ContinueOnTrue);
}		}

// For the first iteration of the loop, we should use the precloned values for		// For the first iteration of the loop, we should use the precloned values for
// PHI nodes. Insert associations now.		// PHI nodes. Insert associations now.
ValueToValueMapTy LastValueMap;
std::vector<PHINode*> OrigPHINode;		std::vector<PHINode*> OrigPHINode;
for (BasicBlock::iterator I = Header->begin(); isa<PHINode>(I); ++I) {		for (BasicBlock::iterator I = Header->begin(); isa<PHINode>(I); ++I) {
OrigPHINode.push_back(cast<PHINode>(I));		OrigPHINode.push_back(cast<PHINode>(I));
}		}

std::vector<BasicBlock *> Headers;		std::vector<BasicBlock *> Headers;
std::vector<BasicBlock *> HeaderSucc;		std::vector<BasicBlock *> HeaderSucc;
std::vector<BasicBlock *> Latches;		std::vector<BasicBlock *> Latches;
Show All 40 Lines	for (BasicBlock *BB : L->getBlocks())
if (NewDIL)		if (NewDIL)
I.setDebugLoc(NewDIL.getValue());		I.setDebugLoc(NewDIL.getValue());
else		else
LLVM_DEBUG(dbgs()		LLVM_DEBUG(dbgs()
<< "Failed to create new discriminator: "		<< "Failed to create new discriminator: "
<< DIL->getFilename() << " Line: " << DIL->getLine());		<< DIL->getFilename() << " Line: " << DIL->getLine());
}		}

		ValueToValueMapTy LastValueMap;
		fhahnUnsubmitted Done Reply Inline Actions nit: unrelated change? fhahn: nit: unrelated change?
		nickdesaulniersAuthorUnsubmitted Done Reply Inline Actions Intentional; helps readability to declare as late as possible. nickdesaulniers: Intentional; helps readability to declare as late as possible.
for (unsigned It = 1; It != ULO.Count; ++It) {		for (unsigned It = 1; It != ULO.Count; ++It) {
std::vector<BasicBlock*> NewBlocks;		std::vector<BasicBlock*> NewBlocks;
SmallDenseMap<const Loop , Loop , 4> NewLoops;		SmallDenseMap<const Loop , Loop , 4> NewLoops;
NewLoops[L] = L;		NewLoops[L] = L;

for (LoopBlocksDFS::RPOIterator BB = BlockBegin; BB != BlockEnd; ++BB) {		for (LoopBlocksDFS::RPOIterator BB = BlockBegin; BB != BlockEnd; ++BB) {
ValueToValueMapTy VMap;		ValueToValueMapTy VMap;
BasicBlock New = CloneBasicBlock(BB, VMap, "." + Twine(It));		BasicBlock New = CloneBasicBlock(BB, VMap, "." + Twine(It));
▲ Show 20 Lines • Show All 368 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopUnroll/callbr.ll

This file was added.

				; RUN: opt -loop-unroll -S -o - %s \| FileCheck %s

				fhahnUnsubmitted Done Reply Inline Actions The test only checks the emitted IR and no debug output. So the 2>&1 should not be needed, right? fhahn: The test only checks the emitted IR and no debug output. So the 2>&1 should not be needed…
				; CHECK-LABEL: if.then:
				; CHECK-NEXT: callbr void asm sideeffect "1: nop\0A\09.quad b, ${0:l}, $$5\0A\09", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@d, %l_yes))
				; CHECK-NEXT: to label %asm.fallthrough [label %l_yes]
				; CHECK-LABEL: l_yes:

				; CHECK-LABEL: if.then.1:
				; CHECK-NEXT: callbr void asm sideeffect "1: nop\0A\09.quad b, ${0:l}, $$5\0A\09", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@d, %l_yes.1))
				; CHECK-NEXT: to label %asm.fallthrough.1 [label %l_yes.1]
				; CHECK-LABLE: l_yes.1:

				; CHECK-LABEL: if.then.2:
				; CHECK-NEXT: callbr void asm sideeffect "1: nop\0A\09.quad b, ${0:l}, $$5\0A\09", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@d, %l_yes.2))
				; CHECK-NEXT: to label %asm.fallthrough.2 [label %l_yes.2]
				; CHECK-LABEL: l_yes.2:

				define dso_local void @d() {
				entry:
				br label %for.body

				for.cond.cleanup: ; preds = %for.inc
				ret void

				fhahnUnsubmitted Done Reply Inline Actions nit: could just be a void function. fhahn: nit: could just be a void function.
				for.body: ; preds = %for.inc, %entry
				%e.04 = phi i32 [ 0, %entry ], [ %inc, %for.inc ]
				%tobool = icmp eq i32 %e.04, 0
				br i1 %tobool, label %for.inc, label %if.then

				if.then: ; preds = %for.body
				callbr void asm sideeffect "1: nop\0A\09.quad b, ${0:l}, $$5\0A\09", "X,~{dirflag},~{fpsr},~{flags}"(i8* blockaddress(@d, %l_yes))
				to label %asm.fallthrough [label %l_yes]

				asm.fallthrough: ; preds = %if.then
				br label %l_yes

				l_yes: ; preds = %asm.fallthrough, %if.then
				%call = tail call i32 (...) @g()
				br label %for.inc

				for.inc: ; preds = %for.body, %l_yes
				%inc = add nuw nsw i32 %e.04, 1
				%exitcond = icmp eq i32 %inc, 3
				br i1 %exitcond, label %for.cond.cleanup, label %for.body
				}

				declare dso_local i32 @g(...) local_unnamed_addr

This is an archive of the discontinued LLVM Phabricator instance.

[LoopUnroll] fix cloning callbrAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 207842

llvm/lib/Transforms/Utils/LoopUnroll.cpp

llvm/test/Transforms/LoopUnroll/callbr.ll

[LoopUnroll] fix cloning callbr
AbandonedPublic