This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/Utils/
-
Transforms/
-
Utils/
3/11
CloneFunction.cpp

Differential D33850

Inlining: Don't re-map simplified cloned instructions.
ClosedPublic

Authored by iteratee on Jun 2 2017, 1:53 PM.

Download Raw Diff

Details

Reviewers

rnk
eraman
chandlerc

Commits

rGf73c8a06a977: Inlining: Don't re-map simplified cloned instructions.
rL306495: Inlining: Don't re-map simplified cloned instructions.

Summary

When simplifying an instruction that has been re-mapped, it should never
simplify to an instruction in the original function. In the edge case
where we are inlining a function into itself, the existing code led to
incorrect behavior. It would double-map an instruction: map-simplify-map.
The last map was incorrect, because the mapped instruction simplified to an instruction in the original portion of the function. An incoming value.
mapping this instruction would select the mapped version of this instruction in the newly inlined portion of the function. This wasn't correct, and would create uses not dominated by defs.

Replace the incorrect code with an assert verifying
that we never expect simplification to produce an instruction in the old
function, unless the functions are the same.

Diff Detail

Event Timeline

iteratee created this revision.Jun 2 2017, 1:53 PM

iteratee added a reviewer: eraman.Jun 5 2017, 4:15 PM

From the summary:

When simplifying an instruction that has been re-mapped, it should never

simplify to an instruction in the original function.
This looks reasonable to me

In the edge case

where we are inlining a function into itself, the existing code led to
incorrect behavior.

The SimpleInliner doesn't do recursive inlining AFAIK. I thought there was a check in InlineFunction as well to disallow this, but I don't see it. So I guess a recursive call may get inlined. It is not obvious to me what the incorrect behavior is in that case. Could you please explain what causes the incorrect behavior?

Replace the incorrect code with an assert verifying

that we never expect simplification to produce an instruction in the old
function, unless the functions are the same.

The asserts look reasonable to me. I don't understand the intention of the original code though.

lib/Transforms/Utils/CloneFunction.cpp
457	Unrelated change?

I wrote this, but also have no memory of the intent. I'm pretty sure I was worried about something that could not in fact happen. I'm happy moving to an assert to check that in fact it does not happen. =]

LGTM with the suggestion from Easwaran and myself applied (and without the unrelated change).

lib/Transforms/Utils/CloneFunction.cpp
313	Just use isa<...> here...

This revision is now accepted and ready to land.Jun 6 2017, 5:08 PM

iteratee edited the summary of this revision. (Show Details)Jun 7 2017, 10:51 AM

iteratee added inline comments.

lib/Transforms/Utils/CloneFunction.cpp
457	Not unrelated. If you're inlining a function into itself, end() changes. I can make it a separate patch if you'd like, but I put it in this patch on purpose.

I forgot to mention this: this needs a test case that would fail/crash before the patch. =]

lib/Transforms/Utils/CloneFunction.cpp
457	I'd make it a separate patch (or update the patch description to talk about it). But I also don't think this is quite correct -- even the new version computes the end iterator once and caches it, so I'm not sure how this fixes the issue cited.

This revision now requires changes to proceed.Jun 7 2017, 2:58 PM

iteratee added inline comments.Jun 7 2017, 3:11 PM

lib/Transforms/Utils/CloneFunction.cpp
457	This isn't necessary for correctness, so I'll submit it separately. It removes unnecessary work. What happens is that the blocks are batch cloned and then inserted. None of the new blocks are in the map, so nothing bad happens, but it's easier to reason about that if you don't visit the new blocks in the first place.

No longer needed because of https://reviews.llvm.org/D34017

In D33850#776482, @iteratee wrote:

No longer needed because of https://reviews.llvm.org/D34017

I mean, it may not do *bad* things any more, but it still seems like an improvement? We shouldn't be doing the thing we're doing. Moving to an assert seems better?

https://reviews.llvm.org/D34017 prevents recursive inline from happening. But even with recursive inlining, it should not trigger compiler error. I think this patch is still needed to fix the underlying issue?

Chandler, After r305934 A test for this isn't possible. We should still commit the fix.

This revision now requires changes to proceed.Jun 21 2017, 11:18 AM

Sure, comments inline...

lib/Transforms/Utils/CloneFunction.cpp
313	Still outstanding.
457	I'll note that as currently written, this still walks from begin to end of OldFunc, which I'm pretty sure is what the old code did as well. Maybe I'm missing something, but this looks like a no-op change...

iteratee updated this revision to Diff 103437.Jun 21 2017, 12:02 PM

iteratee edited edge metadata.

iteratee marked 2 inline comments as done.

Herald added a subscriber: sanjoy. · View Herald TranscriptJun 21 2017, 12:02 PM

iteratee added inline comments.Jun 21 2017, 12:02 PM

lib/Transforms/Utils/CloneFunction.cpp
457	I'll pull it out separately, but explain it here.
458	This line makes it a No-Op when we encounter the blocks added below. The goal of the change was to make it so you don't have to find this line and do the reasoning yourself.
463	You're missing this line here. When OldFunc == NewFunc, this line changes End.

sanjoy added inline comments.Jun 21 2017, 11:24 PM

lib/Transforms/Utils/CloneFunction.cpp
314	You should be able to use `cast<>` instead of `dyn_cast<>`.

Use cast instead of dyn_cast

iteratee marked an inline comment as done.Jun 26 2017, 6:58 PM

LGTM, thanks for cleaning this up even after the proximate issue was fixed!

This revision is now accepted and ready to land.Jun 27 2017, 1:36 AM

Closed by commit rL306495: Inlining: Don't re-map simplified cloned instructions. (authored by iteratee). · Explain WhyJun 27 2017, 6:41 PM

This revision was automatically updated to reflect the committed changes.

This actually seem to happen, i.e. we hit this assertion. We have an end-to-end (i.e. C test) that reduces to:

define void @patatino() {
for.cond:
  br label %for.body

for.body:
  %tobool = icmp eq i32 5, 0
  %sel = select i1 %tobool, i32 0, i32 2
  br i1 undef, label %cleanup1.thread, label %cleanup1

cleanup1.thread:
  ret void

cleanup1:
  %cleanup.dest2 = phi i32 [ %sel, %for.body ]
  %switch = icmp ult i32 %cleanup.dest2, 1
  ret void
}

define void @main() {
entry:
  call void @patatino()
  ret void
}

crashing with opt -inline as when we call SimplifyInstruction

%switch = icmp ult i32 %cleanup.dest2, 1

simplifies to

%tobool = icmp eq i32 5, 0

This simplification, if something, is a little peculiar, as I expected that to just be simplified to false.

I looked at this more closely, and I personally don't see anything wrong with the code as-is.
We happen to have an instruction that simplifies to a different instruction in the original function (mainly because instsimplify does some hazy threading over select/phis), so maybe this assertion is too strict?
Kyle, what do you think?

If this got dropped I'm sorry.

This can be reverted.

Revision Contents

Path

Size

lib/

Transforms/

Utils/

CloneFunction.cpp

11 lines

Diff 101273

lib/Transforms/Utils/CloneFunction.cpp

Show First 20 Lines • Show All 304 Lines • ▼ Show 20 Lines	if (!isa<PHINode>(NewInst)) {
RemapInstruction(NewInst, VMap,		RemapInstruction(NewInst, VMap,
ModuleLevelChanges ? RF_None : RF_NoModuleLevelChanges);		ModuleLevelChanges ? RF_None : RF_NoModuleLevelChanges);

// If we can simplify this instruction to some other value, simply add		// If we can simplify this instruction to some other value, simply add
// a mapping to that value rather than inserting a new instruction into		// a mapping to that value rather than inserting a new instruction into
// the basic block.		// the basic block.
if (Value *V =		if (Value *V =
SimplifyInstruction(NewInst, BB->getModule()->getDataLayout())) {		SimplifyInstruction(NewInst, BB->getModule()->getDataLayout())) {
// On the off-chance that this simplifies to an instruction in the old		assert((dyn_cast<Instruction>(V) == nullptr \|\|
		chandlercUnsubmitted Done Reply Inline Actions Just use isa<...> here... chandlerc: Just use isa<...> here...
		chandlercUnsubmitted Done Reply Inline Actions Still outstanding. chandlerc: Still outstanding.
// function, map it back into the new function.		dyn_cast<Instruction>(V)->getParent() == nullptr \|\|
		sanjoyUnsubmitted Done Reply Inline Actions You should be able to use `cast<>` instead of `dyn_cast<>`. sanjoy: You should be able to use `cast<>` instead of `dyn_cast<>`.
if (Value *MappedV = VMap.lookup(V))		dyn_cast<Instruction>(V)->getFunction() != OldFunc \|\|
V = MappedV;		OldFunc == NewFunc) &&
		"Simplified Instruction should not be in the old function.");

if (!NewInst->mayHaveSideEffects()) {		if (!NewInst->mayHaveSideEffects()) {
VMap[&*II] = V;		VMap[&*II] = V;
NewInst->deleteValue();		NewInst->deleteValue();
continue;		continue;
}		}
}		}
}		}
▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	#endif
}		}

// Loop over all of the basic blocks in the old function. If the block was		// Loop over all of the basic blocks in the old function. If the block was
// reachable, we have cloned it and the old block is now in the value map:		// reachable, we have cloned it and the old block is now in the value map:
// insert it into the new function in the right order. If not, ignore it.		// insert it into the new function in the right order. If not, ignore it.
//		//
// Defer PHI resolution until rest of function is resolved.		// Defer PHI resolution until rest of function is resolved.
SmallVector<const PHINode*, 16> PHIToResolve;		SmallVector<const PHINode*, 16> PHIToResolve;
for (const BasicBlock &BI : *OldFunc) {		for (const BasicBlock &BI : make_range(OldFunc->begin(), OldFunc->end())) {
		eramanUnsubmitted Not Done Reply Inline Actions Unrelated change? eraman: Unrelated change?
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions Not unrelated. If you're inlining a function into itself, end() changes. I can make it a separate patch if you'd like, but I put it in this patch on purpose. iteratee: Not unrelated. If you're inlining a function into itself, end() changes. I can make it a…
		chandlercUnsubmitted Not Done Reply Inline Actions I'd make it a separate patch (or update the patch description to talk about it). But I also don't think this is quite correct -- even the new version computes the end iterator once and caches it, so I'm not sure how this fixes the issue cited. chandlerc: I'd make it a separate patch (or update the patch description to talk about it). But I also…
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions This isn't necessary for correctness, so I'll submit it separately. It removes unnecessary work. What happens is that the blocks are batch cloned and then inserted. None of the new blocks are in the map, so nothing bad happens, but it's easier to reason about that if you don't visit the new blocks in the first place. iteratee: This isn't necessary for correctness, so I'll submit it separately. It removes unnecessary work.
		chandlercUnsubmitted Not Done Reply Inline Actions I'll note that as currently written, this still walks from begin to end of OldFunc, which I'm pretty sure is what the old code did as well. Maybe I'm missing something, but this looks like a no-op change... chandlerc: I'll note that as currently written, this still walks from begin to end of OldFunc, which I'm…
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions I'll pull it out separately, but explain it here. iteratee: I'll pull it out separately, but explain it here.
Value *V = VMap.lookup(&BI);		Value *V = VMap.lookup(&BI);
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions This line makes it a No-Op when we encounter the blocks added below. The goal of the change was to make it so you don't have to find this line and do the reasoning yourself. iteratee: This line makes it a No-Op when we encounter the blocks added below. The goal of the change…
BasicBlock *NewBB = cast_or_null<BasicBlock>(V);		BasicBlock *NewBB = cast_or_null<BasicBlock>(V);
if (!NewBB) continue; // Dead block.		if (!NewBB) continue; // Dead block.

// Add the new block to the new function.		// Add the new block to the new function.
NewFunc->getBasicBlockList().push_back(NewBB);		NewFunc->getBasicBlockList().push_back(NewBB);
		iterateeAuthorUnsubmitted Not Done Reply Inline Actions You're missing this line here. When OldFunc == NewFunc, this line changes End. iteratee: You're missing this line here. When OldFunc == NewFunc, this line changes End.

// Handle PHI nodes specially, as we have to remove references to dead		// Handle PHI nodes specially, as we have to remove references to dead
// blocks.		// blocks.
for (BasicBlock::const_iterator I = BI.begin(), E = BI.end(); I != E; ++I) {		for (BasicBlock::const_iterator I = BI.begin(), E = BI.end(); I != E; ++I) {
// PHI nodes may have been remapped to non-PHI nodes by the caller or		// PHI nodes may have been remapped to non-PHI nodes by the caller or
// during the cloning process.		// during the cloning process.
if (const PHINode *PN = dyn_cast<PHINode>(I)) {		if (const PHINode *PN = dyn_cast<PHINode>(I)) {
if (isa<PHINode>(VMap[PN]))		if (isa<PHINode>(VMap[PN]))
▲ Show 20 Lines • Show All 334 Lines • Show Last 20 Lines