This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/IR/
-
llvm/
-
IR/
-
DebugInfoMetadata.h
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineInternal.h
-
InstCombinePHI.cpp
-
test/DebugInfo/Generic/
-
DebugInfo/
-
Generic/
-
instcombine-phi.ll

Differential D26256

[InstCombine] Don't set debug location when folding through a phi node
ClosedPublic

Authored by rob.lougher on Nov 2 2016, 12:29 PM.

Download Raw Diff

Details

Reviewers

dblaikie
danielcdh
probinson
aprantl
wolfgangp

Commits

rGcfd719869848: [InstCombine] Folding of a compare with RHS const should merge debug locations
rGc9f735477695: [InstCombine] Folding of a binop with RHS const should merge the debug locations
rGf02d9b8325b8: [InstCombine] When folding casts through a phi node merge the debug locations
rG373e36a4104e: [InstCombine] Folding loads through a phi node should merge the debug locations
rG8fc1e89bbbdc: [InstCombine] When folding GEP through a phi node merge the debug locations
rG4b0790d488db: [InstCombine] Merge debug locations when folding through a phi node
rG2428a4050f06: [InstCombine] Merge debug locations when folding through a phi node
rL289704: [InstCombine] Folding of a compare with RHS const should merge debug locations
rL289699: [InstCombine] Folding of a binop with RHS const should merge the debug locations
rL289693: [InstCombine] When folding casts through a phi node merge the debug locations
rL289688: [InstCombine] Folding loads through a phi node should merge the debug locations
rL289684: [InstCombine] When folding GEP through a phi node merge the debug locations
rL289681: [InstCombine] Merge debug locations when folding through a phi node
rL289679: [InstCombine] Merge debug locations when folding through a phi node

Summary

Consider the following simple if-then-else:

define i32 @test(i32 %a, i32 %b) !dbg !6 {
entry:
  %tobool = icmp ne i32 %a, 0, !dbg !8
  br i1 %tobool, label %if.then, label %if.else, !dbg !8

if.then:                                          ; preds = %entry
  %call = call i32 @foo(), !dbg !9
  %sub = sub nsw i32 %b, %call, !dbg !10
  br label %if.end, !dbg !11

if.else:                                          ; preds = %entry
  %call1 = call i32 @bar(), !dbg !12
  %sub2 = sub nsw i32 %b, %call1, !dbg !13
  br label %if.end

if.end:                                           ; preds = %if.else, %if.then
  %b.addr.0 = phi i32 [ %sub, %if.then ], [ %sub2, %if.else ]
  ret i32 %b.addr.0, !dbg !14
}

With the source location of the sub instructions described by:

!10 = !DILocation(line: 10, column: 7, scope: !6)
!13 = !DILocation(line: 12, column: 7, scope: !6)

If this is passed to InstCombine, the two sub instructions feeding into the phi node will be combined into a single sub with an incoming phi node:

if.end:                                           ; preds = %if.else, %if.then
  %call.pn = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
  %b.addr.0 = sub nsw i32 %b, %call.pn, !dbg !11
  ret i32 %b.addr.0, !dbg !12

However, the combined sub has been given the debug location of the first instruction:

!11 = !DILocation(line: 10, column: 7, scope: !5)

This is a problem for sample-based PGO as hits on the sub instruction will be counted towards the if-part of the if-then-else even when the else-part was executed (which may lead to incorrect decisions to re-order blocks). It will also affect the optimized debugging experience leading to odd stepping in the debugger.

This patch fixes the issue by removing the line that sets the common instruction debug location (as the location is now ambiguous).

In addition to binary operations, instcombine will also fold various others such as casts, loads and getelementptr through a phi node (in total, InstCombinePHI.cpp contains 7 calls to set the debug location on the combined instruction). The patch removes all of these. A test is included which contains 7 tests that check each of these cases.

OK to commit? Although the situation is the same for each case, I can split the patch and commit each change separately if preferred.

Thanks,
Rob.

Diff Detail

Event Timeline

rob.lougher updated this revision to Diff 76764.Nov 2 2016, 12:29 PM

rob.lougher retitled this revision from to [InstCombine] Don't set debug location when folding through a phi node.

rob.lougher updated this object.

rob.lougher added reviewers: dblaikie, aprantl, danielcdh, wolfgangp, probinson.

rob.lougher added subscribers: llvm-commits, andreadb, gbedwell.

Shouldn't it only drop the location if the two locations are distinct (and perhaps add a discriminator)?

I mentioned this previously: I think it would be great to add an API that goes something like:
DILocation mergeDebugLoc(DILocation A, DILocation B)
that is invoked in cases like this and then decides what to do depending on the two incoming locations.

adrian

In D26256#586214, @aprantl wrote:

Shouldn't it only drop the location if the two locations are distinct (and perhaps add a discriminator)?

Yes, I thought about that. As the two instructions feed into a phi node they are in different basic-blocks, so we're back to talking about the situation where we have an if-then-else all on a single line. In this case we could create a new debug location with a new discriminator (different scope). I don't know the debug info code well enough to do this at the moment...

I mentioned this previously: I think it would be great to add an API that goes something like:
DILocation mergeDebugLoc(DILocation A, DILocation B)
that is invoked in cases like this and then decides what to do depending on the two incoming locations.

That would be nice.

adrian

In D26256#586214, @aprantl wrote:

Shouldn't it only drop the location if the two locations are distinct (and perhaps add a discriminator)?

Sorry, didn't make myself clear in the last comment. As the two instructions feed into a phi node they are in different basic-blocks, so the two locations must be distinct (they are in different scopes and will have different discriminators). But in the case where we have an if-then-else all on the same line we could create a new debug location with a different scope/discriminator.

In D26256#586228, @rob.lougher wrote:

In D26256#586214, @aprantl wrote:

Shouldn't it only drop the location if the two locations are distinct (and perhaps add a discriminator)?

Sorry, didn't make myself clear in the last comment. As the two instructions feed into a phi node they are in different basic-blocks, so the two locations must be distinct (they are in different scopes and will have different discriminators). But in the case where we have an if-then-else all on the same line we could create a new debug location with a different scope/discriminator.

Yes, that sounds reasonable.

I think the single-line situation is not just an edge case and we should handle it correctly. In C++11 it is becoming quite common to have a lot of control flow on a single line (think anything from <algorithm> with lambdas or the ternary operator).

I'm not sure if I understand why you want to add a discriminator here. Basically we should only add a discriminator in the AddDiscriminators.cpp, or when there is code duplication that we want to record.

Dehao

In D26256#586247, @danielcdh wrote:

I'm not sure if I understand why you want to add a discriminator here. Basically we should only add a discriminator in the AddDiscriminators.cpp, or when there is code duplication that we want to record.

Dehao

DWARF specifies the discriminator as a way to distinguish instructions in different blocks that are attributed to the same source location. We are hypothetically starting with two instructions in different blocks with the same source location, so those two instructions ought to have two different discriminators already. We are then creating a new instruction in a third block, and if that instruction preserves the original source location, it ought to have a third discriminator value.

Normally, the two original instructions will have different source locations, in which case erasing the source attribution on the new instruction is the only really viable solution (because we can't attribute the source correctly).

From the implementation point of view, the map from source location to the maximum discriminator that source location has, is only maintained in AddDiscriminators.cpp. So at later optimizations, you cannot find the maximum discriminator for that location, and assign max_discriminator+1 to the moved instruction.

Any updates on this patch?

We actually hit a point that we need this patch to unlock some performance improvements in sample pgo.

Thanks,
Dehao

No update. I looked into handling the single line if-then-else case and concluded it was not easy (as Dehao says at this stage you do not know the maximum discriminator value, without redoing the work of AddDiscriminators). As others want the patch could I add a TODO for now?

Thanks,
Rob.

Looks fairly reasonable to me.

The changes seem independent though, so if you're inclined - you could split it up into one change and one test per commit for easier isolation/archaeology/etc.

(presumably we could do the simple thing of "if they're actually the same location, keep it" - so in a build with discriminators we wouldn't keep it (because they'd have distinct discriminators) but otherwise/in a normal debug build we could keep them)

This revision is now accepted and ready to land.Nov 9 2016, 10:14 AM

SGTM

As Adrian mentioned, for lambda, there is nothing that can be done (or very difficult) here to address the lexical scope issue.

Assuming this only applies to ternary case, it should not affect the debugging for the following case:

#1 v = cond ? stmt1: stmt2;

Because though the debug info for stmt1&2 are removed, as they are in the same line with cond, debugger will still stop at this line as it need to execute "cond".

The debugging experience for the following code will be affected:

#1 v = cond ?
#2 stmt1: stmt2;

Without the patch, the debugger will first stop at line #1, and then step to line #2. With the patch, #2 does not exist any more in debug info, so it will be skipped after line #1. But this does not seem important as stopping at line #2 does not help much during debugging process.

But for the following case:

#1 v = cond ?
#2 stmt1:
#3 stmt2;

This patch will definitely improve debugging experience by avoiding stopping at incorrect location.

Comments?

I would still prefer adding an "mergeDebugLoc" API, even if it only returns an empty DebugLoc, so we:

make it explicit that the debug location is being intentionally dropped and that this code has been audited for correctly updating debug locations (otherwise the code looks no different from a pass that is just not handling debug info at all)
can improve on it later by, e.g., keeping identical DebugLocs

Thanks for all the comments. I'm away for the next couple of days - I'll check back on Monday.

Just a quick note to say I haven't forgotten about this.

My understanding is if we had

#1 v = cond ?
#2 stmt1: stmt2;

Then if an instruction was commoned, we should still have at least one instruction left in the blocks for stmt1 and stmt2? So debugging experience wouldn't be affected as we'll still stop at line 2.

However, from Adrian's last comment, we're now moving away from a purely functional argument, to aesthetics of the code where we want something to say "yes, I've thought about debug info here", even if we simply do nothing.

I'm not against doing this. However, before I submit a revised patch I'd like to run a few things past people.

First, where should a merge API live? I can see two possible places, the DebugLoc or the DILocation class. Get/SetDebugLoc uses DebugLoc, but in DebugLoc.h there's FIXMEs to avoid using it.

Second, assuming we want to merge two locations, the function could be static or non-static:

DILocation mergeDebugLoc(DILocation A, DILocation B)
DILocation merge(DILocation Other)

If we need to merge more than two locations, this can be done iteratively, merging the next one onto the current merged result (hence a non-static function may be better).

As far as revising the patch to use the new API, there is an added complexity. The optimisations are generalised, and the phi node may have any number of input values (not just two as in the example). In this case we will need to iterate over all the incoming values, merging the debug locations together into a single location. In some cases there is already a loop iterating over the values, in other cases a loop will need to be added.

My initial implementation of mergeDebugLoc() would simply return DebugLoc().

Thanks,
Rob.

In D26256#601321, @rob.lougher wrote:

Just a quick note to say I haven't forgotten about this.

My understanding is if we had

#1 v = cond ?
#2 stmt1: stmt2;

Then if an instruction was commoned, we should still have at least one instruction left in the blocks for stmt1 and stmt2? So debugging experience wouldn't be affected as we'll still stop at line 2.

However, from Adrian's last comment, we're now moving away from a purely functional argument, to aesthetics of the code where we want something to say "yes, I've thought about debug info here", even if we simply do nothing.

I'm not against doing this. However, before I submit a revised patch I'd like to run a few things past people.

First, where should a merge API live? I can see two possible places, the DebugLoc or the DILocation class. Get/SetDebugLoc uses DebugLoc, but in DebugLoc.h there's FIXMEs to avoid using it.

For symmetry with DILocation *DILocation::cloneWithDiscriminator(unsigned Discriminator) const DILocation seems like a natural place.

Second, assuming we want to merge two locations, the function could be static or non-static:
DILocation mergeDebugLoc(DILocation A, DILocation B)
DILocation merge(DILocation Other)
If we need to merge more than two locations, this can be done iteratively, merging the next one onto the current merged result (hence a non-static function may be better).

I assume that both versions would be const? If so I think that the verb shouldn't just be merge as it implies that it modifies the object.

At the call site this would look like this:

I.setDebugLoc(DILocation::mergeDebugLoc(LocA, LocB));
I.setDebugLoc(LocA->merge(LocB));
// More variants:
I.setDebugLoc(DILocation::getMergedLocation(LocA, LocB));
I.setDebugLoc(LocA->mergeWith(LocB));
I.setDebugLoc(LocA->cloneMergedLocation(LocB));

Subjectively I like DILocation::getMergedLocation(LocA, LocB) best, but I'm open to other suggestions.

As far as revising the patch to use the new API, there is an added complexity. The optimisations are generalised, and the phi node may have any number of input values (not just two as in the example). In this case we will need to iterate over all the incoming values, merging the debug locations together into a single location. In some cases there is already a loop iterating over the values, in other cases a loop will need to be added.

Thanks for pointing this out I wasn't aware of that. Let me know if it grows impractical.

My initial implementation of mergeDebugLoc() would simply return DebugLoc().

That's fine. Thanks for looking into this!

adrian

Thanks,
Rob.

I have updated the patch to add a new API for merging debug locations. As explained in the comments, the API is currently a stub which simply uses an empy location.

The obvious case to consider is when the two debug locations are the same (e.g. instructions on the same line without discriminators). This could be handled with something like:

static DILocation *getMergedLocation(const DILocation *LocA,
                                      const DILocation *LocB) {
  return LocA.canDiscriminate(LocB) ? nullptr : LocA;
 }

However, canDiscriminate is incomplete as it only checks filename and line, so I decided to leave it as a pure stub to keep the patch simple. A later fix can address this.

Also to keep the patch simple I have avoided refactoring the FoldPHIArgXIntoPHI() functions, opting instead to simply replacing the calls to FirstInst->getDebugLoc() with a call to a helper that returns the merged location for the PHI node.

Please let me know if this revision is OK to commit. As suggested in a previous comment, alhough this is shown as a single patch I'll probably submit it as a series of commits.

Thanks,
Rob.

aprantl mentioned this in D27468: When GVN removes a redundant load, it should not modify the debug location of the dominating load..Dec 6 2016, 12:38 PM

rob.lougher added a child revision: D27590: [SimplifyCFG] In sinkLastInstruction correctly set the debug location of the "common" instruction.Dec 8 2016, 1:41 PM

rob.lougher mentioned this in D27590: [SimplifyCFG] In sinkLastInstruction correctly set the debug location of the "common" instruction.Dec 8 2016, 1:43 PM

Ping. Adrian, are you happy with this revision?

Yes, thank you!

mkuper mentioned this in D27734: [LoopUnroll] Enable PGO-based loop peeling by default.Dec 13 2016, 2:31 PM

Diffusion mentioned this in rL289661: New API for merging debug locations. NFC..Dec 14 2016, 8:24 AM

Closed by commit rL289679: [InstCombine] Merge debug locations when folding through a phi node (authored by rlougher). · Explain WhyDec 14 2016, 9:59 AM

This revision was automatically updated to reflect the committed changes.

Diffusion mentioned this in rL289679: [InstCombine] Merge debug locations when folding through a phi node.

Diffusion mentioned this in rL289681: [InstCombine] Merge debug locations when folding through a phi node.Dec 14 2016, 10:25 AM

Diffusion mentioned this in rL289684: [InstCombine] When folding GEP through a phi node merge the debug locations.Dec 14 2016, 10:48 AM

Diffusion mentioned this in rL289688: [InstCombine] Folding loads through a phi node should merge the debug locations.Dec 14 2016, 11:12 AM

Diffusion mentioned this in rL289693: [InstCombine] When folding casts through a phi node merge the debug locations.Dec 14 2016, 11:34 AM

Diffusion mentioned this in rL289699: [InstCombine] Folding of a binop with RHS const should merge the debug locations.Dec 14 2016, 12:18 PM

Diffusion mentioned this in rL289704: [InstCombine] Folding of a compare with RHS const should merge debug locations.Dec 14 2016, 12:38 PM

andreadb mentioned this in D27804: [SimplifyCFG] Merge debug locations when hoisting an instruction from a then/else branch. NFC..Dec 15 2016, 6:30 AM

Diffusion mentioned this in rL289862: [SimplifyCFG] Merge debug locations when hoisting an instruction from a….Dec 15 2016, 12:11 PM

I should point out that r289661, which is part of the 8 patches, does not show up in the review commit list.

Revision Contents

Path

Size

include/

llvm/

IR/

DebugInfoMetadata.h

18 lines

lib/

Transforms/

InstCombine/

InstCombineInternal.h

4 lines

InstCombinePHI.cpp

30 lines

test/

DebugInfo/

Generic/

instcombine-phi.ll

370 lines

Diff 80445

include/llvm/IR/DebugInfoMetadata.h

Show First 20 Lines • Show All 1,285 Lines • ▼ Show 20 Lines	public:
///		///
/// DWARF discriminators distinguish identical file locations between		/// DWARF discriminators distinguish identical file locations between
/// instructions that are on different basic blocks.		/// instructions that are on different basic blocks.
inline unsigned getDiscriminator() const;		inline unsigned getDiscriminator() const;

/// Returns a new DILocation with updated \p Discriminator.		/// Returns a new DILocation with updated \p Discriminator.
inline DILocation *cloneWithDiscriminator(unsigned Discriminator) const;		inline DILocation *cloneWithDiscriminator(unsigned Discriminator) const;

		/// When two instructions are combined into a single instruction we also
		/// need to combine the original locations into a single location.
		///
		/// When the locations are the same we can use either location. When they
		/// differ, we need a third location which is distinct from either. If
		/// they have the same file/line but have a different discriminator we
		/// could create a location with a new discriminator. If they are from
		/// different files/lines the location is ambiguous and can't be
		/// represented in a single line entry. In this case, no location
		/// should be set.
		///
		/// Currently this function is simply a stub, and no location will be
		/// used for all cases.
		static DILocation getMergedLocation(const DILocation LocA,
		const DILocation *LocB) {
		return nullptr;
		}

Metadata *getRawScope() const { return getOperand(0); }		Metadata *getRawScope() const { return getOperand(0); }
Metadata *getRawInlinedAt() const {		Metadata *getRawInlinedAt() const {
if (getNumOperands() == 2)		if (getNumOperands() == 2)
return getOperand(1);		return getOperand(1);
return nullptr;		return nullptr;
}		}

static bool classof(const Metadata *MD) {		static bool classof(const Metadata *MD) {
▲ Show 20 Lines • Show All 1,240 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombineInternal.h

Show First 20 Lines • Show All 547 Lines • ▼ Show 20 Lines	private:
/// \brief Try to rotate an operation below a PHI node, using PHI nodes for		/// \brief Try to rotate an operation below a PHI node, using PHI nodes for
/// its operands.		/// its operands.
Instruction *FoldPHIArgOpIntoPHI(PHINode &PN);		Instruction *FoldPHIArgOpIntoPHI(PHINode &PN);
Instruction *FoldPHIArgBinOpIntoPHI(PHINode &PN);		Instruction *FoldPHIArgBinOpIntoPHI(PHINode &PN);
Instruction *FoldPHIArgGEPIntoPHI(PHINode &PN);		Instruction *FoldPHIArgGEPIntoPHI(PHINode &PN);
Instruction *FoldPHIArgLoadIntoPHI(PHINode &PN);		Instruction *FoldPHIArgLoadIntoPHI(PHINode &PN);
Instruction *FoldPHIArgZextsIntoPHI(PHINode &PN);		Instruction *FoldPHIArgZextsIntoPHI(PHINode &PN);

		/// Helper function for FoldPHIArgXIntoPHI() to get debug location for the
		/// folded operation.
		DebugLoc PHIArgMergedDebugLoc(PHINode &PN);

Instruction foldGEPICmp(GEPOperator GEPLHS, Value *RHS,		Instruction foldGEPICmp(GEPOperator GEPLHS, Value *RHS,
ICmpInst::Predicate Cond, Instruction &I);		ICmpInst::Predicate Cond, Instruction &I);
Instruction foldAllocaCmp(ICmpInst &ICI, const AllocaInst Alloca,		Instruction foldAllocaCmp(ICmpInst &ICI, const AllocaInst Alloca,
const Value *Other);		const Value *Other);
Instruction foldCmpLoadFromIndexedGlobal(GetElementPtrInst GEP,		Instruction foldCmpLoadFromIndexedGlobal(GetElementPtrInst GEP,
GlobalVariable *GV, CmpInst &ICI,		GlobalVariable *GV, CmpInst &ICI,
ConstantInt *AndCst = nullptr);		ConstantInt *AndCst = nullptr);
Instruction foldFCmpIntToFPConst(FCmpInst &I, Instruction LHSI,		Instruction foldFCmpIntToFPConst(FCmpInst &I, Instruction LHSI,
▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

lib/Transforms/InstCombine/InstCombinePHI.cpp

Show All 12 Lines

#include "InstCombineInternal.h"		#include "InstCombineInternal.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/Analysis/InstructionSimplify.h"		#include "llvm/Analysis/InstructionSimplify.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/PatternMatch.h"		#include "llvm/IR/PatternMatch.h"
#include "llvm/Transforms/Utils/Local.h"		#include "llvm/Transforms/Utils/Local.h"
		#include "llvm/IR/DebugInfo.h"
using namespace llvm;		using namespace llvm;
using namespace llvm::PatternMatch;		using namespace llvm::PatternMatch;

#define DEBUG_TYPE "instcombine"		#define DEBUG_TYPE "instcombine"

		/// The PHI arguments will be folded into a single operation with a PHI node
		/// as input. The debug location of the single operation will be the merged
		/// locations of the original PHI node arguments.
		DebugLoc InstCombiner::PHIArgMergedDebugLoc(PHINode &PN) {
		auto *FirstInst = cast<Instruction>(PN.getIncomingValue(0));
		DILocation *Loc = FirstInst->getDebugLoc();

		for (unsigned i = 1; i != PN.getNumIncomingValues(); ++i) {
		auto *I = cast<Instruction>(PN.getIncomingValue(i));
		Loc = DILocation::getMergedLocation(Loc, I->getDebugLoc());
		}

		return Loc;
		}

/// If we have something like phi [add (a,b), add(a,c)] and if a/b/c and the		/// If we have something like phi [add (a,b), add(a,c)] and if a/b/c and the
/// adds all have a single use, turn this into a phi and a single binop.		/// adds all have a single use, turn this into a phi and a single binop.
Instruction *InstCombiner::FoldPHIArgBinOpIntoPHI(PHINode &PN) {		Instruction *InstCombiner::FoldPHIArgBinOpIntoPHI(PHINode &PN) {
Instruction *FirstInst = cast<Instruction>(PN.getIncomingValue(0));		Instruction *FirstInst = cast<Instruction>(PN.getIncomingValue(0));
assert(isa<BinaryOperator>(FirstInst) \|\| isa<CmpInst>(FirstInst));		assert(isa<BinaryOperator>(FirstInst) \|\| isa<CmpInst>(FirstInst));
unsigned Opc = FirstInst->getOpcode();		unsigned Opc = FirstInst->getOpcode();
Value *LHSVal = FirstInst->getOperand(0);		Value *LHSVal = FirstInst->getOperand(0);
Value *RHSVal = FirstInst->getOperand(1);		Value *RHSVal = FirstInst->getOperand(1);
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	for (unsigned i = 1, e = PN.getNumIncomingValues(); i != e; ++i) {
NewRHS->addIncoming(NewInRHS, PN.getIncomingBlock(i));		NewRHS->addIncoming(NewInRHS, PN.getIncomingBlock(i));
}		}
}		}
}		}

if (CmpInst *CIOp = dyn_cast<CmpInst>(FirstInst)) {		if (CmpInst *CIOp = dyn_cast<CmpInst>(FirstInst)) {
CmpInst *NewCI = CmpInst::Create(CIOp->getOpcode(), CIOp->getPredicate(),		CmpInst *NewCI = CmpInst::Create(CIOp->getOpcode(), CIOp->getPredicate(),
LHSVal, RHSVal);		LHSVal, RHSVal);
NewCI->setDebugLoc(FirstInst->getDebugLoc());		NewCI->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewCI;		return NewCI;
}		}

BinaryOperator *BinOp = cast<BinaryOperator>(FirstInst);		BinaryOperator *BinOp = cast<BinaryOperator>(FirstInst);
BinaryOperator *NewBinOp =		BinaryOperator *NewBinOp =
BinaryOperator::Create(BinOp->getOpcode(), LHSVal, RHSVal);		BinaryOperator::Create(BinOp->getOpcode(), LHSVal, RHSVal);

NewBinOp->copyIRFlags(PN.getIncomingValue(0));		NewBinOp->copyIRFlags(PN.getIncomingValue(0));

for (unsigned i = 1, e = PN.getNumIncomingValues(); i != e; ++i)		for (unsigned i = 1, e = PN.getNumIncomingValues(); i != e; ++i)
NewBinOp->andIRFlags(PN.getIncomingValue(i));		NewBinOp->andIRFlags(PN.getIncomingValue(i));

NewBinOp->setDebugLoc(FirstInst->getDebugLoc());		NewBinOp->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewBinOp;		return NewBinOp;
}		}

Instruction *InstCombiner::FoldPHIArgGEPIntoPHI(PHINode &PN) {		Instruction *InstCombiner::FoldPHIArgGEPIntoPHI(PHINode &PN) {
GetElementPtrInst *FirstInst =cast<GetElementPtrInst>(PN.getIncomingValue(0));		GetElementPtrInst *FirstInst =cast<GetElementPtrInst>(PN.getIncomingValue(0));

SmallVector<Value*, 16> FixedOperands(FirstInst->op_begin(),		SmallVector<Value*, 16> FixedOperands(FirstInst->op_begin(),
FirstInst->op_end());		FirstInst->op_end());
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	if (HasAnyPHIs) {
}		}
}		}

Value *Base = FixedOperands[0];		Value *Base = FixedOperands[0];
GetElementPtrInst *NewGEP =		GetElementPtrInst *NewGEP =
GetElementPtrInst::Create(FirstInst->getSourceElementType(), Base,		GetElementPtrInst::Create(FirstInst->getSourceElementType(), Base,
makeArrayRef(FixedOperands).slice(1));		makeArrayRef(FixedOperands).slice(1));
if (AllInBounds) NewGEP->setIsInBounds();		if (AllInBounds) NewGEP->setIsInBounds();
NewGEP->setDebugLoc(FirstInst->getDebugLoc());		NewGEP->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewGEP;		return NewGEP;
}		}


/// Return true if we know that it is safe to sink the load out of the block		/// Return true if we know that it is safe to sink the load out of the block
/// that defines it. This means that it must be obvious the value of the load is		/// that defines it. This means that it must be obvious the value of the load is
/// not changed from the point of the load to the end of the block it is in.		/// not changed from the point of the load to the end of the block it is in.
///		///
▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	Instruction *InstCombiner::FoldPHIArgLoadIntoPHI(PHINode &PN) {

// If this was a volatile load that we are merging, make sure to loop through		// If this was a volatile load that we are merging, make sure to loop through
// and mark all the input loads as non-volatile. If we don't do this, we will		// and mark all the input loads as non-volatile. If we don't do this, we will
// insert a new volatile load and the old ones will not be deletable.		// insert a new volatile load and the old ones will not be deletable.
if (isVolatile)		if (isVolatile)
for (Value *IncValue : PN.incoming_values())		for (Value *IncValue : PN.incoming_values())
cast<LoadInst>(IncValue)->setVolatile(false);		cast<LoadInst>(IncValue)->setVolatile(false);

NewLI->setDebugLoc(FirstLI->getDebugLoc());		NewLI->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewLI;		return NewLI;
}		}

/// TODO: This function could handle other cast types, but then it might		/// TODO: This function could handle other cast types, but then it might
/// require special-casing a cast from the 'i1' type. See the comment in		/// require special-casing a cast from the 'i1' type. See the comment in
/// FoldPHIArgOpIntoPHI() about pessimizing illegal integer types.		/// FoldPHIArgOpIntoPHI() about pessimizing illegal integer types.
Instruction *InstCombiner::FoldPHIArgZextsIntoPHI(PHINode &Phi) {		Instruction *InstCombiner::FoldPHIArgZextsIntoPHI(PHINode &Phi) {
// We cannot create a new instruction after the PHI if the terminator is an		// We cannot create a new instruction after the PHI if the terminator is an
▲ Show 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	if (InVal) {
InsertNewInstBefore(NewPN, PN);		InsertNewInstBefore(NewPN, PN);
PhiVal = NewPN;		PhiVal = NewPN;
}		}

// Insert and return the new operation.		// Insert and return the new operation.
if (CastInst *FirstCI = dyn_cast<CastInst>(FirstInst)) {		if (CastInst *FirstCI = dyn_cast<CastInst>(FirstInst)) {
CastInst *NewCI = CastInst::Create(FirstCI->getOpcode(), PhiVal,		CastInst *NewCI = CastInst::Create(FirstCI->getOpcode(), PhiVal,
PN.getType());		PN.getType());
NewCI->setDebugLoc(FirstInst->getDebugLoc());		NewCI->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewCI;		return NewCI;
}		}

if (BinaryOperator *BinOp = dyn_cast<BinaryOperator>(FirstInst)) {		if (BinaryOperator *BinOp = dyn_cast<BinaryOperator>(FirstInst)) {
BinOp = BinaryOperator::Create(BinOp->getOpcode(), PhiVal, ConstantOp);		BinOp = BinaryOperator::Create(BinOp->getOpcode(), PhiVal, ConstantOp);
BinOp->copyIRFlags(PN.getIncomingValue(0));		BinOp->copyIRFlags(PN.getIncomingValue(0));

for (unsigned i = 1, e = PN.getNumIncomingValues(); i != e; ++i)		for (unsigned i = 1, e = PN.getNumIncomingValues(); i != e; ++i)
BinOp->andIRFlags(PN.getIncomingValue(i));		BinOp->andIRFlags(PN.getIncomingValue(i));

BinOp->setDebugLoc(FirstInst->getDebugLoc());		BinOp->setDebugLoc(PHIArgMergedDebugLoc(PN));
return BinOp;		return BinOp;
}		}

CmpInst *CIOp = cast<CmpInst>(FirstInst);		CmpInst *CIOp = cast<CmpInst>(FirstInst);
CmpInst *NewCI = CmpInst::Create(CIOp->getOpcode(), CIOp->getPredicate(),		CmpInst *NewCI = CmpInst::Create(CIOp->getOpcode(), CIOp->getPredicate(),
PhiVal, ConstantOp);		PhiVal, ConstantOp);
NewCI->setDebugLoc(FirstInst->getDebugLoc());		NewCI->setDebugLoc(PHIArgMergedDebugLoc(PN));
return NewCI;		return NewCI;
}		}

/// Return true if this PHI node is only used by a PHI node cycle that is dead.		/// Return true if this PHI node is only used by a PHI node cycle that is dead.
static bool DeadPHICycle(PHINode *PN,		static bool DeadPHICycle(PHINode *PN,
SmallPtrSetImpl<PHINode*> &PotentiallyDeadPHIs) {		SmallPtrSetImpl<PHINode*> &PotentiallyDeadPHIs) {
if (PN->use_empty()) return true;		if (PN->use_empty()) return true;
if (!PN->hasOneUse()) return false;		if (!PN->hasOneUse()) return false;
▲ Show 20 Lines • Show All 424 Lines • Show Last 20 Lines

test/DebugInfo/Generic/instcombine-phi.ll

				; RUN: opt -instcombine -S < %s \| FileCheck %s

				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; If all the operands to a phi node are of the same operation, instcombine
				; will try to pull them through the phi node, combining them into a single
				; operation. Check that when it does this the combined operation does not
				; have a debug location set.

				; Test folding of a binary operation. Generated from source:

				; extern int foo(void);
				; extern int bar(void);
				;
				; int binop(int a, int b) {
				; if(a)
				; b -= foo();
				; else
				; b -= bar();
				; return b;
				; }

				; CHECK: define i32 @binop
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: sub nsw i32 %b, %[[PHI]]
				; CHECK-NOT: !dbg
				; CHECK: ret i32

				define i32 @binop(i32 %a, i32 %b) !dbg !6 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !8
				br i1 %tobool, label %if.then, label %if.else, !dbg !8

				if.then: ; preds = %entry
				%call = call i32 @foo(), !dbg !9
				%sub = sub nsw i32 %b, %call, !dbg !10
				br label %if.end, !dbg !11

				if.else: ; preds = %entry
				%call1 = call i32 @bar(), !dbg !12
				%sub2 = sub nsw i32 %b, %call1, !dbg !13
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%b.addr.0 = phi i32 [ %sub, %if.then ], [ %sub2, %if.else ]
				ret i32 %b.addr.0, !dbg !14
				}

				; Test folding of a compare. Generated from source (with editing to
				; common the zext):

				; extern int foo(void);
				; extern int bar(void);
				;
				; int cmp(int a, int b) {
				; int r;
				; if(a)
				; r = foo() < b;
				; else
				; r = bar() < b;
				; return r;
				; }

				; CHECK: define i32 @cmp
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: icmp slt i32 %[[PHI]], %b
				; CHECK-NOT: !dbg
				; CHECK: ret i32

				define i32 @cmp(i32 %a, i32 %b) !dbg !15 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !16
				br i1 %tobool, label %if.then, label %if.else, !dbg !16

				if.then: ; preds = %entry
				%call = call i32 @foo(), !dbg !17
				%cmp = icmp slt i32 %call, %b, !dbg !18
				br label %if.end, !dbg !19

				if.else: ; preds = %entry
				%call1 = call i32 @bar(), !dbg !20
				%cmp2 = icmp slt i32 %call1, %b, !dbg !21
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i1 [ %cmp, %if.then ], [ %cmp2, %if.else ]
				%conv = zext i1 %r.0 to i32
				ret i32 %conv, !dbg !22
				}

				; Test folding of getelementptr. Generated from source:

				; extern long long foo2(void);
				; extern long long bar2(void);
				;
				; int gep(int a, int b) {
				; int *r;
				; if(a)
				; r = &b[foo2()];
				; else
				; r = &b[bar2()];
				; return p;
				; }

				; CHECK: define i32* @gep
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i64 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: getelementptr inbounds i32, i32* %b, i64 %[[PHI]]
				; CHECK-NOT: !dbg
				; CHECK: ret i32*

				define i32* @gep(i32 %a, i32* %b) !dbg !23 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !24
				br i1 %tobool, label %if.then, label %if.else, !dbg !24

				if.then: ; preds = %entry
				%call = call i64 @foo2(), !dbg !25
				%arrayidx = getelementptr inbounds i32, i32* %b, i64 %call, !dbg !26
				br label %if.end, !dbg !27

				if.else: ; preds = %entry
				%call1 = call i64 @bar2(), !dbg !28
				%arrayidx2 = getelementptr inbounds i32, i32* %b, i64 %call1, !dbg !29
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i32* [ %arrayidx, %if.then ], [ %arrayidx2, %if.else ]
				ret i32* %r.0, !dbg !30
				}

				; Test folding of load. Generated from source:

				; extern int *foo3(void);
				; extern int *bar3(void);
				;
				; int load(int a) {
				; int r;
				; if(a)
				; r = *foo3();
				; else
				; r = *bar3();
				; return r;
				; }

				; CHECK: define i32 @load
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: load i32, i32* %[[PHI]]
				; CHECK-NOT: !dbg
				; CHECK: ret i32

				define i32 @load(i32 %a) !dbg !31 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !32
				br i1 %tobool, label %if.then, label %if.else, !dbg !32

				if.then: ; preds = %entry
				%call = call i32* @foo3(), !dbg !33
				%0 = load i32, i32* %call, align 4, !dbg !34
				br label %if.end, !dbg !35

				if.else: ; preds = %entry
				%call1 = call i32* @bar3(), !dbg !36
				%1 = load i32, i32* %call1, align 4, !dbg !37
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i32 [ %0, %if.then ], [ %1, %if.else ]
				ret i32 %r.0, !dbg !38
				}

				; Test folding of a cast. Generated from source:

				; extern int foo(void);
				; extern int bar(void);
				;
				; long long cast(int a) {
				; long long r;
				; if(a)
				; r = foo();
				; else
				; r = bar();
				; return r;
				; }

				; CHECK: define i64 @cast
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: sext i32 %[[PHI]] to i64
				; CHECK-NOT: !dbg
				; CHECK: ret i64

				define i64 @cast(i32 %a) !dbg !39 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !40
				br i1 %tobool, label %if.then, label %if.else, !dbg !40

				if.then: ; preds = %entry
				%call = call i32 @foo(), !dbg !41
				%conv = sext i32 %call to i64, !dbg !41
				br label %if.end, !dbg !42

				if.else: ; preds = %entry
				%call1 = call i32 @bar(), !dbg !43
				%conv2 = sext i32 %call1 to i64, !dbg !43
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i64 [ %conv, %if.then ], [ %conv2, %if.else ]
				ret i64 %r.0, !dbg !44
				}

				; Test folding of a binary op with a RHS constant. Generated from source:

				; extern int foo(void);
				; extern int bar(void);
				;
				; int binop_const(int a) {
				; int r;
				; if(a)
				; r = foo() - 5;
				; else
				; r = bar() - 5;
				; return r;
				; }

				; CHECK: define i32 @binop_const
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: add nsw i32 %[[PHI]], -5
				; CHECK-NOT: !dbg
				; CHECK: ret i32

				define i32 @binop_const(i32 %a) !dbg !45 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !46
				br i1 %tobool, label %if.then, label %if.else, !dbg !46

				if.then: ; preds = %entry
				%call = call i32 @foo(), !dbg !47
				%sub = sub nsw i32 %call, 5, !dbg !48
				br label %if.end, !dbg !49

				if.else: ; preds = %entry
				%call1 = call i32 @bar(), !dbg !50
				%sub2 = sub nsw i32 %call1, 5, !dbg !51
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i32 [ %sub, %if.then ], [ %sub2, %if.else ]
				ret i32 %r.0, !dbg !52
				}

				; Test folding of a compare with RHS constant. Generated from source (with
				; editing to common the zext):

				; extern int foo(void);
				; extern int bar(void);
				;
				; int cmp_const(int a) {
				; int r;
				; if(a)
				; r = foo() < 10;
				; else
				; r = bar() < 10;
				; return r;
				; }

				; CHECK: define i32 @cmp_const
				; CHECK-LABEL: if.end:
				; CHECK: %[[PHI:.*]] = phi i32 [ %call, %if.then ], [ %call1, %if.else ]
				; CHECK: icmp slt i32 %[[PHI]], 10
				; CHECK-NOT: !dbg
				; CHECK: ret i32

				define i32 @cmp_const(i32 %a) !dbg !53 {
				entry:
				%tobool = icmp ne i32 %a, 0, !dbg !54
				br i1 %tobool, label %if.then, label %if.else, !dbg !54

				if.then: ; preds = %entry
				%call = call i32 @foo(), !dbg !55
				%cmp = icmp slt i32 %call, 10, !dbg !56
				br label %if.end, !dbg !57

				if.else: ; preds = %entry
				%call1 = call i32 @bar(), !dbg !58
				%cmp2 = icmp slt i32 %call1, 10, !dbg !59
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%r.0 = phi i1 [ %cmp, %if.then ], [ %cmp2, %if.else ]
				%conv = zext i1 %r.0 to i32
				ret i32 %conv, !dbg !60
				}

				declare i32 @foo()
				declare i32 @bar()
				declare i64 @foo2()
				declare i64 @bar2()
				declare i32* @foo3()
				declare i32* @bar3()

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "", isOptimized: false, runtimeVersion: 0, emissionKind: LineTablesOnly, enums: !2)
				!1 = !DIFile(filename: "test.c", directory: "")
				!2 = !{}
				!3 = !{i32 2, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!6 = distinct !DISubprogram(name: "binop", scope: !1, file: !1, line: 8, type: !7, isLocal: false, isDefinition: true, scopeLine: 8, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!7 = !DISubroutineType(types: !2)
				!8 = !DILocation(line: 9, column: 6, scope: !6)
				!9 = !DILocation(line: 10, column: 10, scope: !6)
				!10 = !DILocation(line: 10, column: 7, scope: !6)
				!11 = !DILocation(line: 10, column: 5, scope: !6)
				!12 = !DILocation(line: 12, column: 10, scope: !6)
				!13 = !DILocation(line: 12, column: 7, scope: !6)
				!14 = !DILocation(line: 13, column: 3, scope: !6)
				!15 = distinct !DISubprogram(name: "binop_cmp", scope: !1, file: !1, line: 16, type: !7, isLocal: false, isDefinition: true, scopeLine: 16, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!16 = !DILocation(line: 18, column: 6, scope: !15)
				!17 = !DILocation(line: 19, column: 9, scope: !15)
				!18 = !DILocation(line: 19, column: 15, scope: !15)
				!19 = !DILocation(line: 19, column: 5, scope: !15)
				!20 = !DILocation(line: 21, column: 9, scope: !15)
				!21 = !DILocation(line: 21, column: 15, scope: !15)
				!22 = !DILocation(line: 22, column: 3, scope: !15)
				!23 = distinct !DISubprogram(name: "gep", scope: !1, file: !1, line: 25, type: !7, isLocal: false, isDefinition: true, scopeLine: 25, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!24 = !DILocation(line: 27, column: 6, scope: !23)
				!25 = !DILocation(line: 28, column: 12, scope: !23)
				!26 = !DILocation(line: 28, column: 10, scope: !23)
				!27 = !DILocation(line: 28, column: 5, scope: !23)
				!28 = !DILocation(line: 30, column: 12, scope: !23)
				!29 = !DILocation(line: 30, column: 10, scope: !23)
				!30 = !DILocation(line: 31, column: 3, scope: !23)
				!31 = distinct !DISubprogram(name: "load", scope: !1, file: !1, line: 34, type: !7, isLocal: false, isDefinition: true, scopeLine: 34, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!32 = !DILocation(line: 36, column: 6, scope: !31)
				!33 = !DILocation(line: 37, column: 10, scope: !31)
				!34 = !DILocation(line: 37, column: 9, scope: !31)
				!35 = !DILocation(line: 37, column: 5, scope: !31)
				!36 = !DILocation(line: 39, column: 10, scope: !31)
				!37 = !DILocation(line: 39, column: 9, scope: !31)
				!38 = !DILocation(line: 40, column: 3, scope: !31)
				!39 = distinct !DISubprogram(name: "op_cast", scope: !1, file: !1, line: 43, type: !7, isLocal: false, isDefinition: true, scopeLine: 43, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!40 = !DILocation(line: 45, column: 6, scope: !39)
				!41 = !DILocation(line: 46, column: 9, scope: !39)
				!42 = !DILocation(line: 46, column: 5, scope: !39)
				!43 = !DILocation(line: 48, column: 9, scope: !39)
				!44 = !DILocation(line: 49, column: 3, scope: !39)
				!45 = distinct !DISubprogram(name: "op", scope: !1, file: !1, line: 52, type: !7, isLocal: false, isDefinition: true, scopeLine: 52, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!46 = !DILocation(line: 54, column: 6, scope: !45)
				!47 = !DILocation(line: 55, column: 9, scope: !45)
				!48 = !DILocation(line: 55, column: 15, scope: !45)
				!49 = !DILocation(line: 55, column: 5, scope: !45)
				!50 = !DILocation(line: 57, column: 9, scope: !45)
				!51 = !DILocation(line: 57, column: 15, scope: !45)
				!52 = !DILocation(line: 58, column: 3, scope: !45)
				!53 = distinct !DISubprogram(name: "op_cmp", scope: !1, file: !1, line: 61, type: !7, isLocal: false, isDefinition: true, scopeLine: 61, flags: DIFlagPrototyped, isOptimized: false, unit: !0, variables: !2)
				!54 = !DILocation(line: 63, column: 6, scope: !53)
				!55 = !DILocation(line: 64, column: 9, scope: !53)
				!56 = !DILocation(line: 64, column: 15, scope: !53)
				!57 = !DILocation(line: 64, column: 5, scope: !53)
				!58 = !DILocation(line: 66, column: 9, scope: !53)
				!59 = !DILocation(line: 66, column: 15, scope: !53)
				!60 = !DILocation(line: 67, column: 3, scope: !53)

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Don't set debug location when folding through a phi nodeClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 80445

include/llvm/IR/DebugInfoMetadata.h

lib/Transforms/InstCombine/InstCombineInternal.h

lib/Transforms/InstCombine/InstCombinePHI.cpp

test/DebugInfo/Generic/instcombine-phi.ll

[InstCombine] Don't set debug location when folding through a phi node
ClosedPublic