This is an archive of the discontinued LLVM Phabricator instance.

[DebugInfo][CGP] Update dbg.values when updating memory address computations
ClosedPublic

Authored by jmorse on Feb 19 2019, 1:15 PM.

Download Raw Diff

Details

Reviewers

aprantl
bjope
vsk

Commits

rGc93a9b15ce88: [DebugInfo][CGP] Update dbg.values when sinking address computations

Summary

One of CodeGenPrepare's optimisations is to duplicate address calculations into basic blocks, so that as much information as possible can be folded into memory addressing operands. This is great -- but the dbg.value variable location intrinsics are not updated in the same way. This can lead to dbg.values referring to address computations in other blocks that will never be encoded into the DAG, while duplicate address computations are performed locally that could be used by the dbg.value. Some of these (such as non-constant-offset GEPs) can't be salvaged past.

Fix this by, whenever we duplicate an address computation into a block, looking for dbg.value users of the original memory address in the same block, and redirecting those to the local computation.

Diff Detail

Event Timeline

jmorse created this revision.Feb 19 2019, 1:15 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 19 2019, 1:15 PM

Herald added subscribers: llvm-commits, jdoerfert. · View Herald Transcript

aprantl accepted this revision.Feb 19 2019, 1:45 PM

This revision is now accepted and ready to land.Feb 19 2019, 1:45 PM

bjope added inline comments.Feb 20 2019, 12:17 AM

lib/CodeGen/CodeGenPrepare.cpp
4946	What happens if there is an earlier dbg-use of Repl (before MemoryInst) in this BB. Is there a risk that we introduce a dbguse-before-def scenario here? Maybe we need to check if MemoryInst dominates User here? Or is such input IR unlikely in reality? There seems to be lots of cases above (for example reusing an earlier computation for SunkAddr), so maybe the important check is that SunkAddr dominates User. But as you understand from the question marks above, I haven't figured out if that always is guaranteed here. Notice that I do not think about scenarios when the input IR already is "broken" (dgb-use before def) here. Just some worrying that we might introduce dbg-use-before-def here. Actually the test case that you added has the dbg.value before the load in the BB that we sink the address computation into (the scenario I'm talking about). Isn't the address computation added just before the load? Is perhaps the test case working due to placeDbgValues moving the dbg.value?

jmorse added a child revision: D58453: [DebugInfo][CGP] Limit placeDbgValues movement of dbg.value intrinsics.Feb 20 2019, 8:23 AM

It turns out the validity of this change relies on placeDbgValues reordering (curses), a small amount of extra juggling will be required.

lib/CodeGen/CodeGenPrepare.cpp
4946	Actually the test case that you added has the dbg.value before the load in the BB that we sink the address computation into (the scenario I'm talking about). Isn't the address computation added just before the load? Is perhaps the test case working due to placeDbgValues moving the dbg.value? Aha, you've got it in one there, I was inadvertently relying on placeDbgValues. I'd placed the dbg.value at the top of the block and assumed optimizeMemoryInst did-the-right-thing because the duplicated memory insts appeared above the dbg.value. (Should be easily fixable).

Avoid relying on placeDbgValues for this change. We already walk (forwards) through all instructions in a block looking to optimise them, add a dbg.value visitor that rewrites the dbg.value operand if it refers to a sunk address computation.

This works because CGP already keeps a cache of sunk addresses for each block, and we walk forwards through the block (which optimizeMemoryInst relies on anyway). Thus, we will only rewrite dbg.value operands for instructions after the sunk memory computation has been produced.

The test case will need to be updated when placeDbgValues gets limited: the first dbg.value will no longer be hoisted up a block, and the second will no longer go up one inst.

This revision is now accepted and ready to land.Mar 1 2019, 7:09 AM

Closed by commit rGc93a9b15ce88: [DebugInfo][CGP] Update dbg.values when sinking address computations (authored by jmorse). · Explain WhyDec 6 2019, 3:34 AM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: hiraditya. · View Herald TranscriptDec 6 2019, 3:34 AM

Revision Contents

Path

Size

lib/

CodeGen/

CodeGenPrepare.cpp

25 lines

test/

DebugInfo/

Generic/

codegenprep-addrsink.ll

70 lines

Diff 188909

lib/CodeGen/CodeGenPrepare.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 353 Lines • ▼ Show 20 Lines	private:
bool optimizeExt(Instruction *&I);		bool optimizeExt(Instruction *&I);
bool optimizeExtUses(Instruction *I);		bool optimizeExtUses(Instruction *I);
bool optimizeLoadExt(LoadInst *Load);		bool optimizeLoadExt(LoadInst *Load);
bool optimizeSelectInst(SelectInst *SI);		bool optimizeSelectInst(SelectInst *SI);
bool optimizeShuffleVectorInst(ShuffleVectorInst *SVI);		bool optimizeShuffleVectorInst(ShuffleVectorInst *SVI);
bool optimizeSwitchInst(SwitchInst *SI);		bool optimizeSwitchInst(SwitchInst *SI);
bool optimizeExtractElementInst(Instruction *Inst);		bool optimizeExtractElementInst(Instruction *Inst);
bool dupRetToEnableTailCallOpts(BasicBlock *BB);		bool dupRetToEnableTailCallOpts(BasicBlock *BB);
		bool fixupDbgValue(Instruction *I);
bool placeDbgValues(Function &F);		bool placeDbgValues(Function &F);
bool canFormExtLd(const SmallVectorImpl<Instruction *> &MovedExts,		bool canFormExtLd(const SmallVectorImpl<Instruction *> &MovedExts,
LoadInst &LI, Instruction &Inst, bool HasPromoted);		LoadInst &LI, Instruction &Inst, bool HasPromoted);
bool tryToPromoteExts(TypePromotionTransaction &TPT,		bool tryToPromoteExts(TypePromotionTransaction &TPT,
const SmallVectorImpl<Instruction *> &Exts,		const SmallVectorImpl<Instruction *> &Exts,
SmallVectorImpl<Instruction *> &ProfitablyMovedExts,		SmallVectorImpl<Instruction *> &ProfitablyMovedExts,
unsigned CreatedInstsCost = 0);		unsigned CreatedInstsCost = 0);
bool mergeSExts(Function &F);		bool mergeSExts(Function &F);
▲ Show 20 Lines • Show All 1,481 Lines • ▼ Show 20 Lines	case Intrinsic::strip_invariant_group: {
II->replaceAllUsesWith(ArgVal);		II->replaceAllUsesWith(ArgVal);
II->eraseFromParent();		II->eraseFromParent();
return true;		return true;
}		}
case Intrinsic::cttz:		case Intrinsic::cttz:
case Intrinsic::ctlz:		case Intrinsic::ctlz:
// If counting zeros is expensive, try to avoid it.		// If counting zeros is expensive, try to avoid it.
return despeculateCountZeros(II, TLI, DL, ModifiedDT);		return despeculateCountZeros(II, TLI, DL, ModifiedDT);
		case Intrinsic::dbg_value:
		return fixupDbgValue(II);
}		}

if (TLI) {		if (TLI) {
SmallVector<Value*, 2> PtrOps;		SmallVector<Value*, 2> PtrOps;
Type *AccessTy;		Type *AccessTy;
if (TLI->getAddrModeArguments(II, PtrOps, AccessTy))		if (TLI->getAddrModeArguments(II, PtrOps, AccessTy))
while (!PtrOps.empty()) {		while (!PtrOps.empty()) {
Value *PtrVal = PtrOps.pop_back_val();		Value *PtrVal = PtrOps.pop_back_val();
▲ Show 20 Lines • Show All 3,068 Lines • ▼ Show 20 Lines	bool CodeGenPrepare::optimizeMemoryInst(Instruction MemoryInst, Value Addr,
SunkAddrs[Addr] = WeakTrackingVH(SunkAddr);		SunkAddrs[Addr] = WeakTrackingVH(SunkAddr);

// If we have no uses, recursively delete the value and all dead instructions		// If we have no uses, recursively delete the value and all dead instructions
// using it.		// using it.
if (Repl->use_empty()) {		if (Repl->use_empty()) {
// This can cause recursive deletion, which can invalidate our iterator.		// This can cause recursive deletion, which can invalidate our iterator.
// Use a WeakTrackingVH to hold onto it in case this happens.		// Use a WeakTrackingVH to hold onto it in case this happens.
Value CurValue = &CurInstIterator;		Value CurValue = &CurInstIterator;
WeakTrackingVH IterHandle(CurValue);		WeakTrackingVH IterHandle(CurValue);
		bjopeUnsubmitted Not Done Reply Inline Actions What happens if there is an earlier dbg-use of Repl (before MemoryInst) in this BB. Is there a risk that we introduce a dbguse-before-def scenario here? Maybe we need to check if MemoryInst dominates User here? Or is such input IR unlikely in reality? There seems to be lots of cases above (for example reusing an earlier computation for SunkAddr), so maybe the important check is that SunkAddr dominates User. But as you understand from the question marks above, I haven't figured out if that always is guaranteed here. Notice that I do not think about scenarios when the input IR already is "broken" (dgb-use before def) here. Just some worrying that we might introduce dbg-use-before-def here. Actually the test case that you added has the dbg.value before the load in the BB that we sink the address computation into (the scenario I'm talking about). Isn't the address computation added just before the load? Is perhaps the test case working due to placeDbgValues moving the dbg.value? bjope: What happens if there is an earlier dbg-use of Repl (before MemoryInst) in this BB. Is there a…
		jmorseAuthorUnsubmitted Done Reply Inline Actions Actually the test case that you added has the dbg.value before the load in the BB that we sink the address computation into (the scenario I'm talking about). Isn't the address computation added just before the load? Is perhaps the test case working due to placeDbgValues moving the dbg.value? Aha, you've got it in one there, I was inadvertently relying on placeDbgValues. I'd placed the dbg.value at the top of the block and assumed optimizeMemoryInst did-the-right-thing because the duplicated memory insts appeared above the dbg.value. (Should be easily fixable). jmorse: > Actually the test case that you added has the dbg.value before the load in the BB that we…
BasicBlock *BB = CurInstIterator->getParent();		BasicBlock *BB = CurInstIterator->getParent();

RecursivelyDeleteTriviallyDeadInstructions(Repl, TLInfo);		RecursivelyDeleteTriviallyDeadInstructions(Repl, TLInfo);

if (IterHandle != CurValue) {		if (IterHandle != CurValue) {
// If the iterator instruction was recursively deleted, start over at the		// If the iterator instruction was recursively deleted, start over at the
// start of the block.		// start of the block.
CurInstIterator = BB->begin();		CurInstIterator = BB->begin();
▲ Show 20 Lines • Show All 2,024 Lines • ▼ Show 20 Lines	for (auto &I : reverse(BB)) {
}		}
}		}
}		}
MadeChange \|= dupRetToEnableTailCallOpts(&BB);		MadeChange \|= dupRetToEnableTailCallOpts(&BB);

return MadeChange;		return MadeChange;
}		}

		// Some CGP optimizations may move or alter what's computed in a block. Check
		// whether a dbg.value intrinsic could be pointed at a more appropriate operand.
		bool CodeGenPrepare::fixupDbgValue(Instruction *I) {
		assert(isa<DbgValueInst>(I));
		DbgValueInst &DVI = *cast<DbgValueInst>(I);

		// Does this dbg.value refer to a sunk address calculation?
		Value *Location = DVI.getVariableLocation();
		WeakTrackingVH SunkAddrVH = SunkAddrs[Location];
		Value *SunkAddr = SunkAddrVH.pointsToAliveValue() ? SunkAddrVH : nullptr;
		if (SunkAddr) {
		// Point dbg.value at locally computed address, which should give the best
		// opportunity to be accurately lowered. This update may change the type of
		// pointer being referred to; however this makes no difference to debugging
		// information, and we can't generate bitcasts that may affect codegen.
		DVI.setOperand(0, MetadataAsValue::get(DVI.getContext(),
		ValueAsMetadata::get(SunkAddr)));
		return true;
		}
		return false;
		}

// llvm.dbg.value is far away from the value then iSel may not be able		// llvm.dbg.value is far away from the value then iSel may not be able
// handle it properly. iSel will drop llvm.dbg.value if it can not		// handle it properly. iSel will drop llvm.dbg.value if it can not
// find a node corresponding to the value.		// find a node corresponding to the value.
bool CodeGenPrepare::placeDbgValues(Function &F) {		bool CodeGenPrepare::placeDbgValues(Function &F) {
bool MadeChange = false;		bool MadeChange = false;
for (BasicBlock &BB : F) {		for (BasicBlock &BB : F) {
Instruction *PrevNonDbgInst = nullptr;		Instruction *PrevNonDbgInst = nullptr;
for (BasicBlock::iterator BI = BB.begin(), BE = BB.end(); BI != BE;) {		for (BasicBlock::iterator BI = BB.begin(), BE = BB.end(); BI != BE;) {
▲ Show 20 Lines • Show All 231 Lines • Show Last 20 Lines

test/DebugInfo/Generic/codegenprep-addrsink.ll

This file was added.

				; RUN: llc -start-before=codegenprepare -stop-after=codegenprepare %s -o - \| FileCheck %s
				;
				; CGP duplicates address calculation into each basic block that contains loads
				; or stores, so that they can be folded into instruction memory operands for
				; example. dbg.value's should be redirected to identify such local address
				; computations, to give the best opportunity for variable locations to be
				; preserved.
				; This test has two dbg.values in it, one before and one after the relevant
				; memory instruction. Test that the one before does _not_ get updated (as that
				; would either make it use-before-def or shift when the variable appears), and
				; that the dbg.value after the memory instruction does get updated.
				;
				; Due to placeDbgValues, the dbg.values currently get shifted up a few
				; instructions.

				define dso_local i8 @foo(i32 *%p, i32 %cond) !dbg !7 {
				entry:
				; The first dbg.value of %arith, in the 'next' block, will be moved up here
				; by placeDbgValues,
				; CHECK-LABEL: entry:
				; CHECK: dbg.value(metadata i8* %arith, metadata ![[DIVAR:[0-9]+]],
				; CHECK-SAME: metadata !DIExpression()
				%casted = bitcast i32 %p to i8
				%arith = getelementptr i8, i8 *%casted, i32 3
				%load1 = load i8, i8 *%arith
				%cmpresult = icmp eq i32 %cond, 0
				br i1 %cmpresult, label %next, label %ret

				next:
				; Address calcs should be duplicated into this block. One dbg.value should be
				; updated, and the other should not.
				; CHECK-LABEL: next:
				; CHECK: %[[CASTVAR:[0-9a-zA-Z]+]] = bitcast i32* %p to i8*
				; CHECK-NEXT: %[[GEPVAR:[0-9a-zA-Z]+]] = getelementptr i8, i8* %[[CASTVAR]],
				; CHECK-SAME: i64 3
				; CHECK-NEXT: call void @llvm.dbg.value(metadata i8* %[[GEPVAR]],
				; CHECK-SAME: metadata ![[DIVAR]],
				; CHECK-NEXT: %loaded = load i8, i8* %[[GEPVAR]]
				call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14
				%loaded = load i8, i8 *%arith
				call void @llvm.dbg.value(metadata i8 *%arith, metadata !12, metadata !DIExpression()), !dbg !14
				ret i8 %loaded

				ret:
				ret i8 0
				}

				; CHECK: ![[DIVAR]] = !DILocalVariable(name: "p",

				declare void @llvm.dbg.value(metadata, metadata, metadata)

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4, !5}
				!llvm.ident = !{!6}

				!0 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !1, producer: "clang", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, nameTableKind: None)
				!1 = !DIFile(filename: "test.cpp", directory: ".")
				!2 = !{}
				!3 = !{i32 2, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!5 = !{i32 1, !"wchar_size", i32 4}
				!6 = !{!"clang version 8.0.0 (trunk 348209)"}
				!7 = distinct !DISubprogram(name: "foo", linkageName: "foo", scope: !1, file: !1, line: 4, type: !8, scopeLine: 5, flags: DIFlagPrototyped, spFlags: DISPFlagDefinition \| DISPFlagOptimized, unit: !0, retainedNodes: !11)
				!8 = !DISubroutineType(types: !9)
				!9 = !{null, !10}
				!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				!11 = !{!12}
				!12 = !DILocalVariable(name: "p", arg: 1, scope: !7, file: !1, line: 4, type: !10)
				!14 = !DILocation(line: 4, column: 15, scope: !7)
				!20 = distinct !DILexicalBlock(scope: !7, file: !1, line: 8, column: 7)