This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/IR/
-
llvm/
-
IR/
2
IRBuilder.h
-
test/CodeGen/AMDGPU/
-
CodeGen/
-
AMDGPU/
-
llvm.dbg.value.ll
-
unittests/IR/
-
IR/
-
IRBuilderTest.cpp

Differential D61198

[IRBuilder][DebugInfo] Don't pick DebugLocs for new instructions from debug intrinsics
AbandonedPublic

Authored by jmorse on Apr 26 2019, 9:46 AM.

Download Raw Diff

Details

Reviewers

aprantl
vsk
bjope
dblaikie

Summary

As discussed in D59272, LLVM sometimes uses the DebugLoc of a debug intrinsic as the DebugLoc for newly created instructions. This is undesirable, as the line number information for debug intrinsics is at best meaningless and worst misleading -- D59272 has a couple of examples of variable declaration line numbers gratuitously appearing in unrelated code.

This patch has the IRBuilder insertion-point-setting methods skip over debug intrinsics, and find a "Real" instruction to set the DebugLoc from. As this isn't related to a particular pass or transformation, I've added a unit test to check that this behaviour is preserved.

Advice on additional reviewers would be appreciated, IRBuilder seems pretty central, it's not clear who else should review, aside from "Everyone".

Diff Detail

Event Timeline

jmorse created this revision.Apr 26 2019, 9:46 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2019, 9:46 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

jmorse mentioned this in D59272: [DebugInfo] Select debug intrinsic line-numbers more carefully when promoting dbg.declare.Apr 26 2019, 9:48 AM

Add the test change I forgot to stage (whoops) -- the AMDGPU test here doesn't have any non-meta instructions with line numbers. Prior to this patch the debug intrinsic line numbers leak in, which the test depends on. Fix this by giving the real instructions line numbers. (The test is for placement of dbg.values rather than line numbers anyway).

Herald added subscribers: nhaehnle, jvesely. · View Herald TranscriptApr 26 2019, 9:57 AM

bjope added inline comments.Apr 26 2019, 10:27 AM

include/llvm/IR/IRBuilder.h
139	This could end up returning the end() iterator (so the assert above is now out-of-play).
150	You probably want to do this before checking for end() (to avoid reading debug loc from end()). Another interesting thing here is that when setting the insertion point to the end of the BB (or rather after the last non-debug-intrinsic in the BB) the current debug location won't be updated. Is some user of this method relying on that. Should perhaps the current debug location be invalidated instead? My thoughts are: Should we simply skip setting debug location when insertion point is a debug intrinsic? Or should we invalidate the debug location when insertion point is a debug intrinsic? Should perhaps these SetInsertPoint methods take a second argument, providing the debug location to set (only ~800 call sites to update...)? Or should we never do the side effect of updating debug location (just as many call sites to potentially update with explicit settings)? Notice that the `SetInsertPoint(BasicBlock *TheBB)` version does not set the current debug location. So at the moment it isn't obvious that SetInsertPoint always updates the current debug location. Haven't really thought about what I'd prefer myself.

There's a tension here between finding a narrow solution to the invalid-location bug and having SetInsertPoint() do something principled with debug locations. See D39982 for a more detailed explanation of this. That previous attempt to change the IRBuilder API stalled because we couldn't find an automated way to fix up in-tree/out-of-tree uses (imho this is a hard problem).

I think it'd make sense to opt for a narrow solution here, but am not sure building on top of SetInsertPoint() is the way to go. What do folks think about this (possibly naive) alternative: if locations on debug intrinsics aren't meaningful, can we get rid of them (or use line 0 locations which just preserve scope info)? This would prevent SetInsertPoint from propagating misleading debug locations.

Edit: I think this alternative amounts to a version of D59272 that doesn't special-case stores.

For some reason I hadn't clocked that the IRBuilder deals with blocks in a state of partial construction/correctness (duh), and had assumed there would always be a terminator in each block, whoops.

In D61198#1480767, @vsk wrote:

There's a tension here between finding a narrow solution to the invalid-location bug and having SetInsertPoint() do something principled with debug locations. See D39982 for a more detailed explanation of this.

Ouch, I have now been enlightened,

I think it'd make sense to opt for a narrow solution here, but am not sure building on top of SetInsertPoint() is the way to go. What do folks think about this (possibly naive) alternative: if locations on debug intrinsics aren't meaningful, can we get rid of them (or use line 0 locations which just preserve scope info)? This would prevent SetInsertPoint from propagating misleading debug locations.

That sounds like a plan -- this patch was supposed to just reduce the circumstances where D59272 was necessary, but as it seems to be a harder problem to solve than expected, we'll just have to rely on zero-line-nos-for-debug-intrinsics more. I've updated D59272 accordingly.

Revision Contents

Path

Size

include/

llvm/

IR/

IRBuilder.h

14 lines

test/

CodeGen/

AMDGPU/

llvm.dbg.value.ll

4 lines

unittests/

IR/

IRBuilderTest.cpp

46 lines

Diff 196875

include/llvm/IR/IRBuilder.h

Show All 26 Lines
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/GlobalVariable.h"		#include "llvm/IR/GlobalVariable.h"
#include "llvm/IR/InstrTypes.h"		#include "llvm/IR/InstrTypes.h"
#include "llvm/IR/Instruction.h"		#include "llvm/IR/Instruction.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/IR/ValueHandle.h"		#include "llvm/IR/ValueHandle.h"
#include "llvm/Support/AtomicOrdering.h"		#include "llvm/Support/AtomicOrdering.h"
#include "llvm/Support/CBindingWrapping.h"		#include "llvm/Support/CBindingWrapping.h"
▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	void SetInsertPoint(BasicBlock *TheBB) {
InsertPt = BB->end();		InsertPt = BB->end();
}		}

/// This specifies that created instructions should be inserted before		/// This specifies that created instructions should be inserted before
/// the specified instruction.		/// the specified instruction.
void SetInsertPoint(Instruction *I) {		void SetInsertPoint(Instruction *I) {
BB = I->getParent();		BB = I->getParent();
InsertPt = I->getIterator();		InsertPt = I->getIterator();
assert(InsertPt != BB->end() && "Can't read debug loc from end()");		assert(I->getIterator() != BB->end() && "Can't read debug loc from end()");
SetCurrentDebugLocation(I->getDebugLoc());		// Avoid taking DebugLocs from debug intrinsics
		auto DbgSrcInst = skipDebugIntrinsics(InsertPt);
		bjopeUnsubmitted Not Done Reply Inline Actions This could end up returning the end() iterator (so the assert above is now out-of-play). bjope: This could end up returning the end() iterator (so the assert above is now out-of-play).
		SetCurrentDebugLocation(DbgSrcInst->getDebugLoc());
}		}

/// This specifies that created instructions should be inserted at the		/// This specifies that created instructions should be inserted at the
/// specified point.		/// specified point.
void SetInsertPoint(BasicBlock *TheBB, BasicBlock::iterator IP) {		void SetInsertPoint(BasicBlock *TheBB, BasicBlock::iterator IP) {
BB = TheBB;		BB = TheBB;
InsertPt = IP;		InsertPt = IP;
if (IP != TheBB->end())		if (IP != TheBB->end()) {
SetCurrentDebugLocation(IP->getDebugLoc());		// Avoid taking DebugLocs from debug intrinsics
		auto DbgSrcInst = skipDebugIntrinsics(InsertPt);
		bjopeUnsubmitted Not Done Reply Inline Actions You probably want to do this before checking for end() (to avoid reading debug loc from end()). Another interesting thing here is that when setting the insertion point to the end of the BB (or rather after the last non-debug-intrinsic in the BB) the current debug location won't be updated. Is some user of this method relying on that. Should perhaps the current debug location be invalidated instead? My thoughts are: Should we simply skip setting debug location when insertion point is a debug intrinsic? Or should we invalidate the debug location when insertion point is a debug intrinsic? Should perhaps these SetInsertPoint methods take a second argument, providing the debug location to set (only ~800 call sites to update...)? Or should we never do the side effect of updating debug location (just as many call sites to potentially update with explicit settings)? Notice that the `SetInsertPoint(BasicBlock TheBB)` version does not set the current debug location. So at the moment it isn't obvious that SetInsertPoint always updates the current debug location. Haven't really thought about what I'd prefer myself. bjope:* You probably want to do this before checking for end() (to avoid reading debug loc from end()).
		SetCurrentDebugLocation(DbgSrcInst->getDebugLoc());
		}
}		}

/// Set location information used by debugging information.		/// Set location information used by debugging information.
void SetCurrentDebugLocation(DebugLoc L) { CurDbgLocation = std::move(L); }		void SetCurrentDebugLocation(DebugLoc L) { CurDbgLocation = std::move(L); }

/// Get location information used by debugging information.		/// Get location information used by debugging information.
const DebugLoc &getCurrentDebugLocation() const { return CurDbgLocation; }		const DebugLoc &getCurrentDebugLocation() const { return CurDbgLocation; }

▲ Show 20 Lines • Show All 2,211 Lines • Show Last 20 Lines

test/CodeGen/AMDGPU/llvm.dbg.value.ll

	; RUN: llc -O0 -march=amdgcn -mtriple=amdgcn-unknown-amdhsa -verify-machineinstrs < %s \| FileCheck -check-prefixes=GCN,NOOPT %s			; RUN: llc -O0 -march=amdgcn -mtriple=amdgcn-unknown-amdhsa -verify-machineinstrs < %s \| FileCheck -check-prefixes=GCN,NOOPT %s
	; RUN: llc -march=amdgcn -mtriple=amdgcn-unknown-amdhsa -verify-machineinstrs < %s \| FileCheck -check-prefixes=GCN,OPT %s			; RUN: llc -march=amdgcn -mtriple=amdgcn-unknown-amdhsa -verify-machineinstrs < %s \| FileCheck -check-prefixes=GCN,OPT %s

	; GCN-LABEL: {{^}}test_debug_value:			; GCN-LABEL: {{^}}test_debug_value:
	; NOOPT: .loc 1 1 42 prologue_end ; /tmp/test_debug_value.cl:1:42			; NOOPT: .loc 1 1 42 prologue_end ; /tmp/test_debug_value.cl:1:42
	; NOOPT-NEXT: s_load_dwordx2 s[4:5], s[4:5], 0x0			; NOOPT-NEXT: s_load_dwordx2 s[4:5], s[4:5], 0x0
	; NOOPT-NEXT: .Ltmp			; NOOPT-NEXT: .Ltmp
	; NOOPT-NEXT: ;DEBUG_VALUE: test_debug_value:globalptr_arg <- $sgpr4_sgpr5			; NOOPT-NEXT: ;DEBUG_VALUE: test_debug_value:globalptr_arg <- $sgpr4_sgpr5

	; GCN: flat_store_dword			; GCN: flat_store_dword
	; GCN: s_endpgm			; GCN: s_endpgm
	define amdgpu_kernel void @test_debug_value(i32 addrspace(1)* nocapture %globalptr_arg) #0 !dbg !4 {			define amdgpu_kernel void @test_debug_value(i32 addrspace(1)* nocapture %globalptr_arg) #0 !dbg !4 {
	entry:			entry:
	tail call void @llvm.dbg.value(metadata i32 addrspace(1)* %globalptr_arg, metadata !10, metadata !13), !dbg !14			tail call void @llvm.dbg.value(metadata i32 addrspace(1)* %globalptr_arg, metadata !10, metadata !13), !dbg !14
	store i32 123, i32 addrspace(1)* %globalptr_arg, align 4			store i32 123, i32 addrspace(1)* %globalptr_arg, align 4, !dbg !14
	ret void			ret void
	}			}

	; Check for infinite loop in some cases with dbg_value in			; Check for infinite loop in some cases with dbg_value in
	; SIOptimizeExecMaskingPreRA (somehow related to undef argument).			; SIOptimizeExecMaskingPreRA (somehow related to undef argument).

	; GCN-LABEL: {{^}}only_undef_dbg_value:			; GCN-LABEL: {{^}}only_undef_dbg_value:
	; NOOPT: ;DEBUG_VALUE: test_debug_value:globalptr_arg <- [DW_OP_constu 1, DW_OP_swap, DW_OP_xderef] undef			; NOOPT: ;DEBUG_VALUE: test_debug_value:globalptr_arg <- [DW_OP_constu 1, DW_OP_swap, DW_OP_xderef] undef
	; NOOPT-NEXT: s_endpgm			; NOOPT-NEXT: s_endpgm

	; OPT: s_endpgm			; OPT: s_endpgm
	define amdgpu_kernel void @only_undef_dbg_value() #1 {			define amdgpu_kernel void @only_undef_dbg_value() #1 {
	bb:			bb:
	call void @llvm.dbg.value(metadata <4 x float> undef, metadata !10, metadata !DIExpression(DW_OP_constu, 1, DW_OP_swap, DW_OP_xderef)) #2, !dbg !14			call void @llvm.dbg.value(metadata <4 x float> undef, metadata !10, metadata !DIExpression(DW_OP_constu, 1, DW_OP_swap, DW_OP_xderef)) #2, !dbg !14
	ret void			ret void, !dbg !14
	}			}

	declare void @llvm.dbg.value(metadata, metadata, metadata) #1			declare void @llvm.dbg.value(metadata, metadata, metadata) #1

	attributes #0 = { nounwind }			attributes #0 = { nounwind }
	attributes #1 = { nounwind readnone }			attributes #1 = { nounwind readnone }

	!llvm.dbg.cu = !{!0}			!llvm.dbg.cu = !{!0}
	Show All 16 Lines

unittests/IR/IRBuilderTest.cpp

Show First 20 Lines • Show All 716 Lines • ▼ Show 20 Lines	TEST_F(IRBuilderTest, DIBuilderMacro) {
EXPECT_EQ(MN1, MF1->getRawElements());		EXPECT_EQ(MN1, MF1->getRawElements());

Elements.clear();		Elements.clear();
Elements.push_back(MDef2);		Elements.push_back(MDef2);
auto MN2 = MDTuple::get(Ctx, Elements);		auto MN2 = MDTuple::get(Ctx, Elements);
EXPECT_EQ(MN2, MF2->getRawElements());		EXPECT_EQ(MN2, MF2->getRawElements());
EXPECT_TRUE(verifyModule(*M));		EXPECT_TRUE(verifyModule(*M));
}		}

		// Test that IRBuilder won't read DebugLocs from debug intrinsics at the
		// insertion location.
		TEST_F(IRBuilderTest, InsertionDebugLoc) {
		IRBuilder<> Builder(BB);
		Value *V;
		Instruction *I;

		// Create debug environment for test.
		DIBuilder DIB(*M);
		auto File = DIB.createFile("bees", "/");
		auto CU = DIB.createCompileUnit(dwarf::DW_LANG_C, File, "hands", true, "", 0);
		auto Type = DIB.createSubroutineType(DIB.getOrCreateTypeArray(None));
		auto Func = DIB.createFunction(
		CU, "shoe", "", File, 1, Type, 1, DINode::FlagZero,
		DISubprogram::SPFlagDefinition);
		auto Lex = DIB.createLexicalBlock(Func, File, 1, 1);
		auto DIFloat = DIB.createBasicType("float", 32, dwarf::DW_ATE_float);
		auto DIExpr = DIB.createExpression();
		auto LocalVar = DIB.createAutoVariable(Lex, "sandal", File, 2, DIFloat);

		// Create our "Real code" (TM) debug location, on line 10.
		DebugLoc NormalDebugLoc = DebugLoc::get(10, 1, Lex);
		// Create a debug location specially for a dbg.value we'l create, on line 2.
		auto DbgValLoc = DILocation::get(Ctx, 2, 3, Lex);

		// Create some floating point operations with a dbg.value in the middle.
		Builder.SetCurrentDebugLocation(NormalDebugLoc);
		V = Builder.CreateLoad(GV->getValueType(), GV);
		I = cast<Instruction>(Builder.CreateFAdd(V, V));
		auto DbgVal = DIB.insertDbgValueIntrinsic(I, LocalVar, DIExpr, DbgValLoc, BB);
		Builder.CreateStore(I, GV);
		Builder.CreateRetVoid();

		// Now insert an extra instruction at the location of the dbg.value.
		// IRBuilder should seek and find the "Real" debug location, and ignore the
		// location of the intervening dbg.value.
		Builder.SetInsertPoint(DbgVal);
		Instruction *I2 = cast<Instruction>(Builder.CreateFAdd(I, V));

		// Additionally created FAdd should have the normal debug location.
		EXPECT_FALSE(I2->getDebugLoc().get() == DbgValLoc);
		EXPECT_TRUE(I2->getDebugLoc() == NormalDebugLoc);

		DIB.finalize();
		}
}		}