This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
1/2
FunctionLoweringInfo.h
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
-
InstrEmitter.h
10/21
InstrEmitter.cpp
-
ScheduleDAGFast.cpp
-
ScheduleDAGSDNodes.h
-
ScheduleDAGSDNodes.cpp
1
SelectionDAGBuilder.cpp
-
SelectionDAGISel.cpp
-
test/DebugInfo/X86/
-
DebugInfo/
-
X86/
1
entry-values-for-isel-invalidated-nodes.ll

Differential D87357

[SelectionDAG][DebugInfo] Use entry-values to recover variables values
AcceptedPublic

Authored by djtodoro on Sep 9 2020, 2:55 AM.

Download Raw Diff

Details

Reviewers

dstenb
aprantl
vsk
jmorse
StephenTozer
ecnelises

Summary

Use the entry values to salvage some params dbg.values. It is based on D87233.

Diff Detail

Event Timeline

djtodoro created this revision.Sep 9 2020, 2:55 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 9 2020, 2:55 AM

Herald added subscribers: llvm-commits, ecnelises, ormris and 2 others. · View Herald Transcript

djtodoro requested review of this revision.Sep 9 2020, 2:55 AM

djtodoro added a parent revision: D87233: [POC][DebugInfo] Use entry values within IR.

Harbormaster completed remote builds in B71058: Diff 290681.Sep 9 2020, 2:56 AM

I think this looks mostly good.

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
710–727	Can you add a comment that explains what's happening here?
718	Would ValueMap.lookup() work here?

@aprantl thanks for your comments.

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
710–727	Sure.
718	It'd work, thanks.

addressing comments
refactoring
fixing tests
adding RegState::Debug to the Reg from the dbg_value

Rebase

aprantl accepted this revision.Sep 22 2020, 9:15 AM

aprantl added inline comments.

llvm/include/llvm/CodeGen/FunctionLoweringInfo.h
80	Nit: remove `ArgValueMap -`. This used to be part of the comment style, but is completely redundant in non-ancient versions of Doxygen.

This revision is now accepted and ready to land.Sep 22 2020, 9:15 AM

Orlando added a subscriber: Orlando.Sep 22 2020, 9:38 AM

Orlando added inline comments.

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	I think this we should only be doing this for immutable parameters (and mutable parameters which are never assigned to), right? I.e. the entry_value of a parameter register is only a valid location for a parameter variable if that variable is never assigned another value.

djtodoro added inline comments.Sep 22 2020, 10:18 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	We are expressing the modification in terms of its entry value here.

StephenTozer added a subscriber: StephenTozer.Sep 23 2020, 2:20 AM

StephenTozer added inline comments.

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	As far as I understand, we want to recover variables by describing them in relation to the initial values of the parameters. In that case, it only makes sense to consider variables whose values can be expressed in relation to that; this may include variables that aren't parameters themselves, and may exclude dbg.values for parameters. For example: void foo(int param) { int x = param + 2; param = SomeInt(); ... } In this example it should be possible to describe `x` in terms of `param`'s initial value, while `param` itself could not be described by an entry value after its assignment within the function. In that case, I believe it would make more sense to look for dbg.values that use a parameter Value, or can be expressed in terms of one, rather than using `DILocalVariable::isParameter`. Does that seem correct, or have I misunderstood the work or the purpose of this block?

djtodoro added inline comments.Sep 23 2020, 2:56 AM

llvm/include/llvm/CodeGen/FunctionLoweringInfo.h
80	Thanks, sure.
llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	Hi @StephenTozer. Yes, but is a more complex use case, so we do not cover it (yet). There is a `TODO` marker within the `findEntryValue()` for that. The debug entry values feature is complex, relying on various things, so we've initially implemented the support for simple registers locations as entry values of unmodified parameters only, during the LiveDebugValues phase. Currently, we are extending the support onto IR level, for some (simple) cases, but there is space for improvements for sure, since the potential goes beyond the current usage.
718	I think this we should only be doing this for immutable parameters (and mutable parameters which are never assigned to), right? I.e. the entry_value of a parameter register is only a valid location for a parameter variable if that variable is never assigned another value. Hi @Orlando. I might have needed to add some more context here. You are right, and all of that has been already checked within the main debug-entry-values place of the production, within LiveDebugValues. We perform analysis there, and at the moment, use entry values for unmodified params (since it is the basic case). There is space for improvements, by using the entry values for describing/expressing the modification in terms of its entry value, but we do not support it (yet), there. On the IR level, at the moment, we are trying to extend this support for unused arguments, since it seems to be safe/simple use case. First place for that is the `DeadArgElimination` pass, and we are working on sorting out all the pieces for that purpose (but it takes some additional magic in order to get it working for both callee and caller sides). The next spot this could be used is here, where we also have unused parameters, that are not being copied into any register, so the `SelectionDAG` marks them as invalid/unable to track it location anymore. Please comment on this, since I might have missed something.

addressing comment

StephenTozer added inline comments.Sep 23 2020, 3:38 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	Not handling the `x` case is fine then, since it will be much easier to get the basic change completed and then extend it to cover more cases later. I'm still not sure how this new extension avoids the second case however. Because we only use the variable to determine what entry value (if any) to produce, this code would still produce entry value DBG_VALUEs for a parameter even after an assignment to that parameter. Is there anything I've missed that would prevent us producing invalid entry values right here, such as for `param`'s post-assignment value in the example code above?

djtodoro added inline comments.Sep 23 2020, 3:54 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	Probably we need some additional/stronger statements here. Can you please share the test case, so I can reproduce it?

djtodoro added inline comments.Sep 23 2020, 3:56 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	statements -- in the code/extension implementation terms

Orlando added inline comments.Sep 23 2020, 4:12 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	Hi @Orlando. I might have needed to add some more context here. You are right, and all of that has been already checked within the main debug-entry-values place of the production, within LiveDebugValues. We perform analysis there, and at the moment, use entry values for unmodified params (since it is the basic case). There is space for improvements, by using the entry values for describing/expressing the modification in terms of its entry value, but we do not support it (yet), there. On the IR level, at the moment, we are trying to extend this support for unused arguments, since it seems to be safe/simple use case. First place for that is the `DeadArgElimination` pass, and we are working on sorting out all the pieces for that purpose (but it takes some additional magic in order to get it working for both callee and caller sides). The next spot this could be used is here, where we also have unused parameters, that are not being copied into any register, so the `SelectionDAG` marks them as invalid/unable to track it location anymore. Please comment on this, since I might have missed something. Thank you for this explanation, it has helped me understand. The significance of ArgValueMap - in filtering the invalidated SDDbgValues for parameter variables down to just those using argument values - hadn't quite hit me until now, but looking again this all makes sense to me now. I'm not really familiar with how LiveDebugValues handles entry_values currently. I'll take a close look soon as I can though as I'm interested in keeping up with this work!

djtodoro added inline comments.Sep 23 2020, 4:44 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	No problem, thanks for your comments! I've described the intention of this patch, but I might have missed some pieces, so I think we'll cover that with more tests (e.g. with the one @StephenTozer will provide).

dstenb added inline comments.Sep 23 2020, 6:25 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	I don't want to derail this discussion, but I just wanted to add that when I played around with this patch I encountered cases where we incorrectly emit entry values, even after the parameter has been modified, on trunk without this patch. I wrote a PR for that: https://bugs.llvm.org/show_bug.cgi?id=47628.

StephenTozer added inline comments.Sep 23 2020, 7:51 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	incorrect_entry-value_isel.ll2 KBDownload Here is a test case which demonstrates the behaviour I'm talking about. I think the easiest way to summarize the underlying issue is that you're using the DILocalVariable to determine what entry value we should emit, if any. This is incorrect: it makes no fundamental difference whether or not the variable we're describing is a parameter, what matters is whether we use one of the SSA parameters. It might make more sense to track this information in the SDDbgValue directly, but in any case we can't rely on the variable to give us that information. Full disclosure, I've also discussed this with Orlando directly, and he retracts his comment about ArgValueMap solving the issue; since EntryValue is equal to the parameter value for the variable rather than the current dbg.value.

djtodoro added inline comments.Sep 23 2020, 8:34 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	Oh... it makes sense to me. Thanks! I've tried a few tests, and it all seemed to be working fine... I'll try to propose a solution for this. I think it would be nice if we can make it working outside of the `SelectionDAG` (like a general solution for IR), and that is why I thought the variable would be enough/nice way to implement it, but it may not be sufficient...

djtodoro planned changes to this revision.Sep 23 2020, 8:35 AM

StephenTozer added inline comments.Sep 23 2020, 9:50 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	My suggestion would be to use the SSA values. The properties of SSA form ensure that whenever we have a reference to an SSA value `%param` that is a parameter to a function, that value will be available as an entry value. This information should be readily available throughout IR. Once we get to ISel it gets a bit more complicated, but since each dbg.value is associated with a SDDbgValue it should be possible to store some identifying information in the SDDbgValue that can be referred back to here. My naive thought is that you could simply store a `Value*` or Value Handle pointing to the original instruction, and use that instead of `findEntryValue`. If this doesn't work, it may be possible to store some information externally, as with `ArgValueMap`, that can be combined with the identifier to find the corresponding Entry Value (if any).

djtodoro added inline comments.Sep 24 2020, 1:41 AM

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
718	At first, I proposed/thought extending the instructions with some kind of a pointer to the `Value`, but it seemed to be too much work (since there would be handling of such info throughout the pipeline, etc.). Let me think about it, and there might be some combination of this approach and the ideas shared that potentially could solve the problem.

djtodoro mentioned this in D87233: [POC][DebugInfo] Use entry values within IR.Oct 5 2020, 5:31 AM

Use 2 extra arguments from llvm.dbg.value()
Handle modified params

This revision is now accepted and ready to land.Oct 5 2020, 5:36 AM

Herald added a reviewer: ecnelises. · View Herald TranscriptOct 5 2020, 5:36 AM

This revision now requires review to proceed.Oct 5 2020, 5:36 AM

This needs to be tested more (so it is not for the final commit (yet)). I believe this triggers some asserts, but (at least) we are covering the cases we want.

ecnelises resigned from this revision.Oct 16 2020, 12:01 AM

This revision is now accepted and ready to land.Oct 16 2020, 12:01 AM

djtodoro requested review of this revision.Oct 16 2020, 12:02 AM

LGTM, this new approach looks solid. As per usual, wait for further approval from someone with more authority in this area.

llvm/include/llvm/CodeGen/SelectionDAG.h
1477 ↗	(On Diff #296163)	Nit, I think the "v" should be uppercase.
llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp
711	Double semi-colon here.
728–730	Minor: Comment could do with a small rewrite, something like "must generate an undef" -> "must generate a DBG_VALUE that is either undef, or an entry value if one is available"
llvm/lib/CodeGen/SelectionDAG/SDNodeDbgValue.h
68–69 ↗	(On Diff #296163)	Nit, this can be put in the initializer list.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
5475–5477	Perhaps instead of simply making this a condition, there should be an assert here that `V` doesn't already exist in `ArgValueMap`? I can't think of any case where it would already be there that wouldn't be an error, since I don't think we would ever call this twice for the same Value.
llvm/test/DebugInfo/X86/entry-val-invalidated-node-modifed-param-with-add.ll
1–3 ↗	(On Diff #296163)	Personally I think it'd probably be best to specify the DWARF version in this, so that this test doesn't break if the default becomes 5 (or already is in a clone repo). It might be better to simply update the test if/when that happens though, I don't have especially strong opinions about it. Alternatively you could simply regex it as `DW_OP{{(_GNU)?}}_entry_value`; also not sure whether or not this approach would be preferred.
llvm/test/DebugInfo/X86/entry-values-for-isel-invalidated-nodes.ll
2–5	Repeat of above comment about DWARF version.

This revision is now accepted and ready to land.Oct 19 2020, 6:09 AM

@StephenTozer Thanks for your comments. I'll address them and refactor the patch a bit, as soon as I catch some time to address some issues with the first patch from the stack.

Use the entry values for local vars as well

djtodoro mentioned this in D96559: Support emitting complex expressions that include entry values.Feb 12 2021, 4:25 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

FunctionLoweringInfo.h

3 lines

lib/

CodeGen/

SelectionDAG/

5 lines

23 lines

8 lines

5 lines

ScheduleDAGSDNodes.cpp

5 lines

SelectionDAGBuilder.cpp

5 lines

SelectionDAGISel.cpp

3 lines

test/

DebugInfo/

X86/

entry-values-for-isel-invalidated-nodes.ll

61 lines

Diff 293681

llvm/include/llvm/CodeGen/FunctionLoweringInfo.h

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	public:
/// MBBMap - A mapping from LLVM basic blocks to their machine code entry.		/// MBBMap - A mapping from LLVM basic blocks to their machine code entry.
DenseMap<const BasicBlock, MachineBasicBlock > MBBMap;		DenseMap<const BasicBlock, MachineBasicBlock > MBBMap;

/// ValueMap - Since we emit code for the function a basic block at a time,		/// ValueMap - Since we emit code for the function a basic block at a time,
/// we must remember which virtual registers hold the values for		/// we must remember which virtual registers hold the values for
/// cross-basic-block values.		/// cross-basic-block values.
DenseMap<const Value *, Register> ValueMap;		DenseMap<const Value *, Register> ValueMap;

		/// A mapping from the argument values to their virtual registers.
		aprantlUnsubmitted Not Done Reply Inline Actions Nit: remove `ArgValueMap -`. This used to be part of the comment style, but is completely redundant in non-ancient versions of Doxygen. aprantl: Nit: remove `ArgValueMap -`. This used to be part of the comment style, but is completely…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions Thanks, sure. djtodoro: Thanks, sure.
		DenseMap<const Value *, Register> ArgValueMap;

/// VirtReg2Value map is needed by the Divergence Analysis driven		/// VirtReg2Value map is needed by the Divergence Analysis driven
/// instruction selection. It is reverted ValueMap. It is computed		/// instruction selection. It is reverted ValueMap. It is computed
/// in lazy style - on demand. It is used to get the Value corresponding		/// in lazy style - on demand. It is used to get the Value corresponding
/// to the live in virtual register and is called from the		/// to the live in virtual register and is called from the
/// TargetLowerinInfo::isSDNodeSourceOfDivergence.		/// TargetLowerinInfo::isSDNodeSourceOfDivergence.
DenseMap<Register, const Value*> VirtReg2Value;		DenseMap<Register, const Value*> VirtReg2Value;

/// This method is called from TargetLowerinInfo::isSDNodeSourceOfDivergence		/// This method is called from TargetLowerinInfo::isSDNodeSourceOfDivergence
▲ Show 20 Lines • Show All 199 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.h

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	public:			public:
	/// CountResults - The results of target nodes have register or immediate			/// CountResults - The results of target nodes have register or immediate
	/// operands first, then an optional chain, and optional flag operands			/// operands first, then an optional chain, and optional flag operands
	/// (which do not go into the machine instrs.)			/// (which do not go into the machine instrs.)
	static unsigned CountResults(SDNode *Node);			static unsigned CountResults(SDNode *Node);

	/// EmitDbgValue - Generate machine instruction for a dbg_value node.			/// EmitDbgValue - Generate machine instruction for a dbg_value node.
	///			///
	MachineInstr EmitDbgValue(SDDbgValue SD,			MachineInstr *
	DenseMap<SDValue, Register> &VRBaseMap);			EmitDbgValue(SDDbgValue *SD, DenseMap<SDValue, Register> &VRBaseMap,
				DenseMap<const Value , Register> ArgValueMap = nullptr);

	/// Generate machine instruction for a dbg_label node.			/// Generate machine instruction for a dbg_label node.
	MachineInstr EmitDbgLabel(SDDbgLabel SD);			MachineInstr EmitDbgLabel(SDDbgLabel SD);

	/// EmitNode - Generate machine code for a node and needed dependencies.			/// EmitNode - Generate machine code for a node and needed dependencies.
	///			///
	void EmitNode(SDNode *Node, bool IsClone, bool IsCloned,			void EmitNode(SDNode *Node, bool IsClone, bool IsCloned,
	DenseMap<SDValue, Register> &VRBaseMap) {			DenseMap<SDValue, Register> &VRBaseMap) {
	Show All 26 Lines

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

Show All 24 Lines
#include "llvm/CodeGen/TargetLowering.h"		#include "llvm/CodeGen/TargetLowering.h"
#include "llvm/CodeGen/TargetSubtargetInfo.h"		#include "llvm/CodeGen/TargetSubtargetInfo.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Target/TargetMachine.h"		#include "llvm/Target/TargetMachine.h"
		#include "llvm/Transforms/Utils/Local.h"
using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "instr-emitter"		#define DEBUG_TYPE "instr-emitter"

/// MinRCSize - Smallest register class we allow when constraining virtual		/// MinRCSize - Smallest register class we allow when constraining virtual
/// registers. If satisfying all register class constraints would require		/// registers. If satisfying all register class constraints would require
/// using a smaller register class, emit a COPY to a new virtual register		/// using a smaller register class, emit a COPY to a new virtual register
/// instead.		/// instead.
▲ Show 20 Lines • Show All 650 Lines • ▼ Show 20 Lines	void InstrEmitter::EmitRegSequence(SDNode *Node,
(void)isNew; // Silence compiler warning.		(void)isNew; // Silence compiler warning.
assert(isNew && "Node emitted out of order - early");		assert(isNew && "Node emitted out of order - early");
}		}

/// EmitDbgValue - Generate machine instruction for a dbg_value node.		/// EmitDbgValue - Generate machine instruction for a dbg_value node.
///		///
MachineInstr *		MachineInstr *
InstrEmitter::EmitDbgValue(SDDbgValue *SD,		InstrEmitter::EmitDbgValue(SDDbgValue *SD,
DenseMap<SDValue, Register> &VRBaseMap) {		DenseMap<SDValue, Register> &VRBaseMap,
		DenseMap<const Value , Register> ArgValueMap) {
MDNode *Var = SD->getVariable();		MDNode *Var = SD->getVariable();
MDNode *Expr = SD->getExpression();		MDNode *Expr = SD->getExpression();
DebugLoc DL = SD->getDebugLoc();		DebugLoc DL = SD->getDebugLoc();
assert(cast<DILocalVariable>(Var)->isValidLocationForIntrinsic(DL) &&		assert(cast<DILocalVariable>(Var)->isValidLocationForIntrinsic(DL) &&
"Expected inlined-at fields to agree");		"Expected inlined-at fields to agree");

SD->setIsEmitted();		SD->setIsEmitted();

if (SD->isInvalidated()) {		if (SD->isInvalidated()) {
		Register Reg = 0;
		StephenTozerUnsubmitted Not Done Reply Inline Actions Double semi-colon here. StephenTozer: Double semi-colon here.
		Value *EntryValue = nullptr;

		// It the node has been invalidated, try to salvage the value
		// if an entry value is available for the variable. Entry values
		// act as backups here, and it will end up in an expression with
		// a DW_OP_entry_value.
		if (cast<DILocalVariable>(Var)->isParameter())
		aprantlUnsubmitted Not Done Reply Inline Actions Would ValueMap.lookup() work here? aprantl: Would ValueMap.lookup() work here?
		djtodoroAuthorUnsubmitted Done Reply Inline Actions It'd work, thanks. djtodoro: It'd work, thanks.
		OrlandoUnsubmitted Not Done Reply Inline Actions I think this we should only be doing this for immutable parameters (and mutable parameters which are never assigned to), right? I.e. the entry_value of a parameter register is only a valid location for a parameter variable if that variable is never assigned another value. Orlando: I think this we should only be doing this for immutable parameters (and mutable parameters…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions We are expressing the modification in terms of its entry value here. djtodoro: We are expressing the modification in terms of its entry value here.
		StephenTozerUnsubmitted Not Done Reply Inline Actions As far as I understand, we want to recover variables by describing them in relation to the initial values of the parameters. In that case, it only makes sense to consider variables whose values can be expressed in relation to that; this may include variables that aren't parameters themselves, and may exclude dbg.values for parameters. For example: void foo(int param) { int x = param + 2; param = SomeInt(); ... } In this example it should be possible to describe `x` in terms of `param`'s initial value, while `param` itself could not be described by an entry value after its assignment within the function. In that case, I believe it would make more sense to look for dbg.values that use a parameter Value, or can be expressed in terms of one, rather than using `DILocalVariable::isParameter`. Does that seem correct, or have I misunderstood the work or the purpose of this block? StephenTozer: As far as I understand, we want to recover variables by describing them in relation to the…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions Hi @StephenTozer. Yes, but is a more complex use case, so we do not cover it (yet). There is a `TODO` marker within the `findEntryValue()` for that. The debug entry values feature is complex, relying on various things, so we've initially implemented the support for simple registers locations as entry values of unmodified parameters only, during the LiveDebugValues phase. Currently, we are extending the support onto IR level, for some (simple) cases, but there is space for improvements for sure, since the potential goes beyond the current usage. djtodoro: Hi @StephenTozer. Yes, but is a more complex use case, so we do not cover it (yet). There is a…
		StephenTozerUnsubmitted Not Done Reply Inline Actions Not handling the `x` case is fine then, since it will be much easier to get the basic change completed and then extend it to cover more cases later. I'm still not sure how this new extension avoids the second case however. Because we only use the variable to determine what entry value (if any) to produce, this code would still produce entry value DBG_VALUEs for a parameter even after an assignment to that parameter. Is there anything I've missed that would prevent us producing invalid entry values right here, such as for `param`'s post-assignment value in the example code above? StephenTozer: Not handling the `x` case is fine then, since it will be much easier to get the basic change…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions Probably we need some additional/stronger statements here. Can you please share the test case, so I can reproduce it? djtodoro: Probably we need some additional/stronger statements here. Can you please share the test case…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions statements -- in the code/extension implementation terms djtodoro: //statements -- in the code/extension implementation terms//
		StephenTozerUnsubmitted Not Done Reply Inline Actions incorrect_entry-value_isel.ll2 KBDownload Here is a test case which demonstrates the behaviour I'm talking about. I think the easiest way to summarize the underlying issue is that you're using the DILocalVariable to determine what entry value we should emit, if any. This is incorrect: it makes no fundamental difference whether or not the variable we're describing is a parameter, what matters is whether we use one of the SSA parameters. It might make more sense to track this information in the SDDbgValue directly, but in any case we can't rely on the variable to give us that information. Full disclosure, I've also discussed this with Orlando directly, and he retracts his comment about ArgValueMap solving the issue; since EntryValue is equal to the parameter value for the variable rather than the current dbg.value. StephenTozer: {F13044274} Here is a test case which demonstrates the behaviour I'm talking about. I think…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions Oh... it makes sense to me. Thanks! I've tried a few tests, and it all seemed to be working fine... I'll try to propose a solution for this. I think it would be nice if we can make it working outside of the `SelectionDAG` (like a general solution for IR), and that is why I thought the variable would be enough/nice way to implement it, but it may not be sufficient... djtodoro: Oh... it makes sense to me. Thanks! I've tried a few tests, and it all seemed to be working…
		StephenTozerUnsubmitted Not Done Reply Inline Actions My suggestion would be to use the SSA values. The properties of SSA form ensure that whenever we have a reference to an SSA value `%param` that is a parameter to a function, that value will be available as an entry value. This information should be readily available throughout IR. Once we get to ISel it gets a bit more complicated, but since each dbg.value is associated with a SDDbgValue it should be possible to store some identifying information in the SDDbgValue that can be referred back to here. My naive thought is that you could simply store a `Value` or Value Handle pointing to the original instruction, and use that instead of `findEntryValue`. If this doesn't work, it may be possible to store some information externally, as with `ArgValueMap`, that can be combined with the identifier to find the corresponding Entry Value (if any). StephenTozer:* My suggestion would be to use the SSA values. The properties of SSA form ensure that whenever…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions At first, I proposed/thought extending the instructions with some kind of a pointer to the `Value`, but it seemed to be too much work (since there would be handling of such info throughout the pipeline, etc.). Let me think about it, and there might be some combination of this approach and the ideas shared that potentially could solve the problem. djtodoro: At first, I proposed/thought extending the instructions with some kind of a pointer to the…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions I think this we should only be doing this for immutable parameters (and mutable parameters which are never assigned to), right? I.e. the entry_value of a parameter register is only a valid location for a parameter variable if that variable is never assigned another value. Hi @Orlando. I might have needed to add some more context here. You are right, and all of that has been already checked within the main debug-entry-values place of the production, within LiveDebugValues. We perform analysis there, and at the moment, use entry values for unmodified params (since it is the basic case). There is space for improvements, by using the entry values for describing/expressing the modification in terms of its entry value, but we do not support it (yet), there. On the IR level, at the moment, we are trying to extend this support for unused arguments, since it seems to be safe/simple use case. First place for that is the `DeadArgElimination` pass, and we are working on sorting out all the pieces for that purpose (but it takes some additional magic in order to get it working for both callee and caller sides). The next spot this could be used is here, where we also have unused parameters, that are not being copied into any register, so the `SelectionDAG` marks them as invalid/unable to track it location anymore. Please comment on this, since I might have missed something. djtodoro: > I think this we should only be doing this for immutable parameters (and mutable parameters…
		OrlandoUnsubmitted Not Done Reply Inline Actions Hi @Orlando. I might have needed to add some more context here. You are right, and all of that has been already checked within the main debug-entry-values place of the production, within LiveDebugValues. We perform analysis there, and at the moment, use entry values for unmodified params (since it is the basic case). There is space for improvements, by using the entry values for describing/expressing the modification in terms of its entry value, but we do not support it (yet), there. On the IR level, at the moment, we are trying to extend this support for unused arguments, since it seems to be safe/simple use case. First place for that is the `DeadArgElimination` pass, and we are working on sorting out all the pieces for that purpose (but it takes some additional magic in order to get it working for both callee and caller sides). The next spot this could be used is here, where we also have unused parameters, that are not being copied into any register, so the `SelectionDAG` marks them as invalid/unable to track it location anymore. Please comment on this, since I might have missed something. Thank you for this explanation, it has helped me understand. The significance of ArgValueMap - in filtering the invalidated SDDbgValues for parameter variables down to just those using argument values - hadn't quite hit me until now, but looking again this all makes sense to me now. I'm not really familiar with how LiveDebugValues handles entry_values currently. I'll take a close look soon as I can though as I'm interested in keeping up with this work! Orlando: > Hi @Orlando. > I might have needed to add some more context here. You are right, and all of…
		djtodoroAuthorUnsubmitted Done Reply Inline Actions No problem, thanks for your comments! I've described the intention of this patch, but I might have missed some pieces, so I think we'll cover that with more tests (e.g. with the one @StephenTozer will provide). djtodoro: No problem, thanks for your comments! I've described the intention of this patch, but I might…
		dstenbUnsubmitted Not Done Reply Inline Actions I don't want to derail this discussion, but I just wanted to add that when I played around with this patch I encountered cases where we incorrectly emit entry values, even after the parameter has been modified, on trunk without this patch. I wrote a PR for that: https://bugs.llvm.org/show_bug.cgi?id=47628. dstenb: I don't want to derail this discussion, but I just wanted to add that when I played around with…
		EntryValue =
		findEntryValue(cast<DILocalVariable>(Var), MF->getFunction());

		if (EntryValue && ArgValueMap) {
		if ((Reg = ArgValueMap->lookup(EntryValue)))
		Expr = DIExpression::prepend(cast<DIExpression>(Expr),
		DIExpression::EntryValue);
		}

		aprantlUnsubmitted Not Done Reply Inline Actions Can you add a comment that explains what's happening here? aprantl: Can you add a comment that explains what's happening here?
		djtodoroAuthorUnsubmitted Done Reply Inline Actions Sure. djtodoro: Sure.
// An invalidated SDNode must generate an undef DBG_VALUE: although the		// An invalidated SDNode must generate an undef DBG_VALUE: although the
// original value is no longer computed, earlier DBG_VALUEs live ranges		// original value is no longer computed, earlier DBG_VALUEs live ranges
// must not leak into later code.		// must not leak into later code.
		StephenTozerUnsubmitted Not Done Reply Inline Actions Minor: Comment could do with a small rewrite, something like "must generate an undef" -> "must generate a DBG_VALUE that is either undef, or an entry value if one is available" StephenTozer: Minor: Comment could do with a small rewrite, something like "must generate an undef" -> "must…
auto MIB = BuildMI(*MF, DL, TII->get(TargetOpcode::DBG_VALUE));		auto MIB = BuildMI(*MF, DL, TII->get(TargetOpcode::DBG_VALUE));
MIB.addReg(0U);		MIB.addReg(Reg, RegState::Debug);
MIB.addReg(0U, RegState::Debug);		MIB.addReg(0U, RegState::Debug);
MIB.addMetadata(Var);		MIB.addMetadata(Var);
MIB.addMetadata(Expr);		MIB.addMetadata(Expr);
return &*MIB;		return &*MIB;
}		}

if (SD->getKind() == SDDbgValue::FRAMEIX) {		if (SD->getKind() == SDDbgValue::FRAMEIX) {
// Stack address; this needs to be lowered in target-dependent fashion.		// Stack address; this needs to be lowered in target-dependent fashion.
▲ Show 20 Lines • Show All 485 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGFast.cpp

Show First 20 Lines • Show All 641 Lines • ▼ Show 20 Lines
//		//
class ScheduleDAGLinearize : public ScheduleDAGSDNodes {		class ScheduleDAGLinearize : public ScheduleDAGSDNodes {
public:		public:
ScheduleDAGLinearize(MachineFunction &mf) : ScheduleDAGSDNodes(mf) {}		ScheduleDAGLinearize(MachineFunction &mf) : ScheduleDAGSDNodes(mf) {}

void Schedule() override;		void Schedule() override;

MachineBasicBlock *		MachineBasicBlock *
EmitSchedule(MachineBasicBlock::iterator &InsertPos) override;		EmitSchedule(MachineBasicBlock::iterator &InsertPos,
		DenseMap<const Value , Register> & /ArgValueMap*/) override;

private:		private:
std::vector<SDNode*> Sequence;		std::vector<SDNode*> Sequence;
DenseMap<SDNode, SDNode> GluedMap; // Cache glue to its user		DenseMap<SDNode, SDNode> GluedMap; // Cache glue to its user

void ScheduleNode(SDNode *N);		void ScheduleNode(SDNode *N);
};		};
} // end anonymous namespace		} // end anonymous namespace
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = Glues.size(); i != e; ++i) {
GUser->setNodeId(UDegree + Degree);		GUser->setNodeId(UDegree + Degree);
Glue->setNodeId(1);		Glue->setNodeId(1);
}		}

Sequence.reserve(DAGSize);		Sequence.reserve(DAGSize);
ScheduleNode(DAG->getRoot().getNode());		ScheduleNode(DAG->getRoot().getNode());
}		}

MachineBasicBlock*		MachineBasicBlock *ScheduleDAGLinearize::EmitSchedule(
ScheduleDAGLinearize::EmitSchedule(MachineBasicBlock::iterator &InsertPos) {		MachineBasicBlock::iterator &InsertPos,
		DenseMap<const Value , Register> & /ArgValueMap*/) {
InstrEmitter Emitter(BB, InsertPos);		InstrEmitter Emitter(BB, InsertPos);
DenseMap<SDValue, Register> VRBaseMap;		DenseMap<SDValue, Register> VRBaseMap;

LLVM_DEBUG({ dbgs() << "\n* Final schedule *\n"; });		LLVM_DEBUG({ dbgs() << "\n* Final schedule *\n"; });

unsigned NumNodes = Sequence.size();		unsigned NumNodes = Sequence.size();
MachineBasicBlock *BB = Emitter.getBlock();		MachineBasicBlock *BB = Emitter.getBlock();
for (unsigned i = 0; i != NumNodes; ++i) {		for (unsigned i = 0; i != NumNodes; ++i) {
Show All 34 Lines

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.h

Show First 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	public:

/// VerifyScheduledSequence - Verify that all SUnits are scheduled and		/// VerifyScheduledSequence - Verify that all SUnits are scheduled and
/// consistent with the Sequence of scheduled instructions.		/// consistent with the Sequence of scheduled instructions.
void VerifyScheduledSequence(bool isBottomUp);		void VerifyScheduledSequence(bool isBottomUp);

/// EmitSchedule - Insert MachineInstrs into the MachineBasicBlock		/// EmitSchedule - Insert MachineInstrs into the MachineBasicBlock
/// according to the order specified in Sequence.		/// according to the order specified in Sequence.
///		///
virtual MachineBasicBlock*		virtual MachineBasicBlock *
EmitSchedule(MachineBasicBlock::iterator &InsertPos);		EmitSchedule(MachineBasicBlock::iterator &InsertPos,
		DenseMap<const Value *, Register> &ArgValueMap);

void dumpNode(const SUnit &SU) const override;		void dumpNode(const SUnit &SU) const override;
void dump() const override;		void dump() const override;
void dumpSchedule() const;		void dumpSchedule() const;

std::string getGraphNodeLabel(const SUnit *SU) const override;		std::string getGraphNodeLabel(const SUnit *SU) const override;

std::string getDAGName() const override;		std::string getDAGName() const override;
▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

Show First 20 Lines • Show All 822 Lines • ▼ Show 20 Lines	EmitPhysRegCopy(SUnit SU, DenseMap<SUnit, Register> &VRBaseMap,
}		}
}		}

/// EmitSchedule - Emit the machine code in scheduled order. Return the new		/// EmitSchedule - Emit the machine code in scheduled order. Return the new
/// InsertPos and MachineBasicBlock that contains this insertion		/// InsertPos and MachineBasicBlock that contains this insertion
/// point. ScheduleDAGSDNodes holds a BB pointer for convenience, but this does		/// point. ScheduleDAGSDNodes holds a BB pointer for convenience, but this does
/// not necessarily refer to returned BB. The emitter may split blocks.		/// not necessarily refer to returned BB. The emitter may split blocks.
MachineBasicBlock *ScheduleDAGSDNodes::		MachineBasicBlock *ScheduleDAGSDNodes::
EmitSchedule(MachineBasicBlock::iterator &InsertPos) {		EmitSchedule(MachineBasicBlock::iterator &InsertPos,
		DenseMap<const Value *, Register> &ValueMap) {
InstrEmitter Emitter(BB, InsertPos);		InstrEmitter Emitter(BB, InsertPos);
DenseMap<SDValue, Register> VRBaseMap;		DenseMap<SDValue, Register> VRBaseMap;
DenseMap<SUnit*, Register> CopyVRBaseMap;		DenseMap<SUnit*, Register> CopyVRBaseMap;
SmallVector<std::pair<unsigned, MachineInstr*>, 32> Orders;		SmallVector<std::pair<unsigned, MachineInstr*>, 32> Orders;
SmallSet<Register, 8> Seen;		SmallSet<Register, 8> Seen;
bool HasDbg = DAG->hasDebugValues();		bool HasDbg = DAG->hasDebugValues();

// Emit a node, and determine where its first instruction is for debuginfo.		// Emit a node, and determine where its first instruction is for debuginfo.
▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = Orders.size(); i != e && DI != DE; ++i) {
// Insert all SDDbgValue's whose order(s) are before "Order".		// Insert all SDDbgValue's whose order(s) are before "Order".
assert(MI);		assert(MI);
for (; DI != DE; ++DI) {		for (; DI != DE; ++DI) {
if ((DI)->getOrder() < LastOrder \|\| (DI)->getOrder() >= Order)		if ((DI)->getOrder() < LastOrder \|\| (DI)->getOrder() >= Order)
break;		break;
if ((*DI)->isEmitted())		if ((*DI)->isEmitted())
continue;		continue;

MachineInstr DbgMI = Emitter.EmitDbgValue(DI, VRBaseMap);		MachineInstr DbgMI = Emitter.EmitDbgValue(DI, VRBaseMap, &ValueMap);
if (DbgMI) {		if (DbgMI) {
if (!LastOrder)		if (!LastOrder)
// Insert to start of the BB (after PHIs).		// Insert to start of the BB (after PHIs).
BB->insert(BBBegin, DbgMI);		BB->insert(BBBegin, DbgMI);
else {		else {
// Insert at the instruction, which may be in a different		// Insert at the instruction, which may be in a different
// block, if the block was split by a custom inserter.		// block, if the block was split by a custom inserter.
MachineBasicBlock::iterator Pos = MI;		MachineBasicBlock::iterator Pos = MI;
▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,464 Lines • ▼ Show 20 Lines	if (Reg && Reg.isVirtual()) {
Register PR = RegInfo.getLiveInPhysReg(Reg);		Register PR = RegInfo.getLiveInPhysReg(Reg);
if (PR)		if (PR)
Reg = PR;		Reg = PR;
}		}
if (Reg) {		if (Reg) {
Op = MachineOperand::CreateReg(Reg, false);		Op = MachineOperand::CreateReg(Reg, false);
IsIndirect = IsDbgDeclare;		IsIndirect = IsDbgDeclare;
}		}

		// Map the register into the Value, so it can be used for debug
		// info recovering.
		if (FuncInfo.ArgValueMap.find(V) == FuncInfo.ArgValueMap.end())
		FuncInfo.ArgValueMap[V] = Reg;
		StephenTozerUnsubmitted Not Done Reply Inline Actions Perhaps instead of simply making this a condition, there should be an assert here that `V` doesn't already exist in `ArgValueMap`? I can't think of any case where it would already be there that wouldn't be an error, since I don't think we would ever call this twice for the same Value. StephenTozer: Perhaps instead of simply making this a condition, there should be an assert here that `V`…
}		}

if (!Op && N.getNode()) {		if (!Op && N.getNode()) {
// Check if frame index is available.		// Check if frame index is available.
SDValue LCandidate = peekThroughBitcasts(N);		SDValue LCandidate = peekThroughBitcasts(N);
if (LoadSDNode *LNode = dyn_cast<LoadSDNode>(LCandidate.getNode()))		if (LoadSDNode *LNode = dyn_cast<LoadSDNode>(LCandidate.getNode()))
if (FrameIndexSDNode *FINode =		if (FrameIndexSDNode *FINode =
dyn_cast<FrameIndexSDNode>(LNode->getBasePtr().getNode()))		dyn_cast<FrameIndexSDNode>(LNode->getBasePtr().getNode()))
▲ Show 20 Lines • Show All 5,205 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 964 Lines • ▼ Show 20 Lines	#endif
// inserted into.		// inserted into.
MachineBasicBlock FirstMBB = FuncInfo->MBB, LastMBB;		MachineBasicBlock FirstMBB = FuncInfo->MBB, LastMBB;
{		{
NamedRegionTimer T("emit", "Instruction Creation", GroupName,		NamedRegionTimer T("emit", "Instruction Creation", GroupName,
GroupDescription, TimePassesIsEnabled);		GroupDescription, TimePassesIsEnabled);

// FuncInfo->InsertPt is passed by reference and set to the end of the		// FuncInfo->InsertPt is passed by reference and set to the end of the
// scheduled instructions.		// scheduled instructions.
LastMBB = FuncInfo->MBB = Scheduler->EmitSchedule(FuncInfo->InsertPt);		LastMBB = FuncInfo->MBB =
		Scheduler->EmitSchedule(FuncInfo->InsertPt, FuncInfo->ArgValueMap);
}		}

// If the block was split, make sure we update any references that are used to		// If the block was split, make sure we update any references that are used to
// update PHI nodes later on.		// update PHI nodes later on.
if (FirstMBB != LastMBB)		if (FirstMBB != LastMBB)
SDB->UpdateSplitBlock(FirstMBB, LastMBB);		SDB->UpdateSplitBlock(FirstMBB, LastMBB);

// Free the scheduler state.		// Free the scheduler state.
▲ Show 20 Lines • Show All 2,737 Lines • Show Last 20 Lines

llvm/test/DebugInfo/X86/entry-values-for-isel-invalidated-nodes.ll

This file was added.

				; RUN: llc < %s -O2 -stop-before=finalize-isel \| FileCheck %s
				; RUN: llc -O2 %s -o %t -filetype=obj
				; RUN: llvm-dwarfdump %t \| FileCheck %s --check-prefix=CHECK-DWARFDUMP

				; C producer:
				StephenTozerUnsubmitted Not Done Reply Inline Actions Repeat of above comment about DWARF version. StephenTozer: Repeat of above comment about DWARF version.
				; void f1(int);
				; void f2(int i) {
				; f1(1);
				; i = i + 5;
				; f1(3);
				; }
				; $ clang -g -O2 test.c -S -emit-llvm

				; CHECK: DBG_VALUE $edi, $noreg, !{{.*}}, !DIExpression()
				; CHECK: DBG_VALUE $edi, $noreg, !{{.*}}, !DIExpression(DW_OP_LLVM_entry_value, 1, DW_OP_plus_uconst, 5, DW_OP_stack_value)

				; CHECK-DWARFDUMP: DW_OP_GNU_entry_value(DW_OP_reg5 RDI), DW_OP_stack_value
				; CHECK-DWARFDUMP: DW_OP_GNU_entry_value(DW_OP_reg5 RDI), DW_OP_constu 0xffffffff, DW_OP_and, DW_OP_plus_uconst 0x5, DW_OP_stack_value

				; ModuleID = 'test.c'
				source_filename = "test.c"
				target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				; Function Attrs: nounwind uwtable
				define dso_local void @f2(i32 %i) local_unnamed_addr !dbg !7 {
				entry:
				call void @llvm.dbg.value(metadata i32 %i, metadata !12, metadata !DIExpression()), !dbg !13
				tail call void @f1(i32 1), !dbg !14
				call void @llvm.dbg.value(metadata i32 %i, metadata !12, metadata !DIExpression(DW_OP_plus_uconst, 5, DW_OP_stack_value)), !dbg !13
				tail call void @f1(i32 3), !dbg !15
				ret void, !dbg !16
				}

				declare !dbg !17 dso_local void @f1(i32) local_unnamed_addr

				; Function Attrs: nounwind readnone speculatable willreturn
				declare void @llvm.dbg.value(metadata, metadata, metadata)

				!llvm.dbg.cu = !{!0}
				!llvm.module.flags = !{!3, !4, !5}
				!llvm.ident = !{!6}

				!0 = distinct !DICompileUnit(language: DW_LANG_C99, file: !1, producer: "clang version 12.0.0", isOptimized: true, runtimeVersion: 0, emissionKind: FullDebug, enums: !2, splitDebugInlining: false, nameTableKind: None)
				!1 = !DIFile(filename: "test.c", directory: "/dir")
				!2 = !{}
				!3 = !{i32 7, !"Dwarf Version", i32 4}
				!4 = !{i32 2, !"Debug Info Version", i32 3}
				!5 = !{i32 1, !"wchar_size", i32 4}
				!6 = !{!"clang version 12.0.0"}
				!7 = distinct !DISubprogram(name: "f2", scope: !1, file: !1, line: 2, type: !8, scopeLine: 2, flags: DIFlagPrototyped \| DIFlagAllCallsDescribed, spFlags: DISPFlagDefinition \| DISPFlagOptimized, unit: !0, retainedNodes: !11)
				!8 = !DISubroutineType(types: !9)
				!9 = !{null, !10}
				!10 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				!11 = !{!12}
				!12 = !DILocalVariable(name: "i", arg: 1, scope: !7, file: !1, line: 2, type: !10)
				!13 = !DILocation(line: 0, scope: !7)
				!14 = !DILocation(line: 3, column: 3, scope: !7)
				!15 = !DILocation(line: 5, column: 3, scope: !7)
				!16 = !DILocation(line: 6, column: 1, scope: !7)
				!17 = !DISubprogram(name: "f1", scope: !1, file: !1, line: 1, type: !8, flags: DIFlagPrototyped, spFlags: DISPFlagOptimized, retainedNodes: !2)

This is an archive of the discontinued LLVM Phabricator instance.

[SelectionDAG][DebugInfo] Use entry-values to recover variables valuesAcceptedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 293681

llvm/include/llvm/CodeGen/FunctionLoweringInfo.h

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.h

llvm/lib/CodeGen/SelectionDAG/InstrEmitter.cpp

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGFast.cpp

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.h

llvm/lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/test/DebugInfo/X86/entry-values-for-isel-invalidated-nodes.ll

[SelectionDAG][DebugInfo] Use entry-values to recover variables values
AcceptedPublic