This is an archive of the discontinued LLVM Phabricator instance.

Differential D85749

[DebugInstrRef][4/9] Support recording of instruction reference substitutions
ClosedPublic

Authored by jmorse on Aug 11 2020, 9:32 AM.

Download Raw Diff

Details

Reviewers

aprantl

Group Reviewers

debug-info

Commits

rGc521e44defb5: [DebugInstrRef] Support recording of instruction reference substitutions

Summary

At various points in the post-isel optimisation passes, instructions get rewritten and replaced. One way to handle this is just to transfer the instruction number from the old to the new instruction -- however that doesn't account for passes like TwoAddressInstruction, which change the positions of operands. Instead:

We never re-use instruction numbers: when a new instruction is created, it gets a new number,
This patch adds a table of "substitutions": a mapping from old <inst,operand> pairs to the new ones.

This stems from patch 1 in this series: there's no connection between instruction numbers and DBG_INSTR_REFs preserved.

The downside of this is that we end up preserving in memory a mapping table, which more or less contains the history of optimisations that have happened to the function; and we have to apply the substitutions later, when LiveDebugValues runs. IMO this is a worthy price to pay, one table isn't highly expensive, and it models the way I'd like variable location tracking to work, doing as little work as possible during compilation and resolving variables back to locations at the end.

An additional benefit of changing instruction numbers when the instruction is recreated / modified is that changes which destroy the creator of a value can be expressed, by not creating a substitution for the old <inst,operand> pair. An example would be, if there were a divide instruction that generated the quotient and remainder, and it were replaced by one that only generated the quotient:

$rax, $rcx = div-and-remainder $rdx, $rsi, debug-instr-num 1
DBG_INSTR_REF 1, 0
DBG_INSTR_REF 1, 1

Would become

$rax = div $rdx, $rsi, debug-instr-num 2
DBG_INSTR_REF 1, 0
DBG_INSTR_REF 1, 1

With a substitution entered from <1, 0> to <2, 0>, and no substitution created for <1, 1> as it's no longer generated.

This patch only adds the data structure and MIR format, plus round-trip test.

Diff Detail

Event Timeline

jmorse created this revision.Aug 11 2020, 9:32 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 11 2020, 9:32 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

jmorse requested review of this revision.Aug 11 2020, 9:32 AM

jmorse added a parent revision: D85747: [DebugInstrRef][3/9] Create DBG_INSTR_REFs in SelectionDAG.

jmorse added a child revision: D85756: [DebugInstrRef][5/9] Substitute debug value numbers to handle optimisations.Aug 11 2020, 10:11 AM

Harbormaster completed remote builds in B67943: Diff 284787.Aug 11 2020, 11:14 AM

aprantl accepted this revision.Aug 11 2020, 11:37 AM

This revision is now accepted and ready to land.Aug 11 2020, 11:37 AM

This revision was landed with ongoing or failed builds.Oct 15 2020, 3:31 AM

Closed by commit rGc521e44defb5: [DebugInstrRef] Support recording of instruction reference substitutions (authored by jmorse). · Explain Why

This revision was automatically updated to reflect the committed changes.

jmorse added a commit: rGc521e44defb5: [DebugInstrRef] Support recording of instruction reference substitutions.

After landing this, bkramer pointed out one of the assertions in substituteDebugValuesForInst was ineffective. Fixing that shows that some callers (see patch 5 in this series) do so with different instruction signatures (i.e., different types and different numbers of operands).

I've revised the method in the patch I'll upload in a few seconds to optionally allow a "number of operands to substitute" to be specified. This makes use of the LLVM convention that register defs are usually the first operands: we can say for the arithmetic-to-LEA code paths (patch 5) that only the first operand needs to be substituted.

I don't think this changes the fundamental idea of this patch, only the way it's expressed. I'll leave this up for a few days and then land.

This revision is now accepted and ready to land.Oct 15 2020, 5:37 AM

Updated patch, changes signature of substituteDebugValuesForInst to take a "number of operands to look at" argument, defaults to "all of them".

jmorse mentioned this in rGc521e44defb5: [DebugInstrRef] Support recording of instruction reference substitutions.Oct 15 2020, 6:03 AM

Closed by c521e44defb53d38a46f39e29870c628f25d124a

jmorse mentioned this in rG537f0fbe8204: [DebugInfo] Follow up c521e44defb5 with an API improvement.Oct 21 2020, 6:46 AM

jmorse mentioned this in D105820: [DebugInfo][InstrRef] Fix a broken substitution method, add test coverage.Jul 12 2021, 8:32 AM

jmorse mentioned this in rG241f3e386cd2: [DebugInfo][InstrRef] Fix a broken substitution method, add test coverage.Jul 20 2021, 3:45 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

MIRYamlMapping.h

33 lines

MachineFunction.h

24 lines

lib/

CodeGen/

MIRParser/

MIRParser.cpp

15 lines

MIRPrinter.cpp

4 lines

MachineFunction.cpp

36 lines

test/

DebugInfo/

MIR/

InstrRef/

substitusions-roundtrip.mir

26 lines

Diff 298355

llvm/include/llvm/CodeGen/MIRYamlMapping.h

Show First 20 Lines • Show All 435 Lines • ▼ Show 20 Lines	static void mapping(IO &YamlIO, CallSiteInfo &CSInfo) {
YamlIO.mapRequired("offset", CSInfo.CallLocation.Offset);		YamlIO.mapRequired("offset", CSInfo.CallLocation.Offset);
YamlIO.mapOptional("fwdArgRegs", CSInfo.ArgForwardingRegs,		YamlIO.mapOptional("fwdArgRegs", CSInfo.ArgForwardingRegs,
std::vector<CallSiteInfo::ArgRegPair>());		std::vector<CallSiteInfo::ArgRegPair>());
}		}

static const bool flow = true;		static const bool flow = true;
};		};

		/// Serializable representation of debug value substitutions.
		struct DebugValueSubstitution {
		unsigned SrcInst;
		unsigned SrcOp;
		unsigned DstInst;
		unsigned DstOp;

		bool operator==(const DebugValueSubstitution &Other) const {
		return std::tie(SrcInst, SrcOp, DstInst, DstOp) ==
		std::tie(Other.SrcInst, Other.SrcOp, Other.DstInst, Other.DstOp);
		}
		};

		template <> struct MappingTraits<DebugValueSubstitution> {
		static void mapping(IO &YamlIO, DebugValueSubstitution &Sub) {
		YamlIO.mapRequired("srcinst", Sub.SrcInst);
		YamlIO.mapRequired("srcop", Sub.SrcOp);
		YamlIO.mapRequired("dstinst", Sub.DstInst);
		YamlIO.mapRequired("dstop", Sub.DstOp);
		}

		static const bool flow = true;
		};
		} // namespace yaml
		} // namespace llvm

		LLVM_YAML_IS_SEQUENCE_VECTOR(llvm::yaml::DebugValueSubstitution)

		namespace llvm {
		namespace yaml {
struct MachineConstantPoolValue {		struct MachineConstantPoolValue {
UnsignedValue ID;		UnsignedValue ID;
StringValue Value;		StringValue Value;
MaybeAlign Alignment = None;		MaybeAlign Alignment = None;
bool IsTargetSpecific = false;		bool IsTargetSpecific = false;

bool operator==(const MachineConstantPoolValue &Other) const {		bool operator==(const MachineConstantPoolValue &Other) const {
return ID == Other.ID && Value == Other.Value &&		return ID == Other.ID && Value == Other.Value &&
▲ Show 20 Lines • Show All 168 Lines • ▼ Show 20 Lines	struct MachineFunction {
// TODO: Serialize the various register masks.		// TODO: Serialize the various register masks.
// Frame information		// Frame information
MachineFrameInfo FrameInfo;		MachineFrameInfo FrameInfo;
std::vector<FixedMachineStackObject> FixedStackObjects;		std::vector<FixedMachineStackObject> FixedStackObjects;
std::vector<MachineStackObject> StackObjects;		std::vector<MachineStackObject> StackObjects;
std::vector<MachineConstantPoolValue> Constants; /// Constant pool.		std::vector<MachineConstantPoolValue> Constants; /// Constant pool.
std::unique_ptr<MachineFunctionInfo> MachineFuncInfo;		std::unique_ptr<MachineFunctionInfo> MachineFuncInfo;
std::vector<CallSiteInfo> CallSitesInfo;		std::vector<CallSiteInfo> CallSitesInfo;
		std::vector<DebugValueSubstitution> DebugValueSubstitutions;
MachineJumpTable JumpTableInfo;		MachineJumpTable JumpTableInfo;
BlockStringValue Body;		BlockStringValue Body;
};		};

template <> struct MappingTraits<MachineFunction> {		template <> struct MappingTraits<MachineFunction> {
static void mapping(IO &YamlIO, MachineFunction &MF) {		static void mapping(IO &YamlIO, MachineFunction &MF) {
YamlIO.mapRequired("name", MF.Name);		YamlIO.mapRequired("name", MF.Name);
YamlIO.mapOptional("alignment", MF.Alignment, None);		YamlIO.mapOptional("alignment", MF.Alignment, None);
Show All 12 Lines	YamlIO.mapOptional("calleeSavedRegisters", MF.CalleeSavedRegisters,
Optional<std::vector<FlowStringValue>>());		Optional<std::vector<FlowStringValue>>());
YamlIO.mapOptional("frameInfo", MF.FrameInfo, MachineFrameInfo());		YamlIO.mapOptional("frameInfo", MF.FrameInfo, MachineFrameInfo());
YamlIO.mapOptional("fixedStack", MF.FixedStackObjects,		YamlIO.mapOptional("fixedStack", MF.FixedStackObjects,
std::vector<FixedMachineStackObject>());		std::vector<FixedMachineStackObject>());
YamlIO.mapOptional("stack", MF.StackObjects,		YamlIO.mapOptional("stack", MF.StackObjects,
std::vector<MachineStackObject>());		std::vector<MachineStackObject>());
YamlIO.mapOptional("callSites", MF.CallSitesInfo,		YamlIO.mapOptional("callSites", MF.CallSitesInfo,
std::vector<CallSiteInfo>());		std::vector<CallSiteInfo>());
		YamlIO.mapOptional("debugValueSubstitutions", MF.DebugValueSubstitutions,
		std::vector<DebugValueSubstitution>());
YamlIO.mapOptional("constants", MF.Constants,		YamlIO.mapOptional("constants", MF.Constants,
std::vector<MachineConstantPoolValue>());		std::vector<MachineConstantPoolValue>());
YamlIO.mapOptional("machineFunctionInfo", MF.MachineFuncInfo);		YamlIO.mapOptional("machineFunctionInfo", MF.MachineFuncInfo);
if (!YamlIO.outputting() \|\| !MF.JumpTableInfo.Entries.empty())		if (!YamlIO.outputting() \|\| !MF.JumpTableInfo.Entries.empty())
YamlIO.mapOptional("jumpTable", MF.JumpTableInfo, MachineJumpTable());		YamlIO.mapOptional("jumpTable", MF.JumpTableInfo, MachineJumpTable());
YamlIO.mapOptional("body", MF.Body, BlockStringValue());		YamlIO.mapOptional("body", MF.Body, BlockStringValue());
}		}
};		};

} // end namespace yaml		} // end namespace yaml
} // end namespace llvm		} // end namespace llvm

#endif // LLVM_CODEGEN_MIRYAMLMAPPING_H		#endif // LLVM_CODEGEN_MIRYAMLMAPPING_H

llvm/include/llvm/CodeGen/MachineFunction.h

Show First 20 Lines • Show All 434 Lines • ▼ Show 20 Lines	public:
/// assigned to them. Used for debug value tracking, to determine the		/// assigned to them. Used for debug value tracking, to determine the
/// next instruction number.		/// next instruction number.
unsigned DebugInstrNumberingCount = 0;		unsigned DebugInstrNumberingCount = 0;

/// Set value of DebugInstrNumberingCount field. Avoid using this unless		/// Set value of DebugInstrNumberingCount field. Avoid using this unless
/// you're deserializing this data.		/// you're deserializing this data.
void setDebugInstrNumberingCount(unsigned Num);		void setDebugInstrNumberingCount(unsigned Num);

		/// Pair of instruction number and operand number.
		using DebugInstrOperandPair = std::pair<unsigned, unsigned>;

		/// Substitution map: from one <inst,operand> pair to another. Used to
		/// record changes in where a value is defined, so that debug variable
		/// locations can find it later.
		std::map<DebugInstrOperandPair, DebugInstrOperandPair>
		DebugValueSubstitutions;

		/// Create a substitution between one <instr,operand> value to a different,
		/// new value.
		void makeDebugValueSubstitution(DebugInstrOperandPair, DebugInstrOperandPair);

		/// Create substitutions for any tracked values in \p Old, to point at
		/// \p New. Needed when we re-create an instruction during optimization,
		/// which has the same signature (i.e., def operands in the same place) but
		/// a modified instruction type, flags, or otherwise. An example: X86 moves
		/// are sometimes transformed into equivalent LEAs.
		/// If the two instructions are not the same opcode, limit which operands to
		/// examine for substitutions to the first N operands by setting
		/// \p MaxOperand.
		void substituteDebugValuesForInst(const MachineInstr &Old, MachineInstr &New,
		unsigned MaxOperand = UINT_MAX);

MachineFunction(Function &F, const LLVMTargetMachine &Target,		MachineFunction(Function &F, const LLVMTargetMachine &Target,
const TargetSubtargetInfo &STI, unsigned FunctionNum,		const TargetSubtargetInfo &STI, unsigned FunctionNum,
MachineModuleInfo &MMI);		MachineModuleInfo &MMI);
MachineFunction(const MachineFunction &) = delete;		MachineFunction(const MachineFunction &) = delete;
MachineFunction &operator=(const MachineFunction &) = delete;		MachineFunction &operator=(const MachineFunction &) = delete;
~MachineFunction();		~MachineFunction();

/// Reset the instance as if it was just created.		/// Reset the instance as if it was just created.
▲ Show 20 Lines • Show All 712 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MIRParser/MIRParser.cpp

Show First 20 Lines • Show All 399 Lines • ▼ Show 20 Lines	if (TM.Options.EmitCallSiteInfo)
MF.addCallArgsForwardingRegs(&*CallI, std::move(CSInfo));		MF.addCallArgsForwardingRegs(&*CallI, std::move(CSInfo));
}		}

if (YamlMF.CallSitesInfo.size() && !TM.Options.EmitCallSiteInfo)		if (YamlMF.CallSitesInfo.size() && !TM.Options.EmitCallSiteInfo)
return error(Twine("Call site info provided but not used"));		return error(Twine("Call site info provided but not used"));
return false;		return false;
}		}

void MIRParserImpl::setupDebugValueTracking(MachineFunction &MF,		void MIRParserImpl::setupDebugValueTracking(
PerFunctionMIParsingState &PFS, const yaml::MachineFunction &YamlMF) {		MachineFunction &MF, PerFunctionMIParsingState &PFS,
// For now, we only compute the value of the "next instruction number"		const yaml::MachineFunction &YamlMF) {
// field.		// Compute the value of the "next instruction number" field.
unsigned MaxInstrNum = 0;		unsigned MaxInstrNum = 0;
for (auto &MBB : MF)		for (auto &MBB : MF)
for (auto &MI : MBB)		for (auto &MI : MBB)
MaxInstrNum = std::max((unsigned)MI.peekDebugInstrNum(), MaxInstrNum);		MaxInstrNum = std::max((unsigned)MI.peekDebugInstrNum(), MaxInstrNum);
MF.setDebugInstrNumberingCount(MaxInstrNum);		MF.setDebugInstrNumberingCount(MaxInstrNum);
}

		// Load any substitutions.
		for (auto &Sub : YamlMF.DebugValueSubstitutions) {
		MF.makeDebugValueSubstitution(std::make_pair(Sub.SrcInst, Sub.SrcOp),
		std::make_pair(Sub.DstInst, Sub.DstOp));
		}
		}

bool		bool
MIRParserImpl::initializeMachineFunction(const yaml::MachineFunction &YamlMF,		MIRParserImpl::initializeMachineFunction(const yaml::MachineFunction &YamlMF,
MachineFunction &MF) {		MachineFunction &MF) {
// TODO: Recreate the machine function.		// TODO: Recreate the machine function.
if (Target) {		if (Target) {
// Avoid clearing state if we're using the same subtarget again.		// Avoid clearing state if we're using the same subtarget again.
Target->setTarget(MF.getSubtarget());		Target->setTarget(MF.getSubtarget());
▲ Show 20 Lines • Show All 580 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MIRPrinter.cpp

Show First 20 Lines • Show All 214 Lines • ▼ Show 20 Lines	YamlMF.FailedISel = MF.getProperties().hasProperty(
MachineFunctionProperties::Property::FailedISel);		MachineFunctionProperties::Property::FailedISel);

convert(YamlMF, MF.getRegInfo(), MF.getSubtarget().getRegisterInfo());		convert(YamlMF, MF.getRegInfo(), MF.getSubtarget().getRegisterInfo());
ModuleSlotTracker MST(MF.getFunction().getParent());		ModuleSlotTracker MST(MF.getFunction().getParent());
MST.incorporateFunction(MF.getFunction());		MST.incorporateFunction(MF.getFunction());
convert(MST, YamlMF.FrameInfo, MF.getFrameInfo());		convert(MST, YamlMF.FrameInfo, MF.getFrameInfo());
convertStackObjects(YamlMF, MF, MST);		convertStackObjects(YamlMF, MF, MST);
convertCallSiteObjects(YamlMF, MF, MST);		convertCallSiteObjects(YamlMF, MF, MST);
		for (auto &Sub : MF.DebugValueSubstitutions)
		YamlMF.DebugValueSubstitutions.push_back({Sub.first.first, Sub.first.second,
		Sub.second.first,
		Sub.second.second});
if (const auto *ConstantPool = MF.getConstantPool())		if (const auto *ConstantPool = MF.getConstantPool())
convert(YamlMF, *ConstantPool);		convert(YamlMF, *ConstantPool);
if (const auto *JumpTableInfo = MF.getJumpTableInfo())		if (const auto *JumpTableInfo = MF.getJumpTableInfo())
convert(MST, YamlMF.JumpTableInfo, *JumpTableInfo);		convert(MST, YamlMF.JumpTableInfo, *JumpTableInfo);

const TargetMachine &TM = MF.getTarget();		const TargetMachine &TM = MF.getTarget();
YamlMF.MachineFuncInfo =		YamlMF.MachineFuncInfo =
std::unique_ptr<yaml::MachineFunctionInfo>(TM.convertFuncInfoToYAML(MF));		std::unique_ptr<yaml::MachineFunctionInfo>(TM.convertFuncInfoToYAML(MF));
▲ Show 20 Lines • Show All 674 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineFunction.cpp

Show First 20 Lines • Show All 941 Lines • ▼ Show 20 Lines	void MachineFunction::moveCallSiteInfo(const MachineInstr *Old,
CallSitesInfo.erase(CSIt);		CallSitesInfo.erase(CSIt);
CallSitesInfo[New] = CSInfo;		CallSitesInfo[New] = CSInfo;
}		}

void MachineFunction::setDebugInstrNumberingCount(unsigned Num) {		void MachineFunction::setDebugInstrNumberingCount(unsigned Num) {
DebugInstrNumberingCount = Num;		DebugInstrNumberingCount = Num;
}		}

		void MachineFunction::makeDebugValueSubstitution(DebugInstrOperandPair A,
		DebugInstrOperandPair B) {
		auto Result = DebugValueSubstitutions.insert(std::make_pair(A, B));
		(void)Result;
		assert(Result.second && "Substitution for an already substituted value?");
		}

		void MachineFunction::substituteDebugValuesForInst(const MachineInstr &Old,
		MachineInstr &New,
		unsigned MaxOperand) {
		// If the Old instruction wasn't tracked at all, there is no work to do.
		unsigned OldInstrNum = Old.peekDebugInstrNum();
		if (!OldInstrNum)
		return;

		// Iterate over all operands looking for defs to create substitutions for.
		// Avoid creating new instr numbers unless we create a new substitution.
		// While this has no functional effect, it risks confusing someone reading
		// MIR output.
		// Examine all the operands, or the first N specified by the caller.
		MaxOperand = std::min(MaxOperand, Old.getNumOperands());
		for (unsigned int I = 0; I < MaxOperand; ++I) {
		const auto &OldMO = Old.getOperand(I);
		auto &NewMO = New.getOperand(I);
		(void)NewMO;

		if (!OldMO.isReg() \|\| !OldMO.isDef())
		continue;
		assert(NewMO.isDef());

		unsigned NewInstrNum = New.getDebugInstrNum();
		makeDebugValueSubstitution(std::make_pair(OldInstrNum, I),
		std::make_pair(NewInstrNum, I));
		}
		}

/// \}		/// \}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// MachineJumpTableInfo implementation		// MachineJumpTableInfo implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Return the size of each entry in the jump table.		/// Return the size of each entry in the jump table.
unsigned MachineJumpTableInfo::getEntrySize(const DataLayout &TD) const {		unsigned MachineJumpTableInfo::getEntrySize(const DataLayout &TD) const {
▲ Show 20 Lines • Show All 248 Lines • Show Last 20 Lines

llvm/test/DebugInfo/MIR/InstrRef/substitusions-roundtrip.mir

This file was added.

				# RUN: llc %s -march=x86-64 -run-pass=machineverifier \
				# RUN: -experimental-debug-variable-locations -o - 2>&1 \| FileCheck %s
				#
				# REQUIRES: x86-registered-target
				#
				# CHECK: debugValueSubstitutions:
				# CHECK-NEXT: - { srcinst: 1, srcop: 0, dstinst: 2, dstop: 0 }
				#
				# CHECK: MOV64rr $rdi, debug-instr-number 2
				# CHECK-NEXT: DBG_INSTR_REF 1, 0
				---
				name: test
				tracksRegLiveness: true
				liveins:
				- { reg: '$rdi', virtual-reg: '' }
				debugValueSubstitutions:
				- { srcinst: 1, srcop: 0, dstinst: 2, dstop: 0 }
				body: \|
				bb.0:
				liveins: $rdi, $rax
				$rbp = MOV64rr $rdi, debug-instr-number 2
				DBG_INSTR_REF 1, 0
				dead $rcx = MOV64ri 0
				CMP64ri8 renamable $rax, 1, implicit-def $eflags
				RETQ $rax
				...