This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
CodeGen/AsmPrinter/
-
AsmPrinter/
-
AsmPrinter.cpp
-
Target/
-
AArch64/
-
AArch64AsmPrinter.cpp
-
RISCV/
-
RISCVMCInstLower.cpp
-
X86/
1
X86MCInstLower.cpp
-
test/CodeGen/
-
CodeGen/
-
Hexagon/
-
patchable-function-entry.ll
-
X86/
-
patchable-function-entry.ll

Differential D143802

[XRay] Add generic patchable-function-entry NOP sled implementation
AcceptedPublic

Authored by duck-37 on Feb 10 2023, 7:33 PM.

Download Raw Diff

Details

Reviewers

MaskRay

Summary

This patch generalizes the logic for patchable-function-entry=n NOP sleds, meaning that this attribute now has consistent behavior across all architectures that support XRay. The differences in codegen was noticed while porting a separate test-case over to ARM32.

Closes https://github.com/llvm/llvm-project/issues/60672

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

duck-37 created this revision.Feb 10 2023, 7:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 10 2023, 7:33 PM

Herald added subscribers: luke, frasercrmck, luismarques and 23 others. · View Herald Transcript

duck-37 requested review of this revision.Feb 10 2023, 7:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 10 2023, 7:33 PM

Herald added subscribers: llvm-commits, • pcwang-thead. · View Herald Transcript

duck-37 retitled this revision from [XRay] Make patchable-function-entry NOP sled implementation target-independent This patch generalizes the logic for patchable-function-entry NOP sleds, which extends support for them to all architectures that support XRay. A consequence of this... to [XRay] Make patchable-function-entry NOP sled implementation target-independent.Feb 10 2023, 7:34 PM

duck-37 edited the summary of this revision. (Show Details)

The increased instruction count for x86 is a slight performance regression. Can it be fixed?

Fixes https://github.com/llvm/llvm-project/issues/60672

It's not a bug so I'd use Close. Note: without runtime support, the compiler codegen change isn't really useful

In D143802#4119979, @MaskRay wrote:

Fixes https://github.com/llvm/llvm-project/issues/60672

It's not a bug so I'd use Close. Note: without runtime support, the compiler codegen change isn't really useful

Sure, I'll adjust that for the next revision. The main idea here was consistency across architectures; I came across this while moving a test-case over to ARM32 and noticing a difference in the behavior.

duck-37 planned changes to this revision.Feb 11 2023, 9:44 AM

Changed some wording in the review, used a virtual function instead of hardcoded emitNops call so that X86 (and potentially other targets) can use their own implementations if necessary.

Herald added a subscriber: kristof.beyls. · View Herald TranscriptFeb 11 2023, 9:50 AM

Harbormaster completed remote builds in B213223: Diff 496694.Feb 11 2023, 10:37 AM

Sorry for the delay. This looks good, but it needs rebase:)

llvm/lib/Target/X86/X86MCInstLower.cpp
2752	"No newline at end of file"

This revision is now accepted and ready to land.Aug 30 2023, 11:15 PM

Herald added subscribers: wangpc, sunshaoce. · View Herald TranscriptAug 30 2023, 11:15 PM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

AsmPrinter/

AsmPrinter.cpp

19 lines

Target/

AArch64/

AArch64AsmPrinter.cpp

11 lines

RISCV/

RISCVMCInstLower.cpp

13 lines

X86/

X86MCInstLower.cpp

10 lines

test/

CodeGen/

Hexagon/

patchable-function-entry.ll

25 lines

X86/

patchable-function-entry.ll

20 lines

Diff 496652

llvm/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

Show First 20 Lines • Show All 1,705 Lines • ▼ Show 20 Lines	for (auto &MI : MBB) {
break;		break;
case TargetOpcode::ARITH_FENCE:		case TargetOpcode::ARITH_FENCE:
if (isVerbose())		if (isVerbose())
OutStreamer->emitRawComment("ARITH_FENCE");		OutStreamer->emitRawComment("ARITH_FENCE");
break;		break;
case TargetOpcode::MEMBARRIER:		case TargetOpcode::MEMBARRIER:
OutStreamer->emitRawComment("MEMBARRIER");		OutStreamer->emitRawComment("MEMBARRIER");
break;		break;

		// patchable-function-entry=N is a target-independent NOP sled.
		case TargetOpcode::PATCHABLE_FUNCTION_ENTER: {
		const Function &F = MF->getFunction();
		if (F.hasFnAttribute("patchable-function-entry")) {
		unsigned Num;
		if (F.getFnAttribute("patchable-function-entry")
		.getValueAsString()
		.getAsInteger(10, Num)) {
		// This is garbage, do nothing.
		break;
		}
		emitNops(Num);
		} else {
		emitInstruction(&MI);
		}
		break;
		}

default:		default:
emitInstruction(&MI);		emitInstruction(&MI);
if (CanDoExtraAnalysis) {		if (CanDoExtraAnalysis) {
MCInst MCI;		MCInst MCI;
MCI.setOpcode(MI.getOpcode());		MCI.setOpcode(MI.getOpcode());
auto Name = OutStreamer->getMnemonic(MCI);		auto Name = OutStreamer->getMnemonic(MCI);
auto I = MnemonicCounts.insert({Name, 0u});		auto I = MnemonicCounts.insert({Name, 0u});
I.first->second++;		I.first->second++;
▲ Show 20 Lines • Show All 2,412 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64AsmPrinter.cpp

Show First 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	void AArch64AsmPrinter::emitFunctionHeaderComment() {
const AArch64FunctionInfo *FI = MF->getInfo<AArch64FunctionInfo>();		const AArch64FunctionInfo *FI = MF->getInfo<AArch64FunctionInfo>();
std::optional<std::string> OutlinerString = FI->getOutliningStyle();		std::optional<std::string> OutlinerString = FI->getOutliningStyle();
if (OutlinerString != std::nullopt)		if (OutlinerString != std::nullopt)
OutStreamer->getCommentOS() << ' ' << OutlinerString;		OutStreamer->getCommentOS() << ' ' << OutlinerString;
}		}

void AArch64AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI)		void AArch64AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI)
{		{
const Function &F = MF->getFunction();
if (F.hasFnAttribute("patchable-function-entry")) {
unsigned Num;
if (F.getFnAttribute("patchable-function-entry")
.getValueAsString()
.getAsInteger(10, Num))
return;
emitNops(Num);
return;
}

emitSled(MI, SledKind::FUNCTION_ENTER);		emitSled(MI, SledKind::FUNCTION_ENTER);
}		}

void AArch64AsmPrinter::LowerPATCHABLE_FUNCTION_EXIT(const MachineInstr &MI) {		void AArch64AsmPrinter::LowerPATCHABLE_FUNCTION_EXIT(const MachineInstr &MI) {
emitSled(MI, SledKind::FUNCTION_EXIT);		emitSled(MI, SledKind::FUNCTION_EXIT);
}		}

void AArch64AsmPrinter::LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI) {		void AArch64AsmPrinter::LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI) {
▲ Show 20 Lines • Show All 1,400 Lines • Show Last 20 Lines

llvm/lib/Target/RISCV/RISCVMCInstLower.cpp

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	bool llvm::lowerRISCVMachineInstrToMCInst(const MachineInstr *MI, MCInst &OutMI,

for (const MachineOperand &MO : MI->operands()) {		for (const MachineOperand &MO : MI->operands()) {
MCOperand MCOp;		MCOperand MCOp;
if (lowerRISCVMachineOperandToMCOperand(MO, MCOp, AP))		if (lowerRISCVMachineOperandToMCOperand(MO, MCOp, AP))
OutMI.addOperand(MCOp);		OutMI.addOperand(MCOp);
}		}

switch (OutMI.getOpcode()) {		switch (OutMI.getOpcode()) {
case TargetOpcode::PATCHABLE_FUNCTION_ENTER: {
const Function &F = MI->getParent()->getParent()->getFunction();
if (F.hasFnAttribute("patchable-function-entry")) {
unsigned Num;
if (F.getFnAttribute("patchable-function-entry")
.getValueAsString()
.getAsInteger(10, Num))
return false;
AP.emitNops(Num);
return true;
}
break;
}
case RISCV::PseudoReadVLENB:		case RISCV::PseudoReadVLENB:
OutMI.setOpcode(RISCV::CSRRS);		OutMI.setOpcode(RISCV::CSRRS);
OutMI.addOperand(MCOperand::createImm(		OutMI.addOperand(MCOperand::createImm(
RISCVSysReg::lookupSysRegByName("VLENB")->Encoding));		RISCVSysReg::lookupSysRegByName("VLENB")->Encoding));
OutMI.addOperand(MCOperand::createReg(RISCV::X0));		OutMI.addOperand(MCOperand::createReg(RISCV::X0));
break;		break;
case RISCV::PseudoReadVL:		case RISCV::PseudoReadVL:
OutMI.setOpcode(RISCV::CSRRS);		OutMI.setOpcode(RISCV::CSRRS);
OutMI.addOperand(		OutMI.addOperand(
MCOperand::createImm(RISCVSysReg::lookupSysRegByName("VL")->Encoding));		MCOperand::createImm(RISCVSysReg::lookupSysRegByName("VL")->Encoding));
OutMI.addOperand(MCOperand::createReg(RISCV::X0));		OutMI.addOperand(MCOperand::createReg(RISCV::X0));
break;		break;
}		}
return false;		return false;
}		}

llvm/lib/Target/X86/X86MCInstLower.cpp

Show First 20 Lines • Show All 1,754 Lines • ▼ Show 20 Lines	void X86AsmPrinter::LowerPATCHABLE_TYPED_EVENT_CALL(const MachineInstr &MI,
recordSled(CurSled, MI, SledKind::TYPED_EVENT, 2);		recordSled(CurSled, MI, SledKind::TYPED_EVENT, 2);
}		}

void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,		void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
X86MCInstLower &MCIL) {		X86MCInstLower &MCIL) {

NoAutoPaddingScope NoPadScope(*OutStreamer);		NoAutoPaddingScope NoPadScope(*OutStreamer);

const Function &F = MF->getFunction();
if (F.hasFnAttribute("patchable-function-entry")) {
unsigned Num;
if (F.getFnAttribute("patchable-function-entry")
.getValueAsString()
.getAsInteger(10, Num))
return;
emitX86Nops(*OutStreamer, Num, Subtarget);
return;
}
// We want to emit the following pattern:		// We want to emit the following pattern:
//		//
// .p2align 1, ...		// .p2align 1, ...
// .Lxray_sled_N:		// .Lxray_sled_N:
// jmp .tmpN		// jmp .tmpN
// # 9 bytes worth of noops		// # 9 bytes worth of noops
//		//
// We need the 9 bytes because at runtime, we'd be patching over the full 11		// We need the 9 bytes because at runtime, we'd be patching over the full 11
▲ Show 20 Lines • Show All 972 Lines • ▼ Show 20 Lines	if (MI->isCall()) {
// after it.		// after it.
SMShadowTracker.emitShadowPadding(*OutStreamer, getSubtargetInfo());		SMShadowTracker.emitShadowPadding(*OutStreamer, getSubtargetInfo());
// Then emit the call		// Then emit the call
OutStreamer->emitInstruction(TmpInst, getSubtargetInfo());		OutStreamer->emitInstruction(TmpInst, getSubtargetInfo());
return;		return;
}		}

EmitAndCountInstruction(TmpInst);		EmitAndCountInstruction(TmpInst);
}		}
		MaskRayUnsubmitted Not Done Reply Inline Actions "No newline at end of file" MaskRay: "No newline at end of file"

llvm/test/CodeGen/Hexagon/patchable-function-entry.ll

This file was added.

				; RUN: llc -march=hexagon -verify-machineinstrs < %s \| FileCheck %s

				define void @f0() "patchable-function-entry"="0" {
				; CHECK-LABEL: f0:
				; CHECK-NOT: nop
				; CHECK: jumpr r31
				ret void
				}

				define void @f1() "patchable-function-entry"="1" {
				; CHECK-LABEL: f1:
				; CHECK: nop
				; CHECK-NOT: nop
				; CHECK: jumpr r31
				ret void
				}

				define void @f2() "patchable-function-entry"="2" {
				; CHECK-LABEL: f2:
				; CHECK: nop
				; CHECK: nop
				; CHECK-NOT: nop
				; CHECK: jumpr r31
				ret void
				}
				No newline at end of file

llvm/test/CodeGen/X86/patchable-function-entry.ll

	Show All 25 Lines

	;; Without -function-sections, f2 is in the same text section as f1.			;; Without -function-sections, f2 is in the same text section as f1.
	;; They share the __patchable_function_entries section.			;; They share the __patchable_function_entries section.
	;; With -function-sections, f1 and f2 are in different text sections.			;; With -function-sections, f1 and f2 are in different text sections.
	;; Use separate __patchable_function_entries.			;; Use separate __patchable_function_entries.
	define void @f2() "patchable-function-entry"="2" {			define void @f2() "patchable-function-entry"="2" {
	; CHECK-LABEL: f2:			; CHECK-LABEL: f2:
	; CHECK-NEXT: .Lfunc_begin2:			; CHECK-NEXT: .Lfunc_begin2:
	; 32: xchgw %ax, %ax			; CHECK: nop
	; 64: xchgw %ax, %ax			; CHECK-NEXT: nop
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK: .section __patchable_function_entries,"awo",@progbits,f2{{$}}			; CHECK: .section __patchable_function_entries,"awo",@progbits,f2{{$}}
	; 32: .p2align 2			; 32: .p2align 2
	; 32-NEXT: .long .Lfunc_begin2			; 32-NEXT: .long .Lfunc_begin2
	; 64: .p2align 3			; 64: .p2align 3
	; 64-NEXT: .quad .Lfunc_begin2			; 64-NEXT: .quad .Lfunc_begin2
	ret void			ret void
	}			}

	$f3 = comdat any			$f3 = comdat any
	define void @f3() "patchable-function-entry"="3" comdat {			define void @f3() "patchable-function-entry"="3" comdat {
	; CHECK-LABEL: f3:			; CHECK-LABEL: f3:
	; CHECK-NEXT: .Lfunc_begin3:			; CHECK-NEXT: .Lfunc_begin3:
	; 32: xchgw %ax, %ax			; CHECK: nop
	; 32-NEXT: nop			; CHECK-NEXT: nop
	; 64: nopl (%rax)			; CHECK-NEXT: nop
	; CHECK: ret			; CHECK: ret
	; CHECK: .section __patchable_function_entries,"aGwo",@progbits,f3,comdat,f3{{$}}			; CHECK: .section __patchable_function_entries,"aGwo",@progbits,f3,comdat,f3{{$}}
	; 32: .p2align 2			; 32: .p2align 2
	; 32-NEXT: .long .Lfunc_begin3			; 32-NEXT: .long .Lfunc_begin3
	; 64: .p2align 3			; 64: .p2align 3
	; 64-NEXT: .quad .Lfunc_begin3			; 64-NEXT: .quad .Lfunc_begin3
	ret void			ret void
	}			}

	$f5 = comdat any			$f5 = comdat any
	define void @f5() "patchable-function-entry"="5" comdat {			define void @f5() "patchable-function-entry"="5" comdat {
	; CHECK-LABEL: f5:			; CHECK-LABEL: f5:
	; CHECK-NEXT: .Lfunc_begin4:			; CHECK-NEXT: .Lfunc_begin4:
	; 32-COUNT-2: xchgw %ax, %ax			; CHECK: nop
	; 32-NEXT: nop			; CHECK-NEXT: nop
	; 64: nopl 8(%rax,%rax)			; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	; CHECK: .section __patchable_function_entries,"aGwo",@progbits,f5,comdat,f5{{$}}			; CHECK: .section __patchable_function_entries,"aGwo",@progbits,f5,comdat,f5{{$}}
	; 32: .p2align 2			; 32: .p2align 2
	; 32-NEXT: .long .Lfunc_begin4			; 32-NEXT: .long .Lfunc_begin4
	; 64: .p2align 3			; 64: .p2align 3
	; 64-NEXT: .quad .Lfunc_begin4			; 64-NEXT: .quad .Lfunc_begin4
	ret void			ret void
	}			}

	;; -fpatchable-function-entry=3,2			;; -fpatchable-function-entry=3,2
	;; "patchable-function-prefix" emits data before the function entry label.			;; "patchable-function-prefix" emits data before the function entry label.
	;; We emit 1-byte NOPs before the function entry, so that with a partial patch,			;; We emit 1-byte NOPs before the function entry, so that with a partial patch,
	;; the remaining instructions do not need to be modified.			;; the remaining instructions do not need to be modified.
	define void @f3_2() "patchable-function-entry"="1" "patchable-function-prefix"="2" {			define void @f3_2() "patchable-function-entry"="1" "patchable-function-prefix"="2" {
	; CHECK-LABEL: .type f3_2,@function			; CHECK-LABEL: .type f3_2,@function
	; CHECK-NEXT: .Ltmp0: # @f3_2			; CHECK-NEXT: .Ltmp0: # @f3_2
	; CHECK-NEXT: nop			; CHECK: nop
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: f3_2:			; CHECK-NEXT: f3_2:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: ret			; CHECK-NEXT: ret
	;; .size does not include the prefix.			;; .size does not include the prefix.
	; CHECK: .Lfunc_end5:			; CHECK: .Lfunc_end5:
	; CHECK-NEXT: .size f3_2, .Lfunc_end5-f3_2			; CHECK-NEXT: .size f3_2, .Lfunc_end5-f3_2
	; CHECK: .section __patchable_function_entries,"awo",@progbits,f3_2{{$}}			; CHECK: .section __patchable_function_entries,"awo",@progbits,f3_2{{$}}
	; 32: .p2align 2			; 32: .p2align 2
	; 32-NEXT: .long .Ltmp0			; 32-NEXT: .long .Ltmp0
	; 64: .p2align 3			; 64: .p2align 3
	; 64-NEXT: .quad .Ltmp0			; 64-NEXT: .quad .Ltmp0
	%frame = alloca i8, i32 16			%frame = alloca i8, i32 16
	ret void			ret void
	}			}