This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
CodeGen/
3
TargetInstrInfo.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64InstrInfo.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
branch-relax-xray.ll
-
machine-outliner-patchable.ll
-
machine-outliner-patchable.mir

Differential D147982

Account for PATCHABLE instrs in Branch Relaxation
ClosedPublic

Authored by dhoekwater on Apr 10 2023, 5:25 PM.

Download Raw Diff

Details

Reviewers

paquette
t.p.northover

Commits

rG6b62166b4cf8: Account for PATCHABLE instrs in Branch Relaxation

Summary

PATCHABLE_* instructions expand to up to 36-byte
sleds. Updating the size of PATCHABLE instructions
causes them to be outlined, so we need to add a
check to prevent the outliner from considering
basic blocks that contain PATCHABLE instructions.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dhoekwater created this revision.Apr 10 2023, 5:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 10 2023, 5:25 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

dhoekwater requested review of this revision.Apr 10 2023, 5:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 10 2023, 5:25 PM

Harbormaster completed remote builds in B224670: Diff 512291.Apr 10 2023, 6:01 PM

paquette added inline comments.Apr 11 2023, 10:50 AM

llvm/lib/CodeGen/TargetInstrInfo.cpp

1653–1654

I think this could be a llvm::find_if?

return find_if(MBB.getFirstNonDebugInstr(), MBB.getLastNonDebugInstr(), [](const MachineInstr &MI) {
   unsigned Opc = MI.getOpcode();
   switch(Opc) {
      default:
          return false;
      case TargetOpcode::FENTRY_CALL:
      ...
         return true;
   }
}) != MBB.end();

That'd make it easy to add new opcodes/remove opcodes.

dhoekwater added inline comments.Apr 11 2023, 11:15 AM

llvm/lib/CodeGen/TargetInstrInfo.cpp
1653–1654	It definitely could be, and it makes the function visually a lot cleaner. Because PATCHABLE instructions are always inserted at the start or at/just before the end, the previous and current implementations are O(1) in the number of instructions. Being fairly unfamiliar with the performance characteristics of the codebase, I was hesitant to scan through the whole basic block. If the small performance difference isn't a concern, I'd be happy to switch to `llvm::find_if`.

paquette added inline comments.Apr 11 2023, 11:17 AM

llvm/lib/CodeGen/TargetInstrInfo.cpp
1653–1654	Ah I see. Yeah let's keep it like this.

I think it would be good to have a MIR testcase for this.

Other than that, I think this looks good.

This revision is now accepted and ready to land.Apr 11 2023, 11:17 AM

In D147982#4258836, @paquette wrote:

I think it would be good to have a MIR testcase for this.

branch-relax-xray.ll tests that Branch Relaxation relaxes branches that are out-of-range because of XRay sleds, and machine-outliner-patchable.ll tests that XRay sleds aren't outlined. Why the MIR test case?

In D147982#4258991, @dhoekwater wrote:

In D147982#4258836, @paquette wrote:

I think it would be good to have a MIR testcase for this.

branch-relax-xray.ll tests that Branch Relaxation relaxes branches that are out-of-range because of XRay sleds, and machine-outliner-patchable.ll tests that XRay sleds aren't outlined. Why the MIR test case?

Because we can see clearly which MIR opcodes should not be outlined, and it defends us against other optimizer changes in earlier passes. Like, if the MIR test starts failing, we know 100% that it's an outliner issue and not some other pass dropping the X-ray ops or something.

In D147982#4259151, @paquette wrote:

Because we can see clearly which MIR opcodes should not be outlined, and it defends us against other optimizer changes in earlier passes. Like, if the MIR test starts failing, we know 100% that it's an outliner issue and not some other pass dropping the X-ray ops or something.

Got it, thanks! I'll write that up real quick.

Test to make sure XRay patchable instructions aren't outlined. I think technically I could break this into two separate patches now that I have two independent tests. I don't think that's necessary, but please let me know if it's preferred.

Harbormaster completed remote builds in B224922: Diff 512646.Apr 11 2023, 9:37 PM

I think it's okay to keep all of this in one patch.

I put the MIR test through Compiler Explorer to see what is outlined (and removed some of the stuff that isn't necessary in MIR):

https://godbolt.org/z/e85n9Wh8r

It looks like this, without this patch, does outline something with PATCHABLE_FUNCTION_EXIT, but we don't get any outlining behaviour for PATCHABLE_FUNCTION_ENTRY.

Would it be possible to add a case where PATCHABLE_FUNCTION_ENTRY is outlined without this patch?

Because isMBBSafeToOutlineFrom already accounted for TargetOpcode::FENTRY_CALL and TargetOpcode::PATCHABLE_FUNCTION_ENTER, I won't be able to write a test that makes PATCHABLE_FUNCTION_ENTER outlined without this patch.

I could probably write a test that makes sure TargetOpcode::PATCHABLE_RET and TargetOpcode::PATCHABLE_TAIL_CALL aren't outlined, but neither of those instructions are inserted on AArch64 (they're only inserted on ppc64le and x86_64, respectively). AFAIK no test exists to make sure they aren't outlined.

Closed by commit rG6b62166b4cf8: Account for PATCHABLE instrs in Branch Relaxation (authored by dhoekwater). · Explain WhyApr 14 2023, 4:15 PM

This revision was automatically updated to reflect the committed changes.

dhoekwater added a commit: rG6b62166b4cf8: Account for PATCHABLE instrs in Branch Relaxation.

duck-37 mentioned this in D147415: [MachineOutliner] Avoid outlining XRay pseudo-instructions.Jun 26 2023, 11:15 AM

duck-37 mentioned this in D124707: [AArch64] Make sure XRay pseudo-instruction sizes are reported correctly.

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

TargetInstrInfo.cpp

21 lines

Target/

AArch64/

AArch64InstrInfo.cpp

6 lines

test/

CodeGen/

AArch64/

branch-relax-xray.ll

42 lines

machine-outliner-patchable.ll

21 lines

machine-outliner-patchable.mir

162 lines

Diff 513790

llvm/lib/CodeGen/TargetInstrInfo.cpp

Show First 20 Lines • Show All 1,644 Lines • ▼ Show 20 Lines	outliner::InstrType TargetInstrInfo::getOutliningType(
// If we don't know, delegate to the target-specific hook.		// If we don't know, delegate to the target-specific hook.
return getOutliningTypeImpl(MIT, Flags);		return getOutliningTypeImpl(MIT, Flags);
}		}

bool TargetInstrInfo::isMBBSafeToOutlineFrom(MachineBasicBlock &MBB,		bool TargetInstrInfo::isMBBSafeToOutlineFrom(MachineBasicBlock &MBB,
unsigned &Flags) const {		unsigned &Flags) const {
// Some instrumentations create special TargetOpcode at the start which		// Some instrumentations create special TargetOpcode at the start which
// expands to special code sequences which must be present.		// expands to special code sequences which must be present.
auto First = MBB.getFirstNonDebugInstr();		auto First = MBB.getFirstNonDebugInstr();
if (First != MBB.end() &&		if (First == MBB.end())
		paquetteUnsubmitted Not Done Reply Inline Actions I think this could be a `llvm::find_if`? return find_if(MBB.getFirstNonDebugInstr(), MBB.getLastNonDebugInstr(), [](const MachineInstr &MI) { unsigned Opc = MI.getOpcode(); switch(Opc) { default: return false; case TargetOpcode::FENTRY_CALL: ... return true; } }) != MBB.end(); That'd make it easy to add new opcodes/remove opcodes. paquette: I think this could be a `llvm::find_if`? ``` return find_if(MBB.getFirstNonDebugInstr(), MBB.
		dhoekwaterAuthorUnsubmitted Not Done Reply Inline Actions It definitely could be, and it makes the function visually a lot cleaner. Because PATCHABLE instructions are always inserted at the start or at/just before the end, the previous and current implementations are O(1) in the number of instructions. Being fairly unfamiliar with the performance characteristics of the codebase, I was hesitant to scan through the whole basic block. If the small performance difference isn't a concern, I'd be happy to switch to `llvm::find_if`. dhoekwater: It definitely could be, and it makes the function visually a lot cleaner. Because PATCHABLE…
		paquetteUnsubmitted Not Done Reply Inline Actions Ah I see. Yeah let's keep it like this. paquette: Ah I see. Yeah let's keep it like this.
(First->getOpcode() == TargetOpcode::FENTRY_CALL \|\|		return true;
First->getOpcode() == TargetOpcode::PATCHABLE_FUNCTION_ENTER))
		if (First->getOpcode() == TargetOpcode::FENTRY_CALL \|\|
		First->getOpcode() == TargetOpcode::PATCHABLE_FUNCTION_ENTER)
		return false;

		// Some instrumentations create special pseudo-instructions at or just before
		// the end that must be present.
		auto Last = MBB.getLastNonDebugInstr();
		if (Last->getOpcode() == TargetOpcode::PATCHABLE_RET \|\|
		Last->getOpcode() == TargetOpcode::PATCHABLE_TAIL_CALL)
return false;		return false;

		if (Last != First && Last->isReturn()) {
		--Last;
		if (Last->getOpcode() == TargetOpcode::PATCHABLE_FUNCTION_EXIT \|\|
		Last->getOpcode() == TargetOpcode::PATCHABLE_TAIL_CALL)
		return false;
		}
return true;		return true;
}		}

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	case TargetOpcode::PATCHPOINT:
break;		break;
case TargetOpcode::STATEPOINT:		case TargetOpcode::STATEPOINT:
NumBytes = StatepointOpers(&MI).getNumPatchBytes();		NumBytes = StatepointOpers(&MI).getNumPatchBytes();
assert(NumBytes % 4 == 0 && "Invalid number of NOP bytes requested!");		assert(NumBytes % 4 == 0 && "Invalid number of NOP bytes requested!");
// No patch bytes means a normal call inst is emitted		// No patch bytes means a normal call inst is emitted
if (NumBytes == 0)		if (NumBytes == 0)
NumBytes = 4;		NumBytes = 4;
break;		break;
		case TargetOpcode::PATCHABLE_FUNCTION_ENTER:
		case TargetOpcode::PATCHABLE_FUNCTION_EXIT:
		// An XRay sled can be 4 bytes of alignment plus a 32-byte block.
		NumBytes = 36;
		break;

case AArch64::SPACE:		case AArch64::SPACE:
NumBytes = MI.getOperand(1).getImm();		NumBytes = MI.getOperand(1).getImm();
break;		break;
case TargetOpcode::BUNDLE:		case TargetOpcode::BUNDLE:
NumBytes = getInstBundleLength(MI);		NumBytes = getInstBundleLength(MI);
break;		break;
}		}

▲ Show 20 Lines • Show All 8,196 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/branch-relax-xray.ll

This file was added.

				; RUN: llc -mtriple=aarch64-unknown-linux-gnu -aarch64-tbz-offset-bits=4 -aarch64-cbz-offset-bits=4 < %s \| FileCheck %s

				;; Check that branch relaxation accounts for the size of xray EXIT sleds
				;; Note that TAIL_CALL sleds don't exist on AArch64 and don't need a test.
				define void @exit(i1 zeroext %0) nounwind "function-instrument"="xray-always" {
				; CHECK-LABEL: exit:
				; CHECK-NEXT: .Lfunc_begin0:
				; CHECK-NEXT: // %bb.0:
				; CHECK-NEXT: .p2align 2
				; CHECK-NEXT: .Lxray_sled_0:
				; CHECK-NEXT: b #32
				; CHECK-COUNT-7: nop
				; CHECK-NOT: nop
				; CHECK: tbnz
				; CHECK-SAME: [[FALLTHROUGH:.LBB[0-9_]+]]
				; CHECK-NEXT: b
				; CHECK-SAME: [[OUT_OF_RANGE:.LBB[0-9_]+]]
				; CHECK-NEXT: [[FALLTHROUGH]]:
				; CHECK-NEXT: bl bar
				; CHECK: .p2align 2
				; CHECK-NEXT: .Lxray_sled_1:
				; CHECK-NEXT: b #32
				; CHECK-COUNT-7: nop
				; CHECK-NOT: nop
				; CHECK-NEXT: .Ltmp1:
				; CHECK-NEXT: ret
				; CHECK-NEXT: [[OUT_OF_RANGE]]:
				; CHECK-SAME: // %end2
				; CHECK-NEXT: bl baz
				br i1 %0, label %end1, label %end2

				end1:
				%2 = call i32 @bar()
				ret void

				end2:
				%3 = call i32 @baz()
				ret void
				}

				declare i32 @bar()
				declare i32 @baz()
				No newline at end of file

llvm/test/CodeGen/AArch64/machine-outliner-patchable.ll

	; RUN: llc < %s -verify-machineinstrs -enable-machine-outliner \| FileCheck %s			; RUN: llc < %s -verify-machineinstrs -enable-machine-outliner \| FileCheck %s

	target triple = "aarch64-unknown-linux-gnu"			target triple = "aarch64-unknown-linux-gnu"

	declare void @foo(i32, i32, i32, i32) minsize			declare void @foo(i32, i32, i32, i32) minsize

	;; TargetOpcode::FENTRY_CALL at the start of the function expands to a __fentry__			;; TargetOpcode::FENTRY_CALL at the start of the function expands to a __fentry__
	;; call which must be present. Don't outline it.			;; call which must be present. Don't outline it.
	define void @fentry0(i1 %a) nounwind "fentry-call"="true" {			define void @fentry0(i1 %a) nounwind "fentry-call"="true" {
	; CHECK-LABEL: fentry0:			; CHECK-LABEL: fentry0:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: # FEntry call			; CHECK-NEXT: # FEntry call
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION:OUTLINED_FUNCTION_[0-9]+]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

	define void @fentry1(i1 %a) nounwind "fentry-call"="true" {			define void @fentry1(i1 %a) nounwind "fentry-call"="true" {
	; CHECK-LABEL: fentry1:			; CHECK-LABEL: fentry1:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: # FEntry call			; CHECK-NEXT: # FEntry call
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

	;; TargetOpcode::PATCHABLE_FUNCTION_ENTER at the start of the function expands to			;; TargetOpcode::PATCHABLE_FUNCTION_ENTER at the start of the function expands to
	;; NOPs which must be present. Don't outline them.			;; NOPs which must be present. Don't outline them.
	define void @patchable0(i1 %a) nounwind "patchable-function-entry"="2" {			define void @patchable0(i1 %a) nounwind "patchable-function-entry"="2" {
	; CHECK-LABEL: patchable0:			; CHECK-LABEL: patchable0:
	; CHECK-NEXT: .Lfunc_begin0:			; CHECK-NEXT: .Lfunc_begin0:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

	define void @patchable1(i1 %a) nounwind "patchable-function-entry"="2" {			define void @patchable1(i1 %a) nounwind "patchable-function-entry"="2" {
	; CHECK-LABEL: patchable1:			; CHECK-LABEL: patchable1:
	; CHECK-NEXT: .Lfunc_begin1:			; CHECK-NEXT: .Lfunc_begin1:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK-NEXT: nop			; CHECK-NEXT: nop
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

	;; Similar to "patchable-function-entry".			;; Similar to "patchable-function-entry".
	define void @xray0(i1 %a) nounwind "function-instrument"="xray-always" {			define void @xray0(i1 %a) nounwind "function-instrument"="xray-always" {
	; CHECK-LABEL: xray0:			; CHECK-LABEL: xray0:
	; CHECK-NEXT: .Lfunc_begin2:			; CHECK-NEXT: .Lfunc_begin2:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: .p2align 2			; CHECK-NEXT: .p2align 2
	; CHECK-NEXT: .Lxray_sled_0:			; CHECK-NEXT: .Lxray_sled_0:
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

	define void @xray1(i1 %a) nounwind "function-instrument"="xray-always" {			define void @xray1(i1 %a) nounwind "function-instrument"="xray-always" {
	; CHECK-LABEL: xray1:			; CHECK-LABEL: xray1:
	; CHECK-NEXT: .Lfunc_begin3:			; CHECK-NEXT: .Lfunc_begin3:
	; CHECK-NEXT: // %bb.0:			; CHECK-NEXT: // %bb.0:
	; CHECK-NEXT: .p2align 2			; CHECK-NEXT: .p2align 2
	; CHECK-NEXT: .Lxray_sled_2:			; CHECK-NEXT: .Lxray_sled_2:
	; CHECK: // %bb.1:			; CHECK: // %bb.1:
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: bl [[OUTLINED_FUNCTION]]
	entry:			entry:
	br i1 %a, label %if.then, label %if.end			br i1 %a, label %if.then, label %if.end
	if.then:			if.then:
	call void @foo(i32 1, i32 2, i32 3, i32 4)			call void @foo(i32 1, i32 2, i32 3, i32 4)
	br label %if.end			br label %if.end
	if.end:			if.end:
	call void @foo(i32 5, i32 6, i32 7, i32 8)			call void @foo(i32 5, i32 6, i32 7, i32 8)
	ret void			ret void
	}			}

				;; Make sure that OUTLINED_FUNCTION contains the right instructions
				; CHECK: [[OUTLINED_FUNCTION]]:
				; CHECK-NEXT: // %bb.0:
				; CHECK-NEXT: mov w0, #1
				; CHECK-NEXT: mov w1, #2
				; CHECK-NEXT: mov w2, #3
				; CHECK-NEXT: mov w3, #4
				; CHECK-NEXT: b foo

llvm/test/CodeGen/AArch64/machine-outliner-patchable.mir

This file was added.

				# RUN: llc -mtriple=aarch64-none-linux-gnu -run-pass machine-outliner -verify-machineinstrs -enable-machine-outliner %s -o - \| FileCheck %s
				--- \|
				; Function Attrs: minsize
				declare void @foo(i32, i32, i32, i32) #0

				; Function Attrs: nounwind
				define void @xray0(i1 %a) #1 {
				entry:
				br i1 %a, label %if.then, label %if.end

				if.then: ; preds = %entry
				call void @foo(i32 1, i32 2, i32 3, i32 4)
				br label %if.end

				if.end: ; preds = %if.then, %entry
				call void @foo(i32 5, i32 6, i32 7, i32 8)
				ret void
				}

				; Function Attrs: nounwind
				define void @xray1(i1 %a) #1 {
				entry:
				br i1 %a, label %if.then, label %if.end

				if.then: ; preds = %entry
				call void @foo(i32 1, i32 2, i32 3, i32 4)
				br label %if.end

				if.end: ; preds = %if.then, %entry
				call void @foo(i32 5, i32 6, i32 7, i32 8)
				ret void
				}

				attributes #0 = { minsize }
				attributes #1 = { nounwind "function-instrument"="xray-always" }

				...
				---
				name: xray0
				tracksRegLiveness: true
				liveins:
				- { reg: '$w0', virtual-reg: '' }
				stack:
				- { id: 0, name: '', type: spill-slot, offset: -16, size: 8, alignment: 16,
				stack-id: default, callee-saved-register: '$lr', callee-saved-restored: true,
				debug-info-variable: '', debug-info-expression: '', debug-info-location: '' }
				machineFunctionInfo:
				hasRedZone: false
				body: \|
				; CHECK-LABEL: name: xray0
				; CHECK: bb.0.entry:
				; CHECK: PATCHABLE_FUNCTION_ENTER
				; CHECK: bb.1.if.then:
				; CHECK: BL @[[OUTLINED_FUNCTION:OUTLINED_FUNCTION_[0-9]]]
				; CHECK: bb.2.if.end:
				; CHECK-NEXT: $w0 = MOVZWi 5, 0
				; CHECK-NEXT: $w1 = MOVZWi 6, 0
				; CHECK-NEXT: $w2 = MOVZWi 7, 0
				; CHECK-NEXT: $w3 = MOVZWi 8, 0
				; CHECK-NEXT: BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp
				; CHECK: $w0 = MOVZWi 5, 0
				; CHECK-NEXT: $w1 = MOVZWi 6, 0
				; CHECK-NEXT: PATCHABLE_FUNCTION_EXIT
				; CHECK-NEXT: RET undef $lr

				bb.0.entry:
				successors: %bb.1(0x40000000), %bb.2(0x40000000)
				liveins: $w0, $lr

				PATCHABLE_FUNCTION_ENTER
				early-clobber $sp = frame-setup STRXpre killed $lr, $sp, -16 :: (store (s64) into %stack.0)
				TBZW killed renamable $w0, 0, %bb.2

				bb.1.if.then:
				successors: %bb.2(0x80000000)

				$w0 = MOVZWi 1, 0
				$w1 = MOVZWi 2, 0
				$w2 = MOVZWi 3, 0
				$w3 = MOVZWi 4, 0
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp

				bb.2.if.end:
				$w0 = MOVZWi 5, 0
				$w1 = MOVZWi 6, 0
				$w2 = MOVZWi 7, 0
				$w3 = MOVZWi 8, 0
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp
				early-clobber $sp, $lr = frame-destroy LDRXpost $sp, 16 :: (load (s64) from %stack.0)
				$w0 = MOVZWi 5, 0
				$w1 = MOVZWi 6, 0
				PATCHABLE_FUNCTION_EXIT
				RET undef $lr

				...
				---
				name: xray1
				tracksRegLiveness: true
				liveins:
				- { reg: '$w0', virtual-reg: '' }
				stack:
				- { id: 0, name: '', type: spill-slot, offset: -16, size: 8, alignment: 16,
				stack-id: default, callee-saved-register: '$lr', callee-saved-restored: true,
				debug-info-variable: '', debug-info-expression: '', debug-info-location: '' }
				machineFunctionInfo:
				hasRedZone: false
				body: \|
				; CHECK-LABEL: name: xray1
				; CHECK: bb.0.entry:
				; CHECK: PATCHABLE_FUNCTION_ENTER
				; CHECK: bb.1.if.then:
				; CHECK: BL @[[OUTLINED_FUNCTION]]
				; CHECK: bb.2.if.end:
				; CHECK-NEXT: $w0 = MOVZWi 5, 0
				; CHECK-NEXT: $w1 = MOVZWi 6, 0
				; CHECK-NEXT: $w2 = MOVZWi 7, 0
				; CHECK-NEXT: $w3 = MOVZWi 8, 0
				; CHECK-NEXT: BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp
				; CHECK: $w0 = MOVZWi 5, 0
				; CHECK-NEXT: $w1 = MOVZWi 6, 0
				; CHECK-NEXT: PATCHABLE_FUNCTION_EXIT
				; CHECK-NEXT: RET undef $lr

				bb.0.entry:
				successors: %bb.1(0x40000000), %bb.2(0x40000000)
				liveins: $w0, $lr

				PATCHABLE_FUNCTION_ENTER
				early-clobber $sp = frame-setup STRXpre killed $lr, $sp, -16 :: (store (s64) into %stack.0)
				TBZW killed renamable $w0, 0, %bb.2

				bb.1.if.then:
				successors: %bb.2(0x80000000)

				$w0 = MOVZWi 1, 0
				$w1 = MOVZWi 2, 0
				$w2 = MOVZWi 3, 0
				$w3 = MOVZWi 4, 0
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp

				bb.2.if.end:
				$w0 = MOVZWi 5, 0
				$w1 = MOVZWi 6, 0
				$w2 = MOVZWi 7, 0
				$w3 = MOVZWi 8, 0
				BL @foo, csr_aarch64_aapcs, implicit-def dead $lr, implicit $sp, implicit killed $w0, implicit killed $w1, implicit killed $w2, implicit killed $w3, implicit-def $sp
				early-clobber $sp, $lr = frame-destroy LDRXpost $sp, 16 :: (load (s64) from %stack.0)
				$w0 = MOVZWi 5, 0
				$w1 = MOVZWi 6, 0
				PATCHABLE_FUNCTION_EXIT
				RET undef $lr

				; CHECK: name: [[OUTLINED_FUNCTION]]
				; CHECK: bb.0:
				; CHECK: $w0 = MOVZWi 1, 0
				; CHECK-NEXT: $w1 = MOVZWi 2, 0
				; CHECK-NEXT: $w2 = MOVZWi 3, 0
				; CHECK-NEXT: $w3 = MOVZWi 4, 0
				; CHECK-NEXT: TCRETURNdi @foo, 0, implicit $sp

				...

This is an archive of the discontinued LLVM Phabricator instance.

Account for PATCHABLE instrs in Branch RelaxationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 513790

llvm/lib/CodeGen/TargetInstrInfo.cpp

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/test/CodeGen/AArch64/branch-relax-xray.ll

llvm/test/CodeGen/AArch64/machine-outliner-patchable.ll

llvm/test/CodeGen/AArch64/machine-outliner-patchable.mir

Account for PATCHABLE instrs in Branch Relaxation
ClosedPublic