This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Move SLS later in pass pipeline
ClosedPublic

Authored by olista01 on Aug 22 2023, 6:46 AM.

Download Raw Diff

Details

Reviewers

kristof.beyls
paquette
stuij

Commits

rG7e8eccd990d3: [AArch64] Move SLS later in pass pipeline

Summary

Currently, the SLS hardening pass is run before the machine outliner,
which means that the outliner creates new functions and calls which do
not have the SLS hardening applied.

The fix for this is to move the SLS passes to after the outliner, as has
recently been done for the return address signing pass.

This also avoids a bug where the SLS outliner emits code with
instructions after a return, which the outliner doesn't correctly
handle.

This results in the heuristics used by the outliner being wrong, since
it doesn't know about the extra instructions the SLS pass will add to
every call and return. That's not a correctness problem, so I'll update
them in a separate patch.

Previous summary:

The SLS hardening pass inserts barrier instructions after return
instructions, so it was causing the machine outliner to fail to notice
return blocks, so it could outline at places where LR is live. The fix
for this is to check all terminator instructions when deciding if a
block ends in a return, not just the least one.

I think there's a deeper issue that we don't correctly track the
liveness of LR around function returns. This appears to be deliberate,
according to the comment on RET_ReallyLR in AArch64InstrInfo.td. I tried
changing this to make LR an implicit operand of all returns, but haven't
found a way to do that which doesn't cause a large number of MIR
verifier failures.

I think there's also an issue here that the outliner can create code
which does not have the SLS barrier instructions after a tail-call.
@Kristof, do you know if the SLS pass could be moved to after the
outliner to fix this, or does it need to happen as early as it does?

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

olista01 created this revision.Aug 22 2023, 6:46 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 22 2023, 6:46 AM

Herald added a subscriber: kristof.beyls. · View Herald Transcript

olista01 requested review of this revision.Aug 22 2023, 6:46 AM

Herald added a project: Restricted Project. · View Herald TranscriptAug 22 2023, 6:46 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B254078: Diff 552336.Aug 22 2023, 6:46 AM

olista01 added a parent revision: D158512: [AArch64] Add test showing incorrect code-gen.Aug 22 2023, 6:48 AM

olista01 added reviewers: kristof.beyls, paquette, stuij.

I don't remember any reason why the SLS pass needs to run before the outliner.
It seems to me that indeed it may be easier to move to SLS pass to run after the outliner - i.e. run after any pass that may need to interpret the semantics of return instructions.

Move SLS pass later in the pass pipeline

Herald added a subscriber: hiraditya. · View Herald TranscriptOct 19 2023, 1:59 AM

Harbormaster completed remote builds in B257866: Diff 557769.Oct 19 2023, 3:02 AM

kristof.beyls added inline comments.Oct 25 2023, 1:04 AM

llvm/lib/Target/AArch64/AArch64SLSHardening.cpp
225–227	I guess this change is a side-effect of moving the AArch64IndirectThunks pass later in the pipeline. But I cannot easily guess what exactly triggers this change in behavior. I wonder if you happen to know?

olista01 added inline comments.Oct 25 2023, 1:33 AM

llvm/lib/Target/AArch64/AArch64SLSHardening.cpp
225–227	IndirectThunks.h:84 has a comment explaining that it does not create the entry block, but I've not worked out where it was being created before.

LGTM, thanks!

llvm/lib/Target/AArch64/AArch64SLSHardening.cpp
225–227	Thanks. I guess it doesn't really matter much.

This revision is now accepted and ready to land.Oct 25 2023, 1:36 AM

Closed by commit rG7e8eccd990d3: [AArch64] Move SLS later in pass pipeline (authored by olista01). · Explain WhyOct 25 2023, 2:45 AM

This revision was automatically updated to reflect the committed changes.

olista01 added a commit: rG7e8eccd990d3: [AArch64] Move SLS later in pass pipeline.

olista01 added a reverting change: rG339faffd053b: Revert "[AArch64] Move SLS later in pass pipeline".Oct 26 2023, 1:51 AM

Revision Contents

Path

Size

llvm/

lib/

Target/

AArch64/

AArch64SLSHardening.cpp

3 lines

AArch64TargetMachine.cpp

5 lines

test/

CodeGen/

AArch64/

O0-pipeline.ll

4 lines

O3-pipeline.ll

4 lines

arm64-opt-remarks-lazy-bfi.ll

24 lines

sls-stackprotector-outliner.ll

12 lines

Diff 557875

llvm/lib/Target/AArch64/AArch64SLSHardening.cpp

Show First 20 Lines • Show All 216 Lines • ▼ Show 20 Lines	void SLSBLRThunkInserter::populateThunk(MachineFunction &MF) {
assert(MF.getName().startswith(getThunkPrefix()));		assert(MF.getName().startswith(getThunkPrefix()));
auto ThunkIt = llvm::find_if(		auto ThunkIt = llvm::find_if(
SLSBLRThunks, [&MF](auto T) { return T.Name == MF.getName(); });		SLSBLRThunks, [&MF](auto T) { return T.Name == MF.getName(); });
assert(ThunkIt != std::end(SLSBLRThunks));		assert(ThunkIt != std::end(SLSBLRThunks));
Register ThunkReg = ThunkIt->Reg;		Register ThunkReg = ThunkIt->Reg;

const TargetInstrInfo *TII =		const TargetInstrInfo *TII =
MF.getSubtarget<AArch64Subtarget>().getInstrInfo();		MF.getSubtarget<AArch64Subtarget>().getInstrInfo();
assert (MF.size() == 1);		assert (MF.size() == 0);
		MF.push_back(MF.CreateMachineBasicBlock());
MachineBasicBlock *Entry = &MF.front();		MachineBasicBlock *Entry = &MF.front();
		kristof.beylsUnsubmitted Not Done Reply Inline Actions I guess this change is a side-effect of moving the AArch64IndirectThunks pass later in the pipeline. But I cannot easily guess what exactly triggers this change in behavior. I wonder if you happen to know? kristof.beyls: I guess this change is a side-effect of moving the AArch64IndirectThunks pass later in the…
		olista01AuthorUnsubmitted Done Reply Inline Actions IndirectThunks.h:84 has a comment explaining that it does not create the entry block, but I've not worked out where it was being created before. olista01: IndirectThunks.h:84 has a comment explaining that it does not create the entry block, but I've…
		kristof.beylsUnsubmitted Not Done Reply Inline Actions Thanks. I guess it doesn't really matter much. kristof.beyls: Thanks. I guess it doesn't really matter much.
Entry->clear();		Entry->clear();

// These thunks need to consist of the following instructions:		// These thunks need to consist of the following instructions:
// __llvm_slsblr_thunk_xN:		// __llvm_slsblr_thunk_xN:
// BR xN		// BR xN
// barrierInsts		// barrierInsts
Entry->addLiveIn(ThunkReg);		Entry->addLiveIn(ThunkReg);
// MOV X16, ThunkReg == ORR X16, XZR, ThunkReg, LSL #0		// MOV X16, ThunkReg == ORR X16, XZR, ThunkReg, LSL #0
▲ Show 20 Lines • Show All 217 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64TargetMachine.cpp

Show First 20 Lines • Show All 790 Lines • ▼ Show 20 Lines	void AArch64PassConfig::addPreSched2() {

// The AArch64SpeculationHardeningPass destroys dominator tree and natural		// The AArch64SpeculationHardeningPass destroys dominator tree and natural
// loop info, which is needed for the FalkorHWPFFixPass and also later on.		// loop info, which is needed for the FalkorHWPFFixPass and also later on.
// Therefore, run the AArch64SpeculationHardeningPass before the		// Therefore, run the AArch64SpeculationHardeningPass before the
// FalkorHWPFFixPass to avoid recomputing dominator tree and natural loop		// FalkorHWPFFixPass to avoid recomputing dominator tree and natural loop
// info.		// info.
addPass(createAArch64SpeculationHardeningPass());		addPass(createAArch64SpeculationHardeningPass());

addPass(createAArch64IndirectThunks());
addPass(createAArch64SLSHardeningPass());

if (TM->getOptLevel() != CodeGenOptLevel::None) {		if (TM->getOptLevel() != CodeGenOptLevel::None) {
if (EnableFalkorHWPFFix)		if (EnableFalkorHWPFFix)
addPass(createFalkorHWPFFixPass());		addPass(createFalkorHWPFFixPass());
}		}
}		}

void AArch64PassConfig::addPreEmitPass() {		void AArch64PassConfig::addPreEmitPass() {
// Machine Block Placement might have created new opportunities when run		// Machine Block Placement might have created new opportunities when run
Show All 16 Lines	void AArch64PassConfig::addPreEmitPass() {
}		}

if (TM->getOptLevel() != CodeGenOptLevel::None && EnableCollectLOH &&		if (TM->getOptLevel() != CodeGenOptLevel::None && EnableCollectLOH &&
TM->getTargetTriple().isOSBinFormatMachO())		TM->getTargetTriple().isOSBinFormatMachO())
addPass(createAArch64CollectLOHPass());		addPass(createAArch64CollectLOHPass());
}		}

void AArch64PassConfig::addPostBBSections() {		void AArch64PassConfig::addPostBBSections() {
		addPass(createAArch64IndirectThunks());
		addPass(createAArch64SLSHardeningPass());
addPass(createAArch64PointerAuthPass());		addPass(createAArch64PointerAuthPass());
if (EnableBranchTargets)		if (EnableBranchTargets)
addPass(createAArch64BranchTargetsPass());		addPass(createAArch64BranchTargetsPass());
// Relax conditional branch instructions if they're otherwise out of		// Relax conditional branch instructions if they're otherwise out of
// range of their destination.		// range of their destination.
if (BranchRelaxation)		if (BranchRelaxation)
addPass(&BranchRelaxationPassID);		addPass(&BranchRelaxationPassID);

Show All 36 Lines

llvm/test/CodeGen/AArch64/O0-pipeline.ll

	Show First 20 Lines • Show All 58 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Fixup Statepoint Caller Saved			; CHECK-NEXT: Fixup Statepoint Caller Saved
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: Prologue/Epilogue Insertion & Frame Finalization			; CHECK-NEXT: Prologue/Epilogue Insertion & Frame Finalization
	; CHECK-NEXT: Post-RA pseudo instruction expansion pass			; CHECK-NEXT: Post-RA pseudo instruction expansion pass
	; CHECK-NEXT: AArch64 pseudo instruction expansion pass			; CHECK-NEXT: AArch64 pseudo instruction expansion pass
	; CHECK-NEXT: Insert KCFI indirect call checks			; CHECK-NEXT: Insert KCFI indirect call checks
	; CHECK-NEXT: AArch64 speculation hardening pass			; CHECK-NEXT: AArch64 speculation hardening pass
	; CHECK-NEXT: AArch64 Indirect Thunks
	; CHECK-NEXT: AArch64 sls hardening pass
	; CHECK-NEXT: Analyze Machine Code For Garbage Collection			; CHECK-NEXT: Analyze Machine Code For Garbage Collection
	; CHECK-NEXT: Insert fentry calls			; CHECK-NEXT: Insert fentry calls
	; CHECK-NEXT: Insert XRay ops			; CHECK-NEXT: Insert XRay ops
	; CHECK-NEXT: Implement the 'patchable-function' attribute			; CHECK-NEXT: Implement the 'patchable-function' attribute
	; CHECK-NEXT: Workaround A53 erratum 835769 pass			; CHECK-NEXT: Workaround A53 erratum 835769 pass
	; CHECK-NEXT: Contiguously Lay Out Funclets			; CHECK-NEXT: Contiguously Lay Out Funclets
	; CHECK-NEXT: StackMap Liveness Analysis			; CHECK-NEXT: StackMap Liveness Analysis
	; CHECK-NEXT: Live DEBUG_VALUE analysis			; CHECK-NEXT: Live DEBUG_VALUE analysis
	; CHECK-NEXT: Machine Sanitizer Binary Metadata			; CHECK-NEXT: Machine Sanitizer Binary Metadata
				; CHECK-NEXT: AArch64 Indirect Thunks
				; CHECK-NEXT: AArch64 sls hardening pass
	; CHECK-NEXT: AArch64 Pointer Authentication			; CHECK-NEXT: AArch64 Pointer Authentication
	; CHECK-NEXT: AArch64 Branch Targets			; CHECK-NEXT: AArch64 Branch Targets
	; CHECK-NEXT: Branch relaxation pass			; CHECK-NEXT: Branch relaxation pass
	; CHECK-NEXT: Insert CFI remember/restore state instructions			; CHECK-NEXT: Insert CFI remember/restore state instructions
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: Stack Frame Layout Analysis			; CHECK-NEXT: Stack Frame Layout Analysis
	; CHECK-NEXT: Unpack machine instruction bundles			; CHECK-NEXT: Unpack machine instruction bundles
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: AArch64 Assembly Printer			; CHECK-NEXT: AArch64 Assembly Printer
	; CHECK-NEXT: Free MachineFunction			; CHECK-NEXT: Free MachineFunction

	define void @f() {			define void @f() {
	ret void			ret void
	}			}

llvm/test/CodeGen/AArch64/O3-pipeline.ll

	Show First 20 Lines • Show All 195 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Tail Duplication			; CHECK-NEXT: Tail Duplication
	; CHECK-NEXT: Machine Copy Propagation Pass			; CHECK-NEXT: Machine Copy Propagation Pass
	; CHECK-NEXT: Post-RA pseudo instruction expansion pass			; CHECK-NEXT: Post-RA pseudo instruction expansion pass
	; CHECK-NEXT: AArch64 pseudo instruction expansion pass			; CHECK-NEXT: AArch64 pseudo instruction expansion pass
	; CHECK-NEXT: AArch64 load / store optimization pass			; CHECK-NEXT: AArch64 load / store optimization pass
	; CHECK-NEXT: Insert KCFI indirect call checks			; CHECK-NEXT: Insert KCFI indirect call checks
	; CHECK-NEXT: AArch64 speculation hardening pass			; CHECK-NEXT: AArch64 speculation hardening pass
	; CHECK-NEXT: AArch64 Indirect Thunks
	; CHECK-NEXT: AArch64 sls hardening pass
	; CHECK-NEXT: MachineDominator Tree Construction			; CHECK-NEXT: MachineDominator Tree Construction
	; CHECK-NEXT: Machine Natural Loop Construction			; CHECK-NEXT: Machine Natural Loop Construction
	; CHECK-NEXT: Falkor HW Prefetch Fix Late Phase			; CHECK-NEXT: Falkor HW Prefetch Fix Late Phase
	; CHECK-NEXT: PostRA Machine Instruction Scheduler			; CHECK-NEXT: PostRA Machine Instruction Scheduler
	; CHECK-NEXT: Analyze Machine Code For Garbage Collection			; CHECK-NEXT: Analyze Machine Code For Garbage Collection
	; CHECK-NEXT: Machine Block Frequency Analysis			; CHECK-NEXT: Machine Block Frequency Analysis
	; CHECK-NEXT: MachinePostDominator Tree Construction			; CHECK-NEXT: MachinePostDominator Tree Construction
	; CHECK-NEXT: Branch Probability Basic Block Placement			; CHECK-NEXT: Branch Probability Basic Block Placement
	; CHECK-NEXT: Insert fentry calls			; CHECK-NEXT: Insert fentry calls
	; CHECK-NEXT: Insert XRay ops			; CHECK-NEXT: Insert XRay ops
	; CHECK-NEXT: Implement the 'patchable-function' attribute			; CHECK-NEXT: Implement the 'patchable-function' attribute
	; CHECK-NEXT: AArch64 load / store optimization pass			; CHECK-NEXT: AArch64 load / store optimization pass
	; CHECK-NEXT: Machine Copy Propagation Pass			; CHECK-NEXT: Machine Copy Propagation Pass
	; CHECK-NEXT: Workaround A53 erratum 835769 pass			; CHECK-NEXT: Workaround A53 erratum 835769 pass
	; CHECK-NEXT: Contiguously Lay Out Funclets			; CHECK-NEXT: Contiguously Lay Out Funclets
	; CHECK-NEXT: StackMap Liveness Analysis			; CHECK-NEXT: StackMap Liveness Analysis
	; CHECK-NEXT: Live DEBUG_VALUE analysis			; CHECK-NEXT: Live DEBUG_VALUE analysis
	; CHECK-NEXT: Machine Sanitizer Binary Metadata			; CHECK-NEXT: Machine Sanitizer Binary Metadata
	; CHECK-NEXT: Machine Outliner			; CHECK-NEXT: Machine Outliner
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
				; CHECK-NEXT: AArch64 Indirect Thunks
				; CHECK-NEXT: AArch64 sls hardening pass
	; CHECK-NEXT: AArch64 Pointer Authentication			; CHECK-NEXT: AArch64 Pointer Authentication
	; CHECK-NEXT: AArch64 Branch Targets			; CHECK-NEXT: AArch64 Branch Targets
	; CHECK-NEXT: Branch relaxation pass			; CHECK-NEXT: Branch relaxation pass
	; CHECK-NEXT: AArch64 Compress Jump Tables			; CHECK-NEXT: AArch64 Compress Jump Tables
	; CHECK-NEXT: Insert CFI remember/restore state instructions			; CHECK-NEXT: Insert CFI remember/restore state instructions
	; CHECK-NEXT: Lazy Machine Block Frequency Analysis			; CHECK-NEXT: Lazy Machine Block Frequency Analysis
	; CHECK-NEXT: Machine Optimization Remark Emitter			; CHECK-NEXT: Machine Optimization Remark Emitter
	; CHECK-NEXT: Stack Frame Layout Analysis			; CHECK-NEXT: Stack Frame Layout Analysis
	Show All 23 Lines

llvm/test/CodeGen/AArch64/arm64-opt-remarks-lazy-bfi.ll

	Show All 26 Lines


	; Verify that we only populate MachineBFI on behalf of ORE when hotness is			; Verify that we only populate MachineBFI on behalf of ORE when hotness is
	; requested. (This hard-codes the previous pass to the Assembly Printer,			; requested. (This hard-codes the previous pass to the Assembly Printer,
	; please adjust accordingly.)			; please adjust accordingly.)

	; HOTNESS: Freeing Pass 'Machine Outliner'			; HOTNESS: Freeing Pass 'Machine Outliner'
	; HOTNESS-NEXT: Executing Pass 'Function Pass Manager'			; HOTNESS-NEXT: Executing Pass 'Function Pass Manager'
	; HOTNESS-NEXT: Executing Pass 'Verify generated machine code'			; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code'			; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
				; HOTNESS-NEXT: Executing Pass 'AArch64 Indirect Thunks' on Function 'empty_func'...
				; HOTNESS-NEXT: Freeing Pass 'AArch64 Indirect Thunks' on Function 'empty_func'...
				; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
				; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
				; HOTNESS-NEXT: Executing Pass 'AArch64 sls hardening pass' on Function 'empty_func'...
				; HOTNESS-NEXT: Freeing Pass 'AArch64 sls hardening pass' on Function 'empty_func'...
				; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
				; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	; HOTNESS-NEXT: Executing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...			; HOTNESS-NEXT: Executing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...
	; HOTNESS-NEXT: Freeing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...			; HOTNESS-NEXT: Freeing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...
	; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...			; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...			; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	; HOTNESS-NEXT: Executing Pass 'AArch64 Branch Targets' on Function 'empty_func'...			; HOTNESS-NEXT: Executing Pass 'AArch64 Branch Targets' on Function 'empty_func'...
	; HOTNESS-NEXT: Freeing Pass 'AArch64 Branch Targets' on Function 'empty_func'...			; HOTNESS-NEXT: Freeing Pass 'AArch64 Branch Targets' on Function 'empty_func'...
	; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...			; HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...			; HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	Show All 23 Lines
	; HOTNESS-NOT: Executing Pass			; HOTNESS-NOT: Executing Pass
	; HOTNESS: Executing Pass 'AArch64 Assembly Printer'			; HOTNESS: Executing Pass 'AArch64 Assembly Printer'

	; HOTNESS: arm64-summary-remarks.ll:5:0: 1 instructions in function (hotness: 33)			; HOTNESS: arm64-summary-remarks.ll:5:0: 1 instructions in function (hotness: 33)


	; NO_HOTNESS: Freeing Pass 'Machine Outliner'			; NO_HOTNESS: Freeing Pass 'Machine Outliner'
	; NO_HOTNESS-NEXT: Executing Pass 'Function Pass Manager'			; NO_HOTNESS-NEXT: Executing Pass 'Function Pass Manager'
	; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code'			; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code'			; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Executing Pass 'AArch64 Indirect Thunks' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 Indirect Thunks' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Executing Pass 'AArch64 sls hardening pass' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 sls hardening pass' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
				; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Executing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Executing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 Pointer Authentication' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Executing Pass 'AArch64 Branch Targets' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Executing Pass 'AArch64 Branch Targets' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 Branch Targets' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Freeing Pass 'AArch64 Branch Targets' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Executing Pass 'Verify generated machine code' on Function 'empty_func'...
	; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...			; NO_HOTNESS-NEXT: Freeing Pass 'Verify generated machine code' on Function 'empty_func'...
	Show All 34 Lines

llvm/test/CodeGen/AArch64/sls-stackprotector-outliner.ll

	Show All 12 Lines
	; CHECK-NEXT: sub sp, sp, #32			; CHECK-NEXT: sub sp, sp, #32
	; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill			; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill
	; CHECK-NEXT: .cfi_def_cfa_offset 32			; CHECK-NEXT: .cfi_def_cfa_offset 32
	; CHECK-NEXT: .cfi_offset w30, -16			; CHECK-NEXT: .cfi_offset w30, -16
	; CHECK-NEXT: bl OUTLINED_FUNCTION_0			; CHECK-NEXT: bl OUTLINED_FUNCTION_0
	; CHECK-NEXT: b.ne .LBB0_2			; CHECK-NEXT: b.ne .LBB0_2
	; CHECK-NEXT: // %bb.1: // %entry			; CHECK-NEXT: // %bb.1: // %entry
	; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload			; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: add x0, x0, x8
				; CHECK-NEXT: add sp, sp, #32
	; CHECK-NEXT: b _ZN2C6D1Ev			; CHECK-NEXT: b _ZN2C6D1Ev
	; CHECK-NEXT: dsb sy			; CHECK-NEXT: dsb sy
	; CHECK-NEXT: isb			; CHECK-NEXT: isb
	; CHECK-NEXT: .LBB0_2: // %entry			; CHECK-NEXT: .LBB0_2: // %entry
	; CHECK-NEXT: bl __stack_chk_fail			; CHECK-NEXT: bl __stack_chk_fail
	entry:			entry:
	%0 = load ptr, ptr %this, align 8			%0 = load ptr, ptr %this, align 8
	%1 = getelementptr inbounds i8, ptr %0, i64 -24			%1 = getelementptr inbounds i8, ptr %0, i64 -24
	Show All 10 Lines
	; CHECK-NEXT: sub sp, sp, #32			; CHECK-NEXT: sub sp, sp, #32
	; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill			; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill
	; CHECK-NEXT: .cfi_def_cfa_offset 32			; CHECK-NEXT: .cfi_def_cfa_offset 32
	; CHECK-NEXT: .cfi_offset w30, -16			; CHECK-NEXT: .cfi_offset w30, -16
	; CHECK-NEXT: bl OUTLINED_FUNCTION_0			; CHECK-NEXT: bl OUTLINED_FUNCTION_0
	; CHECK-NEXT: b.ne .LBB1_2			; CHECK-NEXT: b.ne .LBB1_2
	; CHECK-NEXT: // %bb.1: // %entry			; CHECK-NEXT: // %bb.1: // %entry
	; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload			; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: add x0, x0, x8
				; CHECK-NEXT: add sp, sp, #32
	; CHECK-NEXT: b _ZN2C6D0Ev			; CHECK-NEXT: b _ZN2C6D0Ev
	; CHECK-NEXT: dsb sy			; CHECK-NEXT: dsb sy
	; CHECK-NEXT: isb			; CHECK-NEXT: isb
	; CHECK-NEXT: .LBB1_2: // %entry			; CHECK-NEXT: .LBB1_2: // %entry
	; CHECK-NEXT: bl __stack_chk_fail			; CHECK-NEXT: bl __stack_chk_fail
	entry:			entry:
	%0 = load ptr, ptr %this, align 8			%0 = load ptr, ptr %this, align 8
	%1 = getelementptr inbounds i8, ptr %0, i64 -24			%1 = getelementptr inbounds i8, ptr %0, i64 -24
	Show All 9 Lines
	; CHECK-NEXT: sub sp, sp, #32			; CHECK-NEXT: sub sp, sp, #32
	; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill			; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill
	; CHECK-NEXT: .cfi_def_cfa_offset 32			; CHECK-NEXT: .cfi_def_cfa_offset 32
	; CHECK-NEXT: .cfi_offset w30, -16			; CHECK-NEXT: .cfi_offset w30, -16
	; CHECK-NEXT: bl OUTLINED_FUNCTION_0			; CHECK-NEXT: bl OUTLINED_FUNCTION_0
	; CHECK-NEXT: b.ne .LBB2_2			; CHECK-NEXT: b.ne .LBB2_2
	; CHECK-NEXT: // %bb.1: // %entry			; CHECK-NEXT: // %bb.1: // %entry
	; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload			; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: add x0, x0, x8
				; CHECK-NEXT: add sp, sp, #32
	; CHECK-NEXT: b _ZN3C10D1Ev			; CHECK-NEXT: b _ZN3C10D1Ev
	; CHECK-NEXT: dsb sy			; CHECK-NEXT: dsb sy
	; CHECK-NEXT: isb			; CHECK-NEXT: isb
	; CHECK-NEXT: .LBB2_2: // %entry			; CHECK-NEXT: .LBB2_2: // %entry
	; CHECK-NEXT: bl __stack_chk_fail			; CHECK-NEXT: bl __stack_chk_fail
	entry:			entry:
	%0 = load ptr, ptr %this, align 8			%0 = load ptr, ptr %this, align 8
	%1 = getelementptr inbounds i8, ptr %0, i64 -24			%1 = getelementptr inbounds i8, ptr %0, i64 -24
	Show All 9 Lines
	; CHECK-NEXT: sub sp, sp, #32			; CHECK-NEXT: sub sp, sp, #32
	; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill			; CHECK-NEXT: str x30, [sp, #16] // 8-byte Folded Spill
	; CHECK-NEXT: .cfi_def_cfa_offset 32			; CHECK-NEXT: .cfi_def_cfa_offset 32
	; CHECK-NEXT: .cfi_offset w30, -16			; CHECK-NEXT: .cfi_offset w30, -16
	; CHECK-NEXT: bl OUTLINED_FUNCTION_0			; CHECK-NEXT: bl OUTLINED_FUNCTION_0
	; CHECK-NEXT: b.ne .LBB3_2			; CHECK-NEXT: b.ne .LBB3_2
	; CHECK-NEXT: // %bb.1: // %entry			; CHECK-NEXT: // %bb.1: // %entry
	; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload			; CHECK-NEXT: ldr x30, [sp, #16] // 8-byte Folded Reload
	; CHECK-NEXT: bl OUTLINED_FUNCTION_1			; CHECK-NEXT: add x0, x0, x8
				; CHECK-NEXT: add sp, sp, #32
	; CHECK-NEXT: b _ZN3C10D0Ev			; CHECK-NEXT: b _ZN3C10D0Ev
	; CHECK-NEXT: dsb sy			; CHECK-NEXT: dsb sy
	; CHECK-NEXT: isb			; CHECK-NEXT: isb
	; CHECK-NEXT: .LBB3_2: // %entry			; CHECK-NEXT: .LBB3_2: // %entry
	; CHECK-NEXT: bl __stack_chk_fail			; CHECK-NEXT: bl __stack_chk_fail
	entry:			entry:
	%0 = load ptr, ptr %this, align 8			%0 = load ptr, ptr %this, align 8
	%1 = getelementptr inbounds i8, ptr %0, i64 -24			%1 = getelementptr inbounds i8, ptr %0, i64 -24
	Show All 10 Lines