This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/
-
llvm/
-
CodeGen/
-
SelectionDAGTargetInfo.h
-
IR/
-
IntrinsicsAArch64.td
-
lib/
-
Analysis/
-
ValueTracking.cpp
-
CodeGen/SelectionDAG/
-
SelectionDAG/
-
SelectionDAGBuilder.cpp
-
Target/AArch64/
-
AArch64/
-
AArch64ExpandPseudoInsts.cpp
-
AArch64FrameLowering.cpp
-
AArch64ISelDAGToDAG.cpp
-
AArch64ISelLowering.h
-
AArch64ISelLowering.cpp
-
AArch64InstrFormats.td
-
AArch64InstrInfo.cpp
-
AArch64InstrInfo.td
-
AArch64MachineFunctionInfo.h
-
AArch64RegisterInfo.cpp
-
AArch64SelectionDAGInfo.h
-
AArch64SelectionDAGInfo.cpp
-
test/
-
Analysis/ValueTracking/
-
ValueTracking/
-
aarch64.irg.ll
-
CodeGen/AArch64/
-
AArch64/
-
irg.ll
-
irg_sp_tagp.ll
-
settag.ll
-
stgp.ll
-
tagp.ll

Differential D64172

Basic codegen for MTE stack tagging.
ClosedPublic

Authored by eugenis on Jul 3 2019, 4:42 PM.

Download Raw Diff

Details

Reviewers

pcc
vitalybuka
hctim
ostannard

Commits

rGd752f5e95309: Basic codegen for MTE stack tagging.
rL366360: Basic codegen for MTE stack tagging.

Summary

Implement IR intrinsics for stack tagging. Generated code is very
unoptimized for now.

Two special intrinsics, llvm.aarch64.irg.sp and llvm.aarch64.tagp are
used to implement a tagged stack frame pointer in a virtual register.

Diff Detail

Repository: rL LLVM

Event Timeline

eugenis created this revision.Jul 3 2019, 4:42 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 3 2019, 4:42 PM

Herald added subscribers: jsji, hiraditya, kristof.beyls, javed.absar. · View Herald Transcript

Harbormaster completed remote builds in B34332: Diff 207922.Jul 3 2019, 4:43 PM

eugenis added a child revision: D64173: Basic MTE stack tagging instrumentation..Jul 3 2019, 4:44 PM

eugenis added a parent revision: D64171: Factor out resolveFrameOffsetReference..

I think this could do with some more tests covering different stack layouts. In particular, we should check that we do the correct thing for functions which use frame or base pointers, and when large stack frames cause the immediates in STG and ADDG instructions go out of range.

llvm/include/llvm/CodeGen/SelectionDAGTargetInfo.h
151 ↗	(On Diff #207922)	Could Op1 and Op2 be given more descriptive names? It looks like they are always the pointer and length of the memory region.
llvm/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp
368 ↗	(On Diff #207922)	Could we make the size a register input to these pseudo-instructions, so that this gets done by normal code-generation?

Herald added a subscriber: • wuzish. · View Herald TranscriptJul 5 2019, 8:27 AM

eugenis marked 2 inline comments as done.Jul 8 2019, 2:51 PM

eugenis added inline comments.

llvm/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp
368 ↗	(On Diff #207922)	That would require moving the previous conditional block (Size % (16 * 2) != 0) to SDAG because this pass will no longer now if the size if 32-byte aligned or not. If I do that, I run into a different problem: regalloc refuses to generate STG Xn, [Xn], #offset even if Xn is dead after this, because STGPostIndex has @earlyclobber on the writeback register. Is this correct? I don't see anything in the spec where the same register can not be used for both, and the pseudo-code suggests that registers are read first and updated later. This adds an extra register copy in all of the settag.ll test cases, ex. mov x8, x0 stg x0, [x8], #16 mov w9, #256 .LBB0_1: // %entry // =>This Inner Loop Header: Depth=1 st2g x8, [x8], #32 sub x9, x9, #32 // =32 cbnz x9, .LBB0_1 (the second st2g is emitted in expand-pseudos)

address review comments & remove @earlyclobber from STGPreIndex / STGPostIndex

Herald added subscribers: mgorny, srhines. · View Herald TranscriptJul 8 2019, 3:10 PM

Harbormaster completed remote builds in B34536: Diff 208517.Jul 8 2019, 3:10 PM

eugenis marked an inline comment as done.Jul 8 2019, 3:11 PM

What about the tests for large stack frames?

llvm/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp
368 ↗	(On Diff #207922)	I think earlier versions of the architecture did not allow Xt and Xn to be the same in STG with writeback, but that restriction has been removed, so removing the `@earlyclobber` is correct.
llvm/lib/Target/AArch64/AArch64StackTagging.cpp
1 ↗	(On Diff #208517)	It looks like you accidentally merged D64173 into this patch,

Made aarch64.irg.sp IntrInaccessibleMemOnly cause it has side effects.
Simplified SDAG a little by matching intrinsic directly w/o going through an SDag node for IRGstack.
Added a bunch of tests.

Harbormaster completed remote builds in B34833: Diff 209393.Jul 11 2019, 6:22 PM

PTAL. I've added tests for various frame layouts. Stg offset overflow is covered by existing tests (settag.ll).

llvm/include/llvm/IR/IntrinsicsAArch64.td
708 ↗	(On Diff #209393)	Ideally this should be [IntrNoMem, IntrHasSideEffects], but it depends on the discussion in https://reviews.llvm.org/D64414. Keeping this the same as int_aarch64_irg for now.

LGTM, thanks

This revision is now accepted and ready to land.Jul 16 2019, 1:59 AM

Closed by commit rL366360: Basic codegen for MTE stack tagging. (authored by eugenis). · Explain WhyJul 17 2019, 12:28 PM

This revision was automatically updated to reflect the committed changes.

evandro added a subscriber: evandro.Aug 7 2019, 12:32 PM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

SelectionDAGTargetInfo.h

8 lines

IR/

IntrinsicsAArch64.td

30 lines

lib/

Analysis/

ValueTracking.cpp

3 lines

CodeGen/

SelectionDAG/

SelectionDAGBuilder.cpp

13 lines

Target/

AArch64/

AArch64ExpandPseudoInsts.cpp

102 lines

AArch64FrameLowering.cpp

4 lines

AArch64ISelDAGToDAG.cpp

60 lines

AArch64ISelLowering.h

8 lines

AArch64ISelLowering.cpp

4 lines

AArch64InstrFormats.td

4 lines

AArch64InstrInfo.cpp

17 lines

AArch64InstrInfo.td

45 lines

AArch64MachineFunctionInfo.h

13 lines

AArch64RegisterInfo.cpp

15 lines

AArch64SelectionDAGInfo.h

4 lines

AArch64SelectionDAGInfo.cpp

88 lines

test/

Analysis/

ValueTracking/

aarch64.irg.ll

16 lines

CodeGen/

AArch64/

42 lines

93 lines

138 lines

78 lines

41 lines

Diff 210390

llvm/trunk/include/llvm/CodeGen/SelectionDAGTargetInfo.h

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	public:

virtual std::pair<SDValue, SDValue>		virtual std::pair<SDValue, SDValue>
EmitTargetCodeForStrnlen(SelectionDAG &DAG, const SDLoc &DL, SDValue Chain,		EmitTargetCodeForStrnlen(SelectionDAG &DAG, const SDLoc &DL, SDValue Chain,
SDValue Src, SDValue MaxLength,		SDValue Src, SDValue MaxLength,
MachinePointerInfo SrcPtrInfo) const {		MachinePointerInfo SrcPtrInfo) const {
return std::make_pair(SDValue(), SDValue());		return std::make_pair(SDValue(), SDValue());
}		}

		virtual SDValue EmitTargetCodeForSetTag(SelectionDAG &DAG, const SDLoc &dl,
		SDValue Chain, SDValue Addr,
		SDValue Size,
		MachinePointerInfo DstPtrInfo,
		bool ZeroData) const {
		return SDValue();
		}

// Return true when the decision to generate FMA's (or FMS, FMLA etc) rather		// Return true when the decision to generate FMA's (or FMS, FMLA etc) rather
// than FMUL and ADD is delegated to the machine combiner.		// than FMUL and ADD is delegated to the machine combiner.
virtual bool generateFMAsInMachineCombiner(CodeGenOpt::Level OptLevel) const {		virtual bool generateFMAsInMachineCombiner(CodeGenOpt::Level OptLevel) const {
return false;		return false;
}		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_CODEGEN_SELECTIONDAGTARGETINFO_H		#endif // LLVM_CODEGEN_SELECTIONDAGTARGETINFO_H

llvm/trunk/include/llvm/IR/IntrinsicsAArch64.td

	Show First 20 Lines • Show All 696 Lines • ▼ Show 20 Lines
	def int_aarch64_gmi : Intrinsic<[llvm_i64_ty], [llvm_ptr_ty, llvm_i64_ty],			def int_aarch64_gmi : Intrinsic<[llvm_i64_ty], [llvm_ptr_ty, llvm_i64_ty],
	[IntrNoMem]>;			[IntrNoMem]>;
	def int_aarch64_ldg : Intrinsic<[llvm_ptr_ty], [llvm_ptr_ty, llvm_ptr_ty],			def int_aarch64_ldg : Intrinsic<[llvm_ptr_ty], [llvm_ptr_ty, llvm_ptr_ty],
	[IntrReadMem]>;			[IntrReadMem]>;
	def int_aarch64_stg : Intrinsic<[], [llvm_ptr_ty, llvm_ptr_ty],			def int_aarch64_stg : Intrinsic<[], [llvm_ptr_ty, llvm_ptr_ty],
	[IntrWriteMem]>;			[IntrWriteMem]>;
	def int_aarch64_subp : Intrinsic<[llvm_i64_ty], [llvm_ptr_ty, llvm_ptr_ty],			def int_aarch64_subp : Intrinsic<[llvm_i64_ty], [llvm_ptr_ty, llvm_ptr_ty],
	[IntrNoMem]>;			[IntrNoMem]>;

				// The following are codegen-only intrinsics for stack instrumentation.

				// Generate a randomly tagged stack base pointer.
				def int_aarch64_irg_sp : Intrinsic<[llvm_ptr_ty], [llvm_i64_ty],
				[IntrInaccessibleMemOnly]>;

				// Transfer pointer tag with offset.
				// ptr1 = tagp(ptr0, baseptr, tag_offset) returns a pointer where
				// * address is the address in ptr0
				// * tag is a function of (tag in baseptr, tag_offset).
				// Address bits in baseptr and tag bits in ptr0 are ignored.
				// When offset between ptr0 and baseptr is a compile time constant, this can be emitted as
				// ADDG ptr1, baseptr, (ptr0 - baseptr), tag_offset
				// It is intended that ptr0 is an alloca address, and baseptr is the direct output of llvm.aarch64.irg.sp.
				def int_aarch64_tagp : Intrinsic<[llvm_anyptr_ty], [LLVMMatchType<0>, llvm_ptr_ty, llvm_i64_ty],
				[IntrNoMem, ImmArg<2>]>;

				// Update allocation tags for the memory range to match the tag in the pointer argument.
				def int_aarch64_settag : Intrinsic<[], [llvm_ptr_ty, llvm_i64_ty],
				[IntrWriteMem, IntrArgMemOnly, NoCapture<0>, WriteOnly<0>]>;

				// Update allocation tags for the memory range to match the tag in the pointer argument,
				// and set memory contents to zero.
				def int_aarch64_settag_zero : Intrinsic<[], [llvm_ptr_ty, llvm_i64_ty],
				[IntrWriteMem, IntrArgMemOnly, NoCapture<0>, WriteOnly<0>]>;

				// Update allocation tags for 16-aligned, 16-sized memory region, and store a pair 8-byte values.
				def int_aarch64_stgp : Intrinsic<[], [llvm_ptr_ty, llvm_i64_ty, llvm_i64_ty],
				[IntrWriteMem, IntrArgMemOnly, NoCapture<0>, WriteOnly<0>]>;
	}			}

llvm/trunk/lib/Analysis/ValueTracking.cpp

Show First 20 Lines • Show All 3,660 Lines • ▼ Show 20 Lines	if (isIntrinsicReturningPointerAliasingArgumentWithoutCapturing(Call))
return Call->getArgOperand(0);		return Call->getArgOperand(0);
return nullptr;		return nullptr;
}		}

bool llvm::isIntrinsicReturningPointerAliasingArgumentWithoutCapturing(		bool llvm::isIntrinsicReturningPointerAliasingArgumentWithoutCapturing(
const CallBase *Call) {		const CallBase *Call) {
return Call->getIntrinsicID() == Intrinsic::launder_invariant_group \|\|		return Call->getIntrinsicID() == Intrinsic::launder_invariant_group \|\|
Call->getIntrinsicID() == Intrinsic::strip_invariant_group \|\|		Call->getIntrinsicID() == Intrinsic::strip_invariant_group \|\|
Call->getIntrinsicID() == Intrinsic::aarch64_irg;		Call->getIntrinsicID() == Intrinsic::aarch64_irg \|\|
		Call->getIntrinsicID() == Intrinsic::aarch64_tagp;
}		}

/// \p PN defines a loop-variant pointer to an object. Check if the		/// \p PN defines a loop-variant pointer to an object. Check if the
/// previous iteration of the loop was referring to the same object as \p PN.		/// previous iteration of the loop was referring to the same object as \p PN.
static bool isSameUnderlyingObjectInLoop(const PHINode *PN,		static bool isSameUnderlyingObjectInLoop(const PHINode *PN,
const LoopInfo *LI) {		const LoopInfo *LI) {
// Find the loop-defined value.		// Find the loop-defined value.
Loop *L = LI->getLoopFor(PN->getParent());		Loop *L = LI->getLoopFor(PN->getParent());
▲ Show 20 Lines • Show All 2,028 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,799 Lines • ▼ Show 20 Lines	case Intrinsic::icall_branch_funnel: {
return;		return;
}		}

case Intrinsic::wasm_landingpad_index:		case Intrinsic::wasm_landingpad_index:
// Information this intrinsic contained has been transferred to		// Information this intrinsic contained has been transferred to
// MachineFunction in SelectionDAGISel::PrepareEHLandingPad. We can safely		// MachineFunction in SelectionDAGISel::PrepareEHLandingPad. We can safely
// delete it now.		// delete it now.
return;		return;

		case Intrinsic::aarch64_settag:
		case Intrinsic::aarch64_settag_zero: {
		const SelectionDAGTargetInfo &TSI = DAG.getSelectionDAGInfo();
		bool ZeroMemory = Intrinsic == Intrinsic::aarch64_settag_zero;
		SDValue Val = TSI.EmitTargetCodeForSetTag(
		DAG, getCurSDLoc(), getRoot(), getValue(I.getArgOperand(0)),
		getValue(I.getArgOperand(1)), MachinePointerInfo(I.getArgOperand(0)),
		ZeroMemory);
		DAG.setRoot(Val);
		setValue(&I, Val);
		return;
		}
}		}
}		}

void SelectionDAGBuilder::visitConstrainedFPIntrinsic(		void SelectionDAGBuilder::visitConstrainedFPIntrinsic(
const ConstrainedFPIntrinsic &FPI) {		const ConstrainedFPIntrinsic &FPI) {
SDLoc sdl = getCurSDLoc();		SDLoc sdl = getCurSDLoc();
unsigned Opcode;		unsigned Opcode;
switch (FPI.getIntrinsicID()) {		switch (FPI.getIntrinsicID()) {
▲ Show 20 Lines • Show All 3,630 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp

Show All 9 Lines
// instructions to allow proper scheduling and other late optimizations. This		// instructions to allow proper scheduling and other late optimizations. This
// pass should be run after register allocation but before the post-regalloc		// pass should be run after register allocation but before the post-regalloc
// scheduling pass.		// scheduling pass.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "AArch64ExpandImm.h"		#include "AArch64ExpandImm.h"
#include "AArch64InstrInfo.h"		#include "AArch64InstrInfo.h"
		#include "AArch64MachineFunctionInfo.h"
#include "AArch64Subtarget.h"		#include "AArch64Subtarget.h"
#include "MCTargetDesc/AArch64AddressingModes.h"		#include "MCTargetDesc/AArch64AddressingModes.h"
#include "Utils/AArch64BaseInfo.h"		#include "Utils/AArch64BaseInfo.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Triple.h"		#include "llvm/ADT/Triple.h"
#include "llvm/CodeGen/LivePhysRegs.h"		#include "llvm/CodeGen/LivePhysRegs.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	private:

bool expandCMP_SWAP(MachineBasicBlock &MBB, MachineBasicBlock::iterator MBBI,		bool expandCMP_SWAP(MachineBasicBlock &MBB, MachineBasicBlock::iterator MBBI,
unsigned LdarOp, unsigned StlrOp, unsigned CmpOp,		unsigned LdarOp, unsigned StlrOp, unsigned CmpOp,
unsigned ExtendImm, unsigned ZeroReg,		unsigned ExtendImm, unsigned ZeroReg,
MachineBasicBlock::iterator &NextMBBI);		MachineBasicBlock::iterator &NextMBBI);
bool expandCMP_SWAP_128(MachineBasicBlock &MBB,		bool expandCMP_SWAP_128(MachineBasicBlock &MBB,
MachineBasicBlock::iterator MBBI,		MachineBasicBlock::iterator MBBI,
MachineBasicBlock::iterator &NextMBBI);		MachineBasicBlock::iterator &NextMBBI);
		bool expandSetTagLoop(MachineBasicBlock &MBB,
		MachineBasicBlock::iterator MBBI,
		MachineBasicBlock::iterator &NextMBBI);
};		};

} // end anonymous namespace		} // end anonymous namespace

char AArch64ExpandPseudo::ID = 0;		char AArch64ExpandPseudo::ID = 0;

INITIALIZE_PASS(AArch64ExpandPseudo, "aarch64-expand-pseudo",		INITIALIZE_PASS(AArch64ExpandPseudo, "aarch64-expand-pseudo",
AARCH64_EXPAND_PSEUDO_NAME, false, false)		AARCH64_EXPAND_PSEUDO_NAME, false, false)
▲ Show 20 Lines • Show All 246 Lines • ▼ Show 20 Lines	bool AArch64ExpandPseudo::expandCMP_SWAP_128(
StoreBB->clearLiveIns();		StoreBB->clearLiveIns();
computeAndAddLiveIns(LiveRegs, *StoreBB);		computeAndAddLiveIns(LiveRegs, *StoreBB);
LoadCmpBB->clearLiveIns();		LoadCmpBB->clearLiveIns();
computeAndAddLiveIns(LiveRegs, *LoadCmpBB);		computeAndAddLiveIns(LiveRegs, *LoadCmpBB);

return true;		return true;
}		}

		bool AArch64ExpandPseudo::expandSetTagLoop(
		MachineBasicBlock &MBB, MachineBasicBlock::iterator MBBI,
		MachineBasicBlock::iterator &NextMBBI) {
		MachineInstr &MI = *MBBI;
		DebugLoc DL = MI.getDebugLoc();
		Register SizeReg = MI.getOperand(2).getReg();
		Register AddressReg = MI.getOperand(3).getReg();

		MachineFunction *MF = MBB.getParent();

		bool ZeroData = MI.getOpcode() == AArch64::STZGloop;
		const unsigned OpCode =
		ZeroData ? AArch64::STZ2GPostIndex : AArch64::ST2GPostIndex;

		auto LoopBB = MF->CreateMachineBasicBlock(MBB.getBasicBlock());
		auto DoneBB = MF->CreateMachineBasicBlock(MBB.getBasicBlock());

		MF->insert(++MBB.getIterator(), LoopBB);
		MF->insert(++LoopBB->getIterator(), DoneBB);

		BuildMI(LoopBB, DL, TII->get(OpCode))
		.addDef(AddressReg)
		.addReg(AddressReg)
		.addReg(AddressReg)
		.addImm(2)
		.cloneMemRefs(MI)
		.setMIFlags(MI.getFlags());
		BuildMI(LoopBB, DL, TII->get(AArch64::SUBXri))
		.addDef(SizeReg)
		.addReg(SizeReg)
		.addImm(16 * 2)
		.addImm(0);
		BuildMI(LoopBB, DL, TII->get(AArch64::CBNZX)).addUse(SizeReg).addMBB(LoopBB);

		LoopBB->addSuccessor(LoopBB);
		LoopBB->addSuccessor(DoneBB);

		DoneBB->splice(DoneBB->end(), &MBB, MI, MBB.end());
		DoneBB->transferSuccessors(&MBB);

		MBB.addSuccessor(LoopBB);

		NextMBBI = MBB.end();
		MI.eraseFromParent();
		// Recompute liveness bottom up.
		LivePhysRegs LiveRegs;
		computeAndAddLiveIns(LiveRegs, *DoneBB);
		computeAndAddLiveIns(LiveRegs, *LoopBB);
		// Do an extra pass in the loop to get the loop carried dependencies right.
		// FIXME: is this necessary?
		LoopBB->clearLiveIns();
		computeAndAddLiveIns(LiveRegs, *LoopBB);
		DoneBB->clearLiveIns();
		computeAndAddLiveIns(LiveRegs, *DoneBB);

		return true;
		}

/// If MBBI references a pseudo instruction that should be expanded here,		/// If MBBI references a pseudo instruction that should be expanded here,
/// do the expansion and return true. Otherwise return false.		/// do the expansion and return true. Otherwise return false.
bool AArch64ExpandPseudo::expandMI(MachineBasicBlock &MBB,		bool AArch64ExpandPseudo::expandMI(MachineBasicBlock &MBB,
MachineBasicBlock::iterator MBBI,		MachineBasicBlock::iterator MBBI,
MachineBasicBlock::iterator &NextMBBI) {		MachineBasicBlock::iterator &NextMBBI) {
MachineInstr &MI = *MBBI;		MachineInstr &MI = *MBBI;
unsigned Opcode = MI.getOpcode();		unsigned Opcode = MI.getOpcode();
switch (Opcode) {		switch (Opcode) {
▲ Show 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	BuildMI(MBB, MBBI, MI.getDebugLoc(),
TII->get(Opcode == AArch64::AESMCrrTied ? AArch64::AESMCrr :		TII->get(Opcode == AArch64::AESMCrrTied ? AArch64::AESMCrr :
AArch64::AESIMCrr))		AArch64::AESIMCrr))
.add(MI.getOperand(0))		.add(MI.getOperand(0))
.add(MI.getOperand(1));		.add(MI.getOperand(1));
transferImpOps(MI, MIB, MIB);		transferImpOps(MI, MIB, MIB);
MI.eraseFromParent();		MI.eraseFromParent();
return true;		return true;
}		}
		case AArch64::IRGstack: {
		MachineFunction &MF = *MBB.getParent();
		const AArch64FunctionInfo *AFI = MF.getInfo<AArch64FunctionInfo>();
		const AArch64FrameLowering *TFI =
		MF.getSubtarget<AArch64Subtarget>().getFrameLowering();

		// IRG does not allow immediate offset. getTaggedBasePointerOffset should
		// almost always point to SP-after-prologue; if not, emit a longer
		// instruction sequence.
		int BaseOffset = -AFI->getTaggedBasePointerOffset();
		unsigned FrameReg;
		int FrameRegOffset = TFI->resolveFrameOffsetReference(
		MF, BaseOffset, false /isFixed/, FrameReg, /PreferFP=/false,
		/ForSimm=/true);
		Register SrcReg = FrameReg;
		if (FrameRegOffset != 0) {
		// Use output register as temporary.
		SrcReg = MI.getOperand(0).getReg();
		emitFrameOffset(MBB, &MI, MI.getDebugLoc(), SrcReg, FrameReg,
		FrameRegOffset, TII);
		}
		BuildMI(MBB, MBBI, MI.getDebugLoc(), TII->get(AArch64::IRG))
		.add(MI.getOperand(0))
		.addUse(SrcReg)
		.add(MI.getOperand(2));
		MI.eraseFromParent();
		return true;
		}
		case AArch64::TAGPstack: {
		BuildMI(MBB, MBBI, MI.getDebugLoc(), TII->get(AArch64::ADDG))
		.add(MI.getOperand(0))
		.add(MI.getOperand(1))
		.add(MI.getOperand(2))
		.add(MI.getOperand(4));
		MI.eraseFromParent();
		return true;
		}
		case AArch64::STGloop:
		case AArch64::STZGloop:
		return expandSetTagLoop(MBB, MBBI, NextMBBI);
}		}
return false;		return false;
}		}

/// Iterate over the instructions in basic block MBB and expand any		/// Iterate over the instructions in basic block MBB and expand any
/// pseudo instructions. Return true if anything was modified.		/// pseudo instructions. Return true if anything was modified.
bool AArch64ExpandPseudo::expandMBB(MachineBasicBlock &MBB) {		bool AArch64ExpandPseudo::expandMBB(MachineBasicBlock &MBB) {
bool Modified = false;		bool Modified = false;
Show All 24 Lines

llvm/trunk/lib/Target/AArch64/AArch64FrameLowering.cpp

Show First 20 Lines • Show All 836 Lines • ▼ Show 20 Lines	BuildMI(MBB, MBBI, DL, TII->get(TargetOpcode::CFI_INSTRUCTION))
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
}		}

// All calls are tail calls in GHC calling conv, and functions have no		// All calls are tail calls in GHC calling conv, and functions have no
// prologue/epilogue.		// prologue/epilogue.
if (MF.getFunction().getCallingConv() == CallingConv::GHC)		if (MF.getFunction().getCallingConv() == CallingConv::GHC)
return;		return;

		// Set tagged base pointer to the bottom of the stack frame.
		// Ideally it should match SP value after prologue.
		AFI->setTaggedBasePointerOffset(MFI.getStackSize());

// getStackSize() includes all the locals in its size calculation. We don't		// getStackSize() includes all the locals in its size calculation. We don't
// include these locals when computing the stack size of a funclet, as they		// include these locals when computing the stack size of a funclet, as they
// are allocated in the parent's stack frame and accessed via the frame		// are allocated in the parent's stack frame and accessed via the frame
// pointer from the funclet. We only save the callee saved registers in the		// pointer from the funclet. We only save the callee saved registers in the
// funclet, which are really the callee saved registers of the parent		// funclet, which are really the callee saved registers of the parent
// function, including the funclet.		// function, including the funclet.
int NumBytes = IsFunclet ? (int)getWinEHFuncletFrameSize(MF)		int NumBytes = IsFunclet ? (int)getWinEHFuncletFrameSize(MF)
: (int)MFI.getStackSize();		: (int)MFI.getStackSize();
▲ Show 20 Lines • Show All 1,391 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp

Show First 20 Lines • Show All 151 Lines • ▼ Show 20 Lines	public:
/// functions. Those should almost always be called instead.		/// functions. Those should almost always be called instead.
SDValue createTuple(ArrayRef<SDValue> Vecs, const unsigned RegClassIDs[],		SDValue createTuple(ArrayRef<SDValue> Vecs, const unsigned RegClassIDs[],
const unsigned SubRegs[]);		const unsigned SubRegs[]);

void SelectTable(SDNode *N, unsigned NumVecs, unsigned Opc, bool isExt);		void SelectTable(SDNode *N, unsigned NumVecs, unsigned Opc, bool isExt);

bool tryIndexedLoad(SDNode *N);		bool tryIndexedLoad(SDNode *N);

		bool trySelectStackSlotTagP(SDNode *N);
		void SelectTagP(SDNode *N);

void SelectLoad(SDNode *N, unsigned NumVecs, unsigned Opc,		void SelectLoad(SDNode *N, unsigned NumVecs, unsigned Opc,
unsigned SubRegIdx);		unsigned SubRegIdx);
void SelectPostLoad(SDNode *N, unsigned NumVecs, unsigned Opc,		void SelectPostLoad(SDNode *N, unsigned NumVecs, unsigned Opc,
unsigned SubRegIdx);		unsigned SubRegIdx);
void SelectLoadLane(SDNode *N, unsigned NumVecs, unsigned Opc);		void SelectLoadLane(SDNode *N, unsigned NumVecs, unsigned Opc);
void SelectPostLoadLane(SDNode *N, unsigned NumVecs, unsigned Opc);		void SelectPostLoadLane(SDNode *N, unsigned NumVecs, unsigned Opc);

void SelectStore(SDNode *N, unsigned NumVecs, unsigned Opc);		void SelectStore(SDNode *N, unsigned NumVecs, unsigned Opc);
▲ Show 20 Lines • Show All 530 Lines • ▼ Show 20 Lines	bool AArch64DAGToDAGISel::SelectAddrModeIndexedBitWidth(SDValue N, bool IsSignedImm,
const TargetLowering *TLI = getTargetLowering();		const TargetLowering *TLI = getTargetLowering();
if (N.getOpcode() == ISD::FrameIndex) {		if (N.getOpcode() == ISD::FrameIndex) {
int FI = cast<FrameIndexSDNode>(N)->getIndex();		int FI = cast<FrameIndexSDNode>(N)->getIndex();
Base = CurDAG->getTargetFrameIndex(FI, TLI->getPointerTy(DL));		Base = CurDAG->getTargetFrameIndex(FI, TLI->getPointerTy(DL));
OffImm = CurDAG->getTargetConstant(0, dl, MVT::i64);		OffImm = CurDAG->getTargetConstant(0, dl, MVT::i64);
return true;		return true;
}		}

// As opposed to the (12-bit) Indexed addressing mode below, the 7-bit signed		// As opposed to the (12-bit) Indexed addressing mode below, the 7/9-bit signed
// selected here doesn't support labels/immediates, only base+offset.		// selected here doesn't support labels/immediates, only base+offset.
if (CurDAG->isBaseWithConstantOffset(N)) {		if (CurDAG->isBaseWithConstantOffset(N)) {
if (ConstantSDNode *RHS = dyn_cast<ConstantSDNode>(N.getOperand(1))) {		if (ConstantSDNode *RHS = dyn_cast<ConstantSDNode>(N.getOperand(1))) {
if (IsSignedImm) {		if (IsSignedImm) {
int64_t RHSC = RHS->getSExtValue();		int64_t RHSC = RHS->getSExtValue();
unsigned Scale = Log2_32(Size);		unsigned Scale = Log2_32(Size);
int64_t Range = 0x1LL << (BW - 1);		int64_t Range = 0x1LL << (BW - 1);

▲ Show 20 Lines • Show All 2,070 Lines • ▼ Show 20 Lines	bool AArch64DAGToDAGISel::SelectCMP_SWAP(SDNode *N) {

ReplaceUses(SDValue(N, 0), SDValue(CmpSwap, 0));		ReplaceUses(SDValue(N, 0), SDValue(CmpSwap, 0));
ReplaceUses(SDValue(N, 1), SDValue(CmpSwap, 2));		ReplaceUses(SDValue(N, 1), SDValue(CmpSwap, 2));
CurDAG->RemoveDeadNode(N);		CurDAG->RemoveDeadNode(N);

return true;		return true;
}		}

		bool AArch64DAGToDAGISel::trySelectStackSlotTagP(SDNode *N) {
		// tagp(FrameIndex, IRGstack, tag_offset):
		// since the offset between FrameIndex and IRGstack is a compile-time
		// constant, this can be lowered to a single ADDG instruction.
		if (!(isa<FrameIndexSDNode>(N->getOperand(1)))) {
		return false;
		}

		SDValue IRG_SP = N->getOperand(2);
		if (IRG_SP->getOpcode() != ISD::INTRINSIC_W_CHAIN \|\|
		cast<ConstantSDNode>(IRG_SP->getOperand(1))->getZExtValue() !=
		Intrinsic::aarch64_irg_sp) {
		return false;
		}

		const TargetLowering *TLI = getTargetLowering();
		SDLoc DL(N);
		int FI = cast<FrameIndexSDNode>(N->getOperand(1))->getIndex();
		SDValue FiOp = CurDAG->getTargetFrameIndex(
		FI, TLI->getPointerTy(CurDAG->getDataLayout()));
		int TagOffset = cast<ConstantSDNode>(N->getOperand(3))->getZExtValue();

		SDNode *Out = CurDAG->getMachineNode(
		AArch64::TAGPstack, DL, MVT::i64,
		{FiOp, CurDAG->getTargetConstant(0, DL, MVT::i64), N->getOperand(2),
		CurDAG->getTargetConstant(TagOffset, DL, MVT::i64)});
		ReplaceNode(N, Out);
		return true;
		}

		void AArch64DAGToDAGISel::SelectTagP(SDNode *N) {
		assert(isa<ConstantSDNode>(N->getOperand(3)) &&
		"llvm.aarch64.tagp third argument must be an immediate");
		if (trySelectStackSlotTagP(N))
		return;
		// FIXME: above applies in any case when offset between Op1 and Op2 is a
		// compile-time constant, not just for stack allocations.

		// General case for unrelated pointers in Op1 and Op2.
		SDLoc DL(N);
		int TagOffset = cast<ConstantSDNode>(N->getOperand(3))->getZExtValue();
		SDNode *N1 = CurDAG->getMachineNode(AArch64::SUBP, DL, MVT::i64,
		{N->getOperand(1), N->getOperand(2)});
		SDNode *N2 = CurDAG->getMachineNode(AArch64::ADDXrr, DL, MVT::i64,
		{SDValue(N1, 0), N->getOperand(2)});
		SDNode *N3 = CurDAG->getMachineNode(
		AArch64::ADDG, DL, MVT::i64,
		{SDValue(N2, 0), CurDAG->getTargetConstant(0, DL, MVT::i64),
		CurDAG->getTargetConstant(TagOffset, DL, MVT::i64)});
		ReplaceNode(N, N3);
		}

void AArch64DAGToDAGISel::Select(SDNode *Node) {		void AArch64DAGToDAGISel::Select(SDNode *Node) {
// If we have a custom node, we already have selected!		// If we have a custom node, we already have selected!
if (Node->isMachineOpcode()) {		if (Node->isMachineOpcode()) {
LLVM_DEBUG(errs() << "== "; Node->dump(CurDAG); errs() << "\n");		LLVM_DEBUG(errs() << "== "; Node->dump(CurDAG); errs() << "\n");
Node->setNodeId(-1);		Node->setNodeId(-1);
return;		return;
}		}

▲ Show 20 Lines • Show All 477 Lines • ▼ Show 20 Lines	case Intrinsic::aarch64_neon_ld4lane:
break;		break;
}		}
} break;		} break;
case ISD::INTRINSIC_WO_CHAIN: {		case ISD::INTRINSIC_WO_CHAIN: {
unsigned IntNo = cast<ConstantSDNode>(Node->getOperand(0))->getZExtValue();		unsigned IntNo = cast<ConstantSDNode>(Node->getOperand(0))->getZExtValue();
switch (IntNo) {		switch (IntNo) {
default:		default:
break;		break;
		case Intrinsic::aarch64_tagp:
		SelectTagP(Node);
		return;
case Intrinsic::aarch64_neon_tbl2:		case Intrinsic::aarch64_neon_tbl2:
SelectTable(Node, 2,		SelectTable(Node, 2,
VT == MVT::v8i8 ? AArch64::TBLv8i8Two : AArch64::TBLv16i8Two,		VT == MVT::v8i8 ? AArch64::TBLv8i8Two : AArch64::TBLv16i8Two,
false);		false);
return;		return;
case Intrinsic::aarch64_neon_tbl3:		case Intrinsic::aarch64_neon_tbl3:
SelectTable(Node, 3, VT == MVT::v8i8 ? AArch64::TBLv8i8Three		SelectTable(Node, 3, VT == MVT::v8i8 ? AArch64::TBLv8i8Three
: AArch64::TBLv16i8Three,		: AArch64::TBLv16i8Three,
▲ Show 20 Lines • Show All 867 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64ISelLowering.h

Show First 20 Lines • Show All 208 Lines • ▼ Show 20 Lines	enum NodeType : unsigned {
LD3DUPpost,		LD3DUPpost,
LD4DUPpost,		LD4DUPpost,
LD1LANEpost,		LD1LANEpost,
LD2LANEpost,		LD2LANEpost,
LD3LANEpost,		LD3LANEpost,
LD4LANEpost,		LD4LANEpost,
ST2LANEpost,		ST2LANEpost,
ST3LANEpost,		ST3LANEpost,
ST4LANEpost		ST4LANEpost,

		STG,
		STZG,
		ST2G,
		STZ2G

};		};

} // end namespace AArch64ISD		} // end namespace AArch64ISD

namespace {		namespace {

// Any instruction that defines a 32-bit result zeros out the high half of the		// Any instruction that defines a 32-bit result zeros out the high half of the
// register. Truncate can be lowered to EXTRACT_SUBREG. CopyFromReg may		// register. Truncate can be lowered to EXTRACT_SUBREG. CopyFromReg may
▲ Show 20 Lines • Show All 517 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,228 Lines • ▼ Show 20 Lines	const char *AArch64TargetLowering::getTargetNodeName(unsigned Opcode) const {
case AArch64ISD::ST3LANEpost: return "AArch64ISD::ST3LANEpost";		case AArch64ISD::ST3LANEpost: return "AArch64ISD::ST3LANEpost";
case AArch64ISD::ST4LANEpost: return "AArch64ISD::ST4LANEpost";		case AArch64ISD::ST4LANEpost: return "AArch64ISD::ST4LANEpost";
case AArch64ISD::SMULL: return "AArch64ISD::SMULL";		case AArch64ISD::SMULL: return "AArch64ISD::SMULL";
case AArch64ISD::UMULL: return "AArch64ISD::UMULL";		case AArch64ISD::UMULL: return "AArch64ISD::UMULL";
case AArch64ISD::FRECPE: return "AArch64ISD::FRECPE";		case AArch64ISD::FRECPE: return "AArch64ISD::FRECPE";
case AArch64ISD::FRECPS: return "AArch64ISD::FRECPS";		case AArch64ISD::FRECPS: return "AArch64ISD::FRECPS";
case AArch64ISD::FRSQRTE: return "AArch64ISD::FRSQRTE";		case AArch64ISD::FRSQRTE: return "AArch64ISD::FRSQRTE";
case AArch64ISD::FRSQRTS: return "AArch64ISD::FRSQRTS";		case AArch64ISD::FRSQRTS: return "AArch64ISD::FRSQRTS";
		case AArch64ISD::STG: return "AArch64ISD::STG";
		case AArch64ISD::STZG: return "AArch64ISD::STZG";
		case AArch64ISD::ST2G: return "AArch64ISD::ST2G";
		case AArch64ISD::STZ2G: return "AArch64ISD::STZ2G";
}		}
return nullptr;		return nullptr;
}		}

MachineBasicBlock *		MachineBasicBlock *
AArch64TargetLowering::EmitF128CSEL(MachineInstr &MI,		AArch64TargetLowering::EmitF128CSEL(MachineInstr &MI,
MachineBasicBlock *MBB) const {		MachineBasicBlock *MBB) const {
// We materialise the F128CSEL pseudo-instruction as some control flow and a		// We materialise the F128CSEL pseudo-instruction as some control flow and a
▲ Show 20 Lines • Show All 10,823 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64InstrFormats.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 4,061 Lines • ▼ Show 20 Lines
	}			}

	multiclass MemTagStore<bits<2> opc1, string insn> {			multiclass MemTagStore<bits<2> opc1, string insn> {
	def Offset :			def Offset :
	BaseMemTagStore<opc1, 0b10, insn, "\t$Rt, [$Rn, $offset]", "",			BaseMemTagStore<opc1, 0b10, insn, "\t$Rt, [$Rn, $offset]", "",
	(outs), (ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;			(outs), (ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;
	def PreIndex :			def PreIndex :
	BaseMemTagStore<opc1, 0b11, insn, "\t$Rt, [$Rn, $offset]!",			BaseMemTagStore<opc1, 0b11, insn, "\t$Rt, [$Rn, $offset]!",
	"$Rn = $wback,@earlyclobber $wback",			"$Rn = $wback",
	(outs GPR64sp:$wback),			(outs GPR64sp:$wback),
	(ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;			(ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;
	def PostIndex :			def PostIndex :
	BaseMemTagStore<opc1, 0b01, insn, "\t$Rt, [$Rn], $offset",			BaseMemTagStore<opc1, 0b01, insn, "\t$Rt, [$Rn], $offset",
	"$Rn = $wback,@earlyclobber $wback",			"$Rn = $wback",
	(outs GPR64sp:$wback),			(outs GPR64sp:$wback),
	(ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;			(ins GPR64sp:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;

	def : InstAlias<insn # "\t$Rt, [$Rn]",			def : InstAlias<insn # "\t$Rt, [$Rn]",
	(!cast<Instruction>(NAME # "Offset") GPR64sp:$Rt, GPR64sp:$Rn, 0)>;			(!cast<Instruction>(NAME # "Offset") GPR64sp:$Rt, GPR64sp:$Rn, 0)>;
	}			}

	//---			//---
	▲ Show 20 Lines • Show All 6,571 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64InstrInfo.cpp

Show First 20 Lines • Show All 1,766 Lines • ▼ Show 20 Lines	unsigned AArch64InstrInfo::getLoadStoreImmIdx(unsigned Opc) {
case AArch64::LDPSi:		case AArch64::LDPSi:
case AArch64::STPWi:		case AArch64::STPWi:
case AArch64::STPSi:		case AArch64::STPSi:
case AArch64::LDNPWi:		case AArch64::LDNPWi:
case AArch64::LDNPSi:		case AArch64::LDNPSi:
case AArch64::STNPWi:		case AArch64::STNPWi:
case AArch64::STNPSi:		case AArch64::STNPSi:
case AArch64::LDG:		case AArch64::LDG:
		case AArch64::STGPi:
return 3;		return 3;
case AArch64::ADDG:		case AArch64::ADDG:
case AArch64::STGOffset:		case AArch64::STGOffset:
return 2;		return 2;
}		}
}		}

bool AArch64InstrInfo::isPairableLdStInst(const MachineInstr &MI) {		bool AArch64InstrInfo::isPairableLdStInst(const MachineInstr &MI) {
▲ Show 20 Lines • Show All 363 Lines • ▼ Show 20 Lines	bool AArch64InstrInfo::getMemOpInfo(unsigned Opcode, unsigned &Scale,
case AArch64::LDRSBXui:		case AArch64::LDRSBXui:
case AArch64::STRBui:		case AArch64::STRBui:
case AArch64::STRBBui:		case AArch64::STRBBui:
Scale = Width = 1;		Scale = Width = 1;
MinOffset = 0;		MinOffset = 0;
MaxOffset = 4095;		MaxOffset = 4095;
break;		break;
case AArch64::ADDG:		case AArch64::ADDG:
		case AArch64::TAGPstack:
Scale = 16;		Scale = 16;
Width = 0;		Width = 0;
MinOffset = 0;		MinOffset = 0;
MaxOffset = 63;		MaxOffset = 63;
break;		break;
case AArch64::LDG:		case AArch64::LDG:
case AArch64::STGOffset:		case AArch64::STGOffset:
		case AArch64::STZGOffset:
Scale = Width = 16;		Scale = Width = 16;
MinOffset = -256;		MinOffset = -256;
MaxOffset = 255;		MaxOffset = 255;
break;		break;
		case AArch64::ST2GOffset:
		case AArch64::STZ2GOffset:
		Scale = 16;
		Width = 32;
		MinOffset = -256;
		MaxOffset = 255;
		break;
		case AArch64::STGPi:
		Scale = Width = 16;
		MinOffset = -64;
		MaxOffset = 63;
		break;
}		}

return true;		return true;
}		}

static unsigned getOffsetStride(unsigned Opc) {		static unsigned getOffsetStride(unsigned Opc) {
switch (Opc) {		switch (Opc) {
default:		default:
▲ Show 20 Lines • Show All 1,079 Lines • ▼ Show 20 Lines	int llvm::isAArch64FrameOffsetLegal(const MachineInstr &MI, int &Offset,
case AArch64::LD1Threev1d:		case AArch64::LD1Threev1d:
case AArch64::LD1Fourv1d:		case AArch64::LD1Fourv1d:
case AArch64::ST1Twov2d:		case AArch64::ST1Twov2d:
case AArch64::ST1Threev2d:		case AArch64::ST1Threev2d:
case AArch64::ST1Fourv2d:		case AArch64::ST1Fourv2d:
case AArch64::ST1Twov1d:		case AArch64::ST1Twov1d:
case AArch64::ST1Threev1d:		case AArch64::ST1Threev1d:
case AArch64::ST1Fourv1d:		case AArch64::ST1Fourv1d:
		case AArch64::IRG:
		case AArch64::IRGstack:
return AArch64FrameOffsetCannotUpdate;		return AArch64FrameOffsetCannotUpdate;
}		}

// Get the min/max offset and the scale.		// Get the min/max offset and the scale.
unsigned Scale, Width;		unsigned Scale, Width;
int64_t MinOff, MaxOff;		int64_t MinOff, MaxOff;
if (!AArch64InstrInfo::getMemOpInfo(MI.getOpcode(), Scale, Width, MinOff,		if (!AArch64InstrInfo::getMemOpInfo(MI.getOpcode(), Scale, Width, MinOff,
MaxOff))		MaxOff))
▲ Show 20 Lines • Show All 2,277 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64InstrInfo.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 403 Lines • ▼ Show 20 Lines

def AArch64saddv : SDNode<"AArch64ISD::SADDV", SDT_AArch64UnaryVec>;		def AArch64saddv : SDNode<"AArch64ISD::SADDV", SDT_AArch64UnaryVec>;
def AArch64uaddv : SDNode<"AArch64ISD::UADDV", SDT_AArch64UnaryVec>;		def AArch64uaddv : SDNode<"AArch64ISD::UADDV", SDT_AArch64UnaryVec>;
def AArch64sminv : SDNode<"AArch64ISD::SMINV", SDT_AArch64UnaryVec>;		def AArch64sminv : SDNode<"AArch64ISD::SMINV", SDT_AArch64UnaryVec>;
def AArch64uminv : SDNode<"AArch64ISD::UMINV", SDT_AArch64UnaryVec>;		def AArch64uminv : SDNode<"AArch64ISD::UMINV", SDT_AArch64UnaryVec>;
def AArch64smaxv : SDNode<"AArch64ISD::SMAXV", SDT_AArch64UnaryVec>;		def AArch64smaxv : SDNode<"AArch64ISD::SMAXV", SDT_AArch64UnaryVec>;
def AArch64umaxv : SDNode<"AArch64ISD::UMAXV", SDT_AArch64UnaryVec>;		def AArch64umaxv : SDNode<"AArch64ISD::UMAXV", SDT_AArch64UnaryVec>;

		def SDT_AArch64SETTAG : SDTypeProfile<0, 2, [SDTCisPtrTy<0>, SDTCisPtrTy<1>]>;
		def AArch64stg : SDNode<"AArch64ISD::STG", SDT_AArch64SETTAG, [SDNPHasChain, SDNPMayStore, SDNPMemOperand]>;
		def AArch64stzg : SDNode<"AArch64ISD::STZG", SDT_AArch64SETTAG, [SDNPHasChain, SDNPMayStore, SDNPMemOperand]>;
		def AArch64st2g : SDNode<"AArch64ISD::ST2G", SDT_AArch64SETTAG, [SDNPHasChain, SDNPMayStore, SDNPMemOperand]>;
		def AArch64stz2g : SDNode<"AArch64ISD::STZ2G", SDT_AArch64SETTAG, [SDNPHasChain, SDNPMayStore, SDNPMemOperand]>;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// AArch64 Instruction Predicate Definitions.		// AArch64 Instruction Predicate Definitions.
// We could compute these on a per-module basis but doing so requires accessing		// We could compute these on a per-module basis but doing so requires accessing
// the Function object through the <Target>Subtarget and objections were raised		// the Function object through the <Target>Subtarget and objections were raised
// to that (see post-commit review comments for r301750).		// to that (see post-commit review comments for r301750).
▲ Show 20 Lines • Show All 864 Lines • ▼ Show 20 Lines	def STZGM : MemTagVector<0, "stzgm", "\t$Rt, [$Rn]",
let Inst{23} = 0;		let Inst{23} = 0;
}		}

defm STG : MemTagStore<0b00, "stg">;		defm STG : MemTagStore<0b00, "stg">;
defm STZG : MemTagStore<0b01, "stzg">;		defm STZG : MemTagStore<0b01, "stzg">;
defm ST2G : MemTagStore<0b10, "st2g">;		defm ST2G : MemTagStore<0b10, "st2g">;
defm STZ2G : MemTagStore<0b11, "stz2g">;		defm STZ2G : MemTagStore<0b11, "stz2g">;

		def : Pat<(AArch64stg GPR64sp:$Rn, (am_indexeds9s128 GPR64sp:$Rm, simm9s16:$imm)),
		(STGOffset $Rn, $Rm, $imm)>;
		def : Pat<(AArch64stzg GPR64sp:$Rn, (am_indexeds9s128 GPR64sp:$Rm, simm9s16:$imm)),
		(STZGOffset $Rn, $Rm, $imm)>;
		def : Pat<(AArch64st2g GPR64sp:$Rn, (am_indexeds9s128 GPR64sp:$Rm, simm9s16:$imm)),
		(ST2GOffset $Rn, $Rm, $imm)>;
		def : Pat<(AArch64stz2g GPR64sp:$Rn, (am_indexeds9s128 GPR64sp:$Rm, simm9s16:$imm)),
		(STZ2GOffset $Rn, $Rm, $imm)>;

defm STGP : StorePairOffset <0b01, 0, GPR64z, simm7s16, "stgp">;		defm STGP : StorePairOffset <0b01, 0, GPR64z, simm7s16, "stgp">;
def STGPpre : StorePairPreIdx <0b01, 0, GPR64z, simm7s16, "stgp">;		def STGPpre : StorePairPreIdx <0b01, 0, GPR64z, simm7s16, "stgp">;
def STGPpost : StorePairPostIdx<0b01, 0, GPR64z, simm7s16, "stgp">;		def STGPpost : StorePairPostIdx<0b01, 0, GPR64z, simm7s16, "stgp">;

def : Pat<(int_aarch64_stg GPR64:$Rt, (am_indexeds9s128 GPR64sp:$Rn, simm9s16:$offset)),		def : Pat<(int_aarch64_stg GPR64:$Rt, (am_indexeds9s128 GPR64sp:$Rn, simm9s16:$offset)),
(STGOffset GPR64:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;		(STGOffset GPR64:$Rt, GPR64sp:$Rn, simm9s16:$offset)>;

		def : Pat<(int_aarch64_stgp (am_indexed7s128 GPR64sp:$Rn, simm7s16:$imm), GPR64:$Rt, GPR64:$Rt2),
		(STGPi $Rt, $Rt2, $Rn, $imm)>;

		def IRGstack
		: Pseudo<(outs GPR64sp:$Rd), (ins GPR64sp:$Rsp, GPR64:$Rm), []>,
		Sched<[]>;
		def TAGPstack
		: Pseudo<(outs GPR64sp:$Rd), (ins GPR64sp:$Rn, uimm6s16:$imm6, GPR64sp:$Rm, imm0_15:$imm4), []>,
		Sched<[]>;

		// Explicit SP in the first operand prevents ShrinkWrap optimization
		// from leaving this instruction out of the stack frame. When IRGstack
		// is transformed into IRG, this operand is replaced with the actual
		// register / expression for the tagged base pointer of the current function.
		def : Pat<(int_aarch64_irg_sp i64:$Rm), (IRGstack SP, i64:$Rm)>;

		// Large STG to be expanded into a loop. $Rm is the size, $Rn is start address.
		// $Rn_wback is one past the end of the range.
		let isCodeGenOnly=1, mayStore=1 in {
		def STGloop
		: Pseudo<(outs GPR64common:$Rm_wback, GPR64sp:$Rn_wback), (ins GPR64common:$Rm, GPR64sp:$Rn),
		[], "$Rn = $Rn_wback,@earlyclobber $Rn_wback,$Rm = $Rm_wback,@earlyclobber $Rm_wback" >,
		Sched<[WriteAdr, WriteST]>;

		def STZGloop
		: Pseudo<(outs GPR64common:$Rm_wback, GPR64sp:$Rn_wback), (ins GPR64common:$Rm, GPR64sp:$Rn),
		[], "$Rn = $Rn_wback,@earlyclobber $Rn_wback,$Rm = $Rm_wback,@earlyclobber $Rm_wback" >,
		Sched<[WriteAdr, WriteST]>;
		}

} // Predicates = [HasMTE]		} // Predicates = [HasMTE]

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Logical instructions.		// Logical instructions.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// (immediate)		// (immediate)
defm ANDS : LogicalImmS<0b11, "ands", AArch64and_flag, "bics">;		defm ANDS : LogicalImmS<0b11, "ands", AArch64and_flag, "bics">;
▲ Show 20 Lines • Show All 5,592 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64MachineFunctionInfo.h

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	class AArch64FunctionInfo final : public MachineFunctionInfo {
/// redzone, and no value otherwise.		/// redzone, and no value otherwise.
/// Initialized during frame lowering, unless the function has the noredzone		/// Initialized during frame lowering, unless the function has the noredzone
/// attribute, in which case it is set to false at construction.		/// attribute, in which case it is set to false at construction.
Optional<bool> HasRedZone;		Optional<bool> HasRedZone;

/// ForwardedMustTailRegParms - A list of virtual and physical registers		/// ForwardedMustTailRegParms - A list of virtual and physical registers
/// that must be forwarded to every musttail call.		/// that must be forwarded to every musttail call.
SmallVector<ForwardedRegister, 1> ForwardedMustTailRegParms;		SmallVector<ForwardedRegister, 1> ForwardedMustTailRegParms;

		// Offset from SP-at-entry to the tagged base pointer.
		// Tagged base pointer is set up to point to the first (lowest address) tagged
		// stack slot.
		unsigned TaggedBasePointerOffset;

public:		public:
AArch64FunctionInfo() = default;		AArch64FunctionInfo() = default;

explicit AArch64FunctionInfo(MachineFunction &MF) {		explicit AArch64FunctionInfo(MachineFunction &MF) {
(void)MF;		(void)MF;

// If we already know that the function doesn't have a redzone, set		// If we already know that the function doesn't have a redzone, set
// HasRedZone here.		// HasRedZone here.
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	void addLOHDirective(MCLOHType Kind, MILOHArgs Args) {
LOHContainerSet.push_back(MILOHDirective(Kind, Args));		LOHContainerSet.push_back(MILOHDirective(Kind, Args));
LOHRelated.insert(Args.begin(), Args.end());		LOHRelated.insert(Args.begin(), Args.end());
}		}

SmallVectorImpl<ForwardedRegister> &getForwardedMustTailRegParms() {		SmallVectorImpl<ForwardedRegister> &getForwardedMustTailRegParms() {
return ForwardedMustTailRegParms;		return ForwardedMustTailRegParms;
}		}

		unsigned getTaggedBasePointerOffset() const {
		return TaggedBasePointerOffset;
		}
		void setTaggedBasePointerOffset(unsigned Offset) {
		TaggedBasePointerOffset = Offset;
		}

private:		private:
// Hold the lists of LOHs.		// Hold the lists of LOHs.
MILOHContainer LOHContainerSet;		MILOHContainer LOHContainerSet;
SetOfInstructions LOHRelated;		SetOfInstructions LOHRelated;

DenseMap<int, std::pair<unsigned, MCSymbol *>> JumpTableEntryInfo;		DenseMap<int, std::pair<unsigned, MCSymbol *>> JumpTableEntryInfo;
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_TARGET_AARCH64_AARCH64MACHINEFUNCTIONINFO_H		#endif // LLVM_LIB_TARGET_AARCH64_AARCH64MACHINEFUNCTIONINFO_H

llvm/trunk/lib/Target/AArch64/AArch64RegisterInfo.cpp

Show First 20 Lines • Show All 462 Lines • ▼ Show 20 Lines	void AArch64RegisterInfo::eliminateFrameIndex(MachineBasicBlock::iterator II,

if (MI.getOpcode() == TargetOpcode::LOCAL_ESCAPE) {		if (MI.getOpcode() == TargetOpcode::LOCAL_ESCAPE) {
MachineOperand &FI = MI.getOperand(FIOperandNum);		MachineOperand &FI = MI.getOperand(FIOperandNum);
Offset = TFI->getNonLocalFrameIndexReference(MF, FrameIndex);		Offset = TFI->getNonLocalFrameIndexReference(MF, FrameIndex);
FI.ChangeToImmediate(Offset);		FI.ChangeToImmediate(Offset);
return;		return;
}		}

// Modify MI as necessary to handle as much of 'Offset' as possible		if (MI.getOpcode() == AArch64::TAGPstack) {
		// TAGPstack must use the virtual frame register in its 3rd operand.
		const MachineFrameInfo &MFI = MF.getFrameInfo();
		const AArch64FunctionInfo *AFI = MF.getInfo<AArch64FunctionInfo>();
		FrameReg = MI.getOperand(3).getReg();
		Offset =
		MFI.getObjectOffset(FrameIndex) + AFI->getTaggedBasePointerOffset();
		} else {
Offset = TFI->resolveFrameIndexReference(		Offset = TFI->resolveFrameIndexReference(
MF, FrameIndex, FrameReg, /PreferFP=/false, /ForSimm=/true);		MF, FrameIndex, FrameReg, /PreferFP=/false, /ForSimm=/true);
		}

		// Modify MI as necessary to handle as much of 'Offset' as possible
if (rewriteAArch64FrameIndex(MI, FIOperandNum, FrameReg, Offset, TII))		if (rewriteAArch64FrameIndex(MI, FIOperandNum, FrameReg, Offset, TII))
return;		return;

assert((!RS \|\| !RS->isScavengingFrameIndex(FrameIndex)) &&		assert((!RS \|\| !RS->isScavengingFrameIndex(FrameIndex)) &&
"Emergency spill slot is out of reach");		"Emergency spill slot is out of reach");

// If we get here, the immediate doesn't fit into the instruction. We folded		// If we get here, the immediate doesn't fit into the instruction. We folded
// as much as possible above. Handle the rest, providing a register that is		// as much as possible above. Handle the rest, providing a register that is
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/AArch64/AArch64SelectionDAGInfo.h

	Show All 17 Lines
	namespace llvm {			namespace llvm {

	class AArch64SelectionDAGInfo : public SelectionDAGTargetInfo {			class AArch64SelectionDAGInfo : public SelectionDAGTargetInfo {
	public:			public:
	SDValue EmitTargetCodeForMemset(SelectionDAG &DAG, const SDLoc &dl,			SDValue EmitTargetCodeForMemset(SelectionDAG &DAG, const SDLoc &dl,
	SDValue Chain, SDValue Dst, SDValue Src,			SDValue Chain, SDValue Dst, SDValue Src,
	SDValue Size, unsigned Align, bool isVolatile,			SDValue Size, unsigned Align, bool isVolatile,
	MachinePointerInfo DstPtrInfo) const override;			MachinePointerInfo DstPtrInfo) const override;
				SDValue EmitTargetCodeForSetTag(SelectionDAG &DAG, const SDLoc &dl,
				SDValue Chain, SDValue Op1, SDValue Op2,
				MachinePointerInfo DstPtrInfo,
				bool ZeroData) const override;
	bool generateFMAsInMachineCombiner(CodeGenOpt::Level OptLevel) const override;			bool generateFMAsInMachineCombiner(CodeGenOpt::Level OptLevel) const override;
	};			};
	}			}

	#endif			#endif

llvm/trunk/lib/Target/AArch64/AArch64SelectionDAGInfo.cpp

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	if (bzeroName && (!SizeValue \|\| SizeValue->getZExtValue() > 256)) {
return CallResult.second;		return CallResult.second;
}		}
return SDValue();		return SDValue();
}		}
bool AArch64SelectionDAGInfo::generateFMAsInMachineCombiner(		bool AArch64SelectionDAGInfo::generateFMAsInMachineCombiner(
CodeGenOpt::Level OptLevel) const {		CodeGenOpt::Level OptLevel) const {
return OptLevel >= CodeGenOpt::Aggressive;		return OptLevel >= CodeGenOpt::Aggressive;
}		}

		static const int kSetTagLoopThreshold = 176;

		static SDValue EmitUnrolledSetTag(SelectionDAG &DAG, const SDLoc &dl,
		SDValue Chain, SDValue Ptr, uint64_t ObjSize,
		const MachineMemOperand *BaseMemOperand,
		bool ZeroData) {
		MachineFunction &MF = DAG.getMachineFunction();
		unsigned ObjSizeScaled = ObjSize / 16;

		SDValue TagSrc = Ptr;
		if (Ptr.getOpcode() == ISD::FrameIndex) {
		int FI = cast<FrameIndexSDNode>(Ptr)->getIndex();
		Ptr = DAG.getTargetFrameIndex(FI, MVT::i64);
		// A frame index operand may end up as [SP + offset] => it is fine to use SP
		// register as the tag source.
		TagSrc = DAG.getRegister(AArch64::SP, MVT::i64);
		}

		const unsigned OpCode1 = ZeroData ? AArch64ISD::STZG : AArch64ISD::STG;
		const unsigned OpCode2 = ZeroData ? AArch64ISD::STZ2G : AArch64ISD::ST2G;

		SmallVector<SDValue, 8> OutChains;
		unsigned OffsetScaled = 0;
		while (OffsetScaled < ObjSizeScaled) {
		if (ObjSizeScaled - OffsetScaled >= 2) {
		SDValue AddrNode = DAG.getMemBasePlusOffset(Ptr, OffsetScaled * 16, dl);
		SDValue St = DAG.getMemIntrinsicNode(
		OpCode2, dl, DAG.getVTList(MVT::Other),
		{Chain, TagSrc, AddrNode},
		MVT::v4i64,
		MF.getMachineMemOperand(BaseMemOperand, OffsetScaled * 16, 16 * 2));
		OffsetScaled += 2;
		OutChains.push_back(St);
		continue;
		}

		if (ObjSizeScaled - OffsetScaled > 0) {
		SDValue AddrNode = DAG.getMemBasePlusOffset(Ptr, OffsetScaled * 16, dl);
		SDValue St = DAG.getMemIntrinsicNode(
		OpCode1, dl, DAG.getVTList(MVT::Other),
		{Chain, TagSrc, AddrNode},
		MVT::v2i64,
		MF.getMachineMemOperand(BaseMemOperand, OffsetScaled * 16, 16));
		OffsetScaled += 1;
		OutChains.push_back(St);
		}
		}

		SDValue Res = DAG.getNode(ISD::TokenFactor, dl, MVT::Other, OutChains);
		return Res;
		}

		SDValue AArch64SelectionDAGInfo::EmitTargetCodeForSetTag(
		SelectionDAG &DAG, const SDLoc &dl, SDValue Chain, SDValue Addr,
		SDValue Size, MachinePointerInfo DstPtrInfo, bool ZeroData) const {
		uint64_t ObjSize = cast<ConstantSDNode>(Size)->getZExtValue();
		assert(ObjSize % 16 == 0);

		MachineFunction &MF = DAG.getMachineFunction();
		MachineMemOperand *BaseMemOperand = MF.getMachineMemOperand(
		DstPtrInfo, MachineMemOperand::MOStore, ObjSize, 16);

		bool UseSetTagRangeLoop =
		kSetTagLoopThreshold >= 0 && (int)ObjSize >= kSetTagLoopThreshold;
		if (!UseSetTagRangeLoop)
		return EmitUnrolledSetTag(DAG, dl, Chain, Addr, ObjSize, BaseMemOperand,
		ZeroData);

		if (ObjSize % 32 != 0) {
		SDNode *St1 = DAG.getMachineNode(
		ZeroData ? AArch64::STZGPostIndex : AArch64::STGPostIndex, dl,
		{MVT::i64, MVT::Other},
		{Addr, Addr, DAG.getTargetConstant(1, dl, MVT::i64), Chain});
		DAG.setNodeMemRefs(cast<MachineSDNode>(St1), {BaseMemOperand});
		ObjSize -= 16;
		Addr = SDValue(St1, 0);
		Chain = SDValue(St1, 1);
		}

		const EVT ResTys[] = {MVT::i64, MVT::i64, MVT::Other};
		SDValue Ops[] = {DAG.getConstant(ObjSize, dl, MVT::i64), Addr, Chain};
		SDNode *St = DAG.getMachineNode(
		ZeroData ? AArch64::STZGloop : AArch64::STGloop, dl, ResTys, Ops);

		DAG.setNodeMemRefs(cast<MachineSDNode>(St), {BaseMemOperand});
		return SDValue(St, 2);
		}

llvm/trunk/test/Analysis/ValueTracking/aarch64.irg.ll

	; RUN: opt -S -instsimplify -instcombine < %s \| FileCheck %s			; RUN: opt -S -instsimplify -instcombine < %s \| FileCheck %s

	; CHECK-LABEL: define void @checkNonnullIrg()			; CHECK-LABEL: define void @checkNonnullIrg()
	define void @checkNonnullIrg() {			define void @checkNonnullIrg() {
	; CHECK: %[[p:.]] = call i8 @llvm.aarch64.irg(i8* nonnull			; CHECK: %[[p:.]] = call i8 @llvm.aarch64.irg(i8* nonnull
	; CHECK: call void @use(i8* nonnull %[[p]])			; CHECK: call void @use(i8* nonnull %[[p]])
	entry:			entry:
	%0 = alloca i8, align 16			%0 = alloca i8, align 16

	%p = call i8* @llvm.aarch64.irg(i8* %0, i64 5)			%p = call i8* @llvm.aarch64.irg(i8* %0, i64 5)
	call void @use(i8* %p)			call void @use(i8* %p)

	ret void			ret void
	}			}

				; CHECK-LABEL: define void @checkNonnullTagp(
				define void @checkNonnullTagp(i8* %tag) {
				; CHECK: %[[p:.]] = call i8 @llvm.aarch64.tagp.p0i8(i8* nonnull %a, i8* %tag, i64 1)
				; CHECK: %[[p2:.]] = call i8 @llvm.aarch64.tagp.p0i8(i8* nonnull %[[p]], i8* %tag, i64 2)
				; CHECK: call void @use(i8* nonnull %[[p2]])
				entry:
				%a = alloca i8, align 8

				%p = call i8* @llvm.aarch64.tagp.p0i8(i8* %a, i8* %tag, i64 1)
				%p2 = call i8* @llvm.aarch64.tagp.p0i8(i8* %p, i8* %tag, i64 2)
				call void @use(i8* %p2)

				ret void
				}

	declare i8* @llvm.aarch64.irg(i8*, i64)			declare i8* @llvm.aarch64.irg(i8*, i64)
				declare i8* @llvm.aarch64.tagp.p0i8(i8, i8, i64)

	declare void @use(i8*)			declare void @use(i8*)

llvm/trunk/test/CodeGen/AArch64/irg.ll

				; RUN: llc < %s -mtriple=aarch64 -mattr=+mte \| FileCheck %s

				define i8* @irg_imm16(i8* %p) {
				entry:
				; CHECK-LABEL: irg_imm16:
				; CHECK: mov w[[R:[0-9]+]], #16
				; CHECK: irg x0, x0, x[[R]]
				; CHECK: ret
				%q = call i8* @llvm.aarch64.irg(i8* %p, i64 16)
				ret i8* %q
				}

				define i8* @irg_imm0(i8* %p) {
				entry:
				; CHECK-LABEL: irg_imm0:
				; CHECK: irg x0, x0{{$}}
				; CHECK: ret
				%q = call i8* @llvm.aarch64.irg(i8* %p, i64 0)
				ret i8* %q
				}

				define i8* @irg_reg(i8* %p, i64 %ex) {
				entry:
				; CHECK-LABEL: irg_reg:
				; CHECK: irg x0, x0, x1
				; CHECK: ret
				%q = call i8* @llvm.aarch64.irg(i8* %p, i64 %ex)
				ret i8* %q
				}

				; undef argument in irg is treated specially
				define i8* @irg_sp() {
				entry:
				; CHECK-LABEL: irg_sp:
				; CHECK: irg x0, sp{{$}}
				; CHECK: ret
				%q = call i8* @llvm.aarch64.irg.sp(i64 0)
				ret i8* %q
				}

				declare i8* @llvm.aarch64.irg(i8* %p, i64 %exclude)
				declare i8* @llvm.aarch64.irg.sp(i64 %exclude)

llvm/trunk/test/CodeGen/AArch64/irg_sp_tagp.ll

				; RUN: llc < %s -mtriple=aarch64 -mattr=+mte \| FileCheck %s

				define i8* @small_alloca() {
				entry:
				; CHECK-LABEL: small_alloca:
				; CHECK: irg [[R:x[0-9]+]], sp{{$}}
				; CHECK-NEXT: addg x0, [[R]], #0, #1
				; CHECK: ret
				%a = alloca i8, align 16
				%q = call i8* @llvm.aarch64.irg.sp(i64 0)
				%q1 = call i8* @llvm.aarch64.tagp.p0i8(i8* %a, i8* %q, i64 1)
				ret i8* %q1
				}

				; Two large allocas. One's offset overflows addg immediate.
				define void @huge_allocas() {
				entry:
				; CHECK-LABEL: huge_allocas:
				; CHECK: irg [[R:x[0-9]+]], sp{{$}}
				; CHECK: add [[TMP:x[0-9]+]], [[R]], #3088
				; CHECK: addg x0, [[TMP]], #1008, #1
				; CHECK: addg x1, [[R]], #0, #2
				; CHECK: bl use2
				%a = alloca i8, i64 4096, align 16
				%b = alloca i8, i64 4096, align 16
				%base = call i8* @llvm.aarch64.irg.sp(i64 0)
				%a_t = call i8* @llvm.aarch64.tagp.p0i8(i8* %a, i8* %base, i64 1)
				%b_t = call i8* @llvm.aarch64.tagp.p0i8(i8* %b, i8* %base, i64 2)
				call void @use2(i8* %a_t, i8* %b_t)
				ret void
				}

				; Realigned stack frame. IRG uses value of SP after realignment,
				; ADDG for the first stack allocation has offset 0.
				define void @realign() {
				entry:
				; CHECK-LABEL: realign:
				; CHECK: add x29, sp, #16
				; CHECK: and sp, x{{[0-9]*}}, #0xffffffffffffffc0
				; CHECK: irg [[R:x[0-9]+]], sp{{$}}
				; CHECK: addg x0, [[R]], #0, #1
				; CHECK: bl use
				%a = alloca i8, i64 4096, align 64
				%base = call i8* @llvm.aarch64.irg.sp(i64 0)
				%a_t = call i8* @llvm.aarch64.tagp.p0i8(i8* %a, i8* %base, i64 1)
				call void @use(i8* %a_t)
				ret void
				}

				; With a dynamic alloca, IRG has to use FP with non-zero offset.
				; ADDG offset for the single static alloca is still zero.
				define void @dynamic_alloca(i64 %size) {
				entry:
				; CHECK-LABEL: dynamic_alloca:
				; CHECK: sub [[R:x[0-9]+]], x29, #[[OFS:[0-9]+]]
				; CHECK: irg [[R]], [[R]]
				; CHECK: addg x1, [[R]], #0, #1
				; CHECK: sub x0, x29, #[[OFS]]
				; CHECK: bl use2
				%base = call i8* @llvm.aarch64.irg.sp(i64 0)
				%a = alloca i128, i64 %size, align 16
				%b = alloca i8, i64 16, align 16
				%b_t = call i8* @llvm.aarch64.tagp.p0i8(i8* %b, i8* %base, i64 1)
				call void @use2(i8* %b, i8* %b_t)
				ret void
				}

				; Both dynamic alloca and realigned frame.
				; After initial realignment, generate the base pointer.
				; IRG uses the base pointer w/o offset.
				; Offsets for tagged and untagged pointers to the same alloca match.
				define void @dynamic_alloca_and_realign(i64 %size) {
				entryz:
				; CHECK-LABEL: dynamic_alloca_and_realign:
				; CHECK: and sp, x{{.*}}, #0xffffffffffffffc0
				; CHECK: mov x19, sp
				; CHECK: irg [[R:x[0-9]+]], x19
				; CHECK: addg x1, [[R]], #[[OFS:[0-9]+]], #1
				; CHECK: add x0, x19, #[[OFS]]
				; CHECK: bl use2
				%base = call i8* @llvm.aarch64.irg.sp(i64 0)
				%a = alloca i128, i64 %size, align 64
				%b = alloca i8, i64 16, align 16
				%b_t = call i8* @llvm.aarch64.tagp.p0i8(i8* %b, i8* %base, i64 1)
				call void @use2(i8* %b, i8* %b_t)
				ret void
				}

				declare void @use(i8*)
				declare void @use2(i8, i8)

				declare i8* @llvm.aarch64.irg.sp(i64 %exclude)
				declare i8* @llvm.aarch64.tagp.p0i8(i8* %p, i8* %tag, i64 %ofs)

llvm/trunk/test/CodeGen/AArch64/settag.ll

				; RUN: llc < %s -mtriple=aarch64 -mattr=+mte \| FileCheck %s

				define void @stg1(i8* %p) {
				entry:
				; CHECK-LABEL: stg1:
				; CHECK: stg x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 16)
				ret void
				}

				define void @stg2(i8* %p) {
				entry:
				; CHECK-LABEL: stg2:
				; CHECK: st2g x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 32)
				ret void
				}

				define void @stg3(i8* %p) {
				entry:
				; CHECK-LABEL: stg3:
				; CHECK: stg x0, [x0, #32]
				; CHECK: st2g x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 48)
				ret void
				}

				define void @stg4(i8* %p) {
				entry:
				; CHECK-LABEL: stg4:
				; CHECK: st2g x0, [x0, #32]
				; CHECK: st2g x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 64)
				ret void
				}

				define void @stg5(i8* %p) {
				entry:
				; CHECK-LABEL: stg5:
				; CHECK: stg x0, [x0, #64]
				; CHECK: st2g x0, [x0, #32]
				; CHECK: st2g x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 80)
				ret void
				}

				define void @stg16(i8* %p) {
				entry:
				; CHECK-LABEL: stg16:
				; CHECK: mov {{(w\|x)}}[[R:[0-9]+]], #256
				; CHECK: st2g x0, [x0], #32
				; CHECK: sub x[[R]], x[[R]], #32
				; CHECK: cbnz x[[R]],
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 256)
				ret void
				}

				define void @stg17(i8* %p) {
				entry:
				; CHECK-LABEL: stg17:
				; CHECK: mov {{(w\|x)}}[[R:[0-9]+]], #256
				; CHECK: stg x0, [x0], #16
				; CHECK: st2g x0, [x0], #32
				; CHECK: sub x[[R]], x[[R]], #32
				; CHECK: cbnz x[[R]],
				; CHECK: ret
				call void @llvm.aarch64.settag(i8* %p, i64 272)
				ret void
				}

				define void @stzg3(i8* %p) {
				entry:
				; CHECK-LABEL: stzg3:
				; CHECK: stzg x0, [x0, #32]
				; CHECK: stz2g x0, [x0]
				; CHECK: ret
				call void @llvm.aarch64.settag.zero(i8* %p, i64 48)
				ret void
				}

				define void @stzg17(i8* %p) {
				entry:
				; CHECK-LABEL: stzg17:
				; CHECK: mov {{w\|x}}[[R:[0-9]+]], #256
				; CHECK: stzg x0, [x0], #16
				; CHECK: stz2g x0, [x0], #32
				; CHECK: sub x[[R]], x[[R]], #32
				; CHECK: cbnz x[[R]],
				; CHECK: ret
				call void @llvm.aarch64.settag.zero(i8* %p, i64 272)
				ret void
				}

				define void @stg_alloca1() {
				entry:
				; CHECK-LABEL: stg_alloca1:
				; CHECK: stg sp, [sp]
				; CHECK: ret
				%a = alloca i8, i32 16, align 16
				call void @llvm.aarch64.settag(i8* %a, i64 16)
				ret void
				}

				define void @stg_alloca5() {
				entry:
				; CHECK-LABEL: stg_alloca5:
				; CHECK: stg sp, [sp, #64]
				; CHECK: st2g sp, [sp, #32]
				; CHECK: st2g sp, [sp]
				; CHECK: ret
				%a = alloca i8, i32 80, align 16
				call void @llvm.aarch64.settag(i8* %a, i64 80)
				ret void
				}

				define void @stg_alloca17() {
				entry:
				; CHECK-LABEL: stg_alloca17:
				; CHECK: mov [[P:x[0-9]+]], sp
				; CHECK: stg [[P]], {{\[}}[[P]]{{\]}}, #16
				; CHECK: mov {{w\|x}}[[R:[0-9]+]], #256
				; CHECK: st2g [[P]], {{\[}}[[P]]{{\]}}, #32
				; CHECK: sub x[[R]], x[[R]], #32
				; CHECK: cbnz x[[R]],
				; CHECK: ret
				%a = alloca i8, i32 272, align 16
				call void @llvm.aarch64.settag(i8* %a, i64 272)
				ret void
				}

				declare void @llvm.aarch64.settag(i8* %p, i64 %a)
				declare void @llvm.aarch64.settag.zero(i8* %p, i64 %a)

llvm/trunk/test/CodeGen/AArch64/stgp.ll

				; RUN: llc < %s -mtriple=aarch64 -mattr=+mte \| FileCheck %s

				define void @stgp0(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp0:
				; CHECK: stgp x0, x1, [x2]
				; CHECK: ret
				call void @llvm.aarch64.stgp(i8* %p, i64 %a, i64 %b)
				ret void
				}

				define void @stgp1004(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp1004:
				; CHECK: add [[R:x[0-9]+]], x2, #1004
				; CHECK: stgp x0, x1, {{\[}}[[R]]{{\]}}
				; CHECK: ret
				%q = getelementptr i8, i8* %p, i32 1004
				call void @llvm.aarch64.stgp(i8* %q, i64 %a, i64 %b)
				ret void
				}

				define void @stgp1008(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp1008:
				; CHECK: stgp x0, x1, [x2, #1008]
				; CHECK: ret
				%q = getelementptr i8, i8* %p, i32 1008
				call void @llvm.aarch64.stgp(i8* %q, i64 %a, i64 %b)
				ret void
				}

				define void @stgp1024(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp1024:
				; CHECK: add [[R:x[0-9]+]], x2, #1024
				; CHECK: stgp x0, x1, {{\[}}[[R]]{{\]}}
				; CHECK: ret
				%q = getelementptr i8, i8* %p, i32 1024
				call void @llvm.aarch64.stgp(i8* %q, i64 %a, i64 %b)
				ret void
				}

				define void @stgp_1024(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp_1024:
				; CHECK: stgp x0, x1, [x2, #-1024]
				; CHECK: ret
				%q = getelementptr i8, i8* %p, i32 -1024
				call void @llvm.aarch64.stgp(i8* %q, i64 %a, i64 %b)
				ret void
				}

				define void @stgp_1040(i64 %a, i64 %b, i8* %p) {
				entry:
				; CHECK-LABEL: stgp_1040:
				; CHECK: sub [[R:x[0-9]+]], x2, #1040
				; CHECK: stgp x0, x1, [x{{.*}}]
				; CHECK: ret
				%q = getelementptr i8, i8* %p, i32 -1040
				call void @llvm.aarch64.stgp(i8* %q, i64 %a, i64 %b)
				ret void
				}

				define void @stgp_alloca(i64 %a, i64 %b) {
				entry:
				; CHECK-LABEL: stgp_alloca:
				; CHECK: stgp x0, x1, [sp]
				; CHECK: stgp x1, x0, [sp, #16]
				; CHECK: ret
				%x = alloca i8, i32 32, align 16
				call void @llvm.aarch64.stgp(i8* %x, i64 %a, i64 %b)
				%x1 = getelementptr i8, i8* %x, i32 16
				call void @llvm.aarch64.stgp(i8* %x1, i64 %b, i64 %a)
				ret void
				}

				declare void @llvm.aarch64.stgp(i8* %p, i64 %a, i64 %b)

llvm/trunk/test/CodeGen/AArch64/tagp.ll

				; RUN: llc < %s -mtriple=aarch64 -mattr=+mte \| FileCheck %s

				define i8* @tagp2(i8* %p, i8* %tag) {
				entry:
				; CHECK-LABEL: tagp2:
				; CHECK: subp [[R:x[0-9]+]], x0, x1
				; CHECK: add [[R]], [[R]], x1
				; CHECK: addg x0, [[R]], #0, #2
				; CHECK: ret
				%q = call i8* @llvm.aarch64.tagp.p0i8(i8* %p, i8* %tag, i64 2)
				ret i8* %q
				}

				define i8* @irg_tagp_unrelated(i8* %p, i8* %q) {
				entry:
				; CHECK-LABEL: irg_tagp_unrelated:
				; CHECK: irg [[R0:x[0-9]+]], x0{{$}}
				; CHECK: subp [[R:x[0-9]+]], [[R0]], x1
				; CHECK: add [[R]], [[R0]], x1
				; CHECK: addg x0, [[R]], #0, #1
				; CHECK: ret
				%p1 = call i8* @llvm.aarch64.irg(i8* %p, i64 0)
				%q1 = call i8* @llvm.aarch64.tagp.p0i8(i8* %p1, i8* %q, i64 1)
				ret i8* %q1
				}

				define i8* @tagp_alloca(i8* %tag) {
				entry:
				; CHECK-LABEL: tagp_alloca:
				; CHECK: mov [[R0:x[0-9]+]], sp{{$}}
				; CHECK: subp [[R:x[0-9]+]], [[R0]], x0{{$}}
				; CHECK: add [[R]], [[R0]], x0{{$}}
				; CHECK: addg x0, [[R]], #0, #3
				; CHECK: ret
				%a = alloca i8, align 16
				%q = call i8* @llvm.aarch64.tagp.p0i8(i8* %a, i8* %tag, i64 3)
				ret i8* %q
				}

				declare i8* @llvm.aarch64.irg(i8* %p, i64 %exclude)
				declare i8* @llvm.aarch64.tagp.p0i8(i8* %p, i8* %tag, i64 %ofs)

This is an archive of the discontinued LLVM Phabricator instance.

Basic codegen for MTE stack tagging.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 210390

llvm/trunk/include/llvm/CodeGen/SelectionDAGTargetInfo.h

llvm/trunk/include/llvm/IR/IntrinsicsAArch64.td

llvm/trunk/lib/Analysis/ValueTracking.cpp

llvm/trunk/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/trunk/lib/Target/AArch64/AArch64ExpandPseudoInsts.cpp

llvm/trunk/lib/Target/AArch64/AArch64FrameLowering.cpp

llvm/trunk/lib/Target/AArch64/AArch64ISelDAGToDAG.cpp

llvm/trunk/lib/Target/AArch64/AArch64ISelLowering.h

llvm/trunk/lib/Target/AArch64/AArch64ISelLowering.cpp

llvm/trunk/lib/Target/AArch64/AArch64InstrFormats.td

llvm/trunk/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/trunk/lib/Target/AArch64/AArch64InstrInfo.td

llvm/trunk/lib/Target/AArch64/AArch64MachineFunctionInfo.h

llvm/trunk/lib/Target/AArch64/AArch64RegisterInfo.cpp

llvm/trunk/lib/Target/AArch64/AArch64SelectionDAGInfo.h

llvm/trunk/lib/Target/AArch64/AArch64SelectionDAGInfo.cpp

llvm/trunk/test/Analysis/ValueTracking/aarch64.irg.ll

llvm/trunk/test/CodeGen/AArch64/irg.ll

llvm/trunk/test/CodeGen/AArch64/irg_sp_tagp.ll

llvm/trunk/test/CodeGen/AArch64/settag.ll

llvm/trunk/test/CodeGen/AArch64/stgp.ll

llvm/trunk/test/CodeGen/AArch64/tagp.ll

Basic codegen for MTE stack tagging.
ClosedPublic