This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
3
TargetInstrInfo.h
-
lib/
-
CodeGen/
9/15
ImplicitNullChecks.cpp
-
Target/
-
AArch64/
-
AArch64InstrInfo.h
-
AArch64InstrInfo.cpp
-
X86/
-
X86InstrInfo.h
2/3
X86InstrInfo.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
1/1
implicit-null-check-negative.ll
2
implicit-null-check.ll
-
implicit-null-checks.mir

Differential D87148

[ImplicitNullCheck] Handle Nonzero faulting pages and complex addressing
ClosedPublic

Authored by anna on Sep 4 2020, 8:43 AM.

Download Raw Diff

Details

Reviewers

dantrushin
skatkov
reames

Commits

rG35cb45c533fb: [ImplicitNullChecks] Support complex addressing mode

Summary

This patch:

introduces a new TII API for calculating the address of a memory operation in complex addressing mode. This is a general function, which checks for both bas and index registers. This is used in isSuitableMemoryOp in ImplicitNullChecks and all the validity checks are done in this caller function. Earlier, we used to bail out if the memory operation had an index register or a scale != 1 (i.e. we supported offsets with simple addressing mode).
handles memory accesses (and makes the null checks implicit) where the faulting page is a non-zero one as

specified through a named module level metadata startaddress_faulting_pages introduced in this patch. The optional metadata is added by front-ends.

See #2 in action with added MIR tests for positive and negative case. #1
allows us to handle a case which was not handled earlier (see
implicit-null-checks.ll).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

anna created this revision.Sep 4 2020, 8:43 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 4 2020, 8:43 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

anna requested review of this revision.Sep 4 2020, 8:43 AM

Harbormaster completed remote builds in B70665: Diff 289967.Sep 4 2020, 8:44 AM

anna added a parent revision: D87108: [ImplicitNullCheck] Handle instructions that do not modify null behaviour of null checked reg.Sep 4 2020, 8:46 AM

While working on a follow-up change, I realized we don't need to separate out whether we're handling simple or complex addressing mode based on whether we fault in zero page or not. Both are orthogonal issues. The reason I coupled them is because we still need a way to get the "total constant address" *if it exists* and make a distinction between when we succeeded to get one versus when there was no such constant address apart from the offset.

handles complex addressing mode using base and index operands. All test cases pass.
I plan to add more counter examples once the basic idea is agreed upon.

anna added inline comments.Sep 7 2020, 7:52 PM

llvm/test/CodeGen/X86/implicit-null-check-negative.ll
132	I added these to show that none of the negative test cases start failing once we add the metadata with "other faulting pages".

Harbormaster completed remote builds in B70891: Diff 290395.Sep 7 2020, 8:21 PM

dantrushin added inline comments.Sep 8 2020, 7:29 AM

llvm/include/llvm/CodeGen/TargetInstrInfo.h
1305	These names are confusing. I've been scrolling back and forth for several minutes :) Why not to pass and name them separately?
llvm/lib/CodeGen/ImplicitNullChecks.cpp
390	Do you really need to check that `BaseRegUsedInConstAddr.first` is not zero? Would simple `BaseRegUsedInConstAddr.first != PointerReg` work here? I also found positive names more readable, e.g. `bool BaseRegIsNullChecked = BaseRegUsedInConstAddr.first == PointerReg` I think it is reasonable to assert `Scale !=0 \|\| IndexRegUsedInConstAddr.first == 0` and remove Scale check.
llvm/lib/Target/X86/X86InstrInfo.cpp
3680	Any reason to use `auto *` instead of `auto &` ? If I read it correctly, `BaseOp` is used to get base register only. Using reference you'd have less characters to type :)
3693	ditto. `auto &IndexOp = MI.getOperand()` ?

anna edited reviewers, added: reames; removed: philip.Sep 8 2020, 8:04 AM

addressed review comments. Added more testcases (positive and countercases).

anna added inline comments.Sep 10 2020, 10:08 AM

llvm/lib/CodeGen/ImplicitNullChecks.cpp
390	Changed the names and hopefully more readable. We need to check that the register exists before checking if it is equal to the null checked register because we don't want to fail when we don't have the register (in that case, we wont be using it in the addres calculation). Here we will state that the memory op is unsuitable and then fail. I realized this through some failing test case `imp_null_check_load` in implicit-null-checks.ll. There, we do not use the index register, so it is zero.

Thanks, it reads much better for me now.
Looks good.

anna marked 2 inline comments as done.Sep 10 2020, 10:44 AM

Harbormaster completed remote builds in B71252: Diff 291011.Sep 10 2020, 10:50 AM

anna edited the summary of this revision. (Show Details)Sep 10 2020, 10:58 AM

Must fix:

Your change does not include a LangRef update for the new metadata.
The name chosen for the metadata is ambiguous. I'd also suggest phrasing it in terms of ranges, not page starts. Maybe: guaranteed_faulting_ranges, or implicit_check_faulting_ranges?
As you note in the review, this is mixing two sets of changes in a way which makes it hard to reason about either in isolation. Please split. Either order is fine as you should be able to test either piece in isolation without the other.

llvm/include/llvm/CodeGen/TargetInstrInfo.h
1301	I find the terminology used here to be very confusing. A constant address to me is either a literal constant or a constant expression. You appear to be referring to complex addressing instead. I believe that your ConstantAddr is the Displacement field in a x86 complex addressing, is that true? If so, rename required. If not, please explain.
llvm/lib/CodeGen/ImplicitNullChecks.cpp
476	From your comment, it sounds like the existing code handles two built in ranges, not one. 0, PageSize 0x100..00, 0x111..11 (i.e. the zeroth page and kernel address range for Linux) I'd suggest adjusting comments and code structure to reflect same, and to make range handling (not page starts) explicit.
llvm/lib/Target/X86/X86InstrInfo.cpp
3701	This really doesn't seem like it belongs here. I think this could be cleanly moved into the caller, simplifying the behaviour of this routine greatly.

This revision now requires changes to proceed.Sep 11 2020, 9:32 AM

addressed review comments about splitting out patches. This patch handles complex addressing mode.

Harbormaster completed remote builds in B72824: Diff 294075.Sep 24 2020, 9:32 AM

Looks close to ready. Please address review comments, and we'll probably be ready for an LGTM.

llvm/include/llvm/CodeGen/TargetInstrInfo.h
980	"is an instruction"
llvm/lib/CodeGen/ImplicitNullChecks.cpp
406	These should be impossible, use asserts.
424	There's a potential overflow case here which needs consideration. Consider 0xF...F as constant and scale as 2. You probably need to return some information about the register width/wrapping of the addressing mode to do this in a target independent way.
428	Please return true or false, then set the per register flag. This is too much coupling.

anna added inline comments.Sep 24 2020, 9:49 AM

llvm/lib/CodeGen/ImplicitNullChecks.cpp
406	Will use assert for the multiplier and add a comment why RegUsedInAddr can be zero reg, i.e. something like: `movq 8(,%rdi,8), %rax` here the BaseReg is X86::NoRegister and ScaleReg is rdi.
424	good catch! will take a look.
428	yes, agreed.

reames requested changes to this revision.Sep 29 2020, 9:47 AM

This revision now requires changes to proceed.Sep 29 2020, 9:47 AM

addressed review comments. Bails out in case of overflow (added testcase for multiply overflow).

Harbormaster completed remote builds in B73357: Diff 295023.Sep 29 2020, 10:38 AM

LGTM, but with required changes.

llvm/lib/CodeGen/ImplicitNullChecks.cpp
393	Please add a check here that the size of BaseReg, ScaledReg, and PointerReg are all equal. I'm concerned about implicit sign extension of registers, and the interaction with the displacement code below. The code you have may be correct, but I'd prefer a explicit bail out just to make it easier to tell.
405	You could if you wished do Displacement mod 2^64, but I don't see any point in that.
411	Add comment here: "If a register used in the address is constant, fold it's effect into the displacement for ease of analysis."
432	Replace IsSignedImmVal with constant true. This isn't "is negative", this is "interpret this as a signed number instead of unsigned".
434	You should be able to just define the APInt in the definition below. The bitwidth will come from the RHS.
441	e,g, APInt Product = Also, remember to constant fold isSignedImmVal
llvm/test/CodeGen/X86/implicit-null-check.ll
625	You don't appear to actually have a test here which uses a displacement large enough to need a base register, please add one. (i.e. modify this test to add not 3526, but 0xFFFFF0000 (or something similarly large)

This revision is now accepted and ready to land.Sep 29 2020, 11:41 AM

anna marked 5 inline comments as done.Oct 7 2020, 1:05 PM

anna added inline comments.

llvm/test/CodeGen/X86/implicit-null-check.ll
625	I have a test in the negative test cases file above where the displacement is large and outside the faulting page. I'll modify it to a very large value so that it is in a register and not an immediate.

addressed review comments, rebased and updated for AArch64 (this patch is NFC in behaviour for AArch64).

Harbormaster completed remote builds in B74339: Diff 296776.Oct 7 2020, 1:38 PM

This revision was landed with ongoing or failed builds.Oct 7 2020, 5:55 PM

Closed by commit rG35cb45c533fb: [ImplicitNullChecks] Support complex addressing mode (authored by anna). · Explain Why

This revision was automatically updated to reflect the committed changes.

anna added a commit: rG35cb45c533fb: [ImplicitNullChecks] Support complex addressing mode.

Hey JFYI this created some unused variables. I fixed with https://reviews.llvm.org/D89022

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

TargetInstrInfo.h

28 lines

lib/

CodeGen/

ImplicitNullChecks.cpp

92 lines

Target/

AArch64/

AArch64InstrInfo.h

4 lines

AArch64InstrInfo.cpp

18 lines

X86/

X86InstrInfo.h

7 lines

X86InstrInfo.cpp

39 lines

test/

CodeGen/

X86/

implicit-null-check-negative.ll

20 lines

implicit-null-check.ll

34 lines

implicit-null-checks.mir

45 lines

Diff 296835

llvm/include/llvm/CodeGen/TargetInstrInfo.h

Show First 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
/// Used to describe a register and immediate addition.		/// Used to describe a register and immediate addition.
struct RegImmPair {		struct RegImmPair {
Register Reg;		Register Reg;
int64_t Imm;		int64_t Imm;

RegImmPair(Register Reg, int64_t Imm) : Reg(Reg), Imm(Imm) {}		RegImmPair(Register Reg, int64_t Imm) : Reg(Reg), Imm(Imm) {}
};		};

		/// Used to describe addressing mode similar to ExtAddrMode in CodeGenPrepare.
		/// It holds the register values, the scale value and the displacement.
		struct ExtAddrMode {
		Register BaseReg;
		Register ScaledReg;
		int64_t Scale;
		int64_t Displacement;
		};

//---------------------------------------------------------------------------		//---------------------------------------------------------------------------
///		///
/// TargetInstrInfo - Interface to description of machine instruction set		/// TargetInstrInfo - Interface to description of machine instruction set
///		///
class TargetInstrInfo : public MCInstrInfo {		class TargetInstrInfo : public MCInstrInfo {
public:		public:
TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,		TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,
unsigned CatchRetOpcode = ~0u, unsigned ReturnOpcode = ~0u)		unsigned CatchRetOpcode = ~0u, unsigned ReturnOpcode = ~0u)
▲ Show 20 Lines • Show All 872 Lines • ▼ Show 20 Lines	public:
/// immediate value and a physical register, and stores the result in		/// immediate value and a physical register, and stores the result in
/// the given physical register \c Reg, return a pair of the source		/// the given physical register \c Reg, return a pair of the source
/// register and the offset which has been added.		/// register and the offset which has been added.
virtual Optional<RegImmPair> isAddImmediate(const MachineInstr &MI,		virtual Optional<RegImmPair> isAddImmediate(const MachineInstr &MI,
Register Reg) const {		Register Reg) const {
return None;		return None;
}		}

		/// Returns true if MI is an instruction that defines Reg to have a constant
		reamesUnsubmitted Not Done Reply Inline Actions "is an instruction" reames: "is an instruction"
		/// value and the value is recorded in ImmVal. The ImmVal is a result that
		/// should be interpreted as modulo size of Reg.
		virtual bool getConstValDefinedInReg(const MachineInstr &MI,
		const Register Reg,
		int64_t &ImmVal) const {
		return false;
		}

/// Store the specified register of the given register class to the specified		/// Store the specified register of the given register class to the specified
/// stack frame index. The store instruction is to be added to the given		/// stack frame index. The store instruction is to be added to the given
/// machine basic block before the specified machine instruction. If isKill		/// machine basic block before the specified machine instruction. If isKill
/// is true, the register operand is the last use and must be marked kill.		/// is true, the register operand is the last use and must be marked kill.
virtual void storeRegToStackSlot(MachineBasicBlock &MBB,		virtual void storeRegToStackSlot(MachineBasicBlock &MBB,
MachineBasicBlock::iterator MI,		MachineBasicBlock::iterator MI,
Register SrcReg, bool isKill, int FrameIndex,		Register SrcReg, bool isKill, int FrameIndex,
const TargetRegisterClass *RC,		const TargetRegisterClass *RC,
▲ Show 20 Lines • Show All 286 Lines • ▼ Show 20 Lines	public:
/// true, the function also sets the operand position in the instruction		/// true, the function also sets the operand position in the instruction
/// for the base register and offset.		/// for the base register and offset.
virtual bool getBaseAndOffsetPosition(const MachineInstr &MI,		virtual bool getBaseAndOffsetPosition(const MachineInstr &MI,
unsigned &BasePos,		unsigned &BasePos,
unsigned &OffsetPos) const {		unsigned &OffsetPos) const {
return false;		return false;
}		}

		/// Target dependent implementation to get the values constituting the address
		/// MachineInstr that is accessing memory. These values are returned as a
		/// struct ExtAddrMode which contains all relevant information to make up the
		/// address.
		virtual Optional<ExtAddrMode>
		getAddrModeFromMemoryOp(const MachineInstr &MemI,
		const TargetRegisterInfo *TRI) const {
		return None;
		}

/// Returns true if MI's Def is NullValueReg, and the MI		/// Returns true if MI's Def is NullValueReg, and the MI
		reamesUnsubmitted Not Done Reply Inline Actions I find the terminology used here to be very confusing. A constant address to me is either a literal constant or a constant expression. You appear to be referring to complex addressing instead. I believe that your ConstantAddr is the Displacement field in a x86 complex addressing, is that true? If so, rename required. If not, please explain. reames: I find the terminology used here to be very confusing. A constant address to me is either a…
/// does not change the Zero value. i.e. cases such as rax = shr rax, X where		/// does not change the Zero value. i.e. cases such as rax = shr rax, X where
/// NullValueReg = rax. Note that if the NullValueReg is non-zero, this		/// NullValueReg = rax. Note that if the NullValueReg is non-zero, this
/// function can return true even if becomes zero. Specifically cases such as		/// function can return true even if becomes zero. Specifically cases such as
/// NullValueReg = shl NullValueReg, 63.		/// NullValueReg = shl NullValueReg, 63.
		dantrushinUnsubmitted Not Done Reply Inline Actions These names are confusing. I've been scrolling back and forth for several minutes :) Why not to pass and name them separately? dantrushin: These names are confusing. I've been scrolling back and forth for several minutes :) Why not to…
virtual bool preservesZeroValueInReg(const MachineInstr *MI,		virtual bool preservesZeroValueInReg(const MachineInstr *MI,
const Register NullValueReg,		const Register NullValueReg,
const TargetRegisterInfo *TRI) const {		const TargetRegisterInfo *TRI) const {
return false;		return false;
}		}

/// If the instruction is an increment of a constant value, return the amount.		/// If the instruction is an increment of a constant value, return the amount.
virtual bool getIncrementValue(const MachineInstr &MI, int &Value) const {		virtual bool getIncrementValue(const MachineInstr &MI, int &Value) const {
▲ Show 20 Lines • Show All 606 Lines • Show Last 20 Lines

llvm/lib/CodeGen/ImplicitNullChecks.cpp

Show First 20 Lines • Show All 372 Lines • ▼ Show 20 Lines	ImplicitNullChecks::isSuitableMemoryOp(const MachineInstr &MI,
bool OffsetIsScalable;		bool OffsetIsScalable;
const MachineOperand *BaseOp;		const MachineOperand *BaseOp;

// Implementation restriction for faulting_op insertion		// Implementation restriction for faulting_op insertion
// TODO: This could be relaxed if we find a test case which warrants it.		// TODO: This could be relaxed if we find a test case which warrants it.
if (MI.getDesc().getNumDefs() > 1)		if (MI.getDesc().getNumDefs() > 1)
return SR_Unsuitable;		return SR_Unsuitable;

// FIXME: This handles only simple addressing mode.		if (!MI.mayLoadOrStore() \|\| MI.isPredicable())
if (!TII->getMemOperandWithOffset(MI, BaseOp, Offset, OffsetIsScalable, TRI))		return SR_Unsuitable;
		auto AM = TII->getAddrModeFromMemoryOp(MI, TRI);
		if (!AM)
return SR_Unsuitable;		return SR_Unsuitable;
		auto AddrMode = *AM;
		const Register BaseReg = AddrMode.BaseReg, ScaledReg = AddrMode.ScaledReg;
		int64_t Displacement = AddrMode.Displacement;

// We need the base of the memory instruction to be same as the register		// We need the base of the memory instruction to be same as the register
		dantrushinUnsubmitted Not Done Reply Inline Actions Do you really need to check that `BaseRegUsedInConstAddr.first` is not zero? Would simple `BaseRegUsedInConstAddr.first != PointerReg` work here? I also found positive names more readable, e.g. `bool BaseRegIsNullChecked = BaseRegUsedInConstAddr.first == PointerReg` I think it is reasonable to assert `Scale !=0 \|\| IndexRegUsedInConstAddr.first == 0` and remove Scale check. dantrushin: Do you really need to check that `BaseRegUsedInConstAddr.first` is not zero? Would simple…
		annaAuthorUnsubmitted Done Reply Inline Actions Changed the names and hopefully more readable. We need to check that the register exists before checking if it is equal to the null checked register because we don't want to fail when we don't have the register (in that case, we wont be using it in the addres calculation). Here we will state that the memory op is unsuitable and then fail. I realized this through some failing test case `imp_null_check_load` in implicit-null-checks.ll. There, we do not use the index register, so it is zero. anna: Changed the names and hopefully more readable. We need to check that the register exists…
// where the null check is performed (i.e. PointerReg).		// where the null check is performed (i.e. PointerReg).
if (!BaseOp->isReg() \|\| BaseOp->getReg() != PointerReg)		if (BaseReg != PointerReg && ScaledReg != PointerReg)
return SR_Unsuitable;		return SR_Unsuitable;
		reamesUnsubmitted Done Reply Inline Actions Please add a check here that the size of BaseReg, ScaledReg, and PointerReg are all equal. I'm concerned about implicit sign extension of registers, and the interaction with the displacement code below. The code you have may be correct, but I'd prefer a explicit bail out just to make it easier to tell. reames: Please add a check here that the size of BaseReg, ScaledReg, and PointerReg are all equal. I'm…
		const MachineRegisterInfo &MRI = MI.getMF()->getRegInfo();
// Scalable offsets are a part of scalable vectors (SVE for AArch64). That		unsigned PointerRegSizeInBits = TRI->getRegSizeInBits(PointerReg, MRI);
// target is in-practice unsupported for ImplicitNullChecks.		// Bail out of the sizes of BaseReg, ScaledReg and PointerReg are not the
if (OffsetIsScalable)		// same.
		if ((BaseReg &&
		TRI->getRegSizeInBits(BaseReg, MRI) != PointerRegSizeInBits) \|\|
		(ScaledReg &&
		TRI->getRegSizeInBits(ScaledReg, MRI) != PointerRegSizeInBits))
return SR_Unsuitable;		return SR_Unsuitable;

if (!MI.mayLoadOrStore() \|\| MI.isPredicable())		// Returns true if RegUsedInAddr is used for calculating the displacement
		// depending on addressing mode. Also calculates the Displacement.
		reamesUnsubmitted Not Done Reply Inline Actions You could if you wished do Displacement mod 2^64, but I don't see any point in that. reames: You could if you wished do Displacement mod 2^64, but I don't see any point in that.
		auto CalculateDisplacementFromAddrMode = [&](Register RegUsedInAddr,
		reamesUnsubmitted Not Done Reply Inline Actions These should be impossible, use asserts. reames: These should be impossible, use asserts.
		annaAuthorUnsubmitted Done Reply Inline Actions Will use assert for the multiplier and add a comment why RegUsedInAddr can be zero reg, i.e. something like: `movq 8(,%rdi,8), %rax` here the BaseReg is X86::NoRegister and ScaleReg is rdi. anna: Will use assert for the multiplier and add a comment why RegUsedInAddr can be zero reg, i.e.
		int64_t Multiplier) {
		// The register can be NoRegister, which is defined as zero for all targets.
		// Consider instruction of interest as `movq 8(,%rdi,8), %rax`. Here the
		// ScaledReg is %rdi, while there is no BaseReg.
		if (!RegUsedInAddr)
		reamesUnsubmitted Done Reply Inline Actions Add comment here: "If a register used in the address is constant, fold it's effect into the displacement for ease of analysis." reames: Add comment here: "If a register used in the address is constant, fold it's effect into the…
		return false;
		assert(Multiplier && "expected to be non-zero!");
		MachineInstr *ModifyingMI = nullptr;
		for (auto It = std::next(MachineBasicBlock::const_reverse_iterator(&MI));
		It != MI.getParent()->rend(); It++) {
		const MachineInstr CurrMI = &It;
		if (CurrMI->modifiesRegister(RegUsedInAddr, TRI)) {
		ModifyingMI = const_cast<MachineInstr *>(CurrMI);
		break;
		}
		}
		if (!ModifyingMI)
		return false;
		reamesUnsubmitted Not Done Reply Inline Actions There's a potential overflow case here which needs consideration. Consider 0xF...F as constant and scale as 2. You probably need to return some information about the register width/wrapping of the addressing mode to do this in a target independent way. reames: There's a potential overflow case here which needs consideration. Consider 0xF...F as constant…
		annaAuthorUnsubmitted Done Reply Inline Actions good catch! will take a look. anna: good catch! will take a look.
		// Check for the const value defined in register by ModifyingMI. This means
		// all other previous values for that register has been invalidated.
		int64_t ImmVal;
		if (!TII->getConstValDefinedInReg(*ModifyingMI, RegUsedInAddr, ImmVal))
		reamesUnsubmitted Not Done Reply Inline Actions Please return true or false, then set the per register flag. This is too much coupling. reames: Please return true or false, then set the per register flag. This is too much coupling.
		annaAuthorUnsubmitted Done Reply Inline Actions yes, agreed. anna: yes, agreed.
		return false;
		// Calculate the reg size in bits, since this is needed for bailing out in
		// case of overflow.
		int32_t RegSizeInBits = TRI->getRegSizeInBits(RegUsedInAddr, MRI);
		reamesUnsubmitted Done Reply Inline Actions Replace IsSignedImmVal with constant true. This isn't "is negative", this is "interpret this as a signed number instead of unsigned". reames: Replace IsSignedImmVal with constant true. This isn't "is negative", this is "interpret this…
		APInt ImmValC(RegSizeInBits, ImmVal, true /IsSigned/);
		APInt MultiplierC(RegSizeInBits, Multiplier);
		reamesUnsubmitted Done Reply Inline Actions You should be able to just define the APInt in the definition below. The bitwidth will come from the RHS. reames: You should be able to just define the APInt in the definition below. The bitwidth will come…
		assert(MultiplierC.isStrictlyPositive() &&
		"expected to be a positive value!");
		bool IsOverflow;
		// Sign of the product depends on the sign of the ImmVal, since Multiplier
		// is always positive.
		APInt Product = ImmValC.smul_ov(MultiplierC, IsOverflow);
		if (IsOverflow)
		reamesUnsubmitted Done Reply Inline Actions e,g, APInt Product = Also, remember to constant fold isSignedImmVal reames: e,g, APInt Product = Also, remember to constant fold isSignedImmVal
		return false;
		APInt DisplacementC(64, Displacement, true /isSigned/);
		DisplacementC = Product.sadd_ov(DisplacementC, IsOverflow);
		if (IsOverflow)
		return false;

		// We only handle diplacements upto 64 bits wide.
		if (DisplacementC.getActiveBits() > 64)
		return false;
		Displacement = DisplacementC.getSExtValue();
		return true;
		};

		// If a register used in the address is constant, fold it's effect into the
		// displacement for ease of analysis.
		bool BaseRegIsConstVal = false, ScaledRegIsConstVal = false;
		if (CalculateDisplacementFromAddrMode(BaseReg, 1))
		BaseRegIsConstVal = true;
		if (CalculateDisplacementFromAddrMode(ScaledReg, AddrMode.Scale))
		ScaledRegIsConstVal = true;

		// The register which is not null checked should be part of the Displacement
		// calculation, otherwise we do not know whether the Displacement is made up
		// by some symbolic values.
		// This matters because we do not want to incorrectly assume that load from
		// falls in the zeroth faulting page in the "sane offset check" below.
		if ((BaseReg && BaseReg != PointerReg && !BaseRegIsConstVal) \|\|
		(ScaledReg && ScaledReg != PointerReg && !ScaledRegIsConstVal))
return SR_Unsuitable;		return SR_Unsuitable;

// We want the mem access to be issued at a sane offset from PointerReg,		// We want the mem access to be issued at a sane offset from PointerReg,
// so that if PointerReg is null then the access reliably page faults.		// so that if PointerReg is null then the access reliably page faults.
if (!(-PageSize < Offset && Offset < PageSize))		if (!(-PageSize < Displacement && Displacement < PageSize))
return SR_Unsuitable;		return SR_Unsuitable;

		reamesUnsubmitted Not Done Reply Inline Actions From your comment, it sounds like the existing code handles two built in ranges, not one. 0, PageSize 0x100..00, 0x111..11 (i.e. the zeroth page and kernel address range for Linux) I'd suggest adjusting comments and code structure to reflect same, and to make range handling (not page starts) explicit. reames: From your comment, it sounds like the existing code handles two built in ranges, not one. 0…
// Finally, check whether the current memory access aliases with previous one.		// Finally, check whether the current memory access aliases with previous one.
for (auto *PrevMI : PrevInsts) {		for (auto *PrevMI : PrevInsts) {
AliasResult AR = areMemoryOpsAliased(MI, PrevMI);		AliasResult AR = areMemoryOpsAliased(MI, PrevMI);
if (AR == AR_WillAliasEverything)		if (AR == AR_WillAliasEverything)
return SR_Impossible;		return SR_Impossible;
if (AR == AR_MayAlias)		if (AR == AR_MayAlias)
return SR_Unsuitable;		return SR_Unsuitable;
}		}
▲ Show 20 Lines • Show All 345 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.h

Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines	public:
static unsigned convertToFlagSettingOpc(unsigned Opc, bool &Is64Bit);		static unsigned convertToFlagSettingOpc(unsigned Opc, bool &Is64Bit);

/// Return true if this is a load/store that can be potentially paired/merged.		/// Return true if this is a load/store that can be potentially paired/merged.
bool isCandidateToMergeOrPair(const MachineInstr &MI) const;		bool isCandidateToMergeOrPair(const MachineInstr &MI) const;

/// Hint that pairing the given load or store is unprofitable.		/// Hint that pairing the given load or store is unprofitable.
static void suppressLdStPair(MachineInstr &MI);		static void suppressLdStPair(MachineInstr &MI);

		Optional<ExtAddrMode>
		getAddrModeFromMemoryOp(const MachineInstr &MemI,
		const TargetRegisterInfo *TRI) const override;

bool getMemOperandsWithOffsetWidth(		bool getMemOperandsWithOffsetWidth(
const MachineInstr &MI, SmallVectorImpl<const MachineOperand *> &BaseOps,		const MachineInstr &MI, SmallVectorImpl<const MachineOperand *> &BaseOps,
int64_t &Offset, bool &OffsetIsScalable, unsigned &Width,		int64_t &Offset, bool &OffsetIsScalable, unsigned &Width,
const TargetRegisterInfo *TRI) const override;		const TargetRegisterInfo *TRI) const override;

/// If \p OffsetIsScalable is set to 'true', the offset is scaled by `vscale`.		/// If \p OffsetIsScalable is set to 'true', the offset is scaled by `vscale`.
/// This is true for some SVE instructions like ldr/str that have a		/// This is true for some SVE instructions like ldr/str that have a
/// 'reg + imm' addressing mode where the immediate is an index to the		/// 'reg + imm' addressing mode where the immediate is an index to the
▲ Show 20 Lines • Show All 330 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

Show First 20 Lines • Show All 2,138 Lines • ▼ Show 20 Lines	bool AArch64InstrInfo::getMemOperandsWithOffsetWidth(
const MachineOperand *BaseOp;		const MachineOperand *BaseOp;
if (!getMemOperandWithOffsetWidth(LdSt, BaseOp, Offset, OffsetIsScalable,		if (!getMemOperandWithOffsetWidth(LdSt, BaseOp, Offset, OffsetIsScalable,
Width, TRI))		Width, TRI))
return false;		return false;
BaseOps.push_back(BaseOp);		BaseOps.push_back(BaseOp);
return true;		return true;
}		}

		Optional<ExtAddrMode>
		AArch64InstrInfo::getAddrModeFromMemoryOp(const MachineInstr &MemI,
		const TargetRegisterInfo *TRI) const {
		const MachineOperand *Base; // Filled with the base operand of MI.
		int64_t Offset; // Filled with the offset of MI.
		bool OffsetIsScalable;
		if (!getMemOperandWithOffset(MemI, Base, Offset, OffsetIsScalable, TRI))
		return None;

		if (!Base->isReg())
		return None;
		ExtAddrMode AM;
		AM.BaseReg = Base->getReg();
		AM.Displacement = Offset;
		AM.ScaledReg = 0;
		return AM;
		}

bool AArch64InstrInfo::getMemOperandWithOffsetWidth(		bool AArch64InstrInfo::getMemOperandWithOffsetWidth(
const MachineInstr &LdSt, const MachineOperand *&BaseOp, int64_t &Offset,		const MachineInstr &LdSt, const MachineOperand *&BaseOp, int64_t &Offset,
bool &OffsetIsScalable, unsigned &Width,		bool &OffsetIsScalable, unsigned &Width,
const TargetRegisterInfo *TRI) const {		const TargetRegisterInfo *TRI) const {
assert(LdSt.mayLoadOrStore() && "Expected a memory operation.");		assert(LdSt.mayLoadOrStore() && "Expected a memory operation.");
// Handle only loads/stores with base register followed by immediate offset.		// Handle only loads/stores with base register followed by immediate offset.
if (LdSt.getNumExplicitOperands() == 3) {		if (LdSt.getNumExplicitOperands() == 3) {
// Non-paired instruction (e.g., ldr x1, [x0, #8]).		// Non-paired instruction (e.g., ldr x1, [x0, #8]).
▲ Show 20 Lines • Show All 4,847 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.h

Show First 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	void replaceBranchWithTailCall(MachineBasicBlock &MBB,
SmallVectorImpl<MachineOperand> &Cond,		SmallVectorImpl<MachineOperand> &Cond,
const MachineInstr &TailCall) const override;		const MachineInstr &TailCall) const override;

bool analyzeBranch(MachineBasicBlock &MBB, MachineBasicBlock *&TBB,		bool analyzeBranch(MachineBasicBlock &MBB, MachineBasicBlock *&TBB,
MachineBasicBlock *&FBB,		MachineBasicBlock *&FBB,
SmallVectorImpl<MachineOperand> &Cond,		SmallVectorImpl<MachineOperand> &Cond,
bool AllowModify) const override;		bool AllowModify) const override;

		Optional<ExtAddrMode>
		getAddrModeFromMemoryOp(const MachineInstr &MemI,
		const TargetRegisterInfo *TRI) const override;

		bool getConstValDefinedInReg(const MachineInstr &MI, const Register Reg,
		int64_t &ImmVal) const override;

bool preservesZeroValueInReg(const MachineInstr *MI,		bool preservesZeroValueInReg(const MachineInstr *MI,
const Register NullValueReg,		const Register NullValueReg,
const TargetRegisterInfo *TRI) const override;		const TargetRegisterInfo *TRI) const override;

bool getMemOperandsWithOffsetWidth(		bool getMemOperandsWithOffsetWidth(
const MachineInstr &LdSt,		const MachineInstr &LdSt,
SmallVectorImpl<const MachineOperand *> &BaseOps, int64_t &Offset,		SmallVectorImpl<const MachineOperand *> &BaseOps, int64_t &Offset,
bool &OffsetIsScalable, unsigned &Width,		bool &OffsetIsScalable, unsigned &Width,
▲ Show 20 Lines • Show All 300 Lines • Show Last 20 Lines

llvm/lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,657 Lines • ▼ Show 20 Lines	case 64:
assert(STI.hasAVX512() && "Using 512-bit register requires AVX512");		assert(STI.hasAVX512() && "Using 512-bit register requires AVX512");
if (isStackAligned)		if (isStackAligned)
return load ? X86::VMOVAPSZrm : X86::VMOVAPSZmr;		return load ? X86::VMOVAPSZrm : X86::VMOVAPSZmr;
else		else
return load ? X86::VMOVUPSZrm : X86::VMOVUPSZmr;		return load ? X86::VMOVUPSZrm : X86::VMOVUPSZmr;
}		}
}		}

		Optional<ExtAddrMode>
		X86InstrInfo::getAddrModeFromMemoryOp(const MachineInstr &MemI,
		const TargetRegisterInfo *TRI) const {
		const MCInstrDesc &Desc = MemI.getDesc();
		int MemRefBegin = X86II::getMemoryOperandNo(Desc.TSFlags);
		if (MemRefBegin < 0)
		return None;

		MemRefBegin += X86II::getOperandBias(Desc);

		auto &BaseOp = MemI.getOperand(MemRefBegin + X86::AddrBaseReg);
		if (!BaseOp.isReg()) // Can be an MO_FrameIndex
		return None;

		const MachineOperand &DispMO = MemI.getOperand(MemRefBegin + X86::AddrDisp);
		dantrushinUnsubmitted Done Reply Inline Actions Any reason to use `auto ` instead of `auto &` ? If I read it correctly, `BaseOp` is used to get base register only. Using reference you'd have less characters to type :) dantrushin:* Any reason to use `auto *` instead of `auto &` ? If I read it correctly, `BaseOp` is used to…
		// Displacement can be symbolic
		if (!DispMO.isImm())
		return None;

		ExtAddrMode AM;
		AM.BaseReg = BaseOp.getReg();
		AM.ScaledReg = MemI.getOperand(MemRefBegin + X86::AddrIndexReg).getReg();
		AM.Scale = MemI.getOperand(MemRefBegin + X86::AddrScaleAmt).getImm();
		AM.Displacement = DispMO.getImm();
		return AM;
		}

		bool X86InstrInfo::getConstValDefinedInReg(const MachineInstr &MI,
		dantrushinUnsubmitted Done Reply Inline Actions ditto. `auto &IndexOp = MI.getOperand()` ? dantrushin: ditto. `auto &IndexOp = MI.getOperand()` ?
		const Register Reg,
		int64_t &ImmVal) const {
		if (MI.getOpcode() != X86::MOV32ri && MI.getOpcode() != X86::MOV64ri)
		return false;
		// Mov Src can be a global address.
		if (!MI.getOperand(1).isImm() \|\| MI.getOperand(0).getReg() != Reg)
		return false;
		ImmVal = MI.getOperand(1).getImm();
		reamesUnsubmitted Not Done Reply Inline Actions This really doesn't seem like it belongs here. I think this could be cleanly moved into the caller, simplifying the behaviour of this routine greatly. reames: This really doesn't seem like it belongs here. I think this could be cleanly moved into the…
		return true;
		}

bool X86InstrInfo::preservesZeroValueInReg(		bool X86InstrInfo::preservesZeroValueInReg(
const MachineInstr *MI, const Register NullValueReg,		const MachineInstr *MI, const Register NullValueReg,
const TargetRegisterInfo *TRI) const {		const TargetRegisterInfo *TRI) const {
if (!MI->modifiesRegister(NullValueReg, TRI))		if (!MI->modifiesRegister(NullValueReg, TRI))
return true;		return true;
switch (MI->getOpcode()) {		switch (MI->getOpcode()) {
// Shift right/left of a null unto itself is still a null, i.e. rax = shl rax		// Shift right/left of a null unto itself is still a null, i.e. rax = shl rax
// X.		// X.
▲ Show 20 Lines • Show All 5,308 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/implicit-null-check-negative.ll

Show First 20 Lines • Show All 123 Lines • ▼ Show 20 Lines	define i64 @imp_null_check_load_shift_add_addr(i64* %x, i64 %r) {
%y = ptrtoint i64* %x to i64		%y = ptrtoint i64* %x to i64
%shry = shl i64 %y, 6		%shry = shl i64 %y, 6
%shry.add = add i64 %shry, %r		%shry.add = add i64 %shry, %r
%y.ptr = inttoptr i64 %shry.add to i64*		%y.ptr = inttoptr i64 %shry.add to i64*
%x.loc = getelementptr i64, i64* %y.ptr, i64 1		%x.loc = getelementptr i64, i64* %y.ptr, i64 1
%t = load i64, i64* %x.loc		%t = load i64, i64* %x.loc
ret i64 %t		ret i64 %t
}		}

		annaAuthorUnsubmitted Done Reply Inline Actions I added these to show that none of the negative test cases start failing once we add the metadata with "other faulting pages". anna: I added these to show that none of the negative test cases start failing once we add the…
		; the memory op is not within faulting page.
		define i64 @imp_null_check_load_addr_outside_faulting_page(i64* %x) {
		entry:
		%c = icmp eq i64* %x, null
		br i1 %c, label %is_null, label %not_null, !make.implicit !0

		is_null:
		ret i64 42

		not_null:
		%y = ptrtoint i64* %x to i64
		%shry = shl i64 %y, 3
		%shry.add = add i64 %shry, 68719472640
		%y.ptr = inttoptr i64 %shry.add to i64*
		%x.loc = getelementptr i64, i64* %y.ptr, i64 1
		%t = load i64, i64* %x.loc
		ret i64 %t
		}

!0 = !{}		!0 = !{}

llvm/test/CodeGen/X86/implicit-null-check.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -verify-machineinstrs -O3 -mtriple=x86_64-apple-macosx -enable-implicit-null-checks < %s \| FileCheck %s		; RUN: llc -verify-machineinstrs -O3 -mtriple=x86_64-apple-macosx -enable-implicit-null-checks < %s \| FileCheck %s

define i32 @imp_null_check_load(i32* %x) {		define i32 @imp_null_check_load(i32* %x) {
; CHECK-LABEL: imp_null_check_load:		; CHECK-LABEL: imp_null_check_load:
; CHECK: ## %bb.0: ## %entry		; CHECK: ## %bb.0: ## %entry
; CHECK-NEXT: Ltmp0:		; CHECK-NEXT: Ltmp0:
; CHECK-NEXT: movl (%rdi), %eax ## on-fault: LBB0_1		; CHECK-NEXT: movl (%rdi), %eax ## on-fault: LBB0_1
; CHECK-NEXT: ## %bb.2: ## %not_null		; CHECK-NEXT: ## %bb.2: ## %not_null
▲ Show 20 Lines • Show All 578 Lines • ▼ Show 20 Lines	; CHECK-NEXT: retq
%y.ptr = inttoptr i64 %shry to i64*		%y.ptr = inttoptr i64 %shry to i64*
%x.loc = getelementptr i64, i64* %y.ptr, i64 1		%x.loc = getelementptr i64, i64* %y.ptr, i64 1
%t = load i64, i64* %x.loc		%t = load i64, i64* %x.loc
ret i64 %t		ret i64 %t
}		}

; Same as imp_null_check_load_shift_addr but shift is by 3 and this is now		; Same as imp_null_check_load_shift_addr but shift is by 3 and this is now
; converted into complex addressing.		; converted into complex addressing.
; TODO: Can be converted into implicit null check
define i64 @imp_null_check_load_shift_by_3_addr(i64* %x) {		define i64 @imp_null_check_load_shift_by_3_addr(i64* %x) {
; CHECK-LABEL: imp_null_check_load_shift_by_3_addr:		; CHECK-LABEL: imp_null_check_load_shift_by_3_addr:
; CHECK: ## %bb.0: ## %entry		; CHECK: ## %bb.0: ## %entry
; CHECK-NEXT: testq %rdi, %rdi		; CHECK-NEXT: Ltmp18:
; CHECK-NEXT: je LBB22_1		; CHECK-NEXT: movq 8(,%rdi,8), %rax ## on-fault: LBB22_1
; CHECK-NEXT: ## %bb.2: ## %not_null		; CHECK-NEXT: ## %bb.2: ## %not_null
; CHECK-NEXT: movq 8(,%rdi,8), %rax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
; CHECK-NEXT: LBB22_1: ## %is_null		; CHECK-NEXT: LBB22_1: ## %is_null
; CHECK-NEXT: movl $42, %eax		; CHECK-NEXT: movl $42, %eax
; CHECK-NEXT: retq		; CHECK-NEXT: retq

entry:		entry:
%c = icmp eq i64* %x, null		%c = icmp eq i64* %x, null
br i1 %c, label %is_null, label %not_null, !make.implicit !0		br i1 %c, label %is_null, label %not_null, !make.implicit !0

is_null:		is_null:
ret i64 42		ret i64 42

not_null:		not_null:
%y = ptrtoint i64* %x to i64		%y = ptrtoint i64* %x to i64
%shry = shl i64 %y, 3		%shry = shl i64 %y, 3
%y.ptr = inttoptr i64 %shry to i64*		%y.ptr = inttoptr i64 %shry to i64*
%x.loc = getelementptr i64, i64* %y.ptr, i64 1		%x.loc = getelementptr i64, i64* %y.ptr, i64 1
%t = load i64, i64* %x.loc		%t = load i64, i64* %x.loc
ret i64 %t		ret i64 %t
}		}

		define i64 @imp_null_check_load_shift_add_addr(i64* %x) {
		; CHECK-LABEL: imp_null_check_load_shift_add_addr:
		; CHECK: ## %bb.0: ## %entry
		; CHECK: movq 3526(,%rdi,8), %rax ## on-fault: LBB23_1
		reamesUnsubmitted Not Done Reply Inline Actions You don't appear to actually have a test here which uses a displacement large enough to need a base register, please add one. (i.e. modify this test to add not 3526, but 0xFFFFF0000 (or something similarly large) reames: You don't appear to actually have a test here which uses a displacement large enough to need a…
		annaAuthorUnsubmitted Not Done Reply Inline Actions I have a test in the negative test cases file above where the displacement is large and outside the faulting page. I'll modify it to a very large value so that it is in a register and not an immediate. anna: I have a test in the negative test cases file above where the displacement is large and outside…
		; CHECK-NEXT: ## %bb.2: ## %not_null
		; CHECK-NEXT: retq
		; CHECK-NEXT: LBB23_1: ## %is_null
		; CHECK-NEXT: movl $42, %eax
		; CHECK-NEXT: retq

		entry:
		%c = icmp eq i64* %x, null
		br i1 %c, label %is_null, label %not_null, !make.implicit !0

		is_null:
		ret i64 42

		not_null:
		%y = ptrtoint i64* %x to i64
		%shry = shl i64 %y, 3
		%shry.add = add i64 %shry, 3518
		%y.ptr = inttoptr i64 %shry.add to i64*
		%x.loc = getelementptr i64, i64* %y.ptr, i64 1
		%t = load i64, i64* %x.loc
		ret i64 %t
		}
!0 = !{}		!0 = !{}

llvm/test/CodeGen/X86/implicit-null-checks.mir

Show First 20 Lines • Show All 371 Lines • ▼ Show 20 Lines	--- \|

not_null:		not_null:
ret i32 undef		ret i32 undef

is_null:		is_null:
ret i32 undef		ret i32 undef
}		}

		define i32 @imp_null_check_address_mul_overflow(i32* %x, i32 %a) {
		entry:
		%c = icmp eq i32* %x, null
		br i1 %c, label %is_null, label %not_null, !make.implicit !0

		is_null: ; preds = %entry
		ret i32 42

		not_null: ; preds = %entry
		%y = ptrtoint i32* %x to i32
		%y64 = zext i32 %y to i64
		%b = mul i64 %y64, 9223372036854775807 ; 0X0FFFF.. i.e. 2^63 - 1
		%z = trunc i64 %b to i32
		ret i32 %z
		}

attributes #0 = { "target-features"="+bmi,+bmi2" }		attributes #0 = { "target-features"="+bmi,+bmi2" }

!0 = !{}		!0 = !{}
...		...
---		---
name: imp_null_check_with_bitwise_op_0		name: imp_null_check_with_bitwise_op_0
# CHECK-LABEL: name: imp_null_check_with_bitwise_op_0		# CHECK-LABEL: name: imp_null_check_with_bitwise_op_0
alignment: 16		alignment: 16
▲ Show 20 Lines • Show All 923 Lines • ▼ Show 20 Lines	bb.1.not_null:
$eax = MOV32rr $edi		$eax = MOV32rr $edi
RETQ $eax		RETQ $eax

bb.2.is_null:		bb.2.is_null:
$eax = XOR32rr undef $eax, undef $eax, implicit-def dead $eflags		$eax = XOR32rr undef $eax, undef $eax, implicit-def dead $eflags
RETQ $eax		RETQ $eax

...		...
		---
		name: imp_null_check_address_mul_overflow
		# CHECK-LABEL: name: imp_null_check_address_mul_overflow
		# CHECK: bb.0.entry:
		# CHECK-NOT: FAULTING_OP
		alignment: 16
		tracksRegLiveness: true
		liveins:
		- { reg: '$rdi' }
		- { reg: '$rsi' }
		body: \|
		bb.0.entry:
		liveins: $rsi, $rdi

		TEST64rr $rdi, $rdi, implicit-def $eflags
		JCC_1 %bb.1, 4, implicit $eflags

		bb.2.not_null:
		liveins: $rdi, $rsi

		$rcx = MOV64ri -9223372036854775808
		$eax = MOV32rm killed $rdi, 2, $rcx, 0, $noreg, implicit-def $rax
		RETQ $eax

		bb.1.is_null:
		$eax = MOV32ri 42
		RETQ $eax

		...

This is an archive of the discontinued LLVM Phabricator instance.

[ImplicitNullCheck] Handle Nonzero faulting pages and complex addressingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 296835

llvm/include/llvm/CodeGen/TargetInstrInfo.h

llvm/lib/CodeGen/ImplicitNullChecks.cpp

llvm/lib/Target/AArch64/AArch64InstrInfo.h

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/lib/Target/X86/X86InstrInfo.h

llvm/lib/Target/X86/X86InstrInfo.cpp

llvm/test/CodeGen/X86/implicit-null-check-negative.ll

llvm/test/CodeGen/X86/implicit-null-check.ll

llvm/test/CodeGen/X86/implicit-null-checks.mir

[ImplicitNullCheck] Handle Nonzero faulting pages and complex addressing
ClosedPublic