Download Raw Diff

Details

Reviewers

sdesmalen
greened
cameron.mcinally
efriedma
rengolin
thegameg
rovka

Commits

rG6d1891c508fe: [AArch64] Fix offset calculation
rL375043: [AArch64] Fix offset calculation

Summary

r374772 changed Offset to be an int64_t but left NewOffset as an int,
which led to integer promotion issues in this calculation and resulted
in bad offset values. Promote NewOffset to int64_t as well to fix this,
and promote EmittableOffset as well, since its one user passes it to a
function which takes an int64_t anyway. This manifested as an
out-of-memory when building the Swift standard library for Android
aarch64. Test case suggested by Sander de Smalen!

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 39671
Build 39708: arc lint + arc unit

Event Timeline

smeenai created this revision.Oct 15 2019, 10:21 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 15 2019, 10:21 PM

Herald added subscribers: llvm-commits, hiraditya, kristof.beyls. · View Herald Transcript

Harbormaster completed remote builds in B39621: Diff 225160.Oct 15 2019, 10:25 PM

I have no idea how to write a test for this; I'm completely unfamiliar with backends. I have a Swift compilation command that reproduces the OOM mentioned in the commit message, and I think I should be able to get IR or MIR from that, but I'd appreciate guidance crafting a test case if it's considered necessary for this commit.

smeenai marked an inline comment as done.Oct 15 2019, 10:29 PM

smeenai added inline comments.

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
3442	EmittableOffset is still an int ... idk if I should be promoting that too.

smeenai marked an inline comment as done.Oct 15 2019, 10:36 PM

smeenai added inline comments.

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
3438	Just casting NewOffset to int64_t over here is actually sufficient, but I figured changing the type entirely was better/cleaner.

Thanks for fixing this @smeenai!

In D69018#1710443, @smeenai wrote:

I have no idea how to write a test for this; I'm completely unfamiliar with backends. I have a Swift compilation command that reproduces the OOM mentioned in the commit message, and I think I should be able to get IR or MIR from that, but I'd appreciate guidance crafting a test case if it's considered necessary for this commit.

You can create a MIR test with a single instruction accessing a pre-allocated stackslot with a very large offset, and check that the offset is generated correctly.

e.g. the following MIR

---
name: D69018
tracksRegLiveness: true
fixedStack:
  - { id: 0, offset: 2147483648, size: 1}
body: |
  bb.0:
    $x0 = LDRXui %fixed-stack.0, 0
    RET_ReallyLR
...

compiles with your patch, but fails to complete (i.e. it keeps running) without it.

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
3442	That should be fine, because it will be used for the immediate field of an instruction.

smeenai marked an inline comment as done.Oct 16 2019, 9:35 AM

smeenai added inline comments.

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp
3442	I decided to change it to `int64_t` as well, since the only call site which uses this parameter passes it to `MachineOperand::ChangeToImmediate`, which takes an `int64_t` parameter.

Add test

In D69018#1711129, @sdesmalen wrote:
Thanks for fixing this @smeenai!

In D69018#1710443, @smeenai wrote:

I have no idea how to write a test for this; I'm completely unfamiliar with backends. I have a Swift compilation command that reproduces the OOM mentioned in the commit message, and I think I should be able to get IR or MIR from that, but I'd appreciate guidance crafting a test case if it's considered necessary for this commit.

You can create a MIR test with a single instruction accessing a pre-allocated stackslot with a very large offset, and check that the offset is generated correctly.

e.g. the following MIR
---
name: D69018
tracksRegLiveness: true
fixedStack:
  - { id: 0, offset: 2147483648, size: 1}
body: |
  bb.0:
    $x0 = LDRXui %fixed-stack.0, 0
    RET_ReallyLR
...
compiles with your patch, but fails to complete (i.e. it keeps running) without it.

Thanks for the test case! I added it.

Harbormaster completed remote builds in B39653: Diff 225249.Oct 16 2019, 9:41 AM

sdesmalen added inline comments.Oct 16 2019, 10:05 AM

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
2	nit: I would probably add `-run-pass=prologepilog`, to reduce possible interference from other passes.
15	Sorry I didn't notice this earlier when I posted the example MIR, but these `sub`s are wrong. The offset is at a positive distance from the SP, so it should use `add` here. If I change the offset from `2147483648` to `2147483632` that changes. So I expect some other changes are needed to fix this.

smeenai marked an inline comment as done.Oct 16 2019, 10:15 AM

smeenai added inline comments.

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
15	If I revert my change and r374772, the problem still persists, so that seems like a pre-existing issue?

Use -run-pass=prologepilog in test

smeenai marked 2 inline comments as done.Oct 16 2019, 10:30 AM

smeenai added inline comments.

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
15	Given that this is a pre-existing problem, would you be okay with submitting this to solve the OOM and then filing a bug for the incorrect offset calculation?

Harbormaster completed remote builds in B39658: Diff 225256.Oct 16 2019, 10:37 AM

thegameg added inline comments.Oct 16 2019, 10:42 AM

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
2	I believe this is missing a `\| FileCheck %s` to perform the actual checks.

sdesmalen added inline comments.Oct 16 2019, 10:47 AM

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
15	Sigh, I'm clearly having a bit of a senior moment here :) maxint is 2147483647 (forgot the -1 in my calculator). But with the fixed/updated offset, the result is the same with/without your patch, so we'll need something different to test it.

Fix test

smeenai marked 3 inline comments as done.Oct 16 2019, 10:52 AM

smeenai added inline comments.

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
2	Oops, forgot to add that back after testing something. Thanks!
15	The test as it stands does infinite loop/OOM without my patch though and completes successfully after.

Harbormaster completed remote builds in B39660: Diff 225263.Oct 16 2019, 10:56 AM

Update test

@sdesmalen How does the new test case look?

Harbormaster completed remote builds in B39671: Diff 225280.Oct 16 2019, 11:42 AM

In D69018#1711613, @smeenai wrote:

@sdesmalen How does the new test case look?

Thanks, the new test-case seems to cover the case well. It is out of range of the immediate and with NewOffset as int64_t the expression NewOffset * Scale should no longer be evaluated as unsigned.

LGTM

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
3	nit: You'll want to change the name of the test/function now.

This revision is now accepted and ready to land.Oct 16 2019, 1:29 PM

Thanks for the quick review!

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir
3	Yup, will change before committing.

Closed by commit rG6d1891c508fe: [AArch64] Fix offset calculation (authored by smeenai). · Explain WhyOct 16 2019, 2:47 PM

This revision was automatically updated to reflect the committed changes.

Likely also fixes regressions for aarch64 linux kernel builds: https://travis-ci.com/ClangBuiltLinux/continuous-integration/jobs/246197698

Diff 225280

llvm/lib/Target/AArch64/AArch64InstrInfo.h

	Show First 20 Lines • Show All 329 Lines • ▼ Show 20 Lines
	/// If set, @p OutUseUnscaledOp will contain the whether @p MI should be			/// If set, @p OutUseUnscaledOp will contain the whether @p MI should be
	/// turned into an unscaled operator, which opcode is in @p OutUnscaledOp.			/// turned into an unscaled operator, which opcode is in @p OutUnscaledOp.
	/// If set, @p EmittableOffset contains the amount that can be set in @p MI			/// If set, @p EmittableOffset contains the amount that can be set in @p MI
	/// (possibly with @p OutUnscaledOp if OutUseUnscaledOp is true) and that			/// (possibly with @p OutUnscaledOp if OutUseUnscaledOp is true) and that
	/// is a legal offset.			/// is a legal offset.
	int isAArch64FrameOffsetLegal(const MachineInstr &MI, StackOffset &Offset,			int isAArch64FrameOffsetLegal(const MachineInstr &MI, StackOffset &Offset,
	bool *OutUseUnscaledOp = nullptr,			bool *OutUseUnscaledOp = nullptr,
	unsigned *OutUnscaledOp = nullptr,			unsigned *OutUnscaledOp = nullptr,
	int *EmittableOffset = nullptr);			int64_t *EmittableOffset = nullptr);

	static inline bool isUncondBranchOpcode(int Opc) { return Opc == AArch64::B; }			static inline bool isUncondBranchOpcode(int Opc) { return Opc == AArch64::B; }

	static inline bool isCondBranchOpcode(int Opc) {			static inline bool isCondBranchOpcode(int Opc) {
	switch (Opc) {			switch (Opc) {
	case AArch64::Bcc:			case AArch64::Bcc:
	case AArch64::CBZW:			case AArch64::CBZW:
	case AArch64::CBZX:			case AArch64::CBZX:
	▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

Show First 20 Lines • Show All 3,362 Lines • ▼ Show 20 Lines	default:
return false;		return false;
}		}
}		}

int llvm::isAArch64FrameOffsetLegal(const MachineInstr &MI,		int llvm::isAArch64FrameOffsetLegal(const MachineInstr &MI,
StackOffset &SOffset,		StackOffset &SOffset,
bool *OutUseUnscaledOp,		bool *OutUseUnscaledOp,
unsigned *OutUnscaledOp,		unsigned *OutUnscaledOp,
int *EmittableOffset) {		int64_t *EmittableOffset) {
// Set output values in case of early exit.		// Set output values in case of early exit.
if (EmittableOffset)		if (EmittableOffset)
*EmittableOffset = 0;		*EmittableOffset = 0;
if (OutUseUnscaledOp)		if (OutUseUnscaledOp)
*OutUseUnscaledOp = false;		*OutUseUnscaledOp = false;
if (OutUnscaledOp)		if (OutUnscaledOp)
*OutUnscaledOp = 0;		*OutUnscaledOp = 0;

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	if (useUnscaledOp &&
!AArch64InstrInfo::getMemOpInfo(*UnscaledOp, Scale, Width, MinOff, MaxOff))		!AArch64InstrInfo::getMemOpInfo(*UnscaledOp, Scale, Width, MinOff, MaxOff))
llvm_unreachable("unhandled opcode in isAArch64FrameOffsetLegal");		llvm_unreachable("unhandled opcode in isAArch64FrameOffsetLegal");

int64_t Remainder = Offset % Scale;		int64_t Remainder = Offset % Scale;
assert(!(Remainder && useUnscaledOp) &&		assert(!(Remainder && useUnscaledOp) &&
"Cannot have remainder when using unscaled op");		"Cannot have remainder when using unscaled op");

assert(MinOff < MaxOff && "Unexpected Min/Max offsets");		assert(MinOff < MaxOff && "Unexpected Min/Max offsets");
int NewOffset = Offset / Scale;		int64_t NewOffset = Offset / Scale;
if (MinOff <= NewOffset && NewOffset <= MaxOff)		if (MinOff <= NewOffset && NewOffset <= MaxOff)
Offset = Remainder;		Offset = Remainder;
else {		else {
NewOffset = NewOffset < 0 ? MinOff : MaxOff;		NewOffset = NewOffset < 0 ? MinOff : MaxOff;
Offset = Offset - NewOffset * Scale + Remainder;		Offset = Offset - NewOffset * Scale + Remainder;
		smeenaiAuthorUnsubmitted Done Reply Inline Actions Just casting NewOffset to int64_t over here is actually sufficient, but I figured changing the type entirely was better/cleaner. smeenai: Just casting NewOffset to int64_t over here is actually sufficient, but I figured changing the…
}		}

if (EmittableOffset)		if (EmittableOffset)
*EmittableOffset = NewOffset;		*EmittableOffset = NewOffset;
		smeenaiAuthorUnsubmitted Done Reply Inline Actions EmittableOffset is still an int ... idk if I should be promoting that too. smeenai: EmittableOffset is still an int ... idk if I should be promoting that too.
		sdesmalenUnsubmitted Not Done Reply Inline Actions That should be fine, because it will be used for the immediate field of an instruction. sdesmalen: That should be fine, because it will be used for the immediate field of an instruction.
		smeenaiAuthorUnsubmitted Done Reply Inline Actions I decided to change it to `int64_t` as well, since the only call site which uses this parameter passes it to `MachineOperand::ChangeToImmediate`, which takes an `int64_t` parameter. smeenai: I decided to change it to `int64_t` as well, since the only call site which uses this parameter…
if (OutUseUnscaledOp)		if (OutUseUnscaledOp)
*OutUseUnscaledOp = useUnscaledOp;		*OutUseUnscaledOp = useUnscaledOp;
if (OutUnscaledOp && UnscaledOp)		if (OutUnscaledOp && UnscaledOp)
OutUnscaledOp = UnscaledOp;		OutUnscaledOp = UnscaledOp;

if (IsMulVL)		if (IsMulVL)
SOffset = StackOffset(Offset, MVT::nxv1i8) +		SOffset = StackOffset(Offset, MVT::nxv1i8) +
StackOffset(SOffset.getBytes(), MVT::i8);		StackOffset(SOffset.getBytes(), MVT::i8);
Show All 15 Lines	if (Opcode == AArch64::ADDSXri \|\| Opcode == AArch64::ADDXri) {
emitFrameOffset(*MI.getParent(), MI, MI.getDebugLoc(),		emitFrameOffset(*MI.getParent(), MI, MI.getDebugLoc(),
MI.getOperand(0).getReg(), FrameReg, Offset, TII,		MI.getOperand(0).getReg(), FrameReg, Offset, TII,
MachineInstr::NoFlags, (Opcode == AArch64::ADDSXri));		MachineInstr::NoFlags, (Opcode == AArch64::ADDSXri));
MI.eraseFromParent();		MI.eraseFromParent();
Offset = StackOffset();		Offset = StackOffset();
return true;		return true;
}		}

int NewOffset;		int64_t NewOffset;
unsigned UnscaledOp;		unsigned UnscaledOp;
bool UseUnscaledOp;		bool UseUnscaledOp;
int Status = isAArch64FrameOffsetLegal(MI, Offset, &UseUnscaledOp,		int Status = isAArch64FrameOffsetLegal(MI, Offset, &UseUnscaledOp,
&UnscaledOp, &NewOffset);		&UnscaledOp, &NewOffset);
if (Status & AArch64FrameOffsetCanUpdate) {		if (Status & AArch64FrameOffsetCanUpdate) {
if (Status & AArch64FrameOffsetIsLegal)		if (Status & AArch64FrameOffsetIsLegal)
// Replace the FrameIndex with FrameReg.		// Replace the FrameIndex with FrameReg.
MI.getOperand(FrameRegIdx).ChangeToRegister(FrameReg, false);		MI.getOperand(FrameRegIdx).ChangeToRegister(FrameReg, false);
▲ Show 20 Lines • Show All 2,249 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir

This file was added.

				# RUN: llc -mtriple=aarch64-none-linux-gnu -run-pass=prologepilog %s -o - \| FileCheck %s
				---
				sdesmalenUnsubmitted Done Reply Inline Actions nit: I would probably add `-run-pass=prologepilog`, to reduce possible interference from other passes. sdesmalen: nit: I would probably add `-run-pass=prologepilog`, to reduce possible interference from other…
				thegamegUnsubmitted Done Reply Inline Actions I believe this is missing a `\| FileCheck %s` to perform the actual checks. thegameg: I believe this is missing a `\| FileCheck %s` to perform the actual checks.
				smeenaiAuthorUnsubmitted Done Reply Inline Actions Oops, forgot to add that back after testing something. Thanks! smeenai: Oops, forgot to add that back after testing something. Thanks!
				name: framelayout_large_offset
				sdesmalenUnsubmitted Not Done Reply Inline Actions nit: You'll want to change the name of the test/function now. sdesmalen: nit: You'll want to change the name of the test/function now.
				smeenaiAuthorUnsubmitted Done Reply Inline Actions Yup, will change before committing. smeenai: Yup, will change before committing.
				tracksRegLiveness: true
				fixedStack:
				- { id: 0, offset: 0, size: 1}
				body: \|
				bb.0:
				$x0 = LDURXi %fixed-stack.0, -264
				RET_ReallyLR
				...
				# CHECK: name: framelayout_large_offset
				# CHECK: body: \|
				# CHECK-NEXT: bb.0:
				# CHECK-NEXT: $x8 = SUBXri $sp, 8, 0
				sdesmalenUnsubmitted Done Reply Inline Actions Sorry I didn't notice this earlier when I posted the example MIR, but these `sub`s are wrong. The offset is at a positive distance from the SP, so it should use `add` here. If I change the offset from `2147483648` to `2147483632` that changes. So I expect some other changes are needed to fix this. sdesmalen: Sorry I didn't notice this earlier when I posted the example MIR, but these `sub`s are wrong.
				smeenaiAuthorUnsubmitted Done Reply Inline Actions If I revert my change and r374772, the problem still persists, so that seems like a pre-existing issue? smeenai: If I revert my change and r374772, the problem still persists, so that seems like a pre…
				smeenaiAuthorUnsubmitted Done Reply Inline Actions Given that this is a pre-existing problem, would you be okay with submitting this to solve the OOM and then filing a bug for the incorrect offset calculation? smeenai: Given that this is a pre-existing problem, would you be okay with submitting this to solve the…
				sdesmalenUnsubmitted Done Reply Inline Actions Sigh, I'm clearly having a bit of a senior moment here :) maxint is 2147483647 (forgot the -1 in my calculator). But with the fixed/updated offset, the result is the same with/without your patch, so we'll need something different to test it. sdesmalen: Sigh, I'm clearly having a bit of a senior moment here :) maxint is 2147483647 (forgot the -1…
				smeenaiAuthorUnsubmitted Done Reply Inline Actions The test as it stands does infinite loop/OOM without my patch though and completes successfully after. smeenai: The test as it stands does infinite loop/OOM without my patch though and completes successfully…
				# CHECK-NEXT: $x0 = LDURXi killed $x8, -256
				# CHECK-NEXT: RET_ReallyLR

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Fix offset calculation
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225280

llvm/lib/Target/AArch64/AArch64InstrInfo.h

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Fix offset calculationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 225280

llvm/lib/Target/AArch64/AArch64InstrInfo.h

llvm/lib/Target/AArch64/AArch64InstrInfo.cpp

llvm/test/CodeGen/AArch64/framelayout-large-offset.mir

[AArch64] Fix offset calculation
ClosedPublic