Download Raw Diff

Details

Reviewers

fhahn
MatzeB
jmolloy
evandro

Commits

rG8d8cb1ad80b7: [AArch64] Avoid pairing loads when the base reg is modified

Summary

When pairing loads, we should check if in between the two loads the
base register has been modified. If that is the case then avoid pairing
them because the second load actually loads from a different address.

Diff Detail

Event Timeline

congzhe created this revision.Sep 1 2020, 9:39 AM

Herald added subscribers: llvm-commits, danielkiss, hiraditya, kristof.beyls. · View Herald TranscriptSep 1 2020, 9:39 AM

congzhe requested review of this revision.Sep 1 2020, 9:39 AM

Is it possible to add a MIR test for the issue? Also, it looks like the formatting is off a bit. Could you run clang-format-diff on the change?

This revision now requires changes to proceed.Sep 1 2020, 9:42 AM

In D86956#2249916, @fhahn wrote:

Is it possible to add a MIR test for the issue? Also, it looks like the formatting is off a bit. Could you run clang-format-diff on the change?

Thanks! Regarding the MIR test: similar to the test file in the other patch https://reviews.llvm.org/D86906, the MIR test file for this patch is very lengthy (over 1000 lines). I'm wondering if it looks fine if I use that test file, or I could spend time trying to prune it, although I'm not optimistic how much it can be reduced.

Harbormaster completed remote builds in B70255: Diff 289201.Sep 1 2020, 10:17 AM

Revision:

Fixed code style
Provided an mir test

In D86956#2249916, @fhahn wrote:

Is it possible to add a MIR test for the issue? Also, it looks like the formatting is off a bit. Could you run clang-format-diff on the change?

Thanks for the comment, now fixed the code style and provided a test case which has already been reduced using creduce and llvm-reduce.

congzhe updated this revision to Diff 292308.Sep 16 2020, 12:21 PM

Further reduced the mir test case.

congzhe updated this revision to Diff 292360.Sep 16 2020, 3:10 PM

congzhe updated this revision to Diff 292460.Sep 17 2020, 4:28 AM

fhahn added inline comments.Sep 17 2020, 10:44 AM

llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir
18	It might be good to have test cases with stores & non-load instructions modifying the base?

congzhe retitled this revision from [AArch64LoadStoreOptimization] Bug fix in ldr to ldp conversion to [AArch64] Avoid pairing loads when the base reg is modified.Sep 17 2020, 3:58 PM

congzhe edited the summary of this revision. (Show Details)

congzhe updated this revision to Diff 293056.Sep 20 2020, 9:11 PM

congzhe added inline comments.Sep 20 2020, 9:28 PM

llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir
18	Thanks for this comment, now added test cases where the base register or base address is modified with non-load instructions. However, the bug in AArch64 Load Store Optimization pass only exists when the base register is updated with load instructions in between two loads for pairing, i.e., ldr x9 [x10] ldr x10 [x8] ldr x10 [x10, 8]. For other instructions that modify the base register or base address with non-load instructions, the existing pass works correctly. The case that the base register is modified with non-load instructions is handled in lines 1626-1627 of the original AArch64 Load Store Optimization pass: if (!ModifiedRegUnits.available(BaseReg)) return E; Only when the pattern shown above occurs, this pass fails to handle it because it would hit a `continue` in the for-loop and would not reach lines 1626-1627. Still, I added these test cases where the base register or base address is modified with non-load instructions, but please note that these test cases already work correctly without the patch since the original AArch64 Load Store Optimization pass handles them well. I did not find similar tests in other test files that's why I added them, but maybe not adding them also makes sense. I'll appreciate it if you could let me know your thoughts.

@fhahn pinging reviewers :)
Comments are very much appreciated.

fhahn requested changes to this revision.Sep 30 2020, 3:54 AM

fhahn added inline comments.

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp
1580	then we cannot do the optimization?
1586	can we hoist the check outside of the containing if, i.e. to around line 1562? I think we should be able to bail out once the base reg is modified, because it won't get 'un-modified' so that should not rule out any valid pairs. Also, it is safer to do it earlier, otherwise we would need the check for each code path that returns a valid found pair (for example, we would probably also need it around line 1611)
llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir
4	nit: spurious `to`, should that be `tries to convert load instructions`?
5	`a ldp` instruction?
6	`convertion` -> `conversion`?
18	Thanks for adding those tests! Even though it is already handled correctly, adding a few additional tests here for completeness makes sense to me.

This revision now requires changes to proceed.Sep 30 2020, 3:54 AM

congzhe updated this revision to Diff 295273.Sep 30 2020, 7:20 AM

LGTM, thanks!

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp
1564	nit: capitalize `For`.

This revision is now accepted and ready to land.Sep 30 2020, 7:22 AM

Thanks for all the comments, now addressed all of them @fhahn

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp
1580	Thanks, updated accordingly.
1586	Now hoisted the check outside of the containing if.
llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir
6	Thanks for the comments, revised accordingly.

congzhe updated this revision to Diff 295307.Sep 30 2020, 8:50 AM

congzhe updated this revision to Diff 295310.Sep 30 2020, 8:55 AM

Closed by commit rG8d8cb1ad80b7: [AArch64] Avoid pairing loads when the base reg is modified (authored by congzhe, committed by dancgr). · Explain WhySep 30 2020, 10:08 AM

This revision was automatically updated to reflect the committed changes.

dancgr added a commit: rG8d8cb1ad80b7: [AArch64] Avoid pairing loads when the base reg is modified.

Diff 295273

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp

Show First 20 Lines • Show All 1,554 Lines • ▼ Show 20 Lines	if (areCandidatesToMergeOrPair(FirstMI, MI, Flags, TII) &&
// registers the same is UNPREDICTABLE and will result in an exception.		// registers the same is UNPREDICTABLE and will result in an exception.
if (MayLoad && Reg == getLdStRegOp(MI).getReg()) {		if (MayLoad && Reg == getLdStRegOp(MI).getReg()) {
LiveRegUnits::accumulateUsedDefed(MI, ModifiedRegUnits, UsedRegUnits,		LiveRegUnits::accumulateUsedDefed(MI, ModifiedRegUnits, UsedRegUnits,
TRI);		TRI);
MemInsns.push_back(&MI);		MemInsns.push_back(&MI);
continue;		continue;
}		}

		// If the BaseReg has been modified, then we cannot do the optimization.
		// for example, in the following pattern
		fhahnUnsubmitted Not Done Reply Inline Actions nit: capitalize `For`. fhahn: nit: capitalize `For`.
		// ldr x1 [x2]
		// ldr x2 [x3]
		// ldr x4 [x2, #8],
		// the first and third ldr cannot be converted to ldp x1, x4, [x2]
		if (!ModifiedRegUnits.available(BaseReg))
		return E;

// If the Rt of the second instruction was not modified or used between		// If the Rt of the second instruction was not modified or used between
// the two instructions and none of the instructions between the second		// the two instructions and none of the instructions between the second
// and first alias with the second, we can combine the second into the		// and first alias with the second, we can combine the second into the
// first.		// first.
if (ModifiedRegUnits.available(getLdStRegOp(MI).getReg()) &&		if (ModifiedRegUnits.available(getLdStRegOp(MI).getReg()) &&
!(MI.mayLoad() &&		!(MI.mayLoad() &&
!UsedRegUnits.available(getLdStRegOp(MI).getReg())) &&		!UsedRegUnits.available(getLdStRegOp(MI).getReg())) &&
!mayAlias(MI, MemInsns, AA)) {		!mayAlias(MI, MemInsns, AA)) {

Flags.setMergeForward(false);		Flags.setMergeForward(false);
		fhahnUnsubmitted Not Done Reply Inline Actions then we cannot do the optimization? fhahn: then we cannot do the optimization?
		congzheAuthorUnsubmitted Done Reply Inline Actions Thanks, updated accordingly. congzhe: Thanks, updated accordingly.
Flags.clearRenameReg();		Flags.clearRenameReg();
return MBBI;		return MBBI;
}		}

// Likewise, if the Rt of the first instruction is not modified or used		// Likewise, if the Rt of the first instruction is not modified or used
// between the two instructions and none of the instructions between the		// between the two instructions and none of the instructions between the
		fhahnUnsubmitted Not Done Reply Inline Actions can we hoist the check outside of the containing if, i.e. to around line 1562? I think we should be able to bail out once the base reg is modified, because it won't get 'un-modified' so that should not rule out any valid pairs. Also, it is safer to do it earlier, otherwise we would need the check for each code path that returns a valid found pair (for example, we would probably also need it around line 1611) fhahn: can we hoist the check outside of the containing if, i.e. to around line 1562? I think we…
		congzheAuthorUnsubmitted Done Reply Inline Actions Now hoisted the check outside of the containing if. congzhe: Now hoisted the check outside of the containing if.
// first and the second alias with the first, we can combine the first		// first and the second alias with the first, we can combine the first
// into the second.		// into the second.
if (!(MayLoad &&		if (!(MayLoad &&
!UsedRegUnits.available(getLdStRegOp(FirstMI).getReg())) &&		!UsedRegUnits.available(getLdStRegOp(FirstMI).getReg())) &&
!mayAlias(FirstMI, MemInsns, AA)) {		!mayAlias(FirstMI, MemInsns, AA)) {

if (ModifiedRegUnits.available(getLdStRegOp(FirstMI).getReg())) {		if (ModifiedRegUnits.available(getLdStRegOp(FirstMI).getReg())) {
Flags.setMergeForward(true);		Flags.setMergeForward(true);
▲ Show 20 Lines • Show All 568 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir

This file was added.

				# RUN: llc -mtriple=aarch64-linux-gnu -verify-machineinstrs -run-pass=aarch64-ldst-opt %s -o - \| FileCheck %s
				#
				# When the AArch64 Load Store Optimization pass tries to convert load instructions
				# into a ldp instruction, and when the base register of the second ldr instruction
				fhahnUnsubmitted Not Done Reply Inline Actions nit: spurious `to`, should that be `tries to convert load instructions`? fhahn: nit: spurious `to`, should that be `tries to convert load instructions`?
				# has been modified in between these two ldr instructions, the conversion should not
				fhahnUnsubmitted Not Done Reply Inline Actions `a ldp` instruction? fhahn: `a ldp` instruction?
				# occur.
				fhahnUnsubmitted Not Done Reply Inline Actions `convertion` -> `conversion`? fhahn: `convertion` -> `conversion`?
				congzheAuthorUnsubmitted Done Reply Inline Actions Thanks for the comments, revised accordingly. congzhe: Thanks for the comments, revised accordingly.
				#
				# For example, for the following pattern:
				# ldr x9 [x10]
				# ldr x10 [x8]
				# ldr x10 [x10, 8],
				# the first and third ldr instructions cannot be converted to ldp x9, x10, [x10].
				#
				# CHECK-LABEL: name: ldr-modified-baseReg-no-ldp1
				# CHECK-NOT: LDP
				# CHECK: $x9 = LDRXui $x10, 1 :: (load 8)
				# CHECK: $x10 = LDURXi $x8, 1 :: (load 8)
				# CHECK: $x10 = LDRXui $x10, 0 :: (load 8)
				fhahnUnsubmitted Not Done Reply Inline Actions It might be good to have test cases with stores & non-load instructions modifying the base? fhahn: It might be good to have test cases with stores & non-load instructions modifying the base?
				congzheAuthorUnsubmitted Done Reply Inline Actions Thanks for this comment, now added test cases where the base register or base address is modified with non-load instructions. However, the bug in AArch64 Load Store Optimization pass only exists when the base register is updated with load instructions in between two loads for pairing, i.e., ldr x9 [x10] ldr x10 [x8] ldr x10 [x10, 8]. For other instructions that modify the base register or base address with non-load instructions, the existing pass works correctly. The case that the base register is modified with non-load instructions is handled in lines 1626-1627 of the original AArch64 Load Store Optimization pass: if (!ModifiedRegUnits.available(BaseReg)) return E; Only when the pattern shown above occurs, this pass fails to handle it because it would hit a `continue` in the for-loop and would not reach lines 1626-1627. Still, I added these test cases where the base register or base address is modified with non-load instructions, but please note that these test cases already work correctly without the patch since the original AArch64 Load Store Optimization pass handles them well. I did not find similar tests in other test files that's why I added them, but maybe not adding them also makes sense. I'll appreciate it if you could let me know your thoughts. congzhe: Thanks for this comment, now added test cases where the base register or base address is…
				fhahnUnsubmitted Not Done Reply Inline Actions Thanks for adding those tests! Even though it is already handled correctly, adding a few additional tests here for completeness makes sense to me. fhahn: Thanks for adding those tests! Even though it is already handled correctly, adding a few…
				# CHECK: RET
				---
				name: ldr-modified-baseReg-no-ldp1
				tracksRegLiveness: true
				body: \|
				bb.0:
				liveins: $x8, $x10

				$x9 = LDRXui $x10, 1 :: (load 8)
				$x10 = LDURXi $x8, 1 :: (load 8)
				$x10 = LDRXui $x10, 0 :: (load 8)
				RET undef $lr, implicit undef $w0
				...

				# CHECK-LABEL: name: str-modified-baseReg-no-stp1
				# CHECK-NOT: STP
				# CHECK: STRXui $x9, $x10, 1 :: (store 8)
				# CHECK: $x10 = LDRXui $x8, 0 :: (load 8)
				# CHECK: STRXui $x10, $x10, 0 :: (store 8)
				# CHECK: RET
				---
				name: str-modified-baseReg-no-stp1
				tracksRegLiveness: true
				body: \|
				bb.0:
				liveins: $x9, $x8, $x10

				STRXui $x9, $x10, 1 :: (store 8)
				$x10 = LDRXui $x8, 0 :: (load 8)
				STRXui $x10, $x10, 0 :: (store 8)
				RET undef $lr, implicit undef $w0
				...

				# CHECK-LABEL: name: ldr-modified-baseReg-no-ldp2
				# CHECK-NOT: LDP
				# CHECK: $x9 = LDRXui $x10, 1 :: (load 8)
				# CHECK: $x10 = MOVi64imm 13
				# CHECK: $x11 = LDRXui $x10, 0 :: (load 8)
				# CHECK: RET
				---
				name: ldr-modified-baseReg-no-ldp2
				tracksRegLiveness: true
				body: \|
				bb.0:
				liveins: $x8, $x10

				$x9 = LDRXui $x10, 1 :: (load 8)
				$x10 = MOVi64imm 13
				$x11 = LDRXui $x10, 0 :: (load 8)
				RET undef $lr, implicit undef $w0
				...

				# CHECK-LABEL: name: ldr-modified-baseReg-no-ldp3
				# CHECK-NOT: LDP
				# CHECK: $x9 = LDRXui $x10, 1 :: (load 8)
				# CHECK: $x10 = ADDXri $x8, $x11, 0
				# CHECK: $x12 = LDRXui $x10, 0 :: (load 8)
				# CHECK: RET
				---
				name: ldr-modified-baseReg-no-ldp3
				tracksRegLiveness: true
				body: \|
				bb.0:
				liveins: $x8, $x10, $x11

				$x9 = LDRXui $x10, 1 :: (load 8)
				$x10 = ADDXri $x8, $x11, 0
				$x12 = LDRXui $x10, 0 :: (load 8)
				RET undef $lr, implicit undef $w0
				...

				# CHECK-LABEL: name: ldr-modified-baseAddr-convert-to-ldp
				# CHECK: $x12, $x9 = LDPXi $x10, 0 :: (load 8)
				# CHECK: STRXui $x11, $x10, 1 :: (store 8)
				# CHECK: RET
				---
				name: ldr-modified-baseAddr-convert-to-ldp
				tracksRegLiveness: true
				body: \|
				bb.0:
				liveins: $x8, $x10, $x11

				$x9 = LDRXui $x10, 1 :: (load 8)
				STRXui $x11, $x10, 1 :: (store 8)
				$x12 = LDRXui $x10, 0 :: (load 8)
				RET undef $lr, implicit undef $w0
				...

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Avoid pairing loads when the base reg is modified
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 295273

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp

llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64] Avoid pairing loads when the base reg is modifiedClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 295273

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp

llvm/test/CodeGen/AArch64/aarch64-ldst-modified-baseReg.mir

[AArch64] Avoid pairing loads when the base reg is modified
ClosedPublic