Download Raw Diff

Details

Reviewers

dp
foad
arsenm
rampitec

Commits

rG9c66c02e2e7e: [AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets.

Summary

Saves some add instructions on a couple Rage 2 shaders and is also a
prerequisite for a coming-soon change matching (register + immediate)
offsets.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	230 ms	x64 debian > BOLT.runtime/X86::user-func-reorder.c
	60,030 ms	x64 debian > MLIR.Examples/standalone::test.toy
	60,030 ms	x64 debian > libFuzzer.libFuzzer::value-profile-load.test

Event Timeline

kosarev created this revision.Jul 4 2022, 11:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 4 2022, 11:45 AM

Herald added subscribers: jsilvanus, kerbowa, hiraditya and 7 others. · View Herald Transcript

kosarev requested review of this revision.Jul 4 2022, 11:45 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 4 2022, 11:45 AM

Herald added subscribers: llvm-commits, wdng. · View Herald Transcript

D128836 covers the case for Global ISel.

Harbormaster completed remote builds in B173587: Diff 442129.Jul 4 2022, 12:42 PM

foad added inline comments.Jul 5 2022, 1:48 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
36	When this patch and D128836 have both landed, can you add GISEL checks here please.
38	Please don't use unnamed values in tests. (You can run the IR through `opt -instnamer` to name them automatically.)
40	Do I understand correctly: in SDag this gep ends up as `(add %2, @0)` (not `(add @0, %2)`) because `@0` is a constant so it gets canonicalized to the RHS?

Rebased on top of D128171 and updated the test to use named values.

kosarev marked 2 inline comments as done.Jul 5 2022, 6:59 AM

kosarev added inline comments.

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
36	Done; the GCN checks now work for both the SDAG and GISEL runs.
40	That's right, for commutative nodes the combiner tries to make whatever seems a constant be the second operand.

LGTM, thanks!

This revision is now accepted and ready to land.Jul 5 2022, 7:04 AM

arsenm added inline comments.Jul 5 2022, 7:05 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
1–2	Should use explicit global-isel=0 for dag test
34–35	Should test the operands
45	Doesn't test the 32-bit constant case

kosarev marked an inline comment as done.Jul 5 2022, 7:14 AM

kosarev added inline comments.

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	Why would we want to check that? Looks like there's nothing specific to 32-bit constants in the change?

arsenm added inline comments.Jul 5 2022, 7:15 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	You have the call to Expand32BitAddress

Harbormaster completed remote builds in B173703: Diff 442293.Jul 5 2022, 7:47 AM

kosarev added inline comments.Jul 5 2022, 7:57 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	Yes, which replicates the existing and already-tested logic. Will it eliminate the need for the test if I move the couples of `SelectSMRDOffset()` and `Expand32BitAddress()` calls into a separate function, something like `SelectSMRDBaseOffset()`? We would need it for matching (register + immediate) offsets anyway. Feels like that'd be a test case for what is actually not a special case.

arsenm added inline comments.Jul 5 2022, 8:17 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	It's another permutation of a somewhat fragile case (which also works differently in globalisel). There's no reason to skimp on tests

kosarev added inline comments.Jul 6 2022, 6:51 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	The `if ((Addr.getValueType() != MVT::i32 \|\| Addr->getFlags().hasNoUnsignedWrap()))` bit in `AMDGPUDAGToDAGISel::SelectSMRD()` means for such a test we would need the `nuw` flag be set on the `ISD::ADD` in the offset, which we only do for constant `getelementptr inbounds` indexes that in turn would always go to the RHS operand of the `ISD::ADD`, and thus can't help reproducing the case. D15544 gives some reasoning for raising `nuw` for non-negative constant offsets. If we think it's worth it, I could try to do the same for non-constants known to be small enough, but it again would help if we do not require cultivating things specific to the constant address space be a prerequisite for this generic change.

Addressed review notes.

kosarev marked 2 inline comments as done.Jul 6 2022, 8:12 AM

Harbormaster completed remote builds in B173912: Diff 442591.Jul 6 2022, 9:18 AM

Update the GISel test for it to be less dependent on code order.

kosarev added a child revision: D129381: [AMDGPU][CodeGen] Support (register + immediate) SMRD offsets..Jul 8 2022, 10:04 AM

Harbormaster completed remote builds in B174406: Diff 443271.Jul 8 2022, 10:33 AM

Match complex register SMRD offsets.

Could you improve the commit message? I don't know what "complex" means here. I think it is really about matching (constant base + register offset) in addition to (register base + constant offset).

Reworded commit message.

kosarev retitled this revision from [AMDGPU][CodeGen] Match complex register SMRD offsets. to [AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets..Jul 11 2022, 7:50 AM

Right, the 'complex' part came from when I had a hypothesis that it is the size of the operand that matters. Thanks for catching.

Harbormaster completed remote builds in B174665: Diff 443639.Jul 11 2022, 8:28 AM

Still LGTM if Matt is happy.

arsenm added inline comments.Jul 11 2022, 11:21 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	You can just add noinbounds to the getelementptr. It's not a special change and just testing the mechanics for what you already have here

kosarev added inline comments.Jul 11 2022, 12:48 PM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	Sorry, I'm not sure I follow. What is noinbounds and how it should be used with the getelementptr, exactly?

kosarev added inline comments.Jul 13 2022, 11:00 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	@arsenm Ping?

arsenm added inline comments.Jul 13 2022, 11:22 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	You can try this: %i3 = getelementptr inbounds [4 x <2 x float>], with the addrspace(6) pointer

kosarev added inline comments.Jul 14 2022, 4:35 AM

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	It has already been explained in https://reviews.llvm.org/D129095#inline-1241075 that that wouldn't work. On top of that, we don't seem to support addrspace(6) constants in SDAG ISel currently; I get AMDGPU DAG->DAG Pattern Instruction Selection error whenever I try to use one. Regarding noinbounds, I don't see how your words explain what it is or how it can be useful for triggering the added code. Is that your comment still relevant? And just for the record, I don't understand the argument about the new Expand32BitAddress() call being a sufficient reason for another test case. We usually don't add tests for higher level code just because it employs more of some lower level functions. It's up to tests specifically for these lower level functions to make sure they can handle their intended uses cases, and as hopefully is obvious from https://reviews.llvm.org/D129381 that eliminates multiple calls to Expand32BitAddress(), this patch doesn't add any new use cases for that function. What has been changed is that we now support another permutation of operands in SelectSMRD(), but that is already covered with the added tests. addrspace(6) operands are no special in this regard, so adding more tests for them would mean we try to test combinations that are not special implementation-wise, which is again not how we write regression tests. And yes, unnecessary tests can actually be harmful -- they draw maintenance resources and attention and they hide cases that are really important. I believe some of our MC tests illustrate that. Overall, I feel somewhat confused about your two last comments. If you can reword what you have to suggest in a bit more explicit way, that will likely be a great help. Thanks.

Updated a comment in the .ll test to not mention complex operands.

Harbormaster completed remote builds in B175352: Diff 444590.Jul 14 2022, 6:50 AM

kosarev requested review of this revision.Jul 15 2022, 1:43 AM

arsenm accepted this revision.Jul 16 2022, 8:31 AM

arsenm added inline comments.

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll
45	The point isn't to test Expand32BitAddress but to test that it's used on this path. I guess if constants don't work for constant 32-bit that's a separate issue

This revision is now accepted and ready to land.Jul 16 2022, 8:31 AM

This revision was landed with ongoing or failed builds.Jul 18 2022, 3:19 AM

Closed by commit rG9c66c02e2e7e: [AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets. (authored by kosarev). · Explain Why

This revision was automatically updated to reflect the committed changes.

kosarev added a commit: rG9c66c02e2e7e: [AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets..

Diff 442293

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

Show First 20 Lines • Show All 1,985 Lines • ▼ Show 20 Lines	if ((Addr.getValueType() != MVT::i32 \|\|
} else if (getBaseWithOffsetUsingSplitOR(*CurDAG, Addr, N0, N1)) {		} else if (getBaseWithOffsetUsingSplitOR(*CurDAG, Addr, N0, N1)) {
assert(N0 && N1 && isa<ConstantSDNode>(N1));		assert(N0 && N1 && isa<ConstantSDNode>(N1));
}		}
if (N0 && N1) {		if (N0 && N1) {
if (SelectSMRDOffset(N1, Offset, Imm, Imm32Only)) {		if (SelectSMRDOffset(N1, Offset, Imm, Imm32Only)) {
SBase = Expand32BitAddress(N0);		SBase = Expand32BitAddress(N0);
return true;		return true;
}		}
		if (SelectSMRDOffset(N0, Offset, Imm, Imm32Only)) {
		SBase = Expand32BitAddress(N1);
		return true;
		}
}		}
return false;		return false;
}		}
if (!Imm)		if (!Imm)
return false;		return false;
SBase = Expand32BitAddress(Addr);		SBase = Expand32BitAddress(Addr);
Offset = CurDAG->getTargetConstant(0, SL, MVT::i32);		Offset = CurDAG->getTargetConstant(0, SL, MVT::i32);
return true;		return true;
▲ Show 20 Lines • Show All 974 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll

	; Test that DAG->DAG ISel is able to pick up the S_LOAD_DWORDX4_SGPR instruction that fetches the offset			; RUN: llc -march=amdgcn -verify-machineinstrs -stop-after=amdgpu-isel -o - %s \| FileCheck -check-prefixes=GCN,SDAG %s
	; from a register.			; RUN: llc -march=amdgcn -global-isel -verify-machineinstrs -stop-after=amdgpu-isel -o - %s \| FileCheck -check-prefixes=GCN,GISEL %s
				arsenmUnsubmitted Done Reply Inline Actions Should use explicit global-isel=0 for dag test arsenm: Should use explicit global-isel=0 for dag test

	; RUN: llc -march=amdgcn -verify-machineinstrs -stop-after=amdgpu-isel -o - %s \| FileCheck -check-prefix=GCN %s
	; RUN: llc -march=amdgcn -global-isel -verify-machineinstrs -stop-after=amdgpu-isel -o - %s \| FileCheck -check-prefix=GISEL %s

	; GCN: %[[OFFSET:[0-9]+]]:sreg_32 = S_MOV_B32 target-flags(amdgpu-abs32-lo) @DescriptorBuffer			@0 = external dso_local addrspace(4) constant [4 x <2 x float>]
	; GCN: %{{[0-9]+}}:sgpr_128 = S_LOAD_DWORDX4_SGPR killed %{{[0-9]+}}, killed %[[OFFSET]], 0 :: (invariant load (s128) from %ir.13, addrspace 4)			@1 = external dso_local addrspace(4) constant i32

				; Test that DAG->DAG ISel is able to pick up the S_LOAD_DWORDX4_SGPR instruction that fetches the offset
				; from a register.
				; GCN-LABEL: name: test_load_zext
				; SDAG: %[[OFFSET:[0-9]+]]:sreg_32 = S_MOV_B32 target-flags(amdgpu-abs32-lo) @DescriptorBuffer
				; SDAG: %{{[0-9]+}}:sgpr_128 = S_LOAD_DWORDX4_SGPR killed %{{[0-9]+}}, killed %[[OFFSET]], 0 :: (invariant load (s128) from %ir.13, addrspace 4)
	; GISEL: $[[OFFSET:.*]] = S_MOV_B32 target-flags(amdgpu-abs32-lo) @DescriptorBuffer			; GISEL: $[[OFFSET:.*]] = S_MOV_B32 target-flags(amdgpu-abs32-lo) @DescriptorBuffer
	; GISEL: S_LOAD_DWORDX4_SGPR killed renamable {{.}}, killed renamable $[[OFFSET]], 0 :: (invariant load (<4 x s32>) from {{.}}, addrspace 4)			; GISEL: S_LOAD_DWORDX4_SGPR killed renamable {{.}}, killed renamable $[[OFFSET]], 0 :: (invariant load (<4 x s32>) from {{.}}, addrspace 4)

	define amdgpu_cs void @test_load_zext(i32 inreg %0, i32 inreg %1, i32 inreg %resNode0, i32 inreg %resNode1, <3 x i32> inreg %2, i32 inreg %3, <3 x i32> %4) local_unnamed_addr #2 {			define amdgpu_cs void @test_load_zext(i32 inreg %0, i32 inreg %1, i32 inreg %resNode0, i32 inreg %resNode1, <3 x i32> inreg %2, i32 inreg %3, <3 x i32> %4) local_unnamed_addr #2 {
	.entry:			.entry:
	%5 = call i64 @llvm.amdgcn.s.getpc() #3			%5 = call i64 @llvm.amdgcn.s.getpc() #3
	%6 = bitcast i64 %5 to <2 x i32>			%6 = bitcast i64 %5 to <2 x i32>
	%7 = insertelement <2 x i32> %6, i32 %resNode0, i32 0			%7 = insertelement <2 x i32> %6, i32 %resNode0, i32 0
	%8 = bitcast <2 x i32> %7 to i64			%8 = bitcast <2 x i32> %7 to i64
	%9 = inttoptr i64 %8 to [4294967295 x i8] addrspace(4)*			%9 = inttoptr i64 %8 to [4294967295 x i8] addrspace(4)*
	%10 = call i32 @llvm.amdgcn.reloc.constant(metadata !4)			%10 = call i32 @llvm.amdgcn.reloc.constant(metadata !4)
	%11 = zext i32 %10 to i64			%11 = zext i32 %10 to i64
	%12 = getelementptr [4294967295 x i8], [4294967295 x i8] addrspace(4)* %9, i64 0, i64 %11			%12 = getelementptr [4294967295 x i8], [4294967295 x i8] addrspace(4)* %9, i64 0, i64 %11
	%13 = bitcast i8 addrspace(4)* %12 to <4 x i32> addrspace(4)*, !amdgpu.uniform !5			%13 = bitcast i8 addrspace(4)* %12 to <4 x i32> addrspace(4)*, !amdgpu.uniform !5
	%14 = load <4 x i32>, <4 x i32> addrspace(4)* %13, align 16, !invariant.load !5			%14 = load <4 x i32>, <4 x i32> addrspace(4)* %13, align 16, !invariant.load !5
	%15 = call <4 x i32> @llvm.amdgcn.s.buffer.load.v4i32(<4 x i32> %14, i32 0, i32 0)			%15 = call <4 x i32> @llvm.amdgcn.s.buffer.load.v4i32(<4 x i32> %14, i32 0, i32 0)
	call void @llvm.amdgcn.raw.buffer.store.v4i32(<4 x i32> %15, <4 x i32> %14, i32 0, i32 0, i32 0)			call void @llvm.amdgcn.raw.buffer.store.v4i32(<4 x i32> %15, <4 x i32> %14, i32 0, i32 0, i32 0)
	ret void			ret void
	}			}

				; Make sure we match complex register offsets, which may come before the
				; base operand in the load's SDAG nodes.
				; GCN-LABEL: name: test_complex_reg_offset
				; CCN: S_LOAD_DWORD_IMM
				; GCN: S_LOAD_DWORD_SGPR
				arsenmUnsubmitted Done Reply Inline Actions Should test the operands arsenm: Should test the operands
				define amdgpu_ps void @test_complex_reg_offset(float addrspace(1)* %out) {
				foadUnsubmitted Done Reply Inline Actions When this patch and D128836 have both landed, can you add GISEL checks here please. foad: When this patch and D128836 have both landed, can you add GISEL checks here please.
				kosarevAuthorUnsubmitted Done Reply Inline Actions Done; the GCN checks now work for both the SDAG and GISEL runs. kosarev: Done; the GCN checks now work for both the SDAG and GISEL runs.
				%i = load i32, i32 addrspace(4)* @1
				%i1 = and i32 %i, 3
				foadUnsubmitted Done Reply Inline Actions Please don't use unnamed values in tests. (You can run the IR through `opt -instnamer` to name them automatically.) foad: Please don't use unnamed values in tests. (You can run the IR through `opt -instnamer` to name…
				%i2 = zext i32 %i1 to i64
				%i3 = getelementptr [4 x <2 x float>], [4 x <2 x float>] addrspace(4)* @0, i64 0, i64 %i2, i64 0
				foadUnsubmitted Not Done Reply Inline Actions Do I understand correctly: in SDag this gep ends up as `(add %2, @0)` (not `(add @0, %2)`) because `@0` is a constant so it gets canonicalized to the RHS? foad: Do I understand correctly: in SDag this gep ends up as `(add %2, @0) ` (not `(add @0, %2)`)…
				kosarevAuthorUnsubmitted Done Reply Inline Actions That's right, for commutative nodes the combiner tries to make whatever seems a constant be the second operand. kosarev: That's right, for commutative nodes the combiner tries to make whatever seems a constant be the…
				%i4 = load float, float addrspace(4)* %i3, align 4
				store float %i4, float addrspace(1)* %out
				ret void
				}

				arsenmUnsubmitted Not Done Reply Inline Actions Doesn't test the 32-bit constant case arsenm: Doesn't test the 32-bit constant case
				kosarevAuthorUnsubmitted Done Reply Inline Actions Why would we want to check that? Looks like there's nothing specific to 32-bit constants in the change? kosarev: Why would we want to check that? Looks like there's nothing specific to 32-bit constants in the…
				arsenmUnsubmitted Not Done Reply Inline Actions You have the call to Expand32BitAddress arsenm: You have the call to Expand32BitAddress
				kosarevAuthorUnsubmitted Done Reply Inline Actions Yes, which replicates the existing and already-tested logic. Will it eliminate the need for the test if I move the couples of `SelectSMRDOffset()` and `Expand32BitAddress()` calls into a separate function, something like `SelectSMRDBaseOffset()`? We would need it for matching (register + immediate) offsets anyway. Feels like that'd be a test case for what is actually not a special case. kosarev: Yes, which replicates the existing and already-tested logic. Will it eliminate the need for the…
				arsenmUnsubmitted Not Done Reply Inline Actions It's another permutation of a somewhat fragile case (which also works differently in globalisel). There's no reason to skimp on tests arsenm: It's another permutation of a somewhat fragile case (which also works differently in…
				kosarevAuthorUnsubmitted Done Reply Inline Actions The `if ((Addr.getValueType() != MVT::i32 \|\| Addr->getFlags().hasNoUnsignedWrap()))` bit in `AMDGPUDAGToDAGISel::SelectSMRD()` means for such a test we would need the `nuw` flag be set on the `ISD::ADD` in the offset, which we only do for constant `getelementptr inbounds` indexes that in turn would always go to the RHS operand of the `ISD::ADD`, and thus can't help reproducing the case. D15544 gives some reasoning for raising `nuw` for non-negative constant offsets. If we think it's worth it, I could try to do the same for non-constants known to be small enough, but it again would help if we do not require cultivating things specific to the constant address space be a prerequisite for this generic change. kosarev: The `if ((Addr.getValueType() != MVT::i32 \|\| Addr->getFlags().hasNoUnsignedWrap()))` bit in…
				arsenmUnsubmitted Not Done Reply Inline Actions You can just add noinbounds to the getelementptr. It's not a special change and just testing the mechanics for what you already have here arsenm: You can just add noinbounds to the getelementptr. It's not a special change and just testing…
				kosarevAuthorUnsubmitted Done Reply Inline Actions Sorry, I'm not sure I follow. What is noinbounds and how it should be used with the getelementptr, exactly? kosarev: Sorry, I'm not sure I follow. What is noinbounds and how it should be used with the…
				kosarevAuthorUnsubmitted Done Reply Inline Actions @arsenm Ping? kosarev: @arsenm Ping?
				arsenmUnsubmitted Not Done Reply Inline Actions You can try this: %i3 = getelementptr inbounds [4 x <2 x float>], with the addrspace(6) pointer arsenm: You can try this: ``` %i3 = getelementptr inbounds [4 x <2 x float>], ``` with the addrspace…
				kosarevAuthorUnsubmitted Done Reply Inline Actions It has already been explained in https://reviews.llvm.org/D129095#inline-1241075 that that wouldn't work. On top of that, we don't seem to support addrspace(6) constants in SDAG ISel currently; I get AMDGPU DAG->DAG Pattern Instruction Selection error whenever I try to use one. Regarding noinbounds, I don't see how your words explain what it is or how it can be useful for triggering the added code. Is that your comment still relevant? And just for the record, I don't understand the argument about the new Expand32BitAddress() call being a sufficient reason for another test case. We usually don't add tests for higher level code just because it employs more of some lower level functions. It's up to tests specifically for these lower level functions to make sure they can handle their intended uses cases, and as hopefully is obvious from https://reviews.llvm.org/D129381 that eliminates multiple calls to Expand32BitAddress(), this patch doesn't add any new use cases for that function. What has been changed is that we now support another permutation of operands in SelectSMRD(), but that is already covered with the added tests. addrspace(6) operands are no special in this regard, so adding more tests for them would mean we try to test combinations that are not special implementation-wise, which is again not how we write regression tests. And yes, unnecessary tests can actually be harmful -- they draw maintenance resources and attention and they hide cases that are really important. I believe some of our MC tests illustrate that. Overall, I feel somewhat confused about your two last comments. If you can reword what you have to suggest in a bit more explicit way, that will likely be a great help. Thanks. kosarev: It has already been explained in https://reviews.llvm.org/D129095#inline-1241075 that that…
				arsenmUnsubmitted Not Done Reply Inline Actions The point isn't to test Expand32BitAddress but to test that it's used on this path. I guess if constants don't work for constant 32-bit that's a separate issue arsenm: The point isn't to test Expand32BitAddress but to test that it's used on this path. I guess if…
	declare void @llvm.amdgcn.raw.buffer.store.v4i32(<4 x i32>, <4 x i32>, i32, i32, i32 immarg) #1			declare void @llvm.amdgcn.raw.buffer.store.v4i32(<4 x i32>, <4 x i32>, i32, i32, i32 immarg) #1

	; Function Attrs: nounwind readnone speculatable			; Function Attrs: nounwind readnone speculatable
	declare i32 @llvm.amdgcn.reloc.constant(metadata) #3			declare i32 @llvm.amdgcn.reloc.constant(metadata) #3

	; Function Attrs: nounwind readnone speculatable			; Function Attrs: nounwind readnone speculatable
	declare i64 @llvm.amdgcn.s.getpc() #3			declare i64 @llvm.amdgcn.s.getpc() #3

	Show All 27 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets.
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 442293

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll

This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 442293

llvm/lib/Target/AMDGPU/AMDGPUISelDAGToDAG.cpp

llvm/test/CodeGen/AMDGPU/amdgcn-load-offset-from-reg.ll

[AMDGPU][CodeGen] Match SMRDs with constant bases and register offsets.
ClosedPublic