This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Target/AMDGPU/
-
lib/
-
Target/
-
AMDGPU/
-
BUFInstructions.td
2
FLATInstructions.td
2
SIInstrInfo.td
-
SMInstructions.td

Differential D109032

[AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses
ClosedPublic

Authored by jrtc27 on Aug 31 2021, 5:00 PM.

Download Raw Diff

Details

Reviewers

arsenm
foad
tstellar

Commits

rG3ee56eed2fc2: [AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses

Summary

When used as a non-leaf node, TableGen does not currently use the type
of a ComplexPattern for type inference, which also means it does not
check it doesn't conflict with the use. This differs from when used as a
leaf value, where the type is used for inference. Fixing that
discrepancy is something I intend to upstream as a subsequent review.

AMDGPU currently has several ComplexPatterns that are used in contexts
where they're expected to be an iPTR, and where using an iPTR instead of
a fixed-width integer type matters. With my locally-patched TableGen,
none of these mismatches result in type contradictions, but do change
the patterns and cause various failures to select. These changes to the
ComplexPatterns' types reflect how they are actually used, result in
bit-for-bit identical TableGen output (without my local TableGen patch),
and ensure that with improved type inference AMDGPU's backend will
continue to work.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	1,290 ms	x64 debian > libomp.lock::omp_init_lock.c

Event Timeline

jrtc27 created this revision.Aug 31 2021, 5:00 PM

Herald added subscribers: kerbowa, hiraditya, t-tye and 6 others. · View Herald TranscriptAug 31 2021, 5:00 PM

jrtc27 requested review of this revision.Aug 31 2021, 5:00 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 31 2021, 5:00 PM

Herald added subscribers: llvm-commits, wdng. · View Herald Transcript

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

llvm/lib/Target/AMDGPU/SIInstrInfo.td
1320–1322	I think iPTR is a flawed concept that doesn't work without the address space, so I don't think AMDGPU should ever use it

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

jrtc27 added a child revision: D109035: [TableGen][SelectionDAG] Use ComplexPattern type for non-leaf nodes.Aug 31 2021, 5:23 PM

Harbormaster completed remote builds in B122013: Diff 369819.Aug 31 2021, 5:37 PM

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

Ping; does the above help explain why, despite what it may look like, this is actually NFC, down to the generated code?

In D109032#2975958, @jrtc27 wrote:

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

llvm/lib/Target/AMDGPU/FLATInstructions.td
9–10	I would believe these are NFC
11	There's no way this is actually correct
llvm/lib/Target/AMDGPU/SIInstrInfo.td
1320–1322	These are also i32, I don't see how this is working

In D109032#2997871, @arsenm wrote:

In D109032#2975958, @jrtc27 wrote:

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:

// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)

is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.

In D109032#2997901, @jrtc27 wrote:
In D109032#2997871, @arsenm wrote:

In D109032#2975958, @jrtc27 wrote:

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:
// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)
is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.

Ping regarding this explanation?

In D109032#3020826, @jrtc27 wrote:
In D109032#2997901, @jrtc27 wrote:
In D109032#2997871, @arsenm wrote:

In D109032#2975958, @jrtc27 wrote:

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:
// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)
is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.
Ping regarding this explanation?

Ping?

In D109032#3043399, @jrtc27 wrote:
In D109032#3020826, @jrtc27 wrote:
In D109032#2997901, @jrtc27 wrote:
In D109032#2997871, @arsenm wrote:

In D109032#2975958, @jrtc27 wrote:

In D109032#2975943, @jrtc27 wrote:

In D109032#2975910, @arsenm wrote:

I'm not sure where this is going without the context of the other patch.

I think iPTR is a flawed concept that doesn't work without the address space. AMDGPU does not want the type changing magic of iPTR when there are mixed and constant pointer sizes that never change, so the fixed types are more logical.

I agree with you in a sense, it causes pain for us downstream to not have address spaces here (so we just add a new type for our other pointer representation downstream, and up until now it's been hacked together, but I'm trying to rewrite it to do things properly, and adding a new possible pointer type everywhere beyond iPTR causes a lot of believed-ambiguous-but-not patterns without better type inference in TableGen). However, if you look at the generated .inc files, you will see that all the cases where these are used TableGen has already inferred that iPTR is in use, and if I don't make this change but have my subsequent TableGen patch applied to actually propagate the types as they should have been from the start then the AMDGPU backend becomes very crash-prone with failures to select all over the place as it seems it really is expecting iPTR not i32/i64 for various things. The changes here have zero effect on the generated files without my TableGen patch, and ensure that even with my TableGen patch the output remains unchanged (otherwise they go from iPTR to i32/i64), so this doesn't change behaviour, it just makes the code more accurately state what is actually currently going on.

For example, picking the first ComplexPattern here, MUBUFAddr64, I see 56 instances of MUBUFAddr64:{ in AMDGPUGenDAGISel.inc, all of which are MUBUFAddr64:{ *:[iPTR] }. Changing the ComplexPattern to have type iPTR rather than i64 ensures that that remains the case once the ComplexPattern type is included in the type inference, rather than altering behaviour by inferring MUBUFAddr64:{ *:[i64] } instead (which is, I imagine, what the author _intended_ it to be, but not what it _actually_ is currently, and does not work if that's actually the type used).

The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:
// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)
is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.
Ping regarding this explanation?
Ping?

Ping?

In D109032#3020826, @jrtc27 wrote:
The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:
// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)
is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.
Ping regarding this explanation?

This is more of another datapoint that nothing with iPTR makes any sense, I don't think this really explains it.

This revision is now accepted and ready to land.Nov 1 2021, 2:33 PM

In D109032#3101169, @arsenm wrote:
In D109032#3020826, @jrtc27 wrote:
The MUBUFAddr64 case isn't that interesting, since you're changing one form of "i64" to another form of "i64". I'm much more puzzled about how this is not blowing up on the cases that do use 32 bit pointers (e.g. all the DS* patterns)

Those examples may not be what you intend them to be, but I can promise you that if you go read AMDGPUGenDAGISel.inc yourself you will find that every single one of them is currently matching an iPTR. For example:
// Src: (ld:{ *:[i32] } (ScratchOffset:{ *:[iPTR] } VGPR_32:{ *:[i32] }:$vaddr, i16:{ *:[i16] }:$offset))<<P:Predicate_unindexedload>><<P:Predicate_extload>><<P:Predicate_extloadi8_private>> - Complexity = 22
// Dst: (SCRATCH_LOAD_UBYTE:{ *:[i32] } ?:{ *:[i32] }:$vaddr, ?:{ *:[i16] }:$offset)
is what I see *without* any of my patches, so I'm just preserving that. Things blow up with failures to select, or poor codegen that regresses tests, when these instead infer i32. If you want to see for yourself, apply D109035, note that the TableGen output changes to honour the existing intended types and that many AMDGPU tests fail.
Ping regarding this explanation?
This is more of another datapoint that nothing with iPTR makes any sense, I don't think this really explains it.

Oh I know, I'm cursed with having to bodge iPTR downstream even more than normal because we're SelectionDAG-based (and I don't see RISC-V switching to GISel any time soon)...

Thanks for the review

This revision was landed with ongoing or failed builds.Dec 2 2021, 11:05 PM

Closed by commit rG3ee56eed2fc2: [AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses (authored by jrtc27). · Explain Why

This revision was automatically updated to reflect the committed changes.

jrtc27 added a commit: rG3ee56eed2fc2: [AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses.

Revision Contents

Path

Size

llvm/

lib/

Target/

AMDGPU/

8 lines

10 lines

8 lines

10 lines

Diff 369819

llvm/lib/Target/AMDGPU/BUFInstructions.td

	//===-- BUFInstructions.td - Buffer Instruction Definitions ---------------===//			//===-- BUFInstructions.td - Buffer Instruction Definitions ---------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def MUBUFAddr64 : ComplexPattern<i64, 4, "SelectMUBUFAddr64">;			def MUBUFAddr64 : ComplexPattern<iPTR, 4, "SelectMUBUFAddr64">;
	def MUBUFOffset : ComplexPattern<i64, 3, "SelectMUBUFOffset">;			def MUBUFOffset : ComplexPattern<iPTR, 3, "SelectMUBUFOffset">;

	def MUBUFScratchOffen : ComplexPattern<i64, 4, "SelectMUBUFScratchOffen", [], [SDNPWantParent]>;			def MUBUFScratchOffen : ComplexPattern<iPTR, 4, "SelectMUBUFScratchOffen", [], [SDNPWantParent]>;
	def MUBUFScratchOffset : ComplexPattern<i64, 3, "SelectMUBUFScratchOffset", [], [SDNPWantParent], 20>;			def MUBUFScratchOffset : ComplexPattern<iPTR, 3, "SelectMUBUFScratchOffset", [], [SDNPWantParent], 20>;

	def BUFAddrKind {			def BUFAddrKind {
	int Offset = 0;			int Offset = 0;
	int OffEn = 1;			int OffEn = 1;
	int IdxEn = 2;			int IdxEn = 2;
	int BothEn = 3;			int BothEn = 3;
	int Addr64 = 4;			int Addr64 = 4;
	}			}
	▲ Show 20 Lines • Show All 2,639 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/FLATInstructions.td

	//===-- FLATInstructions.td - FLAT Instruction Definitions ----------------===//			//===-- FLATInstructions.td - FLAT Instruction Definitions ----------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def FlatOffset : ComplexPattern<i64, 2, "SelectFlatOffset", [], [SDNPWantRoot], -10>;			def FlatOffset : ComplexPattern<iPTR, 2, "SelectFlatOffset", [], [SDNPWantRoot], -10>;
	def GlobalOffset : ComplexPattern<i64, 2, "SelectGlobalOffset", [], [SDNPWantRoot], -10>;			def GlobalOffset : ComplexPattern<iPTR, 2, "SelectGlobalOffset", [], [SDNPWantRoot], -10>;
				arsenmUnsubmitted Not Done Reply Inline Actions I would believe these are NFC arsenm: I would believe these are NFC
	def ScratchOffset : ComplexPattern<i32, 2, "SelectScratchOffset", [], [SDNPWantRoot], -10>;			def ScratchOffset : ComplexPattern<iPTR, 2, "SelectScratchOffset", [], [SDNPWantRoot], -10>;
				arsenmUnsubmitted Not Done Reply Inline Actions There's no way this is actually correct arsenm: There's no way this is actually correct

	def GlobalSAddr : ComplexPattern<i64, 3, "SelectGlobalSAddr", [], [SDNPWantRoot], -10>;			def GlobalSAddr : ComplexPattern<iPTR, 3, "SelectGlobalSAddr", [], [SDNPWantRoot], -10>;
	def ScratchSAddr : ComplexPattern<i32, 2, "SelectScratchSAddr", [], [SDNPWantRoot], -10>;			def ScratchSAddr : ComplexPattern<iPTR, 2, "SelectScratchSAddr", [], [SDNPWantRoot], -10>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// FLAT classes			// FLAT classes
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	class FLAT_Pseudo<string opName, dag outs, dag ins,			class FLAT_Pseudo<string opName, dag outs, dag ins,
	string asmOps, list<dag> pattern=[]> :			string asmOps, list<dag> pattern=[]> :
	InstSI<outs, ins, "", pattern>,			InstSI<outs, ins, "", pattern>,
	▲ Show 20 Lines • Show All 1,753 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/SIInstrInfo.td

	Show First 20 Lines • Show All 1,311 Lines • ▼ Show 20 Lines

	def PackedF16InputMods : PackedFPInputMods<PackedF16InputModsMatchClass>;			def PackedF16InputMods : PackedFPInputMods<PackedF16InputModsMatchClass>;
	def PackedI16InputMods : PackedIntInputMods<PackedI16InputModsMatchClass>;			def PackedI16InputMods : PackedIntInputMods<PackedI16InputModsMatchClass>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Complex patterns			// Complex patterns
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	def DS1Addr1Offset : ComplexPattern<i32, 2, "SelectDS1Addr1Offset">;			def DS1Addr1Offset : ComplexPattern<iPTR, 2, "SelectDS1Addr1Offset">;
	def DS64Bit4ByteAligned : ComplexPattern<i32, 3, "SelectDS64Bit4ByteAligned">;			def DS64Bit4ByteAligned : ComplexPattern<iPTR, 3, "SelectDS64Bit4ByteAligned">;
	def DS128Bit8ByteAligned : ComplexPattern<i64, 3, "SelectDS128Bit8ByteAligned">;			def DS128Bit8ByteAligned : ComplexPattern<iPTR, 3, "SelectDS128Bit8ByteAligned">;
				arsenmUnsubmitted Not Done Reply Inline Actions I think iPTR is a flawed concept that doesn't work without the address space, so I don't think AMDGPU should ever use it arsenm: I think iPTR is a flawed concept that doesn't work without the address space, so I don't think…
				arsenmUnsubmitted Not Done Reply Inline Actions These are also i32, I don't see how this is working arsenm: These are also i32, I don't see how this is working

	def MOVRELOffset : ComplexPattern<i32, 2, "SelectMOVRELOffset">;			def MOVRELOffset : ComplexPattern<iPTR, 2, "SelectMOVRELOffset">;

	def VOP3Mods0 : ComplexPattern<untyped, 4, "SelectVOP3Mods0">;			def VOP3Mods0 : ComplexPattern<untyped, 4, "SelectVOP3Mods0">;
	def VOP3Mods : ComplexPattern<untyped, 2, "SelectVOP3Mods">;			def VOP3Mods : ComplexPattern<untyped, 2, "SelectVOP3Mods">;
	def VOP3NoMods : ComplexPattern<untyped, 1, "SelectVOP3NoMods">;			def VOP3NoMods : ComplexPattern<untyped, 1, "SelectVOP3NoMods">;
	// VOP3Mods, but the input source is known to never be NaN.			// VOP3Mods, but the input source is known to never be NaN.
	def VOP3Mods_nnan : ComplexPattern<fAny, 2, "SelectVOP3Mods_NNaN">;			def VOP3Mods_nnan : ComplexPattern<fAny, 2, "SelectVOP3Mods_NNaN">;

	def VOP3OMods : ComplexPattern<untyped, 3, "SelectVOP3OMods">;			def VOP3OMods : ComplexPattern<untyped, 3, "SelectVOP3OMods">;
	▲ Show 20 Lines • Show All 1,231 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/SMInstructions.td

Show First 20 Lines • Show All 759 Lines • ▼ Show 20 Lines	let GISelPredicateCode = [{
getAddrModeInfo(MI, MRI, AddrInfo);		getAddrModeInfo(MI, MRI, AddrInfo);

if (hasVgprParts(AddrInfo))		if (hasVgprParts(AddrInfo))
return false;		return false;
return true;		return true;
}];		}];
}		}

def SMRDImm : ComplexPattern<i64, 2, "SelectSMRDImm">;		def SMRDImm : ComplexPattern<iPTR, 2, "SelectSMRDImm">;
def SMRDImm32 : ComplexPattern<i64, 2, "SelectSMRDImm32">;		def SMRDImm32 : ComplexPattern<iPTR, 2, "SelectSMRDImm32">;
def SMRDSgpr : ComplexPattern<i64, 2, "SelectSMRDSgpr">;		def SMRDSgpr : ComplexPattern<iPTR, 2, "SelectSMRDSgpr">;
def SMRDBufferImm : ComplexPattern<i32, 1, "SelectSMRDBufferImm">;		def SMRDBufferImm : ComplexPattern<iPTR, 1, "SelectSMRDBufferImm">;
def SMRDBufferImm32 : ComplexPattern<i32, 1, "SelectSMRDBufferImm32">;		def SMRDBufferImm32 : ComplexPattern<iPTR, 1, "SelectSMRDBufferImm32">;

multiclass SMRD_Pattern <string Instr, ValueType vt> {		multiclass SMRD_Pattern <string Instr, ValueType vt> {

// 1. IMM offset		// 1. IMM offset
def : GCNPat <		def : GCNPat <
(smrd_load (SMRDImm i64:$sbase, i32:$offset)),		(smrd_load (SMRDImm i64:$sbase, i32:$offset)),
(vt (!cast<SM_Pseudo>(Instr#"_IMM") $sbase, $offset, 0))		(vt (!cast<SM_Pseudo>(Instr#"_IMM") $sbase, $offset, 0))
>;		>;
▲ Show 20 Lines • Show All 301 Lines • Show Last 20 Lines