This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
test/TableGen/
-
TableGen/
-
address-space-patfrags.td
-
utils/TableGen/
-
TableGen/
-
GlobalISelEmitter.cpp

Differential D64845

[GlobalISel] Check LLT size matches memory size for non-truncating stores.
ClosedPublic

Authored by aemerson on Jul 16 2019, 6:52 PM.

Download Raw Diff

Details

Reviewers

arsenm
dsanders

Commits

rG52e6d52f10dc: [GlobalISel] Check LLT size matches memory size for non-truncating stores.
rL367737: [GlobalISel] Check LLT size matches memory size for non-truncating stores.

Summary

This was causing a bug where non-truncating stores would be selected instead of truncating ones.

Diff Detail

Repository: rL LLVM

Event Timeline

aemerson created this revision.Jul 16 2019, 6:52 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 16 2019, 6:52 PM

Herald added subscribers: llvm-commits, Petar.Avramovic, rovka and 3 others. · View Herald Transcript

arsenm added inline comments.Jul 16 2019, 7:03 PM

llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-store-flat.mir
106 ↗	(On Diff #210230)	This is breaking this, producing a non-truncating store instead of the correct trunc store
llvm/test/CodeGen/AMDGPU/GlobalISel/inst-select-store-private.mir
55 ↗	(On Diff #210230)	This is breaking this, the correct trunc store now fails to select
llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	Should also continue? Could make this a select on the match type to the one addPredicate call

aemerson marked an inline comment as done.Jul 16 2019, 7:12 PM

aemerson added inline comments.

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	Yeah, just seen this doesn't work. I'm not familiar with this code, but how do non-truncstore predicates end up on trunc stores such that it breaks?

arsenm added inline comments.Jul 16 2019, 7:22 PM

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	I don't exactly understand what the expected relationship between predicate bits set on a PatFrag and the PatFrags which used it is. It's possible something is wrong with the AMDGPU PatFrags. This system is pretty confusing, because each property is added in a different layer of PatFrags defined in TargetSelectionDAG.td. The AMDGPU PatFrags reproduce the same hierarchy, with the additional address space predicates. There is also the extra IsTruncStore bit which needs to be manually set on the base truncstore PatFrag. For some reason the derivative PatFrags do not have it set, as it should be implied. However, they still need to set IsStore, although I would expect these to both behave the same way.

arsenm added inline comments.Jul 16 2019, 7:26 PM

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	I'm also not sure the way this loop is structured makes sense. It seems to assume too much about what different predicates can be combined. The atomic predicates also don't work correctly

aemerson marked an inline comment as done.Jul 17 2019, 11:24 AM

aemerson added inline comments.

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	@dsanders any idea what to do?

dsanders added inline comments.Jul 18 2019, 5:48 PM

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	That `else` is suspicious. isTruncStore() means that the predicate needs to be added but !isTruncStore() can mean either no predicate is needed or a non-trunc store predicate is needed. I believe it needs to be: if (Predicate.isTruncStore()) add relevant predicate for trunc store if (Predicate.isNonTruncStore()) add relevant predicate for non-trunc store

dsanders added inline comments.Jul 18 2019, 6:21 PM

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	don't exactly understand what the expected relationship between predicate bits set on a PatFrag and the PatFrags which used it is. It's possible something is wrong with the AMDGPU PatFrags. This system is pretty confusing, because each property is added in a different layer of PatFrags defined in TargetSelectionDAG.td. The AMDGPU PatFrags reproduce the same hierarchy, with the additional address space predicates. The input we start with is pretty weird too and it's mostly a straight import of that. Suppose we start with the pattern: (truncstorei8 node:$x) when the PatFrag is expanded, we get: (truncstore node:$x)<<code-to-check-memvt==i8>> where the `<<some C++ code>>` is tablegen's way of representing predicates (this isn't quite true, it uses a short-hand name instead of the C++ when it's printing but the data in the pattern is the C++ code). Then after further expansion we get: (unindexedstore node:$x)<<code-to-check-memvt==i8>><<code-to-check-for-truncating>> (st node:$x)<<code-to-check-memvt==i8>><<code-to-check-for-truncating>><<code-to-check-for-unindexed>> That last line is how it looks on input to the GlobalISelEmitter before runOnPattern(). The first thing we needed to do was abstract out the C++ as it needed to be different for DAGISel vs GISel. To do that, each predicate was given a field in PatFrag. For these three predicates it was: ValueType MemoryVT = ?; bit IsTruncStore = ?; bit IsUnindexed = ?; Only one of these is set for each PatFrag as each one directly corresponds to the original C++ code. For example: IsTruncStore = 0 emits this code for DAGISel: !cast<StoreSDNode>(N)->isTruncatingStore(); I'm also not sure the way this loop is structured makes sense. It seems to assume too much about what different predicates can be combined. Each predicate can only be one C++ fragment. If you need multiple fragments (which happens when you have PatFrags inside PatFrag expansions) you have multiple Predicates, each with a different field set corresponding to a different fragment. The atomic predicates also don't work correctly Could you elaborate on this?

arsenm added inline comments.Jul 19 2019, 6:48 AM

llvm/utils/TableGen/GlobalISelEmitter.cpp
3314 ↗	(On Diff #210230)	The atomic predicates also don't work correctly Could you elaborate on this? I didn't spend much time looking at it to focus on one broken thing at a time, but the AMDGPU patterns for atomic_load/atomic_store don't work. They should be easy to handle as they ignore the ordering type and just need to preserve the atomicness in the MemOperand. I think the combination of IsStore and IsAtomic wasn't behaving as expected

nhaehnle removed a subscriber: nhaehnle.Jul 22 2019, 2:15 AM

Use isNonTruncStore() instead.

That change LGTM. Do you have a test case for your target?

This revision is now accepted and ready to land.Aug 2 2019, 4:04 PM

In D64845#1613091, @dsanders wrote:

That change LGTM. Do you have a test case for your target?

Thanks, it's only exposed with another patch though so I can't add that test here.

llvm/test/TableGen/address-space-patfrags.td
123 ↗	(On Diff #213137)	Just seen this incorrect comment, will fix in the commit.

Closed by commit rL367737: [GlobalISel] Check LLT size matches memory size for non-truncating stores. (authored by aemerson). · Explain WhyAug 2 2019, 4:32 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

test/

TableGen/

address-space-patfrags.td

17 lines

utils/

TableGen/

GlobalISelEmitter.cpp

18 lines

Diff 213141

llvm/trunk/test/TableGen/address-space-patfrags.td

Show All 38 Lines	def inst_b : Instruction {
let InOperandList = (ins GPR32:$src);		let InOperandList = (ins GPR32:$src);
}		}

def inst_c : Instruction {		def inst_c : Instruction {
let OutOperandList = (outs);		let OutOperandList = (outs);
let InOperandList = (ins GPR32:$src0, GPR32:$src1);		let InOperandList = (ins GPR32:$src0, GPR32:$src1);
}		}

		def inst_d : Instruction {
		let OutOperandList = (outs);
		let InOperandList = (ins GPR32:$src0, GPR32:$src1);
		}

// SDAG: case 2: {		// SDAG: case 2: {
// SDAG-NEXT: // Predicate_pat_frag_b		// SDAG-NEXT: // Predicate_pat_frag_b
// SDAG-NEXT: SDNode *N = Node;		// SDAG-NEXT: SDNode *N = Node;
// SDAG-NEXT: (void)N;		// SDAG-NEXT: (void)N;
// SDAG-NEXT: unsigned AddrSpace = cast<MemSDNode>(N)->getAddressSpace();		// SDAG-NEXT: unsigned AddrSpace = cast<MemSDNode>(N)->getAddressSpace();
// SDAG-NEXT: if (AddrSpace != 123 && AddrSpace != 455)		// SDAG-NEXT: if (AddrSpace != 123 && AddrSpace != 455)
// SDAG-NEXT: return false;		// SDAG-NEXT: return false;
// SDAG-NEXT: if (cast<MemSDNode>(N)->getMemoryVT() != MVT::i32) return false;		// SDAG-NEXT: if (cast<MemSDNode>(N)->getMemoryVT() != MVT::i32) return false;
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
// GISEL-NEXT: GIM_CheckAtomicOrdering, /MI/0, /Order/(int64_t)AtomicOrdering::NotAtomic,		// GISEL-NEXT: GIM_CheckAtomicOrdering, /MI/0, /Order/(int64_t)AtomicOrdering::NotAtomic,
// GISEL-NEXT: // MIs[0] src0		// GISEL-NEXT: // MIs[0] src0
// GISEL-NEXT: GIM_CheckType, /MI/0, /Op/0, /Type/GILLT_s32,		// GISEL-NEXT: GIM_CheckType, /MI/0, /Op/0, /Type/GILLT_s32,
def : Pat <		def : Pat <
(truncstore GPR32:$src0, GPR32:$src1),		(truncstore GPR32:$src0, GPR32:$src1),
(inst_c GPR32:$src0, GPR32:$src1)		(inst_c GPR32:$src0, GPR32:$src1)
>;		>;

// Test truncstore with specific MemoryVT		// Test non-truncstore has a size equal to LLT check.
// GISEL: GIM_Try, /On fail goto//Label 3/ {{[0-9]+}}, // Rule ID 3 //		// GISEL: GIM_Try, /On fail goto//Label 3/ {{[0-9]+}}, // Rule ID 3 //
// GISEL-NEXT: GIM_CheckNumOperands, /MI/0, /Expected/2,		// GISEL-NEXT: GIM_CheckNumOperands, /MI/0, /Expected/2,
// GISEL-NEXT: GIM_CheckOpcode, /MI/0, TargetOpcode::G_STORE,		// GISEL-NEXT: GIM_CheckOpcode, /MI/0, TargetOpcode::G_STORE,
		// GISEL-NEXT: GIM_CheckMemorySizeEqualToLLT, /MI/0, /MMO/0, /OpIdx/0,
		def : Pat <
		(store GPR32:$src0, GPR32:$src1),
		(inst_d GPR32:$src0, GPR32:$src1)
		>;

		// Test truncstore with specific MemoryVT
		// GISEL: GIM_Try, /On fail goto//Label 4/ {{[0-9]+}}, // Rule ID 4 //
		// GISEL-NEXT: GIM_CheckNumOperands, /MI/0, /Expected/2,
		// GISEL-NEXT: GIM_CheckOpcode, /MI/0, TargetOpcode::G_STORE,
// GISEL-NEXT: GIM_CheckMemorySizeLessThanLLT, /MI/0, /MMO/0, /OpIdx/0,		// GISEL-NEXT: GIM_CheckMemorySizeLessThanLLT, /MI/0, /MMO/0, /OpIdx/0,
// GISEL-NEXT: GIM_CheckMemoryAddressSpace, /MI/0, /MMO/0, /NumAddrSpace/2, /AddrSpace/123, /AddrSpace/455,		// GISEL-NEXT: GIM_CheckMemoryAddressSpace, /MI/0, /MMO/0, /NumAddrSpace/2, /AddrSpace/123, /AddrSpace/455,
// GISEL-NEXT: GIM_CheckMemorySizeEqualTo, /MI/0, /MMO/0, /Size/2,		// GISEL-NEXT: GIM_CheckMemorySizeEqualTo, /MI/0, /MMO/0, /Size/2,
def : Pat <		def : Pat <
(truncstorei16_addrspace GPR32:$src0, GPR32:$src1),		(truncstorei16_addrspace GPR32:$src0, GPR32:$src1),
(inst_c GPR32:$src0, GPR32:$src1)		(inst_c GPR32:$src0, GPR32:$src1)
>;		>;

llvm/trunk/utils/TableGen/GlobalISelEmitter.cpp

Show First 20 Lines • Show All 3,341 Lines • ▼ Show 20 Lines	if (Predicate.isLoad() && Predicate.isNonExtLoad()) {
continue;		continue;
}		}
if (Predicate.isLoad() && Predicate.isAnyExtLoad()) {		if (Predicate.isLoad() && Predicate.isAnyExtLoad()) {
InsnMatcher.addPredicate<MemoryVsLLTSizePredicateMatcher>(		InsnMatcher.addPredicate<MemoryVsLLTSizePredicateMatcher>(
0, MemoryVsLLTSizePredicateMatcher::LessThan, 0);		0, MemoryVsLLTSizePredicateMatcher::LessThan, 0);
continue;		continue;
}		}

if (Predicate.isStore() && Predicate.isTruncStore()) {		if (Predicate.isStore()) {
		if (Predicate.isTruncStore()) {
// FIXME: If MemoryVT is set, we end up with 2 checks for the MMO size.		// FIXME: If MemoryVT is set, we end up with 2 checks for the MMO size.
InsnMatcher.addPredicate<MemoryVsLLTSizePredicateMatcher>(		InsnMatcher.addPredicate<MemoryVsLLTSizePredicateMatcher>(
0, MemoryVsLLTSizePredicateMatcher::LessThan, 0);		0, MemoryVsLLTSizePredicateMatcher::LessThan, 0);
continue;		continue;
}		}
		if (Predicate.isNonTruncStore()) {
		// We need to check the sizes match here otherwise we could incorrectly
		// match truncating stores with non-truncating ones.
		InsnMatcher.addPredicate<MemoryVsLLTSizePredicateMatcher>(
		0, MemoryVsLLTSizePredicateMatcher::EqualTo, 0);
		}
		}

// No check required. We already did it by swapping the opcode.		// No check required. We already did it by swapping the opcode.
if (!SrcGIEquivOrNull->isValueUnset("IfSignExtend") &&		if (!SrcGIEquivOrNull->isValueUnset("IfSignExtend") &&
Predicate.isSignExtLoad())		Predicate.isSignExtLoad())
continue;		continue;

// No check required. We already did it by swapping the opcode.		// No check required. We already did it by swapping the opcode.
if (!SrcGIEquivOrNull->isValueUnset("IfZeroExtend") &&		if (!SrcGIEquivOrNull->isValueUnset("IfZeroExtend") &&
▲ Show 20 Lines • Show All 1,660 Lines • Show Last 20 Lines