This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
lib/Conversion/StandardToSPIRV/
-
Conversion/
-
StandardToSPIRV/
31/31
ConvertStandardToSPIRV.cpp
-
test/Conversion/StandardToSPIRV/
-
Conversion/
-
StandardToSPIRV/
7/7
std-ops-to-spirv.mlir

Differential D78974

[mlir][StandardToSPIRV] Emulate bitwidths not supported for load op.
ClosedPublic

Authored by hanchung on Apr 27 2020, 5:47 PM.

Download Raw Diff

Details

Reviewers

mravishankar
antiagainst
denis13

Commits

rG6601b65aedd0: [mlir][StandardToSPIRV] Emulate bitwidths not supported for load op.

Summary

The current implementation in SPIRVTypeConverter just unconditionally turns
everything into 32-bit if it doesn't meet the requirements of extensions or
capabilities. In this case, we can load a 32-bit value and then do bit
extraction to get the value.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hanchung created this revision.Apr 27 2020, 5:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 27 2020, 5:47 PM

Herald added subscribers: llvm-commits, Kayjukh, frgossen and 12 others. · View Herald Transcript

I found that there is a bug in the patch, please wait me to fix it before review, thanks!

Harbormaster completed remote builds in B54906: Diff 260510.Apr 27 2020, 6:53 PM

Fix indexing issues.

Harbormaster completed remote builds in B54915: Diff 260530.Apr 27 2020, 7:57 PM

mravishankar requested changes to this revision.Apr 28 2020, 2:15 PM

mravishankar added inline comments.

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
103	Couple of things here This assumes bits < 32. Probably need to assert that as well. It would be nice to actually not specialize this to 32-bits. You could take the target integer type as an argument and the same logic should more or less hold.
123	This too could be generalized to handle any target integer width.
669	This will assert if this is not an integer. So it might be better to have a different pattern for load stores when the memref is integer type. So one pattern will implement this logic for integer type load/stores. Another pattern will be generic that will be type agnostic (and will return failure for integer types to not intersect with the other pattern)
683	Could we add a smarter logic here. We can try to find the "next highest power of 2" that is legal and use that instead.

This revision now requires changes to proceed.Apr 28 2020, 2:15 PM

Isn't this the kind of legalization that can be made on the std dialect itself as a pre-pass before the conversion to SPIRV? That would make all this logic reusable.

Awesome, thanks Hanhan for taking on this! Sorry for a lot of comments; but this is type availability in SPIR-V is quite nuanced. :)

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
103	Nit: s/bits/elementBits/
103	Do we need to pass in the op here? I think we just need the location and the last index? That way this function can be clearer that it is just adjusting an index into 32-bit arrays into another index into `bits`-bit arrays.
104	Just use normal OpBuilder?
121	Assert in the function regarding 1-D array?
124	Nit: s/bits/elementBits/?
125	Nit: this can just be normal OpBuilder?
129	assert llvm::isPowerOf2_32(bits)?
131	auto indices = llvm::to_vector<4>(op.indices())?
669	+1 to having separate patterns and reject not-handled cases early. It's okay to just implement integer for now and add others gradually.
670	The type conversion must factor in the storage class, which is carried as the memref memory space. This affects the converted element type. For example, if `StorageBuffer16BitAccess` is available then 16-bit integers in storage buffer class (which right now mapped to memory space `0`) does not need conversion. If we only consider the element type here it can be wrong because as long as `Int16` is not available, we will convert 16-bit integers to 32-bit. So here we should convert the whole memref type and then get the element type.
678	Just directly update `result` instead creating this local variable?
mlir/test/Conversion/StandardToSPIRV/std-ops-to-spirv.mlir
625	This needs to be updated: // Check that access chain indices are properly adjusted if non-32-bit types are emulated via 32-bit types.
635	What about creating separate functions for each type so that we have more focused and easier-to-read tests?
635	We will need tests with `StorageBuffer16BitAcess`/etc. capability.
637	I think we want to check the index calculation in detail for at least one of the case here given it's the crucial part of the adjusting. For others we might be able to just check the op name.

In D78974#2009336, @mehdi_amini wrote:

Isn't this the kind of legalization that can be made on the std dialect itself as a pre-pass before the conversion to SPIRV? That would make all this logic reusable.

Good question! But whether to do a specific type conversion is determined by the SPIR-V target environment and it can be quite nuanced. For example, if we only have StorageBuffer16Acesss capability then memref<i16, 0> will be fine but memref<i8,0>/memref<i16, 4>/etc. needs to be adjusted. There are many other similar capabilities like UniformAndStorageBuffer16BitAccess, *8BitAccess, {Int|Float}{8|16|64}, etc. This kind of information is only available when converting to SPIR-V and hide behind SPIRVTypeConverter. If this is to be implemented as a pre-pass operating on standard types, it's not quite clear to me how to solve the phase-ordering issue and rope the configuration there.
But regarding code reuse, I guess we might be able to extract some of the index adjusting logic out and change them to templated ones so one can also plug in std and other dialect ops to reuse.

antiagainst requested changes to this revision.Apr 29 2020, 9:21 AM

Address comments.

hanchung marked an inline comment as done.Apr 29 2020, 3:53 PM

hanchung added inline comments.

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
129	I make it to handle any target integer width, add `assert(targetBits % elementBits == 0);` in the beginning.
683	This depends on how type converter handles it. In this case, I followed your suggestion to generalize it -- making all 32-bit to convertedBit. Thus, if the typeConverter does try to find the "next highest power of 2", it will still work.
mlir/test/Conversion/StandardToSPIRV/std-ops-to-spirv.mlir
635	That's what I think after sent out for review...I planned to fix it in a later rivision.
635	I added one more test, although the exts and caps are more than I expected. Please let me know if this isn't the case you'd like me to add. Thanks!

In D78974#2010386, @antiagainst wrote:

In D78974#2009336, @mehdi_amini wrote:

Isn't this the kind of legalization that can be made on the std dialect itself as a pre-pass before the conversion to SPIRV? That would make all this logic reusable.

Good question! But whether to do a specific type conversion is determined by the SPIR-V target environment and it can be quite nuanced. For example, if we only have StorageBuffer16Acesss capability then memref<i16, 0> will be fine but memref<i8,0>/memref<i16, 4>/etc. needs to be adjusted. There are many other similar capabilities like UniformAndStorageBuffer16BitAccess, *8BitAccess, {Int|Float}{8|16|64}, etc. This kind of information is only available when converting to SPIR-V and hide behind SPIRVTypeConverter. If this is to be implemented as a pre-pass operating on standard types, it's not quite clear to me how to solve the phase-ordering issue and rope the configuration there.
But regarding code reuse, I guess we might be able to extract some of the index adjusting logic out and change them to templated ones so one can also plug in std and other dialect ops to reuse.

Yes, as Lei said, the information of available integer width is hidden in SPIRVTypeConverter. I think some of code reuse could be like having a method loadAndCast where it would rewrite a std load to loading a elementBits element and applying a shift and an and mask. It'd be great if there are other targets need this, so we can think more about how to reuse it.

Harbormaster failed remote builds in B55217: Diff 261075!Apr 29 2020, 4:14 PM

sync to master

hanchung added a child revision: D79143: [mlir][StandardToSPIRV] Add support for lowering integer casting..Apr 29 2020, 4:30 PM

Fix test.

Harbormaster failed remote builds in B55224: Diff 261088!Apr 29 2020, 5:50 PM

Harbormaster failed remote builds in B55231: Diff 261096!

mravishankar added inline comments.Apr 30 2020, 9:18 AM

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
112	idx and elemBitsValue seem to be the same...
117	This comment is hard to parse. Maybe more descriptive will help. Something along the lines Based on the extension/capabilities, certain integer bitwidths (`targetBits`) might not be supported. During conversion if a memref of an unsupported type is used, load/stores to this memref need to be modified to use a supported higher bitwidth (`elementBits`) and extracting the required bits. For a accessing a 1D array (spv.array or spv.rt_array), the last index is modified to load the bits needed. The extraction of the actual bits needed are handled separately.
135	This probably needs some explanation. If the accesschain is created while lowering a zero-rank memref, you have only one element in indices. You are just changing the element type here. This is still valid cause the host side would have to use the same bitwidth to store the scalar (Even though it needs lesser bitwidth).
139	use builder.replaceOpWithNewOp. I am assuming the older accesschain operation is dead and needs to be deleted.

Nice! I just have a few more nits.

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
100	What about: Assuming `index` is an index into a 1-D array with each element having `sourceBits`, returns the adjusted `index` by treating the 1-D array as having elements of `targetBits`? This means renaming `lastDim` to `index`.
104	Sorry for nitpicking again, but with `targetBits`, it's better to call `elementBits` as `sourceBits` then. ;) Simlarly for the next function.
104	What about naming it as `adjust1DArrayIndexForBitwidth`? It's nothing special to integer anymore.
122	What about naming it as `adjustAccessChainForBitwidth`?
mlir/test/Conversion/StandardToSPIRV/std-ops-to-spirv.mlir
654	It would be nice to test a 1-D memref here and with a index coming as function parameter.

antiagainst requested changes to this revision.Apr 30 2020, 11:50 AM

This revision now requires changes to proceed.Apr 30 2020, 11:50 AM

Address comments. Also found that scalar is not the case, so we can remove some checks and make the logic simpler.

hanchung added inline comments.Apr 30 2020, 3:13 PM

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
100	I think the method is not to adjust the index. Instead, it's calculating the offset of value from loaded value. When accessing the value from target 1-D array, multiple values are loaded in the same time. In this context, the method returns the offset where the `srcIdx` locates in the value. In the example, it's (x % 4) * 8, not (x % 4). I add more comments here, please take a look.
104	Yes, sourceBits is better. thanks!
112	Good catch, thanks!
135	I just found that there is no scalar case here because getElementPtr() always linearize the buffer. If it's a scalar, we still turn it to a 1D array.
139	The method looks more like returning an adjusted ptr to me, so we can focus more on how to build the ptr. I think keeping the replacement logic in the matchAndRewrite method is better. In this use case, we don't want it to be destroyed immediately because we still need some information from it later.

hanchung marked an inline comment as done.Apr 30 2020, 3:13 PM

Awesome, thanks Hanhan!

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp
100	Oh yeah good point. :)

Harbormaster failed remote builds in B55380: Diff 261374!Apr 30 2020, 4:07 PM

THanks Hanhan!

This revision is now accepted and ready to land.Apr 30 2020, 6:39 PM

Closed by commit rG6601b65aedd0: [mlir][StandardToSPIRV] Emulate bitwidths not supported for load op. (authored by hanchung). · Explain WhyApr 30 2020, 7:33 PM

This revision was automatically updated to reflect the committed changes.

hanchung mentioned this in D79272: [mlir][StandardToSPIRV] Emulate bitwidths not supported for store op..May 1 2020, 3:56 PM

In D78974#2010386, @antiagainst wrote:

In D78974#2009336, @mehdi_amini wrote:

Isn't this the kind of legalization that can be made on the std dialect itself as a pre-pass before the conversion to SPIRV? That would make all this logic reusable.

Good question! But whether to do a specific type conversion is determined by the SPIR-V target environment and it can be quite nuanced. For example, if we only have StorageBuffer16Acesss capability then memref<i16, 0> will be fine but memref<i8,0>/memref<i16, 4>/etc. needs to be adjusted. There are many other similar capabilities like UniformAndStorageBuffer16BitAccess, *8BitAccess, {Int|Float}{8|16|64}, etc. This kind of information is only available when converting to SPIR-V and hide behind SPIRVTypeConverter. If this is to be implemented as a pre-pass operating on standard types, it's not quite clear to me how to solve the phase-ordering issue and rope the configuration there.
But regarding code reuse, I guess we might be able to extract some of the index adjusting logic out and change them to templated ones so one can also plug in std and other dialect ops to reuse.

OK, thanks for the explanation!

hanchung mentioned this in rG5d10613b6edc: [mlir][StandardToSPIRV] Emulate bitwidths not supported for store op..May 4 2020, 3:38 PM

Revision Contents

Path

Size

mlir/

lib/

Conversion/

StandardToSPIRV/

ConvertStandardToSPIRV.cpp

135 lines

test/

Conversion/

StandardToSPIRV/

std-ops-to-spirv.mlir

112 lines

Diff 261421

mlir/lib/Conversion/StandardToSPIRV/ConvertStandardToSPIRV.cpp

Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	LLVM_DEBUG(llvm::dbgs()
<< srcAttr << " illegal: cannot fit into converted type '"		<< srcAttr << " illegal: cannot fit into converted type '"
<< dstType << "'\n");		<< dstType << "'\n");
return FloatAttr();		return FloatAttr();
}		}

return builder.getF32FloatAttr(dstVal.convertToFloat());		return builder.getF32FloatAttr(dstVal.convertToFloat());
}		}

		/// Returns the offset of the value in `targetBits` representation. `srcIdx` is
		antiagainstUnsubmitted Done Reply Inline Actions What about: Assuming `index` is an index into a 1-D array with each element having `sourceBits`, returns the adjusted `index` by treating the 1-D array as having elements of `targetBits`? This means renaming `lastDim` to `index`. antiagainst: What about: Assuming `index` is an index into a 1-D array with each element having…
		hanchungAuthorUnsubmitted Done Reply Inline Actions I think the method is not to adjust the index. Instead, it's calculating the offset of value from loaded value. When accessing the value from target 1-D array, multiple values are loaded in the same time. In this context, the method returns the offset where the `srcIdx` locates in the value. In the example, it's (x % 4) * 8, not (x % 4). I add more comments here, please take a look. hanchung: I think the method is not to adjust the index. Instead, it's calculating the offset of value…
		antiagainstUnsubmitted Done Reply Inline Actions Oh yeah good point. :) antiagainst: Oh yeah good point. :)
		/// an index into a 1-D array with each element having `sourceBits`. When
		/// accessing an element in the array treating as having elements of
		/// `targetBits`, multiple values are loaded in the same time. The method
		mravishankarUnsubmitted Done Reply Inline Actions Couple of things here This assumes bits < 32. Probably need to assert that as well. It would be nice to actually not specialize this to 32-bits. You could take the target integer type as an argument and the same logic should more or less hold. mravishankar: Couple of things here 1) This assumes bits < 32. Probably need to assert that as well. 2) It…
		antiagainstUnsubmitted Done Reply Inline Actions Nit: s/bits/elementBits/ antiagainst: Nit: s/bits/elementBits/
		antiagainstUnsubmitted Done Reply Inline Actions Do we need to pass in the op here? I think we just need the location and the last index? That way this function can be clearer that it is just adjusting an index into 32-bit arrays into another index into `bits`-bit arrays. antiagainst: Do we need to pass in the op here? I think we just need the location and the last index? That…
		/// returns the offset where the `srcIdx` locates in the value. For example, if
		antiagainstUnsubmitted Done Reply Inline Actions Just use normal OpBuilder? antiagainst: Just use normal OpBuilder?
		antiagainstUnsubmitted Done Reply Inline Actions Sorry for nitpicking again, but with `targetBits`, it's better to call `elementBits` as `sourceBits` then. ;) Simlarly for the next function. antiagainst: Sorry for nitpicking again, but with `targetBits`, it's better to call `elementBits` as…
		hanchungAuthorUnsubmitted Done Reply Inline Actions Yes, sourceBits is better. thanks! hanchung: Yes, sourceBits is better. thanks!
		antiagainstUnsubmitted Done Reply Inline Actions What about naming it as `adjust1DArrayIndexForBitwidth`? It's nothing special to integer anymore. antiagainst: What about naming it as `adjust1DArrayIndexForBitwidth`? It's nothing special to integer…
		/// `sourceBits` equals to 8 and `targetBits` equals to 32, the x-th element is
		/// located at (x % 4) * 8. Because there are four elements in one i32, and one
		/// element has 8 bits.
		static Value getOffsetForBitwidth(Location loc, Value srcIdx, int sourceBits,
		int targetBits, OpBuilder &builder) {
		assert(targetBits % sourceBits == 0);
		IntegerType targetType = builder.getIntegerType(targetBits);
		IntegerAttr idxAttr =
		mravishankarUnsubmitted Done Reply Inline Actions idx and elemBitsValue seem to be the same... mravishankar: idx and elemBitsValue seem to be the same...
		hanchungAuthorUnsubmitted Done Reply Inline Actions Good catch, thanks! hanchung: Good catch, thanks!
		builder.getIntegerAttr(targetType, targetBits / sourceBits);
		auto idx = builder.create<spirv::ConstantOp>(loc, targetType, idxAttr);
		IntegerAttr srcBitsAttr = builder.getIntegerAttr(targetType, sourceBits);
		auto srcBitsValue =
		builder.create<spirv::ConstantOp>(loc, targetType, srcBitsAttr);
		mravishankarUnsubmitted Done Reply Inline Actions This comment is hard to parse. Maybe more descriptive will help. Something along the lines Based on the extension/capabilities, certain integer bitwidths (`targetBits`) might not be supported. During conversion if a memref of an unsupported type is used, load/stores to this memref need to be modified to use a supported higher bitwidth (`elementBits`) and extracting the required bits. For a accessing a 1D array (spv.array or spv.rt_array), the last index is modified to load the bits needed. The extraction of the actual bits needed are handled separately. mravishankar: This comment is hard to parse. Maybe more descriptive will help. Something along the lines…
		auto m = builder.create<spirv::SModOp>(loc, srcIdx, idx);
		return builder.create<spirv::IMulOp>(loc, targetType, m, srcBitsValue);
		}

		antiagainstUnsubmitted Done Reply Inline Actions Assert in the function regarding 1-D array? antiagainst: Assert in the function regarding 1-D array?
		/// Returns an adjusted spirv::AccessChainOp. Based on the
		antiagainstUnsubmitted Done Reply Inline Actions What about naming it as `adjustAccessChainForBitwidth`? antiagainst: What about naming it as `adjustAccessChainForBitwidth`?
		/// extension/capabilities, certain integer bitwidths `sourceBits` might not be
		mravishankarUnsubmitted Done Reply Inline Actions This too could be generalized to handle any target integer width. mravishankar: This too could be generalized to handle any target integer width.
		/// supported. During conversion if a memref of an unsupported type is used,
		antiagainstUnsubmitted Done Reply Inline Actions Nit: s/bits/elementBits/? antiagainst: Nit: s/bits/elementBits/?
		/// load/stores to this memref need to be modified to use a supported higher
		antiagainstUnsubmitted Done Reply Inline Actions Nit: this can just be normal OpBuilder? antiagainst: Nit: this can just be normal OpBuilder?
		/// bitwidth `targetBits` and extracting the required bits. For an accessing a
		/// 1D array (spv.array or spv.rt_array), the last index is modified to load the
		/// bits needed. The extraction of the actual bits needed are handled
		/// separately. Note that this only works for a 1-D tensor.
		antiagainstUnsubmitted Done Reply Inline Actions assert llvm::isPowerOf2_32(bits)? antiagainst: assert llvm::isPowerOf2_32(bits)?
		hanchungAuthorUnsubmitted Done Reply Inline Actions I make it to handle any target integer width, add `assert(targetBits % elementBits == 0);` in the beginning. hanchung: I make it to handle any target integer width, add `assert(targetBits % elementBits == 0);` in…
		static Value adjustAccessChainForBitwidth(SPIRVTypeConverter &typeConverter,
		spirv::AccessChainOp op,
		antiagainstUnsubmitted Done Reply Inline Actions auto indices = llvm::to_vector<4>(op.indices())? antiagainst: auto indices = llvm::to_vector<4>(op.indices())?
		int sourceBits, int targetBits,
		OpBuilder &builder) {
		assert(targetBits % sourceBits == 0);
		const auto loc = op.getLoc();
		mravishankarUnsubmitted Done Reply Inline Actions This probably needs some explanation. If the accesschain is created while lowering a zero-rank memref, you have only one element in indices. You are just changing the element type here. This is still valid cause the host side would have to use the same bitwidth to store the scalar (Even though it needs lesser bitwidth). mravishankar: This probably needs some explanation. If the accesschain is created while lowering a zero-rank…
		hanchungAuthorUnsubmitted Done Reply Inline Actions I just found that there is no scalar case here because getElementPtr() always linearize the buffer. If it's a scalar, we still turn it to a 1D array. hanchung: I just found that there is no scalar case here because getElementPtr() always linearize the…
		IntegerType targetType = builder.getIntegerType(targetBits);
		IntegerAttr attr =
		builder.getIntegerAttr(targetType, targetBits / sourceBits);
		auto idx = builder.create<spirv::ConstantOp>(loc, targetType, attr);
		mravishankarUnsubmitted Done Reply Inline Actions use builder.replaceOpWithNewOp. I am assuming the older accesschain operation is dead and needs to be deleted. mravishankar: use builder.replaceOpWithNewOp. I am assuming the older accesschain operation is dead and needs…
		hanchungAuthorUnsubmitted Done Reply Inline Actions The method looks more like returning an adjusted ptr to me, so we can focus more on how to build the ptr. I think keeping the replacement logic in the matchAndRewrite method is better. In this use case, we don't want it to be destroyed immediately because we still need some information from it later. hanchung: The method looks more like returning an adjusted ptr to me, so we can focus more on how to…
		auto lastDim = op.getOperation()->getOperand(op.getNumOperands() - 1);
		auto indices = llvm::to_vector<4>(op.indices());
		// There are two elements if this is a 1-D tensor.
		assert(indices.size() == 2);
		indices.back() = builder.create<spirv::SDivOp>(loc, lastDim, idx);
		Type t = typeConverter.convertType(op.component_ptr().getType());
		return builder.create<spirv::AccessChainOp>(loc, t, op.base_ptr(), indices);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operation conversion		// Operation conversion
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// Note that DRR cannot be used for the patterns in this file: we may need to		// Note that DRR cannot be used for the patterns in this file: we may need to
// convert type along the way, which requires ConversionPattern. DRR generates		// convert type along the way, which requires ConversionPattern. DRR generates
// normal RewritePattern.		// normal RewritePattern.

▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	public:
using SPIRVOpLowering<CmpIOp>::SPIRVOpLowering;		using SPIRVOpLowering<CmpIOp>::SPIRVOpLowering;

LogicalResult		LogicalResult
matchAndRewrite(CmpIOp cmpIOp, ArrayRef<Value> operands,		matchAndRewrite(CmpIOp cmpIOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override;		ConversionPatternRewriter &rewriter) const override;
};		};

/// Converts std.load to spv.Load.		/// Converts std.load to spv.Load.
		class IntLoadOpPattern final : public SPIRVOpLowering<LoadOp> {
		public:
		using SPIRVOpLowering<LoadOp>::SPIRVOpLowering;

		LogicalResult
		matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override;
		};

		/// Converts std.load to spv.Load.
class LoadOpPattern final : public SPIRVOpLowering<LoadOp> {		class LoadOpPattern final : public SPIRVOpLowering<LoadOp> {
public:		public:
using SPIRVOpLowering<LoadOp>::SPIRVOpLowering;		using SPIRVOpLowering<LoadOp>::SPIRVOpLowering;

LogicalResult		LogicalResult
matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,		matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const override;		ConversionPatternRewriter &rewriter) const override;
};		};
▲ Show 20 Lines • Show All 308 Lines • ▼ Show 20 Lines	#undef DISPATCH
return failure();		return failure();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// LoadOp		// LoadOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

LogicalResult		LogicalResult
		IntLoadOpPattern::matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const {
		LoadOpOperandAdaptor loadOperands(operands);
		auto loc = loadOp.getLoc();
		auto memrefType = loadOp.memref().getType().cast<MemRefType>();
		if (!memrefType.getElementType().isSignlessInteger())
		return failure();
		spirv::AccessChainOp accessChainOp =
		spirv::getElementPtr(typeConverter, memrefType, loadOperands.memref(),
		loadOperands.indices(), loc, rewriter);

		int srcBits = memrefType.getElementType().getIntOrFloatBitWidth();
		auto dstType = typeConverter.convertType(memrefType)
		.cast<spirv::PointerType>()
		.getPointeeType()
		.cast<spirv::StructType>()
		.getElementType(0)
		.cast<spirv::ArrayType>()
		.getElementType();
		int dstBits = dstType.getIntOrFloatBitWidth();
		assert(dstBits % srcBits == 0);

		// If the rewrited load op has the same bit width, use the loading value
		// directly.
		if (srcBits == dstBits) {
		rewriter.replaceOpWithNewOp<spirv::LoadOp>(loadOp,
		accessChainOp.getResult());
		return success();
		}

		// Assume that getElementPtr() works linearizely. If it's a scalar, the method
		// still returns a linearized accessing. If the accessing is not linearized,
		// there will be offset issues.
		assert(accessChainOp.indices().size() == 2);
		Value adjustedPtr = adjustAccessChainForBitwidth(typeConverter, accessChainOp,
		srcBits, dstBits, rewriter);
		Value spvLoadOp = rewriter.create<spirv::LoadOp>(
		loc, dstType, adjustedPtr,
		loadOp.getAttrOfType<IntegerAttr>(
		spirv::attributeName<spirv::MemoryAccess>()),
		loadOp.getAttrOfType<IntegerAttr>("alignment"));

		// Shift the bits to the rightmost.
		// ____XXXX________ -> ____________XXXX
		Value lastDim = accessChainOp.getOperation()->getOperand(
		accessChainOp.getNumOperands() - 1);
		Value offset = getOffsetForBitwidth(loc, lastDim, srcBits, dstBits, rewriter);
		Value result = rewriter.create<spirv::ShiftRightArithmeticOp>(
		loc, spvLoadOp.getType(), spvLoadOp, offset);

		// Apply the mask to extract corresponding bits.
		Value mask = rewriter.create<spirv::ConstantOp>(
		loc, dstType, rewriter.getIntegerAttr(dstType, (1 << srcBits) - 1));
		result = rewriter.create<spirv::BitwiseAndOp>(loc, dstType, result, mask);
		rewriter.replaceOp(loadOp, result);

		assert(accessChainOp.use_empty());
		rewriter.eraseOp(accessChainOp);

		return success();
		}

		LogicalResult
LoadOpPattern::matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,		LoadOpPattern::matchAndRewrite(LoadOp loadOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const {		ConversionPatternRewriter &rewriter) const {
LoadOpOperandAdaptor loadOperands(operands);		LoadOpOperandAdaptor loadOperands(operands);
auto loadPtr = spirv::getElementPtr(		auto memrefType = loadOp.memref().getType().cast<MemRefType>();
typeConverter, loadOp.memref().getType().cast<MemRefType>(),		if (memrefType.getElementType().isSignlessInteger())
loadOperands.memref(), loadOperands.indices(), loadOp.getLoc(), rewriter);		return failure();
		auto loadPtr =
		spirv::getElementPtr(typeConverter, memrefType, loadOperands.memref(),
		loadOperands.indices(), loadOp.getLoc(), rewriter);
rewriter.replaceOpWithNewOp<spirv::LoadOp>(loadOp, loadPtr);		rewriter.replaceOpWithNewOp<spirv::LoadOp>(loadOp, loadPtr);
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ReturnOp		// ReturnOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		mravishankarUnsubmitted Done Reply Inline Actions This will assert if this is not an integer. So it might be better to have a different pattern for load stores when the memref is integer type. So one pattern will implement this logic for integer type load/stores. Another pattern will be generic that will be type agnostic (and will return failure for integer types to not intersect with the other pattern) mravishankar: This will assert if this is not an integer. So it might be better to have a different pattern…
		antiagainstUnsubmitted Done Reply Inline Actions +1 to having separate patterns and reject not-handled cases early. It's okay to just implement integer for now and add others gradually. antiagainst: +1 to having separate patterns and reject not-handled cases early. It's okay to just implement…

		antiagainstUnsubmitted Done Reply Inline Actions The type conversion must factor in the storage class, which is carried as the memref memory space. This affects the converted element type. For example, if `StorageBuffer16BitAccess` is available then 16-bit integers in storage buffer class (which right now mapped to memory space `0`) does not need conversion. If we only consider the element type here it can be wrong because as long as `Int16` is not available, we will convert 16-bit integers to 32-bit. So here we should convert the whole memref type and then get the element type. antiagainst: The type conversion must factor in the storage class, which is carried as the memref memory…
LogicalResult		LogicalResult
ReturnOpPattern::matchAndRewrite(ReturnOp returnOp, ArrayRef<Value> operands,		ReturnOpPattern::matchAndRewrite(ReturnOp returnOp, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const {		ConversionPatternRewriter &rewriter) const {
if (returnOp.getNumOperands()) {		if (returnOp.getNumOperands()) {
return failure();		return failure();
}		}
rewriter.replaceOpWithNewOp<spirv::ReturnOp>(returnOp);		rewriter.replaceOpWithNewOp<spirv::ReturnOp>(returnOp);
return success();		return success();
		antiagainstUnsubmitted Done Reply Inline Actions Just directly update `result` instead creating this local variable? antiagainst: Just directly update `result` instead creating this local variable?
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SelectOp		// SelectOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		mravishankarUnsubmitted Done Reply Inline Actions Could we add a smarter logic here. We can try to find the "next highest power of 2" that is legal and use that instead. mravishankar: Could we add a smarter logic here. We can try to find the "next highest power of 2" that is…
		hanchungAuthorUnsubmitted Done Reply Inline Actions This depends on how type converter handles it. In this case, I followed your suggestion to generalize it -- making all 32-bit to convertedBit. Thus, if the typeConverter does try to find the "next highest power of 2", it will still work. hanchung: This depends on how type converter handles it. In this case, I followed your suggestion to…

LogicalResult		LogicalResult
SelectOpPattern::matchAndRewrite(SelectOp op, ArrayRef<Value> operands,		SelectOpPattern::matchAndRewrite(SelectOp op, ArrayRef<Value> operands,
ConversionPatternRewriter &rewriter) const {		ConversionPatternRewriter &rewriter) const {
SelectOpOperandAdaptor selectOperands(operands);		SelectOpOperandAdaptor selectOperands(operands);
rewriter.replaceOpWithNewOp<spirv::SelectOp>(op, selectOperands.condition(),		rewriter.replaceOpWithNewOp<spirv::SelectOp>(op, selectOperands.condition(),
selectOperands.true_value(),		selectOperands.true_value(),
selectOperands.false_value());		selectOperands.false_value());
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	patterns.insert<
UnaryAndBinaryOpPattern<SubIOp, spirv::ISubOp>,		UnaryAndBinaryOpPattern<SubIOp, spirv::ISubOp>,
UnaryAndBinaryOpPattern<TanhOp, spirv::GLSLTanhOp>,		UnaryAndBinaryOpPattern<TanhOp, spirv::GLSLTanhOp>,
UnaryAndBinaryOpPattern<UnsignedDivIOp, spirv::UDivOp>,		UnaryAndBinaryOpPattern<UnsignedDivIOp, spirv::UDivOp>,
UnaryAndBinaryOpPattern<UnsignedRemIOp, spirv::UModOp>,		UnaryAndBinaryOpPattern<UnsignedRemIOp, spirv::UModOp>,
UnaryAndBinaryOpPattern<UnsignedShiftRightOp, spirv::ShiftRightLogicalOp>,		UnaryAndBinaryOpPattern<UnsignedShiftRightOp, spirv::ShiftRightLogicalOp>,
BitwiseOpPattern<AndOp, spirv::LogicalAndOp, spirv::BitwiseAndOp>,		BitwiseOpPattern<AndOp, spirv::LogicalAndOp, spirv::BitwiseAndOp>,
BitwiseOpPattern<OrOp, spirv::LogicalOrOp, spirv::BitwiseOrOp>,		BitwiseOpPattern<OrOp, spirv::LogicalOrOp, spirv::BitwiseOrOp>,
BoolCmpIOpPattern, ConstantCompositeOpPattern, ConstantScalarOpPattern,		BoolCmpIOpPattern, ConstantCompositeOpPattern, ConstantScalarOpPattern,
CmpFOpPattern, CmpIOpPattern, LoadOpPattern, ReturnOpPattern,		CmpFOpPattern, CmpIOpPattern, IntLoadOpPattern, LoadOpPattern,
SelectOpPattern, StoreOpPattern,		ReturnOpPattern, SelectOpPattern, StoreOpPattern,
TypeCastingOpPattern<SIToFPOp, spirv::ConvertSToFOp>,		TypeCastingOpPattern<SIToFPOp, spirv::ConvertSToFOp>,
TypeCastingOpPattern<FPExtOp, spirv::FConvertOp>,		TypeCastingOpPattern<FPExtOp, spirv::FConvertOp>,
TypeCastingOpPattern<FPTruncOp, spirv::FConvertOp>, XOrOpPattern>(		TypeCastingOpPattern<FPTruncOp, spirv::FConvertOp>, XOrOpPattern>(
context, typeConverter);		context, typeConverter);
}		}
} // namespace mlir		} // namespace mlir

mlir/test/Conversion/StandardToSPIRV/std-ops-to-spirv.mlir

Show First 20 Lines • Show All 613 Lines • ▼ Show 20 Lines	func @load_store_zero_rank_int(%arg0: memref<i32>, %arg1: memref<i32>) {
// CHECK-SAME: [[ZERO2]], [[ZERO2]]		// CHECK-SAME: [[ZERO2]], [[ZERO2]]
// CHECK-SAME: ] :		// CHECK-SAME: ] :
// CHECK: spv.Store "StorageBuffer" %{{.*}} : i32		// CHECK: spv.Store "StorageBuffer" %{{.*}} : i32
store %0, %arg1[] : memref<i32>		store %0, %arg1[] : memref<i32>
return		return
}		}

} // end module		} // end module

		// -----

		// Check that access chain indices are properly adjusted if non-32-bit types are
		antiagainstUnsubmitted Done Reply Inline Actions This needs to be updated: // Check that access chain indices are properly adjusted if non-32-bit types are emulated via 32-bit types. antiagainst: This needs to be updated: // Check that access chain indices are properly adjusted if non-32…
		// emulated via 32-bit types.
		// TODO: Test i64 type.
		module attributes {
		spv.target_env = #spv.target_env<
		#spv.vce<v1.0, [Shader], [SPV_KHR_storage_buffer_storage_class]>,
		{max_compute_workgroup_invocations = 128 : i32,
		max_compute_workgroup_size = dense<[128, 128, 64]> : vector<3xi32>}>
		} {

		// CHECK-LABEL: @load_i8
		antiagainstUnsubmitted Done Reply Inline Actions What about creating separate functions for each type so that we have more focused and easier-to-read tests? antiagainst: What about creating separate functions for each type so that we have more focused and easier-to…
		hanchungAuthorUnsubmitted Done Reply Inline Actions That's what I think after sent out for review...I planned to fix it in a later rivision. hanchung: That's what I think after sent out for review...I planned to fix it in a later rivision.
		antiagainstUnsubmitted Done Reply Inline Actions We will need tests with `StorageBuffer16BitAcess`/etc. capability. antiagainst: We will need tests with `StorageBuffer16BitAcess`/etc. capability.
		hanchungAuthorUnsubmitted Done Reply Inline Actions I added one more test, although the exts and caps are more than I expected. Please let me know if this isn't the case you'd like me to add. Thanks! hanchung: I added one more test, although the exts and caps are more than I expected. Please let me know…
		func @load_i8(%arg0: memref<i8>) {
		// CHECK: %[[ZERO:.+]] = spv.constant 0 : i32
		antiagainstUnsubmitted Done Reply Inline Actions I think we want to check the index calculation in detail for at least one of the case here given it's the crucial part of the adjusting. For others we might be able to just check the op name. antiagainst: I think we want to check the index calculation in detail for at least one of the case here…
		// CHECK: %[[FOUR1:.+]] = spv.constant 4 : i32
		// CHECK: %[[QUOTIENT:.+]] = spv.SDiv %[[ZERO]], %[[FOUR1]] : i32
		// CHECK: %[[PTR:.+]] = spv.AccessChain %{{.+}}[%[[ZERO]], %[[QUOTIENT]]]
		// CHECK: %[[LOAD:.+]] = spv.Load "StorageBuffer" %[[PTR]]
		// CHECK: %[[FOUR2:.+]] = spv.constant 4 : i32
		// CHECK: %[[EIGHT:.+]] = spv.constant 8 : i32
		// CHECK: %[[IDX:.+]] = spv.SMod %[[ZERO]], %[[FOUR2]] : i32
		// CHECK: %[[BITS:.+]] = spv.IMul %[[IDX]], %[[EIGHT]] : i32
		// CHECK: %[[VALUE:.+]] = spv.ShiftRightArithmetic %[[LOAD]], %[[BITS]] : i32, i32
		// CHECK: %[[MASK:.+]] = spv.constant 255 : i32
		// CHECK: spv.BitwiseAnd %[[VALUE]], %[[MASK]] : i32
		%0 = load %arg0[] : memref<i8>
		return
		}

		// CHECK-LABEL: @load_i16
		// CHECK: (%[[ARG0:.+]]: {{.*}}, %[[ARG1:.+]]: i32)
		antiagainstUnsubmitted Done Reply Inline Actions It would be nice to test a 1-D memref here and with a index coming as function parameter. antiagainst: It would be nice to test a 1-D memref here and with a index coming as function parameter.
		func @load_i16(%arg0: memref<10xi16>, %index : index) {
		// CHECK: %[[ONE:.+]] = spv.constant 1 : i32
		// CHECK: %[[FLAT_IDX:.+]] = spv.IMul %[[ONE]], %[[ARG1]] : i32
		// CHECK: %[[ZERO:.+]] = spv.constant 0 : i32
		// CHECK: %[[TWO1:.+]] = spv.constant 2 : i32
		// CHECK: %[[QUOTIENT:.+]] = spv.SDiv %[[FLAT_IDX]], %[[TWO1]] : i32
		// CHECK: %[[PTR:.+]] = spv.AccessChain %{{.+}}[%[[ZERO]], %[[QUOTIENT]]]
		// CHECK: %[[LOAD:.+]] = spv.Load "StorageBuffer" %[[PTR]]
		// CHECK: %[[TWO2:.+]] = spv.constant 2 : i32
		// CHECK: %[[SIXTEEN:.+]] = spv.constant 16 : i32
		// CHECK: %[[IDX:.+]] = spv.SMod %[[FLAT_IDX]], %[[TWO2]] : i32
		// CHECK: %[[BITS:.+]] = spv.IMul %[[IDX]], %[[SIXTEEN]] : i32
		// CHECK: %[[VALUE:.+]] = spv.ShiftRightArithmetic %[[LOAD]], %[[BITS]] : i32, i32
		// CHECK: %[[MASK:.+]] = spv.constant 65535 : i32
		// CHECK: spv.BitwiseAnd %[[VALUE]], %[[MASK]] : i32
		%0 = load %arg0[%index] : memref<10xi16>
		return
		}

		// CHECK-LABEL: @load_i32
		func @load_i32(%arg0: memref<i32>) {
		// CHECK-NOT: spv.SDiv
		// CHECK: spv.Load
		// CHECK-NOT: spv.ShiftRightArithmetic
		%0 = load %arg0[] : memref<i32>
		return
		}

		// CHECK-LABEL: @load_f32
		func @load_f32(%arg0: memref<f32>) {
		// CHECK-NOT: spv.SDiv
		// CHECK: spv.Load
		// CHECK-NOT: spv.ShiftRightArithmetic
		%0 = load %arg0[] : memref<f32>
		return
		}

		} // end module

		// -----

		// Check that access chain indices are properly adjusted if non-16/32-bit types
		// are emulated via 32-bit types.
		module attributes {
		spv.target_env = #spv.target_env<
		#spv.vce<v1.0, [Int16, StorageBuffer16BitAccess, Shader],
		[SPV_KHR_storage_buffer_storage_class, SPV_KHR_16bit_storage]>,
		{max_compute_workgroup_invocations = 128 : i32,
		max_compute_workgroup_size = dense<[128, 128, 64]> : vector<3xi32>}>
		} {

		// CHECK-LABEL: @load_i8
		func @load_i8(%arg0: memref<i8>) {
		// CHECK: %[[ZERO:.+]] = spv.constant 0 : i32
		// CHECK: %[[FOUR1:.+]] = spv.constant 4 : i32
		// CHECK: %[[QUOTIENT:.+]] = spv.SDiv %[[ZERO]], %[[FOUR1]] : i32
		// CHECK: %[[PTR:.+]] = spv.AccessChain %{{.+}}[%[[ZERO]], %[[QUOTIENT]]]
		// CHECK: %[[LOAD:.+]] = spv.Load "StorageBuffer" %[[PTR]]
		// CHECK: %[[FOUR2:.+]] = spv.constant 4 : i32
		// CHECK: %[[EIGHT:.+]] = spv.constant 8 : i32
		// CHECK: %[[IDX:.+]] = spv.SMod %[[ZERO]], %[[FOUR2]] : i32
		// CHECK: %[[BITS:.+]] = spv.IMul %[[IDX]], %[[EIGHT]] : i32
		// CHECK: %[[VALUE:.+]] = spv.ShiftRightArithmetic %[[LOAD]], %[[BITS]] : i32, i32
		// CHECK: %[[MASK:.+]] = spv.constant 255 : i32
		// CHECK: spv.BitwiseAnd %[[VALUE]], %[[MASK]] : i32
		%0 = load %arg0[] : memref<i8>
		return
		}

		// CHECK-LABEL: @load_i16
		func @load_i16(%arg0: memref<i16>) {
		// CHECK-NOT: spv.SDiv
		// CHECK: spv.Load
		// CHECK-NOT: spv.ShiftRightArithmetic
		%0 = load %arg0[] : memref<i16>
		return
		}

		} // end module