Diff 453185

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

Show First 20 Lines • Show All 1,754 Lines • ▼ Show 20 Lines	def Vector_MaskedStoreOp :
let hasFolder = 1;		let hasFolder = 1;
let hasVerifier = 1;		let hasVerifier = 1;
}		}

def Vector_GatherOp :		def Vector_GatherOp :
Vector_Op<"gather">,		Vector_Op<"gather">,
Arguments<(ins Arg<AnyShaped, "", [MemRead]>:$base,		Arguments<(ins Arg<AnyShaped, "", [MemRead]>:$base,
Variadic<Index>:$indices,		Variadic<Index>:$indices,
VectorOfRankAndType<[1], [AnyInteger, Index]>:$index_vec,		VectorOf<[AnyInteger, Index]>:$index_vec,
VectorOfRankAndType<[1], [I1]>:$mask,		VectorOf<[I1]>:$mask,
VectorOfRank<[1]>:$pass_thru)>,		AnyVector:$pass_thru)>,
Results<(outs VectorOfRank<[1]>:$result)> {		Results<(outs AnyVector:$result)> {

let summary = [{		let summary = [{
gathers elements from memory or ranked tensor into a vector as defined by an		gathers elements from memory or ranked tensor into a vector as defined by an
index vector and mask		index vector and mask
}];		}];

let description = [{		let description = [{
The gather operation gathers elements from memory or ranked tensor into a		The gather operation gathers elements from memory or ranked tensor into a
1-D vector as defined by a base with indices and an additional 1-D index		n-D vector as defined by a base with indices and an additional n-D index
vector, but only if the corresponding bit is set in a 1-D mask vector.		vector (each index is a 1-D offset on the base), but only if the
Otherwise, the element is taken from a 1-D pass-through vector. Informally		corresponding bit is set in a n-D mask vector. Otherwise, the element is
the semantics are:		taken from a n-D pass-through vector. Informally the semantics are:
```		```
result[0] := mask[0] ? base[index[0]] : pass_thru[0]		result[0] := mask[0] ? base[index[0]] : pass_thru[0]
result[1] := mask[1] ? base[index[1]] : pass_thru[1]		result[1] := mask[1] ? base[index[1]] : pass_thru[1]
etc.		etc.
```		```
The vector dialect leaves out-of-bounds behavior undefined.		The vector dialect leaves out-of-bounds behavior undefined.

The gather operation can be used directly where applicable, or can be used		The gather operation can be used directly where applicable, or can be used
during progressively lowering to bring other memory operations closer to		during progressively lowering to bring other memory operations closer to
hardware ISA support for a gather. The semantics of the operation closely		hardware ISA support for a gather.
		dcaballeUnsubmitted Done Reply Inline Actions I think we should remove this reference to LLVM since the operation is evolving in a different way? dcaballe: I think we should remove this reference to LLVM since the operation is evolving in a different…
correspond to those of the `llvm.masked.gather`
[intrinsic](https://llvm.org/docs/LangRef.html#llvm-masked-gather-intrinsics).

Examples:		Examples:

```mlir		```mlir
%0 = vector.gather %base[%c0][%v], %mask, %pass_thru		%0 = vector.gather %base[%c0][%v], %mask, %pass_thru
: memref<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>		: memref<?xf32>, vector<2x16xi32>, vector<2x16xi1>, vector<2x16xf32> into vector<2x16xf32>

%1 = vector.gather %base[%i, %j][%v], %mask, %pass_thru		%1 = vector.gather %base[%i, %j][%v], %mask, %pass_thru
: memref<16x16xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>		: memref<16x16xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
```		```
}];		}];
let extraClassDeclaration = [{		let extraClassDeclaration = [{
ShapedType getBaseType() {		ShapedType getBaseType() {
return getBase().getType().cast<ShapedType>();		return getBase().getType().cast<ShapedType>();
▲ Show 20 Lines • Show All 903 Lines • Show Last 20 Lines

mlir/lib/Conversion/LLVMCommon/VectorPattern.cpp

Show First 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	void LLVM::detail::nDVectorIterate(const LLVM::detail::NDVectorTypeInfo &info,
}		}
}		}

LogicalResult LLVM::detail::handleMultidimensionalVectors(		LogicalResult LLVM::detail::handleMultidimensionalVectors(
Operation *op, ValueRange operands, LLVMTypeConverter &typeConverter,		Operation *op, ValueRange operands, LLVMTypeConverter &typeConverter,
std::function<Value(Type, ValueRange)> createOperand,		std::function<Value(Type, ValueRange)> createOperand,
ConversionPatternRewriter &rewriter) {		ConversionPatternRewriter &rewriter) {
auto resultNDVectorType = op->getResult(0).getType().cast<VectorType>();		auto resultNDVectorType = op->getResult(0).getType().cast<VectorType>();

SmallVector<Type> operand1DVectorTypes;
for (Value operand : op->getOperands()) {
auto operandNDVectorType = operand.getType().cast<VectorType>();
auto operandTypeInfo =
extractNDVectorTypeInfo(operandNDVectorType, typeConverter);
operand1DVectorTypes.push_back(operandTypeInfo.llvm1DVectorTy);
}
auto resultTypeInfo =		auto resultTypeInfo =
extractNDVectorTypeInfo(resultNDVectorType, typeConverter);		extractNDVectorTypeInfo(resultNDVectorType, typeConverter);
auto result1DVectorTy = resultTypeInfo.llvm1DVectorTy;		auto result1DVectorTy = resultTypeInfo.llvm1DVectorTy;
auto resultNDVectoryTy = resultTypeInfo.llvmNDVectorTy;		auto resultNDVectoryTy = resultTypeInfo.llvmNDVectorTy;
auto loc = op->getLoc();		auto loc = op->getLoc();
Value desc = rewriter.create<LLVM::UndefOp>(loc, resultNDVectoryTy);		Value desc = rewriter.create<LLVM::UndefOp>(loc, resultNDVectoryTy);
nDVectorIterate(resultTypeInfo, rewriter, [&](ArrayRef<int64_t> position) {		nDVectorIterate(resultTypeInfo, rewriter, [&](ArrayRef<int64_t> position) {
// For this unrolled `position` corresponding to the `linearIndex`^th		// For this unrolled `position` corresponding to the `linearIndex`^th
Show All 37 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	LogicalResult getMemRefAlignment(LLVMTypeConverter &typeConverter,
// TODO: this should use the MLIR data layout when it becomes available and		// TODO: this should use the MLIR data layout when it becomes available and
// stop depending on translation.		// stop depending on translation.
llvm::LLVMContext llvmContext;		llvm::LLVMContext llvmContext;
align = LLVM::TypeToLLVMIRTranslator(llvmContext)		align = LLVM::TypeToLLVMIRTranslator(llvmContext)
.getPreferredAlignment(elementTy, typeConverter.getDataLayout());		.getPreferredAlignment(elementTy, typeConverter.getDataLayout());
return success();		return success();
}		}

// Add an index vector component to a base pointer. This almost always succeeds		// Check if the last stride is non-unit or the memory space is not zero.
// unless the last stride is non-unit or the memory space is not zero.		static LogicalResult isMemRefTypeSupported(MemRefType memRefType) {
		dcaballeUnsubmitted Not Done Reply Inline Actions Great refactoring, thanks! dcaballe: Great refactoring, thanks!
static LogicalResult getIndexedPtrs(ConversionPatternRewriter &rewriter,
Location loc, Value memref, Value base,
Value index, MemRefType memRefType,
VectorType vType, Value &ptrs) {
int64_t offset;		int64_t offset;
SmallVector<int64_t, 4> strides;		SmallVector<int64_t, 4> strides;
auto successStrides = getStridesAndOffset(memRefType, strides, offset);		auto successStrides = getStridesAndOffset(memRefType, strides, offset);
if (failed(successStrides) \|\| strides.back() != 1 \|\|		if (failed(successStrides) \|\| strides.back() != 1 \|\|
memRefType.getMemorySpaceAsInt() != 0)		memRefType.getMemorySpaceAsInt() != 0)
return failure();		return failure();
auto pType = MemRefDescriptor(memref).getElementPtrType();
auto ptrsType = LLVM::getFixedVectorType(pType, vType.getDimSize(0));
ptrs = rewriter.create<LLVM::GEPOp>(loc, ptrsType, base, index);
return success();		return success();
}		}

		// Add an index vector component to a base pointer.
		static Value getIndexedPtrs(ConversionPatternRewriter &rewriter, Location loc,
		MemRefType memRefType, Value llvmMemref, Value base,
		Value index, uint64_t vLen) {
		assert(succeeded(isMemRefTypeSupported(memRefType)) &&
		dcaballeUnsubmitted Done Reply Inline Actions if `isMemRefTypeSupported` is a pre-condition, we should add an assert. dcaballe: if `isMemRefTypeSupported` is a pre-condition, we should add an assert.
		"unsupported memref type");
		auto pType = MemRefDescriptor(llvmMemref).getElementPtrType();
		auto ptrsType = LLVM::getFixedVectorType(pType, vLen);
		return rewriter.create<LLVM::GEPOp>(loc, ptrsType, base, index);
		}

// Casts a strided element pointer to a vector pointer. The vector pointer		// Casts a strided element pointer to a vector pointer. The vector pointer
// will be in the same address space as the incoming memref type.		// will be in the same address space as the incoming memref type.
static Value castDataPtr(ConversionPatternRewriter &rewriter, Location loc,		static Value castDataPtr(ConversionPatternRewriter &rewriter, Location loc,
Value ptr, MemRefType memRefType, Type vt) {		Value ptr, MemRefType memRefType, Type vt) {
auto pType = LLVM::LLVMPointerType::get(vt, memRefType.getMemorySpaceAsInt());		auto pType = LLVM::LLVMPointerType::get(vt, memRefType.getMemorySpaceAsInt());
return rewriter.create<LLVM::BitcastOp>(loc, pType, ptr);		return rewriter.create<LLVM::BitcastOp>(loc, pType, ptr);
}		}

▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines
class VectorGatherOpConversion		class VectorGatherOpConversion
: public ConvertOpToLLVMPattern<vector::GatherOp> {		: public ConvertOpToLLVMPattern<vector::GatherOp> {
public:		public:
using ConvertOpToLLVMPattern<vector::GatherOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<vector::GatherOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(vector::GatherOp gather, OpAdaptor adaptor,		matchAndRewrite(vector::GatherOp gather, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = gather->getLoc();
MemRefType memRefType = gather.getBaseType().dyn_cast<MemRefType>();		MemRefType memRefType = gather.getBaseType().dyn_cast<MemRefType>();
assert(memRefType && "The base should be bufferized");		assert(memRefType && "The base should be bufferized");

		if (failed(isMemRefTypeSupported(memRefType)))
		return failure();

		auto loc = gather->getLoc();

// Resolve alignment.		// Resolve alignment.
unsigned align;		unsigned align;
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))
return failure();		return failure();

// Resolve address.
Value ptrs;
VectorType vType = gather.getVectorType();
Value ptr = getStridedElementPtr(loc, memRefType, adaptor.getBase(),		Value ptr = getStridedElementPtr(loc, memRefType, adaptor.getBase(),
adaptor.getIndices(), rewriter);		adaptor.getIndices(), rewriter);
if (failed(getIndexedPtrs(rewriter, loc, adaptor.getBase(), ptr,		Value base = adaptor.getBase();
adaptor.getIndexVec(), memRefType, vType, ptrs)))
return failure();

		auto llvmNDVectorTy = adaptor.getIndexVec().getType();
		// Handle the simple case of 1-D vector.
		if (!llvmNDVectorTy.isa<LLVM::LLVMArrayType>()) {
		auto vType = gather.getVectorType();
		// Resolve address.
		Value ptrs = getIndexedPtrs(rewriter, loc, memRefType, base, ptr,
		adaptor.getIndexVec(),
		/vLen=/vType.getDimSize(0));
// Replace with the gather intrinsic.		// Replace with the gather intrinsic.
rewriter.replaceOpWithNewOp<LLVM::masked_gather>(		rewriter.replaceOpWithNewOp<LLVM::masked_gather>(
gather, typeConverter->convertType(vType), ptrs, adaptor.getMask(),		gather, typeConverter->convertType(vType), ptrs, adaptor.getMask(),
adaptor.getPassThru(), rewriter.getI32IntegerAttr(align));		adaptor.getPassThru(), rewriter.getI32IntegerAttr(align));
return success();		return success();
}		}

		auto callback = [align, memRefType, base, ptr, loc, &rewriter](
		Type llvm1DVectorTy, ValueRange vectorOperands) {
		// Resolve address.
		Value ptrs = getIndexedPtrs(
		rewriter, loc, memRefType, base, ptr, /index=/vectorOperands[0],
		LLVM::getVectorNumElements(llvm1DVectorTy).getFixedValue());
		// Create the gather intrinsic.
		return rewriter.create<LLVM::masked_gather>(
		loc, llvm1DVectorTy, ptrs, /mask=/vectorOperands[1],
		/passThru=/vectorOperands[2], rewriter.getI32IntegerAttr(align));
		};
		ValueRange vectorOperands = {adaptor.getIndexVec(), adaptor.getMask(),
		adaptor.getPassThru()};
		return LLVM::detail::handleMultidimensionalVectors(
		gather, vectorOperands, *getTypeConverter(), callback, rewriter);
		}
};		};

/// Conversion pattern for a vector.scatter.		/// Conversion pattern for a vector.scatter.
class VectorScatterOpConversion		class VectorScatterOpConversion
: public ConvertOpToLLVMPattern<vector::ScatterOp> {		: public ConvertOpToLLVMPattern<vector::ScatterOp> {
public:		public:
using ConvertOpToLLVMPattern<vector::ScatterOp>::ConvertOpToLLVMPattern;		using ConvertOpToLLVMPattern<vector::ScatterOp>::ConvertOpToLLVMPattern;

LogicalResult		LogicalResult
matchAndRewrite(vector::ScatterOp scatter, OpAdaptor adaptor,		matchAndRewrite(vector::ScatterOp scatter, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto loc = scatter->getLoc();		auto loc = scatter->getLoc();
MemRefType memRefType = scatter.getMemRefType();		MemRefType memRefType = scatter.getMemRefType();

		if (failed(isMemRefTypeSupported(memRefType)))
		return failure();

// Resolve alignment.		// Resolve alignment.
unsigned align;		unsigned align;
if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))		if (failed(getMemRefAlignment(*getTypeConverter(), memRefType, align)))
return failure();		return failure();

// Resolve address.		// Resolve address.
Value ptrs;
VectorType vType = scatter.getVectorType();		VectorType vType = scatter.getVectorType();
Value ptr = getStridedElementPtr(loc, memRefType, adaptor.getBase(),		Value ptr = getStridedElementPtr(loc, memRefType, adaptor.getBase(),
adaptor.getIndices(), rewriter);		adaptor.getIndices(), rewriter);
if (failed(getIndexedPtrs(rewriter, loc, adaptor.getBase(), ptr,		Value ptrs =
adaptor.getIndexVec(), memRefType, vType, ptrs)))		getIndexedPtrs(rewriter, loc, memRefType, adaptor.getBase(), ptr,
return failure();		adaptor.getIndexVec(), /vLen=/vType.getDimSize(0));

// Replace with the scatter intrinsic.		// Replace with the scatter intrinsic.
rewriter.replaceOpWithNewOp<LLVM::masked_scatter>(		rewriter.replaceOpWithNewOp<LLVM::masked_scatter>(
scatter, adaptor.getValueToStore(), ptrs, adaptor.getMask(),		scatter, adaptor.getValueToStore(), ptrs, adaptor.getMask(),
rewriter.getI32IntegerAttr(align));		rewriter.getI32IntegerAttr(align));
return success();		return success();
}		}
};		};
▲ Show 20 Lines • Show All 989 Lines • Show Last 20 Lines

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines

/// Helper enum to classify mask value.		/// Helper enum to classify mask value.
enum class MaskFormat {		enum class MaskFormat {
AllTrue = 0,		AllTrue = 0,
AllFalse = 1,		AllFalse = 1,
Unknown = 2,		Unknown = 2,
};		};

/// Helper method to classify a 1-D mask value. Currently, the method		/// Helper method to classify a mask value. Currently, the method
/// looks "under the hood" of a constant value with dense attributes		/// looks "under the hood" of a constant value with dense attributes
/// and a constant mask operation (since the client may be called at		/// and a constant mask operation (since the client may be called at
/// various stages during progressive lowering).		/// various stages during progressive lowering).
static MaskFormat get1DMaskFormat(Value mask) {		static MaskFormat getMaskFormat(Value mask) {
if (auto c = mask.getDefiningOp<arith::ConstantOp>()) {		if (auto c = mask.getDefiningOp<arith::ConstantOp>()) {
// Inspect constant dense values. We count up for bits that		// Inspect constant dense values. We count up for bits that
// are set, count down for bits that are cleared, and bail		// are set, count down for bits that are cleared, and bail
// when a mix is detected.		// when a mix is detected.
if (auto denseElts = c.getValue().dyn_cast<DenseIntElementsAttr>()) {		if (auto denseElts = c.getValue().dyn_cast<DenseIntElementsAttr>()) {
int64_t val = 0;		int64_t val = 0;
for (bool b : denseElts.getValues<bool>())		for (bool b : denseElts.getValues<bool>())
if (b && val >= 0)		if (b && val >= 0)
val++;		val++;
else if (!b && val <= 0)		else if (!b && val <= 0)
val--;		val--;
else		else
return MaskFormat::Unknown;		return MaskFormat::Unknown;
if (val > 0)		if (val > 0)
return MaskFormat::AllTrue;		return MaskFormat::AllTrue;
if (val < 0)		if (val < 0)
return MaskFormat::AllFalse;		return MaskFormat::AllFalse;
}		}
} else if (auto m = mask.getDefiningOp<ConstantMaskOp>()) {		} else if (auto m = mask.getDefiningOp<ConstantMaskOp>()) {
// Inspect constant mask index. If the index exceeds the		// Inspect constant mask index. If the index exceeds the
// dimension size, all bits are set. If the index is zero		// dimension size, all bits are set. If the index is zero
// or less, no bits are set.		// or less, no bits are set.
ArrayAttr masks = m.getMaskDimSizes();		ArrayAttr masks = m.getMaskDimSizes();
assert(masks.size() == 1);		auto shape = m.getType().getShape();
int64_t i = masks[0].cast<IntegerAttr>().getInt();		bool allTrue = true;
int64_t u = m.getType().getDimSize(0);		bool allFalse = true;
if (i >= u)		for (auto pair : llvm::zip(masks, shape)) {
		int64_t i = std::get<0>(pair).cast<IntegerAttr>().getInt();
		int64_t u = std::get<1>(pair);
		if (i < u)
		allTrue = false;
		if (i > 0)
		allFalse = false;
		}
		if (allTrue)
return MaskFormat::AllTrue;		return MaskFormat::AllTrue;
if (i <= 0)		if (allFalse)
return MaskFormat::AllFalse;		return MaskFormat::AllFalse;
}		}
return MaskFormat::Unknown;		return MaskFormat::Unknown;
}		}

// Helper for verifying combining kinds in contractions and reductions.		// Helper for verifying combining kinds in contractions and reductions.
static bool isSupportedCombiningKind(CombiningKind combiningKind,		static bool isSupportedCombiningKind(CombiningKind combiningKind,
Type elementType) {		Type elementType) {
▲ Show 20 Lines • Show All 3,881 Lines • ▼ Show 20 Lines
}		}

namespace {		namespace {
class MaskedLoadFolder final : public OpRewritePattern<MaskedLoadOp> {		class MaskedLoadFolder final : public OpRewritePattern<MaskedLoadOp> {
public:		public:
using OpRewritePattern<MaskedLoadOp>::OpRewritePattern;		using OpRewritePattern<MaskedLoadOp>::OpRewritePattern;
LogicalResult matchAndRewrite(MaskedLoadOp load,		LogicalResult matchAndRewrite(MaskedLoadOp load,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(load.getMask())) {		switch (getMaskFormat(load.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::LoadOp>(		rewriter.replaceOpWithNewOp<vector::LoadOp>(
load, load.getType(), load.getBase(), load.getIndices());		load, load.getType(), load.getBase(), load.getIndices());
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.replaceOp(load, load.getPassThru());		rewriter.replaceOp(load, load.getPassThru());
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
Show All 34 Lines
}		}

namespace {		namespace {
class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {		class MaskedStoreFolder final : public OpRewritePattern<MaskedStoreOp> {
public:		public:
using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;		using OpRewritePattern<MaskedStoreOp>::OpRewritePattern;
LogicalResult matchAndRewrite(MaskedStoreOp store,		LogicalResult matchAndRewrite(MaskedStoreOp store,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(store.getMask())) {		switch (getMaskFormat(store.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::StoreOp>(		rewriter.replaceOpWithNewOp<vector::StoreOp>(
store, store.getValueToStore(), store.getBase(), store.getIndices());		store, store.getValueToStore(), store.getBase(), store.getIndices());
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.eraseOp(store);		rewriter.eraseOp(store);
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
Show All 26 Lines	LogicalResult GatherOp::verify() {

if (!baseType.isa<MemRefType, RankedTensorType>())		if (!baseType.isa<MemRefType, RankedTensorType>())
return emitOpError("requires base to be a memref or ranked tensor type");		return emitOpError("requires base to be a memref or ranked tensor type");

if (resVType.getElementType() != baseType.getElementType())		if (resVType.getElementType() != baseType.getElementType())
return emitOpError("base and result element type should match");		return emitOpError("base and result element type should match");
if (llvm::size(getIndices()) != baseType.getRank())		if (llvm::size(getIndices()) != baseType.getRank())
return emitOpError("requires ") << baseType.getRank() << " indices";		return emitOpError("requires ") << baseType.getRank() << " indices";
if (resVType.getDimSize(0) != indVType.getDimSize(0))		if (resVType.getShape() != indVType.getShape())
return emitOpError("expected result dim to match indices dim");		return emitOpError("expected result dim to match indices dim");
if (resVType.getDimSize(0) != maskVType.getDimSize(0))		if (resVType.getShape() != maskVType.getShape())
return emitOpError("expected result dim to match mask dim");		return emitOpError("expected result dim to match mask dim");
if (resVType != getPassThruVectorType())		if (resVType != getPassThruVectorType())
return emitOpError("expected pass_thru of same type as result type");		return emitOpError("expected pass_thru of same type as result type");
return success();		return success();
}		}

namespace {		namespace {
class GatherFolder final : public OpRewritePattern<GatherOp> {		class GatherFolder final : public OpRewritePattern<GatherOp> {
public:		public:
using OpRewritePattern<GatherOp>::OpRewritePattern;		using OpRewritePattern<GatherOp>::OpRewritePattern;
LogicalResult matchAndRewrite(GatherOp gather,		LogicalResult matchAndRewrite(GatherOp gather,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(gather.getMask())) {		switch (getMaskFormat(gather.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
return failure(); // no unmasked equivalent		return failure(); // no unmasked equivalent
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.replaceOp(gather, gather.getPassThru());		rewriter.replaceOp(gather, gather.getPassThru());
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
return failure();		return failure();
}		}
Show All 29 Lines
}		}

namespace {		namespace {
class ScatterFolder final : public OpRewritePattern<ScatterOp> {		class ScatterFolder final : public OpRewritePattern<ScatterOp> {
public:		public:
using OpRewritePattern<ScatterOp>::OpRewritePattern;		using OpRewritePattern<ScatterOp>::OpRewritePattern;
LogicalResult matchAndRewrite(ScatterOp scatter,		LogicalResult matchAndRewrite(ScatterOp scatter,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(scatter.getMask())) {		switch (getMaskFormat(scatter.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
return failure(); // no unmasked equivalent		return failure(); // no unmasked equivalent
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.eraseOp(scatter);		rewriter.eraseOp(scatter);
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
return failure();		return failure();
}		}
Show All 29 Lines
}		}

namespace {		namespace {
class ExpandLoadFolder final : public OpRewritePattern<ExpandLoadOp> {		class ExpandLoadFolder final : public OpRewritePattern<ExpandLoadOp> {
public:		public:
using OpRewritePattern<ExpandLoadOp>::OpRewritePattern;		using OpRewritePattern<ExpandLoadOp>::OpRewritePattern;
LogicalResult matchAndRewrite(ExpandLoadOp expand,		LogicalResult matchAndRewrite(ExpandLoadOp expand,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(expand.getMask())) {		switch (getMaskFormat(expand.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::LoadOp>(		rewriter.replaceOpWithNewOp<vector::LoadOp>(
expand, expand.getType(), expand.getBase(), expand.getIndices());		expand, expand.getType(), expand.getBase(), expand.getIndices());
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.replaceOp(expand, expand.getPassThru());		rewriter.replaceOp(expand, expand.getPassThru());
return success();		return success();
case MaskFormat::Unknown:		case MaskFormat::Unknown:
Show All 28 Lines
}		}

namespace {		namespace {
class CompressStoreFolder final : public OpRewritePattern<CompressStoreOp> {		class CompressStoreFolder final : public OpRewritePattern<CompressStoreOp> {
public:		public:
using OpRewritePattern<CompressStoreOp>::OpRewritePattern;		using OpRewritePattern<CompressStoreOp>::OpRewritePattern;
LogicalResult matchAndRewrite(CompressStoreOp compress,		LogicalResult matchAndRewrite(CompressStoreOp compress,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
switch (get1DMaskFormat(compress.getMask())) {		switch (getMaskFormat(compress.getMask())) {
case MaskFormat::AllTrue:		case MaskFormat::AllTrue:
rewriter.replaceOpWithNewOp<vector::StoreOp>(		rewriter.replaceOpWithNewOp<vector::StoreOp>(
compress, compress.getValueToStore(), compress.getBase(),		compress, compress.getValueToStore(), compress.getBase(),
compress.getIndices());		compress.getIndices());
return success();		return success();
case MaskFormat::AllFalse:		case MaskFormat::AllFalse:
rewriter.eraseOp(compress);		rewriter.eraseOp(compress);
return success();		return success();
▲ Show 20 Lines • Show All 901 Lines • Show Last 20 Lines

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

	Show First 20 Lines • Show All 1,912 Lines • ▼ Show 20 Lines

	// CHECK-LABEL: func @gather_op_index			// CHECK-LABEL: func @gather_op_index
	// CHECK: %[[P:.]] = llvm.getelementptr %{{.}}[%{{.*}}] : (!llvm.ptr<i64>, vector<3xi64>) -> !llvm.vec<3 x ptr<i64>>			// CHECK: %[[P:.]] = llvm.getelementptr %{{.}}[%{{.*}}] : (!llvm.ptr<i64>, vector<3xi64>) -> !llvm.vec<3 x ptr<i64>>
	// CHECK: %[[G:.]] = llvm.intr.masked.gather %{{.}}, %{{.}}, %{{.}} {alignment = 8 : i32} : (!llvm.vec<3 x ptr<i64>>, vector<3xi1>, vector<3xi64>) -> vector<3xi64>			// CHECK: %[[G:.]] = llvm.intr.masked.gather %{{.}}, %{{.}}, %{{.}} {alignment = 8 : i32} : (!llvm.vec<3 x ptr<i64>>, vector<3xi1>, vector<3xi64>) -> vector<3xi64>
	// CHECK: %{{.*}} = builtin.unrealized_conversion_cast %[[G]] : vector<3xi64> to vector<3xindex>			// CHECK: %{{.*}} = builtin.unrealized_conversion_cast %[[G]] : vector<3xi64> to vector<3xindex>

	// -----			// -----

				func.func @gather_op_multi_dims(%arg0: memref<?xf32>, %arg1: vector<2x3xi32>, %arg2: vector<2x3xi1>, %arg3: vector<2x3xf32>) -> vector<2x3xf32> {
				%0 = arith.constant 0: index
				%1 = vector.gather %arg0[%0][%arg1], %arg2, %arg3 : memref<?xf32>, vector<2x3xi32>, vector<2x3xi1>, vector<2x3xf32> into vector<2x3xf32>
				return %1 : vector<2x3xf32>
				}

				// CHECK-LABEL: func @gather_op_multi_dims
				// CHECK: %[[B:.]] = llvm.getelementptr %{{.}} : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
				// CHECK: %[[I0:.]] = llvm.extractvalue %{{.}}[0] : !llvm.array<2 x vector<3xi32>>
				// CHECK: %[[M0:.]] = llvm.extractvalue %{{.}}[0] : !llvm.array<2 x vector<3xi1>>
				// CHECK: %[[S0:.]] = llvm.extractvalue %{{.}}[0] : !llvm.array<2 x vector<3xf32>>
				// CHECK: %[[P0:.*]] = llvm.getelementptr %[[B]][%[[I0]]] : (!llvm.ptr<f32>, vector<3xi32>) -> !llvm.vec<3 x ptr<f32>>
				// CHECK: %[[G0:.*]] = llvm.intr.masked.gather %[[P0]], %[[M0]], %[[S0]] {alignment = 4 : i32} : (!llvm.vec<3 x ptr<f32>>, vector<3xi1>, vector<3xf32>) -> vector<3xf32>
				// CHECK: %{{.}} = llvm.insertvalue %[[G0]], %{{.}}[0] : !llvm.array<2 x vector<3xf32>>
				// CHECK: %[[I1:.]] = llvm.extractvalue %{{.}}[1] : !llvm.array<2 x vector<3xi32>>
				// CHECK: %[[M1:.]] = llvm.extractvalue %{{.}}[1] : !llvm.array<2 x vector<3xi1>>
				// CHECK: %[[S1:.]] = llvm.extractvalue %{{.}}[1] : !llvm.array<2 x vector<3xf32>>
				// CHECK: %[[P1:.*]] = llvm.getelementptr %[[B]][%[[I1]]] : (!llvm.ptr<f32>, vector<3xi32>) -> !llvm.vec<3 x ptr<f32>>
				// CHECK: %[[G1:.*]] = llvm.intr.masked.gather %[[P1]], %[[M1]], %[[S1]] {alignment = 4 : i32} : (!llvm.vec<3 x ptr<f32>>, vector<3xi1>, vector<3xf32>) -> vector<3xf32>
				// CHECK: %{{.}} = llvm.insertvalue %[[G1]], %{{.}}[1] : !llvm.array<2 x vector<3xf32>>

				// -----

				func.func @gather_op_with_mask(%arg0: memref<?xf32>, %arg1: vector<2x3xi32>, %arg2: vector<2x3xf32>) -> vector<2x3xf32> {
				%0 = arith.constant 0: index
				%1 = vector.constant_mask [1, 2] : vector<2x3xi1>
				%2 = vector.gather %arg0[%0][%arg1], %1, %arg2 : memref<?xf32>, vector<2x3xi32>, vector<2x3xi1>, vector<2x3xf32> into vector<2x3xf32>
				return %2 : vector<2x3xf32>
				}

				// CHECK-LABEL: func @gather_op_with_mask
				// CHECK: %[[G0:.]] = llvm.intr.masked.gather %{{.}}, %{{.}}, %{{.}} {alignment = 4 : i32} : (!llvm.vec<3 x ptr<f32>>, vector<3xi1>, vector<3xf32>) -> vector<3xf32>
				// CHECK: %[[G1:.]] = llvm.intr.masked.gather %{{.}}, %{{.}}, %{{.}} {alignment = 4 : i32} : (!llvm.vec<3 x ptr<f32>>, vector<3xi1>, vector<3xf32>) -> vector<3xf32>

				// -----

				func.func @gather_op_with_zero_mask(%arg0: memref<?xf32>, %arg1: vector<2x3xi32>, %arg2: vector<2x3xf32>) -> vector<2x3xf32> {
				%0 = arith.constant 0: index
				%1 = vector.constant_mask [0, 0] : vector<2x3xi1>
				%2 = vector.gather %arg0[%0][%arg1], %1, %arg2 : memref<?xf32>, vector<2x3xi32>, vector<2x3xi1>, vector<2x3xf32> into vector<2x3xf32>
				return %2 : vector<2x3xf32>
				}

				// CHECK-LABEL: func @gather_op_with_zero_mask
				// CHECK-SAME: (%{{.}}: memref<?xf32>, %{{.}}: vector<2x3xi32>, %[[S:.*]]: vector<2x3xf32>)
				// CHECK-NOT: %{{.*}} = llvm.intr.masked.gather
				// CHECK: return %[[S]] : vector<2x3xf32>

				// -----

	func.func @gather_2d_op(%arg0: memref<4x4xf32>, %arg1: vector<4xi32>, %arg2: vector<4xi1>, %arg3: vector<4xf32>) -> vector<4xf32> {			func.func @gather_2d_op(%arg0: memref<4x4xf32>, %arg1: vector<4xi32>, %arg2: vector<4xi1>, %arg3: vector<4xf32>) -> vector<4xf32> {
	%0 = arith.constant 3 : index			%0 = arith.constant 3 : index
	%1 = vector.gather %arg0[%0, %0][%arg1], %arg2, %arg3 : memref<4x4xf32>, vector<4xi32>, vector<4xi1>, vector<4xf32> into vector<4xf32>			%1 = vector.gather %arg0[%0, %0][%arg1], %arg2, %arg3 : memref<4x4xf32>, vector<4xi32>, vector<4xi1>, vector<4xf32> into vector<4xf32>
	return %1 : vector<4xf32>			return %1 : vector<4xf32>
	}			}

	// CHECK-LABEL: func @gather_2d_op			// CHECK-LABEL: func @gather_2d_op
	// CHECK: %[[B:.]] = llvm.getelementptr %{{.}}[%{{.*}}] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>			// CHECK: %[[B:.]] = llvm.getelementptr %{{.}}[%{{.*}}] : (!llvm.ptr<f32>, i64) -> !llvm.ptr<f32>
	▲ Show 20 Lines • Show All 120 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/invalid.mlir

Show First 20 Lines • Show All 1,299 Lines • ▼ Show 20 Lines	%0 = vector.gather %base[%c0][%indices], %mask, %pass_thru
: memref<?x?xf64>, vector<16xi32>, vector<16xi1>, vector<16xf64> into vector<16xf64>		: memref<?x?xf64>, vector<16xi32>, vector<16xi1>, vector<16xf64> into vector<16xf64>
}		}

// -----		// -----

func.func @gather_rank_mismatch(%base: memref<?xf32>, %indices: vector<16xi32>,		func.func @gather_rank_mismatch(%base: memref<?xf32>, %indices: vector<16xi32>,
%mask: vector<16xi1>, %pass_thru: vector<16xf32>) {		%mask: vector<16xi1>, %pass_thru: vector<16xf32>) {
%c0 = arith.constant 0 : index		%c0 = arith.constant 0 : index
// expected-error@+1 {{'vector.gather' op result #0 must be of ranks 1, but got 'vector<2x16xf32>'}}		// expected-error@+1 {{'vector.gather' op expected result dim to match indices dim}}
%0 = vector.gather %base[%c0][%indices], %mask, %pass_thru		%0 = vector.gather %base[%c0][%indices], %mask, %pass_thru
: memref<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<2x16xf32>		: memref<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<2x16xf32>
}		}

// -----		// -----

func.func @gather_dim_indices_mismatch(%base: memref<?xf32>, %indices: vector<17xi32>,		func.func @gather_dim_indices_mismatch(%base: memref<?xf32>, %indices: vector<17xi32>,
%mask: vector<16xi1>, %pass_thru: vector<16xf32>) {		%mask: vector<16xi1>, %pass_thru: vector<16xf32>) {
▲ Show 20 Lines • Show All 292 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/ops.mlir

	Show First 20 Lines • Show All 671 Lines • ▼ Show 20 Lines
	// CHECK-LABEL: @gather_on_tensor			// CHECK-LABEL: @gather_on_tensor
	func.func @gather_on_tensor(%base: tensor<?xf32>, %v: vector<16xi32>, %mask: vector<16xi1>, %pass_thru: vector<16xf32>) -> vector<16xf32> {			func.func @gather_on_tensor(%base: tensor<?xf32>, %v: vector<16xi32>, %mask: vector<16xi1>, %pass_thru: vector<16xf32>) -> vector<16xf32> {
	%c0 = arith.constant 0 : index			%c0 = arith.constant 0 : index
	// CHECK: vector.gather %{{.}}[%{{.}}] [%{{.}}], %{{.}}, %{{.*}} : tensor<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			// CHECK: vector.gather %{{.}}[%{{.}}] [%{{.}}], %{{.}}, %{{.*}} : tensor<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	%0 = vector.gather %base[%c0][%v], %mask, %pass_thru : tensor<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.gather %base[%c0][%v], %mask, %pass_thru : tensor<?xf32>, vector<16xi32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	return %0 : vector<16xf32>			return %0 : vector<16xf32>
	}			}

				// CHECK-LABEL: @gather_multi_dims
				func.func @gather_multi_dims(%base: tensor<?xf32>, %v: vector<2x16xi32>, %mask: vector<2x16xi1>, %pass_thru: vector<2x16xf32>) -> vector<2x16xf32> {
				%c0 = arith.constant 0 : index
				// CHECK: vector.gather %{{.}}[%{{.}}] [%{{.}}], %{{.}}, %{{.*}} : tensor<?xf32>, vector<2x16xi32>, vector<2x16xi1>, vector<2x16xf32> into vector<2x16xf32>
				%0 = vector.gather %base[%c0][%v], %mask, %pass_thru : tensor<?xf32>, vector<2x16xi32>, vector<2x16xi1>, vector<2x16xf32> into vector<2x16xf32>
				return %0 : vector<2x16xf32>
				}

	// CHECK-LABEL: @expand_and_compress			// CHECK-LABEL: @expand_and_compress
	func.func @expand_and_compress(%base: memref<?xf32>, %mask: vector<16xi1>, %pass_thru: vector<16xf32>) {			func.func @expand_and_compress(%base: memref<?xf32>, %mask: vector<16xi1>, %pass_thru: vector<16xf32>) {
	%c0 = arith.constant 0 : index			%c0 = arith.constant 0 : index
	// CHECK: %[[X:.]] = vector.expandload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			// CHECK: %[[X:.]] = vector.expandload %{{.}}[%{{.}}], %{{.}}, %{{.*}} : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	%0 = vector.expandload %base[%c0], %mask, %pass_thru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>			%0 = vector.expandload %base[%c0], %mask, %pass_thru : memref<?xf32>, vector<16xi1>, vector<16xf32> into vector<16xf32>
	// CHECK: vector.compressstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>			// CHECK: vector.compressstore %{{.}}[%{{.}}], %{{.*}}, %[[X]] : memref<?xf32>, vector<16xi1>, vector<16xf32>
	vector.compressstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>			vector.compressstore %base[%c0], %mask, %0 : memref<?xf32>, vector<16xi1>, vector<16xf32>
	return			return
	▲ Show 20 Lines • Show All 98 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR]Extend vector.gather to support n-D result
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 453185

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

mlir/lib/Conversion/LLVMCommon/VectorPattern.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[MLIR]Extend vector.gather to support n-D resultClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 453185

mlir/include/mlir/Dialect/Vector/IR/VectorOps.td

mlir/lib/Conversion/LLVMCommon/VectorPattern.cpp

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/Vector/invalid.mlir

mlir/test/Dialect/Vector/ops.mlir

[MLIR]Extend vector.gather to support n-D result
ClosedPublic