Diff 242805

mlir/include/mlir/Dialect/VectorOps/VectorOps.td

Show First 20 Lines • Show All 365 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
static StringRef getSizesAttrName() { return "sizes"; }		static StringRef getSizesAttrName() { return "sizes"; }
static StringRef getStridesAttrName() { return "strides"; }		static StringRef getStridesAttrName() { return "strides"; }
}];		}];
let assemblyFormat = [{		let assemblyFormat = [{
$vector `,` $sizes `,` $strides attr-dict `:` type($vector) `into`		$vector `,` $sizes `,` $strides attr-dict `:` type($vector) `into`
type(results)		type(results)
}];		}];
}		}

		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code +def Vector_FMAOp : + Op<Vector_Dialect, "fma", [NoSideEffect, + AllTypesMatch<["lhs", "rhs", "acc", "result"]>]>, + Arguments<(ins AnyVector:$lhs, AnyVector:$rhs, AnyVector:$acc)>, + Results<(outs AnyVector:$result)> { + let summary = "vector fused multiply-add"; + let description = [{ + Multiply-add expressions operate on n-D vectors and compute a fused + pointwise multiply followed by accumulate. In the particular case of + lowering to LLVM, this is guaranteed to lower to the `llvm.fmuladd.` + intrinsic. + + Example: + + + %3 = vector.fma %0, %1, %2: vector<8x16xf32> + ` + }]; + // Fully specified by traits. + let verifier = ?; + let assemblyFormat = "$lhs `,` $rhs `,` $acc attr-dict `:` type($lhs)"; + let builders = [OpBuilder< + "Builder b, OperationState &result, Value lhs, Value rhs, Value acc", + "build(b, result, lhs.getType(), lhs, rhs, acc);">]; + let extraClassDeclaration = [{ + VectorType getVectorType() { return lhs().getType().cast<VectorType>(); } + }]; +} + Lint: Pre-merge checks: clang-format: please reformat the code ``` +def Vector_FMAOp : + Op<Vector_Dialect, "fma"…
		def Vector_FMAOp :
		Op<Vector_Dialect, "fma", [NoSideEffect,
		AllTypesMatch<["lhs", "rhs", "acc", "result"]>]>,
		Arguments<(ins AnyVector:$lhs, AnyVector:$rhs, AnyVector:$acc)>,
		Results<(outs AnyVector:$result)> {
		let summary = "vector fused multiply-add";
		let description = [{
		Multiply-add expressions operate on n-D vectors and compute a fused
		aartbikUnsubmitted Done Reply Inline Actions typo: that operate (plural) but more in general, can you describe the semantics in a bit more detail than this? In particular, the lowering part to llvm could be mentioned at one point as motivation to have this, but it seems a bit strange to mention that in the very first sentence already. aartbik: typo: that operate (plural) but more in general, can you describe the semantics in a bit more…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Made the description more general. There is a notion of guaranteed fmuladd in the LLVM case that I kept. nicolasvasilache: Made the description more general. There is a notion of guaranteed fmuladd in the LLVM case…
		pointwise multiply followed by accumulate. In the particular case of
		aartbikUnsubmitted Done Reply Inline Actions much better, and the mention of llvm later is fine of course I would still say something about that all shapes need to match exactly (rank and dimensions), it is implied by the syntax, and of course enforced by the traits, but why not be a bit more explicit in the doc :-) aartbik: much better, and the mention of llvm later is fine of course I would still say something about…
		lowering to LLVM, this is guaranteed to lower to the `llvm.fmuladd.*`
		fhahnUnsubmitted Done Reply Inline Actions IIRC llvm.fmuladd does not specify the rounding behaviour (mul and add may or may not be fused). The way I read the description here it seems like fusion is ecpexted for vector.fma unless I am missing something. It might be worth clarifying the rounding behaviour fhahn: IIRC llvm.fmuladd does not specify the rounding behaviour (mul and add may or may not be fused).
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Thanks @fhahn ! I overlooked the semantics section that clearly states to use fma. What I want is indeed fma, will adapt the revision accordingly. nicolasvasilache: Thanks @fhahn ! I overlooked the semantics section that clearly states to use fma. What I want…
		intrinsic.

		Example:

		```
		%3 = vector.fma %0, %1, %2: vector<8x16xf32>
		```
		}];
		// Fully specified by traits.
		let verifier = ?;
		let assemblyFormat = "$lhs `,` $rhs `,` $acc attr-dict `:` type($lhs)";
		let builders = [OpBuilder<
		"Builder *b, OperationState &result, Value lhs, Value rhs, Value acc",
		"build(b, result, lhs.getType(), lhs, rhs, acc);">];
		let extraClassDeclaration = [{
		VectorType getVectorType() { return lhs().getType().cast<VectorType>(); }
		}];
		}

def Vector_InsertElementOp :		def Vector_InsertElementOp :
Vector_Op<"insertelement", [NoSideEffect,		Vector_Op<"insertelement", [NoSideEffect,
PredOpTrait<"source operand and result have same element type",		PredOpTrait<"source operand and result have same element type",
TCresVTEtIsSameAsOpBase<0, 0>>,		TCresVTEtIsSameAsOpBase<0, 0>>,
PredOpTrait<"dest operand and result have same type",		PredOpTrait<"dest operand and result have same type",
TCresIsSameAsOpBase<0, 1>>]>,		TCresIsSameAsOpBase<0, 1>>]>,
Arguments<(ins AnyType:$source, AnyVector:$dest, AnyInteger:$position)>,		Arguments<(ins AnyType:$source, AnyVector:$dest, AnyInteger:$position)>,
Results<(outs AnyVector)> {		Results<(outs AnyVector)> {
▲ Show 20 Lines • Show All 796 Lines • Show Last 20 Lines

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

Show First 20 Lines • Show All 403 Lines • ▼ Show 20 Lines	matchAndRewrite(Operation *op, ArrayRef<Value> operands,
auto constant = rewriter.create<LLVM::ConstantOp>(loc, i64Type, position);		auto constant = rewriter.create<LLVM::ConstantOp>(loc, i64Type, position);
extracted =		extracted =
rewriter.create<LLVM::ExtractElementOp>(loc, extracted, constant);		rewriter.create<LLVM::ExtractElementOp>(loc, extracted, constant);
rewriter.replaceOp(op, extracted);		rewriter.replaceOp(op, extracted);

return matchSuccess();		return matchSuccess();
}		}
};		};

		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code +/// Conversion pattern that turns a vector.fma on a 1-D vector +/// into an llvm.intr.fmuladd. This is a trivial 1-1 conversion. +/// This does not match vectors of n >= 2 rank. +/// +/// Example: +/// +/ vector.fma %a, %a, %a : vector<8xf32> +/ ` +/ is converted to: +/ ` +/ llvm.intr.fmuladd{{.}}: (...) -> !llvm<"<8 x float>"> +/ ` +class VectorFMAOp1DConversion : public LLVMOpLowering { +public: + explicit VectorFMAOp1DConversion(MLIRContext context, + LLVMTypeConverter &typeConverter) + : LLVMOpLowering(vector::FMAOp::getOperationName(), context, + typeConverter) {} + + PatternMatchResult + matchAndRewrite(Operation op, ArrayRef<Value> operands, + ConversionPatternRewriter &rewriter) const override { + auto adaptor = vector::FMAOpOperandAdaptor(operands); + vector::FMAOp fmaOp = cast<vector::FMAOp>(op); + VectorType vType = fmaOp.getVectorType(); + if (vType.getRank() != 1) + return matchFailure(); + rewriter.replaceOpWithNewOp<LLVM::FMulAddOp>(op, adaptor.lhs(), + adaptor.rhs(), adaptor.acc()); + return matchSuccess(); + } +}; + Lint: Pre-merge checks:* clang-format: please reformat the code ``` +/// Conversion pattern that turns a vector.fma on a…
		/// Conversion pattern that turns a vector.fma on a 1-D vector
		dcaballeUnsubmitted Done Reply Inline Actions Adding doc about what this pattern is matching and what is not would be great dcaballe: Adding doc about what this pattern is matching and what is not would be great
		/// into an llvm.intr.fmuladd. This is a trivial 1-1 conversion.
		/// This does not match vectors of n >= 2 rank.
		///
		/// Example:
		/// ```
		/// vector.fma %a, %a, %a : vector<8xf32>
		/// ```
		/// is converted to:
		/// ```
		/// llvm.intr.fmuladd{{.*}}: (...) -> !llvm<"<8 x float>">
		/// ```
		class VectorFMAOp1DConversion : public LLVMOpLowering {
		public:
		explicit VectorFMAOp1DConversion(MLIRContext *context,
		LLVMTypeConverter &typeConverter)
		: LLVMOpLowering(vector::FMAOp::getOperationName(), context,
		typeConverter) {}

		PatternMatchResult
		matchAndRewrite(Operation *op, ArrayRef<Value> operands,
		ConversionPatternRewriter &rewriter) const override {
		auto adaptor = vector::FMAOpOperandAdaptor(operands);
		vector::FMAOp fmaOp = cast<vector::FMAOp>(op);
		VectorType vType = fmaOp.getVectorType();
		if (vType.getRank() != 1)
		return matchFailure();
		rewriter.replaceOpWithNewOp<LLVM::FMulAddOp>(op, adaptor.lhs(),
		adaptor.rhs(), adaptor.acc());
		return matchSuccess();
		}
		};

class VectorInsertElementOpConversion : public LLVMOpLowering {		class VectorInsertElementOpConversion : public LLVMOpLowering {
public:		public:
explicit VectorInsertElementOpConversion(MLIRContext *context,		explicit VectorInsertElementOpConversion(MLIRContext *context,
LLVMTypeConverter &typeConverter)		LLVMTypeConverter &typeConverter)
: LLVMOpLowering(vector::InsertElementOp::getOperationName(), context,		: LLVMOpLowering(vector::InsertElementOp::getOperationName(), context,
typeConverter) {}		typeConverter) {}

PatternMatchResult		PatternMatchResult
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	if (sourceType.isa<VectorType>()) {
loc, llvmResultType, adaptor.dest(), adaptor.source(),		loc, llvmResultType, adaptor.dest(), adaptor.source(),
positionArrayAttr);		positionArrayAttr);
rewriter.replaceOp(op, inserted);		rewriter.replaceOp(op, inserted);
return matchSuccess();		return matchSuccess();
}		}

// Potential extraction of 1-D vector from array.		// Potential extraction of 1-D vector from array.
auto *context = op->getContext();		auto *context = op->getContext();
Value extracted = adaptor.dest();		Value extracted = adaptor.dest();
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code +/// Rank reducing rewrite for n-D FMA into (n-1)-D FMA where n > 1. +/// +/// Example: +/// +/ vector.fma %b, %b, %b : vector<2x4xf32> +/ ` +/ is rewritten into: +/ ` +/ vector.extractvalue {{.}}[0] : vector<2x4xf32> +/ vector.fma{{.}} : vector<4xf32> +/ vector.insertvalue {{.}}, {{.}}[0] : vector<4xf32> into vector<2x4xf32> +/ vector.extractvalue {{.}}[1] : vector<2x4xf32> +/ vector.fma{{.}} : vector<4xf32> +/ vector.insertvalue {{.}}, {{.}}[1] : vector<4xf32> into vector<2x4xf32> +/// ` +class VectorFMAOpNDRewritePattern : public OpRewritePattern<FMAOp> { +public: + using OpRewritePattern<FMAOp>::OpRewritePattern; + + PatternMatchResult matchAndRewrite(FMAOp op, + PatternRewriter &rewriter) const override { + auto vType = op.getVectorType(); + if (vType.getRank() < 2) + return matchFailure(); + + auto loc = op.getLoc(); + auto elemType = vType.getElementType(); + Value zero = rewriter.create<ConstantOp>(loc, elemType, + rewriter.getZeroAttr(elemType)); + Value desc = rewriter.create<SplatOp>(loc, vType, zero); + for (int64_t i = 0, e = vType.getShape().front(); i != e; ++i) { + Value extrLHS = rewriter.create<ExtractOp>(loc, op.lhs(), i); + Value extrRHS = rewriter.create<ExtractOp>(loc, op.rhs(), i); + Value extrACC = rewriter.create<ExtractOp>(loc, op.acc(), i); + Value fma = rewriter.create<FMAOp>(loc, extrLHS, extrRHS, extrACC); + desc = rewriter.create<InsertOp>(loc, fma, desc, i); + } + rewriter.replaceOp(op, desc); + return matchSuccess(); + } +}; + Lint: Pre-merge checks: clang-format: please reformat the code ``` +/// Rank reducing rewrite for n-D FMA into (n-1)-D…
auto positionAttrs = positionArrayAttr.getValue();		auto positionAttrs = positionArrayAttr.getValue();
auto position = positionAttrs.back().cast<IntegerAttr>();		auto position = positionAttrs.back().cast<IntegerAttr>();
auto oneDVectorType = destVectorType;		auto oneDVectorType = destVectorType;
if (positionAttrs.size() > 1) {		if (positionAttrs.size() > 1) {
oneDVectorType = reducedVectorTypeBack(destVectorType);		oneDVectorType = reducedVectorTypeBack(destVectorType);
auto nMinusOnePositionAttrs =		auto nMinusOnePositionAttrs =
ArrayAttr::get(positionAttrs.drop_back(), context);		ArrayAttr::get(positionAttrs.drop_back(), context);
extracted = rewriter.create<LLVM::ExtractValueOp>(		extracted = rewriter.create<LLVM::ExtractValueOp>(
Show All 17 Lines	if (positionAttrs.size() > 1) {
nMinusOnePositionAttrs);		nMinusOnePositionAttrs);
}		}

rewriter.replaceOp(op, inserted);		rewriter.replaceOp(op, inserted);
return matchSuccess();		return matchSuccess();
}		}
};		};

		/// Rank reducing rewrite for n-D FMA into (n-1)-D FMA where n > 1.
		dcaballeUnsubmitted Done Reply Inline Actions I think adding a small example to the doc would help better understand what this patter is doing. dcaballe: I think adding a small example to the doc would help better understand what this patter is…
		///
		/// Example:
		/// ```
		/// vector.fma %b, %b, %b : vector<2x4xf32>
		/// ```
		/// is rewritten into:
		/// ```
		/// vector.extractvalue {{.*}}[0] : vector<2x4xf32>
		aartbikUnsubmitted Done Reply Inline Actions this is not a check pattern, so why not fill out the %a, %b, %c and ssa vars for readability? aartbik: this is not a check pattern, so why not fill out the %a, %b, %c and ssa vars for readability?
		/// vector.fma{{.*}} : vector<4xf32>
		/// vector.insertvalue {{.}}, {{.}}[0] : vector<4xf32> into vector<2x4xf32>
		/// vector.extractvalue {{.*}}[1] : vector<2x4xf32>
		/// vector.fma{{.*}} : vector<4xf32>
		/// vector.insertvalue {{.}}, {{.}}[1] : vector<4xf32> into vector<2x4xf32>
		/// ```
		class VectorFMAOpNDRewritePattern : public OpRewritePattern<FMAOp> {
		public:
		using OpRewritePattern<FMAOp>::OpRewritePattern;

		PatternMatchResult matchAndRewrite(FMAOp op,
		PatternRewriter &rewriter) const override {
		auto vType = op.getVectorType();
		if (vType.getRank() < 2)
		return matchFailure();

		auto loc = op.getLoc();
		auto elemType = vType.getElementType();
		Value zero = rewriter.create<ConstantOp>(loc, elemType,
		rewriter.getZeroAttr(elemType));
		Value desc = rewriter.create<SplatOp>(loc, vType, zero);
		for (int64_t i = 0, e = vType.getShape().front(); i != e; ++i) {
		Value extrLHS = rewriter.create<ExtractOp>(loc, op.lhs(), i);
		Value extrRHS = rewriter.create<ExtractOp>(loc, op.rhs(), i);
		Value extrACC = rewriter.create<ExtractOp>(loc, op.acc(), i);
		Value fma = rewriter.create<FMAOp>(loc, extrLHS, extrRHS, extrACC);
		desc = rewriter.create<InsertOp>(loc, fma, desc, i);
		}
		rewriter.replaceOp(op, desc);
		return matchSuccess();
		}
		};

// When ranks are different, InsertStridedSlice needs to extract a properly		// When ranks are different, InsertStridedSlice needs to extract a properly
// ranked vector from the destination vector into which to insert. This pattern		// ranked vector from the destination vector into which to insert. This pattern
// only takes care of this part and forwards the rest of the conversion to		// only takes care of this part and forwards the rest of the conversion to
// another pattern that converts InsertStridedSlice for operands of the same		// another pattern that converts InsertStridedSlice for operands of the same
// rank.		// rank.
//		//
// RewritePattern for InsertStridedSliceOp where source and destination vectors		// RewritePattern for InsertStridedSliceOp where source and destination vectors
// have different ranks. In this case:		// have different ranks. In this case:
▲ Show 20 Lines • Show All 362 Lines • ▼ Show 20 Lines	private:
Operation getPrintFloat(Operation op) const {		Operation getPrintFloat(Operation op) const {
LLVM::LLVMDialect *dialect = lowering.getDialect();		LLVM::LLVMDialect *dialect = lowering.getDialect();
return getPrint(op, dialect, "print_f32",		return getPrint(op, dialect, "print_f32",
LLVM::LLVMType::getFloatTy(dialect));		LLVM::LLVMType::getFloatTy(dialect));
}		}
Operation getPrintDouble(Operation op) const {		Operation getPrintDouble(Operation op) const {
LLVM::LLVMDialect *dialect = lowering.getDialect();		LLVM::LLVMDialect *dialect = lowering.getDialect();
return getPrint(op, dialect, "print_f64",		return getPrint(op, dialect, "print_f64",
LLVM::LLVMType::getDoubleTy(dialect));		LLVM::LLVMType::getDoubleTy(dialect));
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - patterns.insert<VectorInsertStridedSliceOpDifferentRankRewritePattern, + patterns.insert<VectorFMAOpNDRewritePattern, + VectorInsertStridedSliceOpDifferentRankRewritePattern, Lint: Pre-merge checks: clang-format: please reformat the code ``` - patterns.
}		}
Operation getPrintOpen(Operation op) const {		Operation getPrintOpen(Operation op) const {
return getPrint(op, lowering.getDialect(), "print_open", {});		return getPrint(op, lowering.getDialect(), "print_open", {});
}		}
Operation getPrintClose(Operation op) const {		Operation getPrintClose(Operation op) const {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - VectorInsertElementOpConversion, VectorInsertOpConversion, - VectorOuterProductOpConversion, VectorTypeCastOpConversion, - VectorPrintOpConversion>(ctx, converter); + VectorFMAOp1DConversion, VectorInsertElementOpConversion, + VectorInsertOpConversion, VectorOuterProductOpConversion, + VectorTypeCastOpConversion, VectorPrintOpConversion>( + ctx, converter); Lint: Pre-merge checks: clang-format: please reformat the code ``` - VectorInsertElementOpConversion…
return getPrint(op, lowering.getDialect(), "print_close", {});		return getPrint(op, lowering.getDialect(), "print_close", {});
}		}
Operation getPrintComma(Operation op) const {		Operation getPrintComma(Operation op) const {
return getPrint(op, lowering.getDialect(), "print_comma", {});		return getPrint(op, lowering.getDialect(), "print_comma", {});
}		}
Operation getPrintNewline(Operation op) const {		Operation getPrintNewline(Operation op) const {
return getPrint(op, lowering.getDialect(), "print_newline", {});		return getPrint(op, lowering.getDialect(), "print_newline", {});
}		}
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines
};		};

} // namespace		} // namespace

/// Populate the given list with patterns that convert from Vector to LLVM.		/// Populate the given list with patterns that convert from Vector to LLVM.
void mlir::populateVectorToLLVMConversionPatterns(		void mlir::populateVectorToLLVMConversionPatterns(
LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {		LLVMTypeConverter &converter, OwningRewritePatternList &patterns) {
MLIRContext *ctx = converter.getDialect()->getContext();		MLIRContext *ctx = converter.getDialect()->getContext();
patterns.insert<VectorInsertStridedSliceOpDifferentRankRewritePattern,		patterns.insert<VectorFMAOpNDRewritePattern,
		VectorInsertStridedSliceOpDifferentRankRewritePattern,
VectorInsertStridedSliceOpSameRankRewritePattern,		VectorInsertStridedSliceOpSameRankRewritePattern,
VectorStridedSliceOpConversion>(ctx);		VectorStridedSliceOpConversion>(ctx);
patterns.insert<VectorBroadcastOpConversion, VectorShuffleOpConversion,		patterns.insert<VectorBroadcastOpConversion, VectorShuffleOpConversion,
VectorExtractElementOpConversion, VectorExtractOpConversion,		VectorExtractElementOpConversion, VectorExtractOpConversion,
VectorInsertElementOpConversion, VectorInsertOpConversion,		VectorFMAOp1DConversion, VectorInsertElementOpConversion,
VectorOuterProductOpConversion, VectorTypeCastOpConversion,		VectorInsertOpConversion, VectorOuterProductOpConversion,
VectorPrintOpConversion>(ctx, converter);		VectorTypeCastOpConversion, VectorPrintOpConversion>(
		ctx, converter);
		dcaballeUnsubmitted Done Reply Inline Actions Rename `VectorFMAOpRewritePattern` and `VectorFMAOpConversion` to be more aligned with what they match? You'll probably find something better but something along the lines of `SingleDimVectorFMAOpRewritePattern` and `MultiDimVectorFMAOpConversion` dcaballe: Rename `VectorFMAOpRewritePattern` and `VectorFMAOpConversion` to be more aligned with what…
}		}

namespace {		namespace {
struct LowerVectorToLLVMPass : public ModulePass<LowerVectorToLLVMPass> {		struct LowerVectorToLLVMPass : public ModulePass<LowerVectorToLLVMPass> {
void runOnModule() override;		void runOnModule() override;
};		};
} // namespace		} // namespace

Show All 32 Lines

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

	Show First 20 Lines • Show All 603 Lines • ▼ Show 20 Lines
	// CHECK: %[[s0:.*]] = llvm.mlir.constant(dense<0.000000e+00> : vector<1x1xf32>) : !llvm<"[1 x <1 x float>]">			// CHECK: %[[s0:.*]] = llvm.mlir.constant(dense<0.000000e+00> : vector<1x1xf32>) : !llvm<"[1 x <1 x float>]">
	// CHECK: %[[s1:.*]] = llvm.extractvalue %[[A]][2] : !llvm<"[3 x <3 x float>]">			// CHECK: %[[s1:.*]] = llvm.extractvalue %[[A]][2] : !llvm<"[3 x <3 x float>]">
	// CHECK: %[[s3:.*]] = llvm.mlir.constant(dense<0.000000e+00> : vector<1xf32>) : !llvm<"<1 x float>">			// CHECK: %[[s3:.*]] = llvm.mlir.constant(dense<0.000000e+00> : vector<1xf32>) : !llvm<"<1 x float>">
	// CHECK: %[[s4:.*]] = llvm.mlir.constant(2 : index) : !llvm.i64			// CHECK: %[[s4:.*]] = llvm.mlir.constant(2 : index) : !llvm.i64
	// CHECK: %[[s5:.*]] = llvm.extractelement %[[s1]][%[[s4]] : !llvm.i64] : !llvm<"<3 x float>">			// CHECK: %[[s5:.*]] = llvm.extractelement %[[s1]][%[[s4]] : !llvm.i64] : !llvm<"<3 x float>">
	// CHECK: %[[s6:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64			// CHECK: %[[s6:.*]] = llvm.mlir.constant(0 : index) : !llvm.i64
	// CHECK: %[[s7:.*]] = llvm.insertelement %[[s5]], %[[s3]][%[[s6]] : !llvm.i64] : !llvm<"<1 x float>">			// CHECK: %[[s7:.*]] = llvm.insertelement %[[s5]], %[[s3]][%[[s6]] : !llvm.i64] : !llvm<"<1 x float>">
	// CHECK: %[[s8:.*]] = llvm.insertvalue %[[s7]], %[[s0]][0] : !llvm<"[1 x <1 x float>]">			// CHECK: %[[s8:.*]] = llvm.insertvalue %[[s7]], %[[s0]][0] : !llvm<"[1 x <1 x float>]">
	// CHECK: llvm.return %[[s8]] : !llvm<"[1 x <1 x float>]">			// CHECK: llvm.return %[[s8]] : !llvm<"[1 x <1 x float>]">
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code + +// CHECK-LABEL: llvm.func @vector_fma +func @vector_fma(%a: vector<8xf32>, %b: vector<2x4xf32>) + -> (vector<8xf32>, vector<2x4xf32>) { + // CHECK: llvm.intr.fmuladd{{.}}: (!llvm<"<8 x float>">, !llvm<"<8 x float>">, !llvm<"<8 x float>">) -> !llvm<"<8 x float>"> + %0 = vector.fma %a, %a, %a : vector<8xf32> + // CHECK: %[[b00:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[b01:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[b02:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[B0:.]] = "llvm.intr.fmuladd"(%[[b00]], %[[b01]], %[[b02]]) : + // CHECK-SAME: (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>"> + // CHECK: llvm.insertvalue %[[B0]], {{.}}[0] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[b10:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[b11:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[b12:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]"> + // CHECK: %[[B1:.]] = "llvm.intr.fmuladd"(%[[b10]], %[[b11]], %[[b12]]) : + // CHECK-SAME: (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>"> + // CHECK: llvm.insertvalue %[[B1]], {{.}}[1] : !llvm<"[2 x <4 x float>]"> + %1 = vector.fma %b, %b, %b : vector<2x4xf32> + return %0, %1: vector<8xf32>, vector<2x4xf32> +} + Lint: Pre-merge checks:* clang-format: please reformat the code ``` + +// CHECK-LABEL: llvm.func @vector_fma +func…

				// CHECK-LABEL: llvm.func @vector_fma
				aartbikUnsubmitted Done Reply Inline Actions note that something like // CHECK-SAME: %[[A:arg[0-9]+]]: can be used if you want to match the argument inputs as well, see e.g. insert_strided_slice3 (probably need to rename a/b into arg0/arg1 to be sure, although I think that renaming happens anyway) aartbik: note that something like // CHECK-SAME: %[[A:arg[0-9]+]]: can be used if you want to match…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I do not see this as necessary, there is a 1-1 mapping that is already captured by the type. nicolasvasilache: I do not see this as necessary, there is a 1-1 mapping that is already captured by the type.
				aartbikUnsubmitted Done Reply Inline Actions It was of course just a suggestion :-) Regardless, I find %[[arg0]] easier to parse by eye than {{.}} and it is the direction we are generally taking with new tests. But I won't insist. aartbik:* It was of course just a suggestion :-) Regardless, I find %[[arg0]] easier to parse by eye…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions ok, I am sensitive to general simplicity, maybe erring on the side of terseness is counter-productive, I'll just update it :) nicolasvasilache: ok, I am sensitive to general simplicity, maybe erring on the side of terseness is counter…
				func @vector_fma(%a: vector<8xf32>, %b: vector<2x4xf32>)
				-> (vector<8xf32>, vector<2x4xf32>) {
				// CHECK: llvm.intr.fmuladd{{.*}}: (!llvm<"<8 x float>">, !llvm<"<8 x float>">, !llvm<"<8 x float>">) -> !llvm<"<8 x float>">
				%0 = vector.fma %a, %a, %a : vector<8xf32>
				aartbikUnsubmitted Done Reply Inline Actions perhaps use the %[[s0:.]] = .... to capture the values and make sure they are used where expected? aartbik:* perhaps use the %[[s0:.*]] = .... to capture the values and make sure they are used where…
				dcaballeUnsubmitted Done Reply Inline Actions +1 dcaballe: +1
				// CHECK: %[[b00:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[b01:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[b02:.]] = llvm.extractvalue {{.}}[0] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[B0:.*]] = "llvm.intr.fmuladd"(%[[b00]], %[[b01]], %[[b02]]) :
				// CHECK-SAME: (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>">
				// CHECK: llvm.insertvalue %[[B0]], {{.*}}[0] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[b10:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[b11:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[b12:.]] = llvm.extractvalue {{.}}[1] : !llvm<"[2 x <4 x float>]">
				// CHECK: %[[B1:.*]] = "llvm.intr.fmuladd"(%[[b10]], %[[b11]], %[[b12]]) :
				// CHECK-SAME: (!llvm<"<4 x float>">, !llvm<"<4 x float>">, !llvm<"<4 x float>">) -> !llvm<"<4 x float>">
				// CHECK: llvm.insertvalue %[[B1]], {{.*}}[1] : !llvm<"[2 x <4 x float>]">
				%1 = vector.fma %b, %b, %b : vector<2x4xf32>
				return %0, %1: vector<8xf32>, vector<2x4xf32>
				}

mlir/test/Dialect/VectorOps/ops.mlir

Show First 20 Lines • Show All 226 Lines • ▼ Show 20 Lines	func @reshape(%arg0 : vector<3x2x4xf32>) -> (vector<2x3x4xf32>) {
%c6 = constant 6 : index		%c6 = constant 6 : index
// CHECK: %[[C9:.*]] = constant 9 : index		// CHECK: %[[C9:.*]] = constant 9 : index
%c9 = constant 9 : index		%c9 = constant 9 : index
// CHECK: vector.reshape %{{.*}}, [%[[C3]], %[[C6]]], [%[[C2]], %[[C9]]], [4] : vector<3x2x4xf32> to vector<2x3x4xf32>		// CHECK: vector.reshape %{{.*}}, [%[[C3]], %[[C6]]], [%[[C2]], %[[C9]]], [4] : vector<3x2x4xf32> to vector<2x3x4xf32>
%1 = vector.reshape %arg0, [%c3, %c6], [%c2, %c9], [4]		%1 = vector.reshape %arg0, [%c3, %c6], [%c2, %c9], [4]
: vector<3x2x4xf32> to vector<2x3x4xf32>		: vector<3x2x4xf32> to vector<2x3x4xf32>

return %1 : vector<2x3x4xf32>		return %1 : vector<2x3x4xf32>
}		}
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code + +// CHECK-LABEL: @vector_fma +func @vector_fma(%a: vector<8xf32>, %b: vector<8x4xf32>) { + // CHECK: vector.fma %{{.}} : vector<8xf32> + vector.fma %a, %a, %a : vector<8xf32> + // CHECK: vector.fma %{{.}} : vector<8x4xf32> + vector.fma %b, %b, %b : vector<8x4xf32> + return +} Lint: Pre-merge checks: clang-format: please reformat the code ``` + +// CHECK-LABEL: @vector_fma +func @vector_fma(%a…

		// CHECK-LABEL: @vector_fma
		func @vector_fma(%a: vector<8xf32>, %b: vector<8x4xf32>) {
		// CHECK: vector.fma %{{.*}} : vector<8xf32>
		vector.fma %a, %a, %a : vector<8xf32>
		// CHECK: vector.fma %{{.*}} : vector<8x4xf32>
		vector.fma %b, %b, %b : vector<8x4xf32>
		return
		}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][VectorOps] Introduce a `vector.fma` op that works on n-D vectors and lowers to `llvm.intrin.fmuladd`
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 242805

mlir/include/mlir/Dialect/VectorOps/VectorOps.td

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/VectorOps/ops.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][VectorOps] Introduce a `vector.fma` op that works on n-D vectors and lowers to `llvm.intrin.fmuladd`ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 242805

mlir/include/mlir/Dialect/VectorOps/VectorOps.td

mlir/lib/Conversion/VectorToLLVM/ConvertVectorToLLVM.cpp

mlir/test/Conversion/VectorToLLVM/vector-to-llvm.mlir

mlir/test/Dialect/VectorOps/ops.mlir

[mlir][VectorOps] Introduce a `vector.fma` op that works on n-D vectors and lowers to `llvm.intrin.fmuladd`
ClosedPublic