Download Raw Diff

Details

Reviewers

antiagainst
Mogball
ThomasRaoux
mravishankar
dcaballe
grosser
aartbik
nicolasvasilache

Commits

rG8c2ea14436c8: [mlir][vector] Fold scalar vector.extract of non-splat n-D constants

Summary

Add a new pattern to fold vector.extract over n-D constants that extract scalars.
The previous code handled ND splat constants only. The new pattern is conservative and does handle sub-vector constants.

This is to aid the arith::EmulateWideInt pass which emits a lot of 2-element vector constants.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kuhar created this revision.Sep 12 2022, 6:33 PM

Herald added a reviewer: grosser. · View Herald TranscriptSep 12 2022, 6:33 PM

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: bzcheeseman, sdasgup3, wenzhicui and 17 others. · View Herald Transcript

kuhar requested review of this revision.Sep 12 2022, 6:33 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptSep 12 2022, 6:33 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B186278: Diff 459619.Sep 12 2022, 6:59 PM

Thanks, @kuhar! LGTM. Just a few comments!

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1575–1576	I think this condition is guaranteed by definition of the vector type.
1577–1578	Would it be too much work to also add support for n-D cases where a scalar is extracted? It would be very useful in general! Having it working for n-D would be great if it's just about adding some extra logic to compute the position. Supporting a sub-vector extraction would require more work that we can add in the future.
1587	I wonder if it would make sense to merge this pattern with the one above. They current share some code and when we eventually support n-D cases, the would be even more similar, except by the way the constant value is extracted, right?

kuhar marked an inline comment as done.Sep 13 2022, 8:07 AM

kuhar added inline comments.

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1575–1576	Thanks, I wanted to exclude scalable vectors, but now I see I used the wrong predicate for this.
1577–1578	I considered this but was warned by @antiagainst to be careful not to blow up the size of the IR. Thinking about it more, I think this would be fine if the extracted element was a scalar instead of a nested vector constant, because we would always replace one `vector.extract` with one `arith.constant` with a single `APInt/APFloat`. IE I think it would be safe to support n-D cases when all positions are provided, but not when some dims are still left. Does this make sense?

Fixed scalable vector check

Harbormaster completed remote builds in B186374: Diff 459753.Sep 13 2022, 8:25 AM

Mogball added inline comments.Sep 13 2022, 9:47 AM

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1570	Please match this using `m_Constant`. The defining op might be a different kind of constant op.

dcaballe added inline comments.Sep 13 2022, 12:18 PM

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1577–1578	IE I think it would be safe to support n-D cases when all positions are provided, but not when some dims are still left. Does this make sense? Exactly! That's what I meant. Regarding the sub-vector extraction, I'm not sure I understand the IR size concern. Could you please elaborate? This folding is something that will eventually happen in LLVM so I think the sooner we expose these constants the better.

kuhar added inline comments.Sep 13 2022, 12:43 PM

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1577–1578	Regarding the sub-vector extraction, I'm not sure I understand the IR size concern. Could you please elaborate? This folding is something that will eventually happen in LLVM so I think the sooner we expose these constants the better. Say you have a large n-D constant vector: `%v = arith.constant dence<...> : vector<...>` and perform a series of extractions for sub-vectors. If you encode it with `vector.extract`, each extraction needs `< n * 64-bit` positions. If you convert it to a series of constants, each representing some sub-vector, these constants may be much larger than the positions. I don't know if the in-memory attributes will be deduplicated or not, but this seems like something that can negatively affect the printing/parsing time in pathological cases.

Handle n-D vectors. Match constants vectors.

kuhar retitled this revision from [mlir][vector] Fold vector.extract of non-splat 1D constants to [mlir][vector] Fold scalar vector.extract of non-splat n-D constants.Sep 13 2022, 2:27 PM

kuhar edited the summary of this revision. (Show Details)

kuhar marked an inline comment as done.

kuhar added inline comments.

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1587	IMO the new pattern is sufficiently different that merging the two does not make the code cleaner.

Thanks! LGTM

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1577–1578	If you convert it to a series of constants, each representing some sub-vector, these constants may be much larger than the positions. I don't know if the in-memory attributes will be deduplicated or not, but this seems like something that can negatively affect the printing/parsing time in pathological cases. It's ok for me if we don't want to support the sub-vector cases for now but I don't think making printing/parsing slower would justify not introducing that canonicalization. Actually, splitting a massive constant into smaller ones could even help. I would expect identical constants to be deduplicated. Again, I think this will happen in LLVM anyways.

This revision is now accepted and ready to land.Sep 13 2022, 2:39 PM

Harbormaster completed remote builds in B186452: Diff 459873.Sep 13 2022, 2:41 PM

Mogball accepted this revision.Sep 13 2022, 4:45 PM

Mogball added inline comments.

mlir/lib/Dialect/Vector/IR/VectorOps.cpp
1584	Can you drop a TODO to support other element types, like complex values?

Added a TODO to support more types. Rebased.

This revision was landed with ongoing or failed builds.Sep 13 2022, 5:31 PM

Closed by commit rG8c2ea14436c8: [mlir][vector] Fold scalar vector.extract of non-splat n-D constants (authored by kuhar). · Explain Why

This revision was automatically updated to reflect the committed changes.

kuhar added a commit: rG8c2ea14436c8: [mlir][vector] Fold scalar vector.extract of non-splat n-D constants.

Harbormaster completed remote builds in B186495: Diff 459929.Sep 13 2022, 5:51 PM

Diff 459930

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

Show First 20 Lines • Show All 1,528 Lines • ▼ Show 20 Lines	if (extractResultRank < broadcastSrcRank)
return failure();		return failure();
rewriter.replaceOpWithNewOp<vector::BroadcastOp>(		rewriter.replaceOpWithNewOp<vector::BroadcastOp>(
extractOp, extractOp.getType(), source);		extractOp, extractOp.getType(), source);
return success();		return success();
}		}
};		};

// Pattern to rewrite a ExtractOp(splat ConstantOp) -> ConstantOp.		// Pattern to rewrite a ExtractOp(splat ConstantOp) -> ConstantOp.
class ExtractOpConstantFolder final : public OpRewritePattern<ExtractOp> {		class ExtractOpSplatConstantFolder final : public OpRewritePattern<ExtractOp> {
public:		public:
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;

LogicalResult matchAndRewrite(ExtractOp extractOp,		LogicalResult matchAndRewrite(ExtractOp extractOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Return if 'extractStridedSliceOp' operand is not defined by a		// Return if 'ExtractOp' operand is not defined by a splat vector
// ConstantOp.		// ConstantOp.
auto constantOp = extractOp.getVector().getDefiningOp<arith::ConstantOp>();		Value sourceVector = extractOp.getVector();
if (!constantOp)		Attribute vectorCst;
		if (!matchPattern(sourceVector, m_Constant(&vectorCst)))
return failure();		return failure();
auto dense = constantOp.getValue().dyn_cast<SplatElementsAttr>();		auto splat = vectorCst.dyn_cast<SplatElementsAttr>();
if (!dense)		if (!splat)
return failure();		return failure();
Attribute newAttr = dense.getSplatValue<Attribute>();		Attribute newAttr = splat.getSplatValue<Attribute>();
if (auto vecDstType = extractOp.getType().dyn_cast<VectorType>())		if (auto vecDstType = extractOp.getType().dyn_cast<VectorType>())
newAttr = DenseElementsAttr::get(vecDstType, newAttr);		newAttr = DenseElementsAttr::get(vecDstType, newAttr);
rewriter.replaceOpWithNewOp<arith::ConstantOp>(extractOp, newAttr);		rewriter.replaceOpWithNewOp<arith::ConstantOp>(extractOp, newAttr);
return success();		return success();
}		}
};		};

		// Pattern to rewrite a ExtractOp(vector<...xT> ConstantOp)[...] -> ConstantOp,
		// where the position array specifies a scalar element.
		class ExtractOpScalarVectorConstantFolder final
		: public OpRewritePattern<ExtractOp> {
		public:
		using OpRewritePattern::OpRewritePattern;

		LogicalResult matchAndRewrite(ExtractOp extractOp,
		PatternRewriter &rewriter) const override {
		// Return if 'ExtractOp' operand is not defined by a compatible vector
		// ConstantOp.
		MogballUnsubmitted Done Reply Inline Actions Please match this using `m_Constant`. The defining op might be a different kind of constant op. Mogball: Please match this using `m_Constant`. The defining op might be a different kind of constant op.
		Value sourceVector = extractOp.getVector();
		Attribute vectorCst;
		if (!matchPattern(sourceVector, m_Constant(&vectorCst)))
		return failure();

		auto vecTy = sourceVector.getType().cast<VectorType>();
		dcaballeUnsubmitted Done Reply Inline Actions I think this condition is guaranteed by definition of the vector type. dcaballe: I think this condition is guaranteed by definition of the vector type.
		kuharAuthorUnsubmitted Done Reply Inline Actions Thanks, I wanted to exclude scalable vectors, but now I see I used the wrong predicate for this. kuhar: Thanks, I wanted to exclude scalable vectors, but now I see I used the wrong predicate for this.
		Type elemTy = vecTy.getElementType();
		ArrayAttr positions = extractOp.getPosition();
		dcaballeUnsubmitted Done Reply Inline Actions Would it be too much work to also add support for n-D cases where a scalar is extracted? It would be very useful in general! Having it working for n-D would be great if it's just about adding some extra logic to compute the position. Supporting a sub-vector extraction would require more work that we can add in the future. dcaballe: Would it be too much work to also add support for n-D cases where a scalar is extracted? It…
		kuharAuthorUnsubmitted Done Reply Inline Actions I considered this but was warned by @antiagainst to be careful not to blow up the size of the IR. Thinking about it more, I think this would be fine if the extracted element was a scalar instead of a nested vector constant, because we would always replace one `vector.extract` with one `arith.constant` with a single `APInt/APFloat`. IE I think it would be safe to support n-D cases when all positions are provided, but not when some dims are still left. Does this make sense? kuhar: I considered this but was warned by @antiagainst to be careful not to blow up the size of the…
		dcaballeUnsubmitted Not Done Reply Inline Actions IE I think it would be safe to support n-D cases when all positions are provided, but not when some dims are still left. Does this make sense? Exactly! That's what I meant. Regarding the sub-vector extraction, I'm not sure I understand the IR size concern. Could you please elaborate? This folding is something that will eventually happen in LLVM so I think the sooner we expose these constants the better. dcaballe: > IE I think it would be safe to support n-D cases when all positions are provided, but not…
		kuharAuthorUnsubmitted Done Reply Inline Actions Regarding the sub-vector extraction, I'm not sure I understand the IR size concern. Could you please elaborate? This folding is something that will eventually happen in LLVM so I think the sooner we expose these constants the better. Say you have a large n-D constant vector: `%v = arith.constant dence<...> : vector<...>` and perform a series of extractions for sub-vectors. If you encode it with `vector.extract`, each extraction needs `< n * 64-bit` positions. If you convert it to a series of constants, each representing some sub-vector, these constants may be much larger than the positions. I don't know if the in-memory attributes will be deduplicated or not, but this seems like something that can negatively affect the printing/parsing time in pathological cases. kuhar: > Regarding the sub-vector extraction, I'm not sure I understand the IR size concern. Could you…
		dcaballeUnsubmitted Not Done Reply Inline Actions If you convert it to a series of constants, each representing some sub-vector, these constants may be much larger than the positions. I don't know if the in-memory attributes will be deduplicated or not, but this seems like something that can negatively affect the printing/parsing time in pathological cases. It's ok for me if we don't want to support the sub-vector cases for now but I don't think making printing/parsing slower would justify not introducing that canonicalization. Actually, splitting a massive constant into smaller ones could even help. I would expect identical constants to be deduplicated. Again, I think this will happen in LLVM anyways. dcaballe: > If you convert it to a series of constants, each representing some sub-vector, these…
		if (vecTy.isScalable())
		return failure();
		// Do not allow extracting sub-vectors to limit the size of the generated
		// constants.
		if (vecTy.getRank() != static_cast<int64_t>(positions.size()))
		return failure();
		MogballUnsubmitted Not Done Reply Inline Actions Can you drop a TODO to support other element types, like complex values? Mogball: Can you drop a TODO to support other element types, like complex values?
		// TODO: Handle more element types, e.g., complex values.
		if (!elemTy.isIntOrIndexOrFloat())
		return failure();
		dcaballeUnsubmitted Not Done Reply Inline Actions I wonder if it would make sense to merge this pattern with the one above. They current share some code and when we eventually support n-D cases, the would be even more similar, except by the way the constant value is extracted, right? dcaballe: I wonder if it would make sense to merge this pattern with the one above. They current share…
		kuharAuthorUnsubmitted Done Reply Inline Actions IMO the new pattern is sufficiently different that merging the two does not make the code cleaner. kuhar: IMO the new pattern is sufficiently different that merging the two does not make the code…

		// The splat case is handled by `ExtractOpSplatConstantFolder`.
		auto dense = vectorCst.dyn_cast<DenseElementsAttr>();
		if (!dense \|\| dense.isSplat())
		return failure();

		// Calculate the flattened position.
		int64_t elemPosition = 0;
		int64_t innerElems = 1;
		for (auto [dimSize, positionInDim] :
		llvm::reverse(llvm::zip(vecTy.getShape(), positions))) {
		int64_t positionVal = positionInDim.cast<IntegerAttr>().getInt();
		elemPosition += positionVal * innerElems;
		innerElems *= dimSize;
		}

		Attribute newAttr;
		if (vecTy.getElementType().isIntOrIndex()) {
		auto values = to_vector(dense.getValues<APInt>());
		newAttr = IntegerAttr::get(extractOp.getType(), values[elemPosition]);
		} else if (vecTy.getElementType().isa<FloatType>()) {
		auto values = to_vector(dense.getValues<APFloat>());
		newAttr = FloatAttr::get(extractOp.getType(), values[elemPosition]);
		}
		assert(newAttr && "Unhandled case");

		rewriter.replaceOpWithNewOp<arith::ConstantOp>(extractOp, newAttr);
		return success();
		}
		};

} // namespace		} // namespace

void ExtractOp::getCanonicalizationPatterns(RewritePatternSet &results,		void ExtractOp::getCanonicalizationPatterns(RewritePatternSet &results,
MLIRContext *context) {		MLIRContext *context) {
results.add<ExtractOpConstantFolder, ExtractOpFromBroadcast>(context);		results.add<ExtractOpSplatConstantFolder, ExtractOpScalarVectorConstantFolder,
		ExtractOpFromBroadcast>(context);
}		}

static void populateFromInt64AttrArray(ArrayAttr arrayAttr,		static void populateFromInt64AttrArray(ArrayAttr arrayAttr,
SmallVectorImpl<int64_t> &results) {		SmallVectorImpl<int64_t> &results) {
for (auto attr : arrayAttr)		for (auto attr : arrayAttr)
results.push_back(attr.cast<IntegerAttr>().getInt());		results.push_back(attr.cast<IntegerAttr>().getInt());
}		}

▲ Show 20 Lines • Show All 3,582 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/canonicalize.mlir

Show First 20 Lines • Show All 1,361 Lines • ▼ Show 20 Lines	func.func @insert_extract_to_broadcast(%arg0 : vector<1x1x4xf32>,
%arg1 : vector<4xf32>) -> (vector<4xf32>, vector<1x1x4xf32>) {		%arg1 : vector<4xf32>) -> (vector<4xf32>, vector<1x1x4xf32>) {
%0 = vector.extract %arg0[0, 0] : vector<1x1x4xf32>		%0 = vector.extract %arg0[0, 0] : vector<1x1x4xf32>
%1 = vector.insert %arg1, %arg0 [0, 0] : vector<4xf32> into vector<1x1x4xf32>		%1 = vector.insert %arg1, %arg0 [0, 0] : vector<4xf32> into vector<1x1x4xf32>
return %0, %1 : vector<4xf32>, vector<1x1x4xf32>		return %0, %1 : vector<4xf32>, vector<1x1x4xf32>
}		}

// -----		// -----

// CHECK-LABEL: extract_constant		// CHECK-LABEL: func.func @extract_splat_constant
// CHECK-DAG: %[[CST1:.*]] = arith.constant 1 : i32		// CHECK-DAG: %[[CST1:.*]] = arith.constant 1 : i32
// CHECK-DAG: %[[CST0:.*]] = arith.constant dense<2.000000e+00> : vector<7xf32>		// CHECK-DAG: %[[CST0:.*]] = arith.constant dense<2.000000e+00> : vector<7xf32>
// CHECK: return %[[CST0]], %[[CST1]] : vector<7xf32>, i32		// CHECK-NEXT: return %[[CST0]], %[[CST1]] : vector<7xf32>, i32
func.func @extract_constant() -> (vector<7xf32>, i32) {		func.func @extract_splat_constant() -> (vector<7xf32>, i32) {
%cst = arith.constant dense<2.000000e+00> : vector<29x7xf32>		%cst = arith.constant dense<2.000000e+00> : vector<29x7xf32>
%cst_1 = arith.constant dense<1> : vector<4x37x9xi32>		%cst_1 = arith.constant dense<1> : vector<4x37x9xi32>
%0 = vector.extract %cst[2] : vector<29x7xf32>		%0 = vector.extract %cst[2] : vector<29x7xf32>
%1 = vector.extract %cst_1[1, 4, 5] : vector<4x37x9xi32>		%1 = vector.extract %cst_1[1, 4, 5] : vector<4x37x9xi32>
return %0, %1 : vector<7xf32>, i32		return %0, %1 : vector<7xf32>, i32
}		}

// -----		// -----

		// CHECK-LABEL: func.func @extract_1d_constant
		// CHECK-DAG: %[[I32CST:.*]] = arith.constant 3 : i32
		// CHECK-DAG: %[[IDXCST:.*]] = arith.constant 1 : index
		// CHECK-DAG: %[[F32CST:.*]] = arith.constant 2.000000e+00 : f32
		// CHECK-NEXT: return %[[I32CST]], %[[IDXCST]], %[[F32CST]] : i32, index, f32
		func.func @extract_1d_constant() -> (i32, index, f32) {
		%icst = arith.constant dense<[1, 2, 3, 4]> : vector<4xi32>
		%e = vector.extract %icst[2] : vector<4xi32>
		%idx_cst = arith.constant dense<[0, 1, 2]> : vector<3xindex>
		%f = vector.extract %idx_cst[1] : vector<3xindex>
		%fcst = arith.constant dense<[2.000000e+00, 3.000000e+00, 4.000000e+00]> : vector<3xf32>
		%g = vector.extract %fcst[0] : vector<3xf32>
		return %e, %f, %g : i32, index, f32
		}

		// -----

		// CHECK-LABEL: func.func @extract_2d_constant
		// CHECK-DAG: %[[ACST:.*]] = arith.constant 0 : i32
		// CHECK-DAG: %[[BCST:.*]] = arith.constant 2 : i32
		// CHECK-DAG: %[[CCST:.*]] = arith.constant 3 : i32
		// CHECK-DAG: %[[DCST:.*]] = arith.constant 5 : i32
		// CHECK-NEXT: return %[[ACST]], %[[BCST]], %[[CCST]], %[[DCST]] : i32, i32, i32, i32
		func.func @extract_2d_constant() -> (i32, i32, i32, i32) {
		%cst = arith.constant dense<[[0, 1, 2], [3, 4, 5]]> : vector<2x3xi32>
		%a = vector.extract %cst[0, 0] : vector<2x3xi32>
		%b = vector.extract %cst[0, 2] : vector<2x3xi32>
		%c = vector.extract %cst[1, 0] : vector<2x3xi32>
		%d = vector.extract %cst[1, 2] : vector<2x3xi32>
		return %a, %b, %c, %d : i32, i32, i32, i32
		}

		// -----

		// CHECK-LABEL: func.func @extract_3d_constant
		// CHECK-DAG: %[[ACST:.*]] = arith.constant 0 : i32
		// CHECK-DAG: %[[BCST:.*]] = arith.constant 1 : i32
		// CHECK-DAG: %[[CCST:.*]] = arith.constant 9 : i32
		// CHECK-DAG: %[[DCST:.*]] = arith.constant 10 : i32
		// CHECK-NEXT: return %[[ACST]], %[[BCST]], %[[CCST]], %[[DCST]] : i32, i32, i32, i32
		func.func @extract_3d_constant() -> (i32, i32, i32, i32) {
		%cst = arith.constant dense<[[[0, 1], [2, 3], [4, 5]], [[6, 7], [8, 9], [10, 11]]]> : vector<2x3x2xi32>
		%a = vector.extract %cst[0, 0, 0] : vector<2x3x2xi32>
		%b = vector.extract %cst[0, 0, 1] : vector<2x3x2xi32>
		%c = vector.extract %cst[1, 1, 1] : vector<2x3x2xi32>
		%d = vector.extract %cst[1, 2, 0] : vector<2x3x2xi32>
		return %a, %b, %c, %d : i32, i32, i32, i32
		}

		// -----

// CHECK-LABEL: extract_extract_strided		// CHECK-LABEL: extract_extract_strided
// CHECK-SAME: %[[A:.*]]: vector<32x16x4xf16>		// CHECK-SAME: %[[A:.*]]: vector<32x16x4xf16>
// CHECK: %[[V:.*]] = vector.extract %[[A]][9, 7] : vector<32x16x4xf16>		// CHECK: %[[V:.*]] = vector.extract %[[A]][9, 7] : vector<32x16x4xf16>
// CHECK: return %[[V]] : vector<4xf16>		// CHECK: return %[[V]] : vector<4xf16>
func.func @extract_extract_strided(%arg0: vector<32x16x4xf16>) -> vector<4xf16> {		func.func @extract_extract_strided(%arg0: vector<32x16x4xf16>) -> vector<4xf16> {
%1 = vector.extract_strided_slice %arg0		%1 = vector.extract_strided_slice %arg0
{offsets = [7, 3], sizes = [10, 8], strides = [1, 1]} :		{offsets = [7, 3], sizes = [10, 8], strides = [1, 1]} :
vector<32x16x4xf16> to vector<10x8x4xf16>		vector<32x16x4xf16> to vector<10x8x4xf16>
▲ Show 20 Lines • Show All 351 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][vector] Fold scalar vector.extract of non-splat n-D constants
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 459930

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/test/Dialect/Vector/canonicalize.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][vector] Fold scalar vector.extract of non-splat n-D constantsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 459930

mlir/lib/Dialect/Vector/IR/VectorOps.cpp

mlir/test/Dialect/Vector/canonicalize.mlir

[mlir][vector] Fold scalar vector.extract of non-splat n-D constants
ClosedPublic