Download Raw Diff

Details

Reviewers

ThomasRaoux
aartbik
nicolasvasilache
dcaballe
mravishankar

Summary

There are cases that after combining vector.contract and
vector.broadcast, the generated vector.contract's operands
do not have parallel or reduction pair in LHS and RHS at all.
Such cases may fail vector.contract verification. Explicitly
guard against such cases.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

antiagainst created this revision.Apr 12 2022, 5:41 AM

Herald added a reviewer: aartbik. · View Herald TranscriptApr 12 2022, 5:41 AM

Herald added a reviewer: mravishankar. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: sdasgup3, wenzhicui, wrengr and 18 others. · View Herald Transcript

antiagainst requested review of this revision.Apr 12 2022, 5:41 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptApr 12 2022, 5:41 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B159214: Diff 422195.Apr 12 2022, 6:22 AM

Update

antiagainst retitled this revision from [mlir][vector] Fix CombineContractBroadcast for all parallel cases to [mlir][vector] Fix CombineContractBroadcast for invalid cases.Apr 12 2022, 6:30 AM

antiagainst edited the summary of this revision. (Show Details)

Update

Harbormaster completed remote builds in B159231: Diff 422217.Apr 12 2022, 7:04 AM

mravishankar resigned from this revision.Apr 12 2022, 1:46 PM

ThomasRaoux added inline comments.Apr 14 2022, 7:56 PM

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
1058	this is only correct for float and add kind of contract?

Address comments

antiagainst marked an inline comment as done.Apr 15 2022, 6:24 AM

antiagainst added inline comments.

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
1058	Ah, great catch! Fixed.

Harbormaster completed remote builds in B159819: Diff 423078.Apr 15 2022, 6:49 AM

Update to fix another invalid case

antiagainst edited the summary of this revision. (Show Details)Apr 25 2022, 5:11 AM

Harbormaster completed remote builds in B161143: Diff 424879.Apr 25 2022, 5:51 AM

ThomasRaoux added inline comments.May 10 2022, 12:48 PM

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

1066–1076

I don't think we can really remove the reduction dimension unless they are unit dimension?
For instance:

func.func @contract_broadcast_fma(%a: vector<4xf32>, %b: vector<4xf32>, %c: vector<4xf32>) -> vector<4xf32> {
  %bcast_a = vector.broadcast %a : vector<4xf32> to vector<1x2x4xf32>
  %bcast_b = vector.broadcast %b : vector<4xf32> to vector<1x2x4xf32>
  %contract = vector.contract {
    indexing_maps = [affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d0)>],
    iterator_types = ["parallel", "reduction", "reduction"], kind = #vector.kind<add>
  } %bcast_a, %bcast_b, %c : vector<1x2x4xf32>, vector<1x2x4xf32> into vector<4xf32>
  return %contract: vector<4xf32>
}

shouldn't become:

func.func @contract_broadcast_fma(%arg0: vector<4xf32>, %arg1: vector<4xf32>, %arg2: vector<4xf32>) -> vector<4xf32> {
  %0 = vector.fma %arg0, %arg1, %arg2 : vector<4xf32>
  return %0 : vector<4xf32>
}

1069–1074

I find it a bit odd that we have lowering to fma in the pattern that tries to combine contract and broadcast, why can't this be a separate pattern that would be called by user if it wants this kind of lowering.

Remove vector.fma generation

Herald added a subscriber: bzcheeseman. · View Herald TranscriptMay 23 2022, 12:15 PM

antiagainst marked 2 inline comments as done.May 23 2022, 12:15 PM

antiagainst added inline comments.

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
1066–1076	You are right.. This is problematic. Removed now.
1069–1074	Makes sense. Removed this part now.

antiagainst edited the summary of this revision. (Show Details)May 23 2022, 12:16 PM

Harbormaster completed remote builds in B165891: Diff 431450.May 23 2022, 12:29 PM

Fix func.func

Harbormaster completed remote builds in B165906: Diff 431468.May 23 2022, 1:44 PM

ThomasRaoux added inline comments.May 23 2022, 4:04 PM

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
1051–1060	nit: can we reverse the condition to do early exit like the rest of the function?
1053	I don't understand why we need the second condition? As long as we have a reduction being used it should be okay to generate the new contraction op?

Address comments

antiagainst marked 2 inline comments as done.Jun 28 2022, 10:50 AM

antiagainst added inline comments.

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp
1051–1060	Done.
1053	It's required by contraction op verification. For full reduction, valid contraction op expects scalar accumulator/result https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/IR/VectorOps.cpp#L674-L676. So we need to make sure we are checking that here. The second test case checks this.

Harbormaster completed remote builds in B172538: Diff 440694.Jun 28 2022, 12:25 PM

would be nice to refactor to directly call into a new verifyContractionOpImpl helper in the same way that we have tensor::verifyInsertSliceOp

Herald added a subscriber: anlunx. · View Herald TranscriptJul 4 2022, 9:36 AM

antiagainst planned changes to this revision.Sep 20 2022, 9:40 AM

antiagainst marked 2 inline comments as done.

Herald added a reviewer: dcaballe. · View Herald TranscriptSep 20 2022, 9:40 AM

Diff 422214

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

Show First 20 Lines • Show All 929 Lines • ▼ Show 20 Lines	if (!changed)
return failure();		return failure();
rewriter.replaceOpWithNewOp<vector::ContractionOp>(		rewriter.replaceOpWithNewOp<vector::ContractionOp>(
contractOp, lhs, rhs, contractOp.getAcc(),		contractOp, lhs, rhs, contractOp.getAcc(),
rewriter.getAffineMapArrayAttr(maps), contractOp.getIteratorTypes());		rewriter.getAffineMapArrayAttr(maps), contractOp.getIteratorTypes());
return success();		return success();
}		}
};		};

		/// Returns true if the given `lhsMap` and `rhsMap` from a vector.contract op
		/// has a reduction dimension access pair.
		static bool hasReductionPair(AffineMap lhsMap, AffineMap rhsMap,
		ArrayAttr iteratorTypes) {
		for (const auto &it : llvm::enumerate(iteratorTypes)) {
		if (!isReductionIterator(it.value()))
		continue;
		auto lhsDim = getResultIndex(lhsMap, it.index());
		auto rhsDim = getResultIndex(rhsMap, it.index());
		if (lhsDim && rhsDim)
		return true;
		}
		return false;
		}

/// Merge BroadcastOp into ContractionOp user.		/// Merge BroadcastOp into ContractionOp user.
/// Ex:		/// Ex:
/// ```		/// ```
/// %0 = vector.broadcast %arg0 : vector<32x16xf32> to vector<8x32x16xf32>		/// %0 = vector.broadcast %arg0 : vector<32x16xf32> to vector<8x32x16xf32>
/// %1 = vector.contract {indexing_maps = [		/// %1 = vector.contract {indexing_maps = [
/// affine_map<(d0, d1, d2) -> (d0, d1, d2)>,		/// affine_map<(d0, d1, d2) -> (d0, d1, d2)>,
/// affine_map<(d0, d1, d2) -> (d0, d1, d2)>,		/// affine_map<(d0, d1, d2) -> (d0, d1, d2)>,
/// affine_map<(d0, d1, d2) -> (d0, d1)>],		/// affine_map<(d0, d1, d2) -> (d0, d1)>],
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	for (Value *operand : {&lhs, &rhs}) {
AffineMap::get(broadcast.getVectorType().getRank(), 0, originalDims,		AffineMap::get(broadcast.getVectorType().getRank(), 0, originalDims,
contractOp.getContext());		contractOp.getContext());
map = broadcastMap.compose(map);		map = broadcastMap.compose(map);
*operand = broadcast.getSource();		*operand = broadcast.getSource();
changed = true;		changed = true;
}		}
if (!changed)		if (!changed)
return failure();		return failure();

		ArrayAttr iteratorTypes = contractOp.getIteratorTypes();
		// We need to make sure at least one reduction dimension is actually used to
		// generate valid vector.contract ops.
		if (hasReductionPair(maps[0], maps[1], iteratorTypes)) {
rewriter.replaceOpWithNewOp<vector::ContractionOp>(		rewriter.replaceOpWithNewOp<vector::ContractionOp>(
contractOp, lhs, rhs, contractOp.getAcc(),		contractOp, lhs, rhs, contractOp.getAcc(),
rewriter.getAffineMapArrayAttr(maps), contractOp.getIteratorTypes());		rewriter.getAffineMapArrayAttr(maps), contractOp.getIteratorTypes());
		} else {
		// After combining, all access are through parallel dimensions. It can be
		// simplified into a vector.fma op if all maps are the same.
		if (!llvm::is_splat(maps))
		return failure();
		rewriter.replaceOpWithNewOp<vector::FMAOp>(contractOp, lhs, rhs,
		contractOp.getAcc());
		}
return success();		return success();
}		}
};		};

/// Reorders cast(broadcast) to broadcast(cast). This makes broadcast ops and		/// Reorders cast(broadcast) to broadcast(cast). This makes broadcast ops and
/// contraction ops closer, which kicks in CombineContractBroadcast pattern when		/// contraction ops closer, which kicks in CombineContractBroadcast pattern when
/// casting ops are around these operations.		/// casting ops are around these operations.
/// Ex:		/// Ex:
/// ```		/// ```
/// %0 = vector.broadcast %arg0 : vector<32x16xi8> to vector<8x32x16xi8>		/// %0 = vector.broadcast %arg0 : vector<32x16xi8> to vector<8x32x16xi8>
/// %1 = arith.extsi %0 : vector<8x32x16xi8> to vector<8x32x16xi32>		/// %1 = arith.extsi %0 : vector<8x32x16xi8> to vector<8x32x16xi32>
/// ```		/// ```
/// Gets converted to:		/// Gets converted to:
/// ```		/// ```
/// %0 = arith.extsi %0 : vector<32x16xi8> to vector<32x16xi32>		/// %0 = arith.extsi %0 : vector<32x16xi8> to vector<32x16xi32>
/// %1 = vector.broadcast %arg0 : vector<32x16xi32> to vector<8x32x16xi32>		/// %1 = vector.broadcast %arg0 : vector<32x16xi32> to vector<8x32x16xi32>
		ThomasRaouxUnsubmitted Done Reply Inline Actions I don't understand why we need the second condition? As long as we have a reduction being used it should be okay to generate the new contraction op? ThomasRaoux: I don't understand why we need the second condition? As long as we have a reduction being used…
		antiagainstAuthorUnsubmitted Done Reply Inline Actions It's required by contraction op verification. For full reduction, valid contraction op expects scalar accumulator/result https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/Vector/IR/VectorOps.cpp#L674-L676. So we need to make sure we are checking that here. The second test case checks this. antiagainst: It's required by contraction op verification. For full reduction, valid contraction op expects…
/// ```		/// ```
struct ReorderCastOpsOnBroadcast		struct ReorderCastOpsOnBroadcast
: public OpInterfaceRewritePattern<CastOpInterface> {		: public OpInterfaceRewritePattern<CastOpInterface> {
using OpInterfaceRewritePattern<CastOpInterface>::OpInterfaceRewritePattern;		using OpInterfaceRewritePattern<CastOpInterface>::OpInterfaceRewritePattern;

		ThomasRaouxUnsubmitted Done Reply Inline Actions this is only correct for float and add kind of contract? ThomasRaoux: this is only correct for float and add kind of contract?
		antiagainstAuthorUnsubmitted Done Reply Inline Actions Ah, great catch! Fixed. antiagainst: Ah, great catch! Fixed.
LogicalResult matchAndRewrite(CastOpInterface op,		LogicalResult matchAndRewrite(CastOpInterface op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
		ThomasRaouxUnsubmitted Done Reply Inline Actions nit: can we reverse the condition to do early exit like the rest of the function? ThomasRaoux: nit: can we reverse the condition to do early exit like the rest of the function?
		antiagainstAuthorUnsubmitted Done Reply Inline Actions Done. antiagainst: Done.
if (op->getNumOperands() != 1)		if (op->getNumOperands() != 1)
return failure();		return failure();
auto bcastOp = op->getOperand(0).getDefiningOp<vector::BroadcastOp>();		auto bcastOp = op->getOperand(0).getDefiningOp<vector::BroadcastOp>();
if (!bcastOp)		if (!bcastOp)
return failure();		return failure();

Type castResTy = getElementTypeOrSelf(op->getResult(0));		Type castResTy = getElementTypeOrSelf(op->getResult(0));
if (auto vecTy = bcastOp.getSourceType().dyn_cast<VectorType>())		if (auto vecTy = bcastOp.getSourceType().dyn_cast<VectorType>())
castResTy = VectorType::get(vecTy.getShape(), castResTy);		castResTy = VectorType::get(vecTy.getShape(), castResTy);
auto castOp =		auto castOp =
rewriter.create(op->getLoc(), op->getName().getIdentifier(),		rewriter.create(op->getLoc(), op->getName().getIdentifier(),
bcastOp.getSource(), castResTy, op->getAttrs());		bcastOp.getSource(), castResTy, op->getAttrs());
rewriter.replaceOpWithNewOp<vector::BroadcastOp>(		rewriter.replaceOpWithNewOp<vector::BroadcastOp>(
op, op->getResult(0).getType(), castOp->getResult(0));		op, op->getResult(0).getType(), castOp->getResult(0));
		ThomasRaouxUnsubmitted Done Reply Inline Actions I find it a bit odd that we have lowering to fma in the pattern that tries to combine contract and broadcast, why can't this be a separate pattern that would be called by user if it wants this kind of lowering. ThomasRaoux: I find it a bit odd that we have lowering to fma in the pattern that tries to combine contract…
		antiagainstAuthorUnsubmitted Done Reply Inline Actions Makes sense. Removed this part now. antiagainst: Makes sense. Removed this part now.
return success();		return success();
}		}
		ThomasRaouxUnsubmitted Done Reply Inline Actions I don't think we can really remove the reduction dimension unless they are unit dimension? For instance: func.func @contract_broadcast_fma(%a: vector<4xf32>, %b: vector<4xf32>, %c: vector<4xf32>) -> vector<4xf32> { %bcast_a = vector.broadcast %a : vector<4xf32> to vector<1x2x4xf32> %bcast_b = vector.broadcast %b : vector<4xf32> to vector<1x2x4xf32> %contract = vector.contract { indexing_maps = [affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d0)>], iterator_types = ["parallel", "reduction", "reduction"], kind = #vector.kind<add> } %bcast_a, %bcast_b, %c : vector<1x2x4xf32>, vector<1x2x4xf32> into vector<4xf32> return %contract: vector<4xf32> } shouldn't become: func.func @contract_broadcast_fma(%arg0: vector<4xf32>, %arg1: vector<4xf32>, %arg2: vector<4xf32>) -> vector<4xf32> { %0 = vector.fma %arg0, %arg1, %arg2 : vector<4xf32> return %0 : vector<4xf32> } ThomasRaoux: I don't think we can really remove the reduction dimension unless they are unit dimension? For…
		antiagainstAuthorUnsubmitted Done Reply Inline Actions You are right.. This is problematic. Removed now. antiagainst: You are right.. This is problematic. Removed now.
};		};

/// Reorders cast(transpose) to transpose(cast). This makes broadcast ops and		/// Reorders cast(transpose) to transpose(cast). This makes broadcast ops and
/// contraction ops closer, which kicks in CombineContractTranspose pattern when		/// contraction ops closer, which kicks in CombineContractTranspose pattern when
/// casting ops are around these operations.		/// casting ops are around these operations.
/// Ex:		/// Ex:
/// ```		/// ```
/// %0 = vector.transpose %arg0, [2, 0, 1]		/// %0 = vector.transpose %arg0, [2, 0, 1]
▲ Show 20 Lines • Show All 1,616 Lines • Show Last 20 Lines

mlir/test/Dialect/Vector/vector-reduce-to-contract.mlir

Show First 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	func @contract_broadcast(
%cst = arith.constant dense<0.000000e+00> : vector<8x32xf32>		%cst = arith.constant dense<0.000000e+00> : vector<8x32xf32>
%0 = vector.broadcast %arg0 : vector<32x16xf32> to vector<8x32x16xf32>		%0 = vector.broadcast %arg0 : vector<32x16xf32> to vector<8x32x16xf32>
%1 = vector.contract {indexing_maps = [#map0, #map0, #map1],		%1 = vector.contract {indexing_maps = [#map0, #map0, #map1],
iterator_types = ["parallel", "parallel", "reduction"],		iterator_types = ["parallel", "parallel", "reduction"],
kind = #vector.kind<add>} %0, %arg1, %cst : vector<8x32x16xf32>, vector<8x32x16xf32> into vector<8x32xf32>		kind = #vector.kind<add>} %0, %arg1, %cst : vector<8x32x16xf32>, vector<8x32x16xf32> into vector<8x32xf32>
return %1 : vector<8x32xf32>		return %1 : vector<8x32xf32>
}		}

		// -----

		// CHECK-LABEL: contract_broadcast_fma
		// CHECK-SAME: (%[[A:.+]]: vector<4xf32>, %[[B:.+]]: vector<4xf32>, %[[C:.+]]: vector<4xf32>)
		// CHECK: %[[FMA:.+]] = vector.fma %[[A]], %[[B]], %[[C]] : vector<4xf32>
		// CHECK: return %[[FMA]] : vector<4xf32
		func @contract_broadcast_fma(%a: vector<4xf32>, %b: vector<4xf32>, %c: vector<4xf32>) -> vector<4xf32> {
		%bcast_a = vector.broadcast %a : vector<4xf32> to vector<1x1x4xf32>
		%bcast_b = vector.broadcast %b : vector<4xf32> to vector<1x1x4xf32>
		%contract = vector.contract {
		indexing_maps = [affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d1, d2, d0)>, affine_map<(d0, d1, d2) -> (d0)>],
		iterator_types = ["parallel", "reduction", "reduction"], kind = #vector.kind<add>
		} %bcast_a, %bcast_b, %c : vector<1x1x4xf32>, vector<1x1x4xf32> into vector<4xf32>
		return %contract: vector<4xf32>
		}

		// -----

		// CHECK-LABEL: func @contract_broadcast_no_reduction_pair
		// CHECK: vector.broadcast
		// CHECK: vector.contract
		func @contract_broadcast_no_reduction_pair(%a: vector<1xf32>, %b: vector<4xf32>, %c: vector<4xf32>) -> vector<4xf32> {
		%bcast = vector.broadcast %b : vector<4xf32> to vector<1x4xf32>
		%contract = vector.contract {
		indexing_maps = [affine_map<(d0, d1) -> (d1)>, affine_map<(d0, d1) -> (d1, d0)>, affine_map<(d0, d1) -> (d0)>],
		iterator_types = ["parallel", "reduction"], kind = #vector.kind<add>
		} %a, %bcast, %c : vector<1xf32>, vector<1x4xf32> into vector<4xf32>
		return %contract: vector<4xf32>
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Reorder casting ops and vector ops. The casting ops have almost identical		// Reorder casting ops and vector ops. The casting ops have almost identical
// pattern, so only arith.extsi op is tested.		// pattern, so only arith.extsi op is tested.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

// -----		// -----

func @broadcast_vector_extsi(%a : vector<4xi8>) -> vector<2x4xi32> {		func @broadcast_vector_extsi(%a : vector<4xi8>) -> vector<2x4xi32> {
Show All 26 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][vector] Fix CombineContractBroadcast for invalid cases
Changes PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 422214

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

mlir/test/Dialect/Vector/vector-reduce-to-contract.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][vector] Fix CombineContractBroadcast for invalid casesChanges PlannedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 422214

mlir/lib/Dialect/Vector/Transforms/VectorTransforms.cpp

mlir/test/Dialect/Vector/vector-reduce-to-contract.mlir

[mlir][vector] Fix CombineContractBroadcast for invalid cases
Changes PlannedPublic