This is an archive of the discontinued LLVM Phabricator instance.

[MLIR][LinAlg] Detensoring CF cost-model: look forward.
ClosedPublic

Authored by ergawy on Apr 14 2021, 2:13 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
silvas

Commits

rG0b05207e45ef: [MLIR][LinAlg] Detensoring CF cost-model: look forward.

Summary

This patch extends the control-flow cost-model for detensoring by
implementing a forward-looking pass on block arguments that should be
detensored. This makes sure that if a (to-be-detensored) block argument
"escapes" its block through the terminator, then the successor arguments
are also detensored.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ergawy created this revision.Apr 14 2021, 2:13 AM

Herald added subscribers: dcaballe, cota, mravishankar and 17 others. · View Herald TranscriptApr 14 2021, 2:13 AM

ergawy requested review of this revision.Apr 14 2021, 2:13 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptApr 14 2021, 2:13 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

ergawy added a reviewer: silvas.Apr 14 2021, 2:13 AM

ergawy added inline comments.Apr 14 2021, 2:15 AM

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
495	For now, I ignored this part since it might complicate the algorithm by quite a bit. I prefer to integrate detensoring in IREE after patch first and derive further changes to the cost-model on a need-by-need basis. WDYT?

Harbormaster completed remote builds in B98645: Diff 337377.Apr 14 2021, 3:02 AM

silvas added inline comments.Apr 14 2021, 11:32 AM

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
474	You should be able to fold this into the worklist traversal above.

silvas added inline comments.Apr 14 2021, 11:39 AM

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
474	To elaborate on that, think of this as finding connected components on the undirected graph where Value's are nodes, and an undirected edge exists between Value's (v1, v2) when: v2 is an operand of the defining op of v1 (or vice versa), and the defining op is a detensorable op v2 is a block argument and v1 is a successor arg that corresponds to it in a predecessor (or vice versa). You only need one worklist traversal (which is a DFS) to discover this. The only thing that is slightly custom is that when you traverse enumerate the edges incident to a node, you need to check both situations 1. and 2. Note that the edge is undirected, but the IR data structures have a directedness (mathematically, the above definition is the same if we omit "or vice versa", but I kept it for clarity).

silvas added inline comments.Apr 14 2021, 11:42 AM

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
495	I would prioritize integrating this into IREE. If you structure the code as a DFS over the graph I described above, none of these additions should be much code or very complicated.

Handle review comments:

Fold the 2 phases of the cost-mode into one.

ergawy marked 3 inline comments as done.Apr 16 2021, 12:37 AM

ergawy added inline comments.

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
474	That's awesome. Thanks a lot! Folded the 2 phases into 1.

ergawy marked an inline comment as done.Apr 16 2021, 12:37 AM

Harbormaster completed remote builds in B99098: Diff 338010.Apr 16 2021, 1:16 AM

Awesome. Looks great!

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp
347	could you elaborate a bit more on 2.1 and 2.2? I liked the level of detail you had for 1.1 and 1.2
425	typo: FromElemntsOp

This revision is now accepted and ready to land.Apr 19 2021, 2:21 PM

Add more detailed docs.

This revision was landed with ongoing or failed builds.Apr 20 2021, 12:02 AM

Closed by commit rG0b05207e45ef: [MLIR][LinAlg] Detensoring CF cost-model: look forward. (authored by ergawy). · Explain Why

This revision was automatically updated to reflect the committed changes.

ergawy added a commit: rG0b05207e45ef: [MLIR][LinAlg] Detensoring CF cost-model: look forward..

Harbormaster completed remote builds in B99627: Diff 338730.Apr 20 2021, 12:29 AM

Revision Contents

Path

Size

mlir/

lib/

Dialect/

Linalg/

Transforms/

Detensorize.cpp

62 lines

test/

Dialect/

Linalg/

	detensorize_0d.mlir
	detensorized_0d.mlir

detensorize_if.mlir

157 lines

detensorize_while.mlir

10 lines

detensorized_0d.mlir

Diff 338731

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp

Show First 20 Lines • Show All 294 Lines • ▼ Show 20 Lines	void compute(FuncOp func, DetensorizeTypeConverter typeConverter,
SmallVector<Value> workList;		SmallVector<Value> workList;

func.walk(		func.walk(
[&](CondBranchOp condBr) { workList.push_back(condBr.condition()); });		[&](CondBranchOp condBr) { workList.push_back(condBr.condition()); });

DenseSet<Value> visitedValues;		DenseSet<Value> visitedValues;
DenseSet<Operation *> visitedOps;		DenseSet<Operation *> visitedOps;

		// For a (to-be-detesored) value, check if it "escapes" the block by being
		// passed to terminator. If it does, then workList is updated with the
		// corresponding argument to the successor block.
		auto updateWorkListWithSuccessorArguments =
		[&](Value value, BranchOpInterface terminator) {
		if (!terminator)
		return;

		for (auto operandIdx :
		llvm::seq<unsigned>(0, terminator->getOperands().size())) {
		Value operand = terminator->getOperand(operandIdx);

		if (operand == value) {
		auto succBlockArg =
		terminator.getSuccessorBlockArgument(operandIdx);

		if (succBlockArg && !blockArgsToDetensor.count(*succBlockArg))
		workList.push_back(*succBlockArg);
		}
		}
		};

while (!workList.empty()) {		while (!workList.empty()) {
Value currentItem = workList.pop_back_val();		Value currentItem = workList.pop_back_val();

if (!visitedValues.insert(currentItem).second)		if (!visitedValues.insert(currentItem).second)
continue;		continue;

// The current item is defined by a block argument.		// 1 - Look forward:
if (auto bbarg = currentItem.dyn_cast<BlockArgument>()) {		// 1.1 - If currentItem escapes to one or more successors, add
		// the corresponding successor arguments to workList.
		updateWorkListWithSuccessorArguments(
		currentItem, dyn_cast<BranchOpInterface>(
		currentItem.getParentBlock()->getTerminator()));

		// 1.2 - For each user of currentItem, add the defined values to
		// workList. This way, the user ops can be inspected later if they are
		// detensorable and if so, their operands will be added to workList to
		// potentially discover other parts of the detensorable component.
		for (auto *user : currentItem.getUsers())
		for (Value result : user->getResults())
		workList.push_back(result);

		// 2 - Look backward:
		// 2.1 - The current item is defined by a block argument. If the owner
		silvasUnsubmitted Done Reply Inline Actions could you elaborate a bit more on 2.1 and 2.2? I liked the level of detail you had for 1.1 and 1.2 silvas: could you elaborate a bit more on 2.1 and 2.2? I liked the level of detail you had for 1.1 and…
		// block is a non-entry one, then:
		// * Add the argument to blockArgsToDetensor.
		// * Walk the use-def chain backwards to add each predecessor's
		// terminator-operands corresponding to currentItem to workList.
		if (currentItem.dyn_cast<BlockArgument>()) {
BlockArgument currentItemBlockArgument =		BlockArgument currentItemBlockArgument =
currentItem.cast<BlockArgument>();		currentItem.cast<BlockArgument>();
Block *ownerBlock = currentItemBlockArgument.getOwner();		Block *ownerBlock = currentItemBlockArgument.getOwner();

// Function arguments are not detensored/converted.		// Function arguments are not detensored/converted.
if (&*ownerBlock->getParent()->begin() == ownerBlock)		if (&*ownerBlock->getParent()->begin() == ownerBlock)
continue;		continue;

Show All 30 Lines	void compute(FuncOp func, DetensorizeTypeConverter typeConverter,
continue;		continue;
}		}

Operation *currentItemDefiningOp = currentItem.getDefiningOp();		Operation *currentItemDefiningOp = currentItem.getDefiningOp();

if (!visitedOps.insert(currentItemDefiningOp).second)		if (!visitedOps.insert(currentItemDefiningOp).second)
continue;		continue;

// The current item is computed by a GenericOp.		// 2.2 - The current item is computed by a GenericOp. If the op should
		// be detensored, then:
		// * Add it to opsToDetensor.
		// * Add its operands to workList to discover other parts of the
		// potentially detensorable component.
if (auto genericOp = dyn_cast<GenericOp>(currentItemDefiningOp)) {		if (auto genericOp = dyn_cast<GenericOp>(currentItemDefiningOp)) {
// The op was encountered already, no need to inspect it again.		// The op was encountered already, no need to inspect it again.
if (opsToDetensor.count(genericOp))		if (opsToDetensor.count(genericOp))
continue;		continue;

// TODO: For now, we give up if any of the control-flow components		// TODO: For now, we give up if any of the control-flow components
// in a function is not detensorable. Fix that.		// in a function is not detensorable. Fix that.
if (!shouldBeDetensored(genericOp, typeConverter)) {		if (!shouldBeDetensored(genericOp, typeConverter)) {
opsToDetensor.clear();		opsToDetensor.clear();
blockArgsToDetensor.clear();		blockArgsToDetensor.clear();
return;		return;
}		}

opsToDetensor.insert(genericOp);		opsToDetensor.insert(genericOp);

for (Value genericOpOperand : genericOp.inputs())		for (Value genericOpOperand : genericOp.inputs())
workList.push_back(genericOpOperand);		workList.push_back(genericOpOperand);

continue;		continue;
}		}

// The current item is the result of a FromElemntsOp, it will be		// 2.3 - The current item is the result of a FromElementsOp, it will be
		silvasUnsubmitted Done Reply Inline Actions typo: FromElemntsOp silvas: typo: FromElemntsOp
// trivially detensored later as part of canonicalization patterns		// trivially detensored later as part of canonicalization patterns
// applied at the end of detensoring.		// applied at the end of detensoring.
//		//
// Note: No need to check whether the result type of this op is		// Note: No need to check whether the result type of this op is
// detensorable since if it wasn't we wouldn't reach that point in the		// detensorable since if it wasn't we wouldn't reach that point in the
// work list.		// work list.
if (dyn_cast<tensor::FromElementsOp>(currentItemDefiningOp))		if (dyn_cast<tensor::FromElementsOp>(currentItemDefiningOp))
continue;		continue;

// The current item is the result of a scalar op, add all its operands		// 2.4 - The current item is the result of a scalar op, add all its
// to the work list.		// operands to the work list.
if (llvm::all_of(		if (llvm::all_of(
currentItemDefiningOp->getResultTypes(),		currentItemDefiningOp->getResultTypes(),
[&](Type resultType) { return resultType.isIntOrFloat(); }))		[&](Type resultType) { return resultType.isIntOrFloat(); }))
for (Value scalarOpOperand : currentItemDefiningOp->getOperands())		for (Value scalarOpOperand : currentItemDefiningOp->getOperands())
workList.push_back(scalarOpOperand);		workList.push_back(scalarOpOperand);
}		}
}		}
};		};
Show All 21 Lines	void runOnFunction() override {
RewritePatternSet patterns(context);		RewritePatternSet patterns(context);
ConversionTarget target(*context);		ConversionTarget target(*context);
DenseSet<Operation *> opsToDetensor;		DenseSet<Operation *> opsToDetensor;
DenseMap<Operation *, DenseSet<int>> detensorableBranchOps;		DenseMap<Operation *, DenseSet<int>> detensorableBranchOps;
DenseSet<BlockArgument> blockArgsToDetensor;		DenseSet<BlockArgument> blockArgsToDetensor;

if (aggressiveMode.getValue()) {		if (aggressiveMode.getValue()) {
AggressiveDetensoringModel costModel;		AggressiveDetensoringModel costModel;
costModel.compute(getFunction(), typeConverter, opsToDetensor,		costModel.compute(getFunction(), typeConverter, opsToDetensor,
		silvasUnsubmitted Done Reply Inline Actions You should be able to fold this into the worklist traversal above. silvas: You should be able to fold this into the worklist traversal above.
		silvasUnsubmitted Done Reply Inline Actions To elaborate on that, think of this as finding connected components on the undirected graph where Value's are nodes, and an undirected edge exists between Value's (v1, v2) when: v2 is an operand of the defining op of v1 (or vice versa), and the defining op is a detensorable op v2 is a block argument and v1 is a successor arg that corresponds to it in a predecessor (or vice versa). You only need one worklist traversal (which is a DFS) to discover this. The only thing that is slightly custom is that when you traverse enumerate the edges incident to a node, you need to check both situations 1. and 2. Note that the edge is undirected, but the IR data structures have a directedness (mathematically, the above definition is the same if we omit "or vice versa", but I kept it for clarity). silvas: To elaborate on that, think of this as finding connected components on the undirected graph…
		ergawyAuthorUnsubmitted Done Reply Inline Actions That's awesome. Thanks a lot! Folded the 2 phases into 1. ergawy: That's awesome. Thanks a lot! Folded the 2 phases into 1.
blockArgsToDetensor);		blockArgsToDetensor);

} else {		} else {
PureControlFlowDetectionModel costModel;		PureControlFlowDetectionModel costModel;
costModel.compute(getFunction(), typeConverter, opsToDetensor,		costModel.compute(getFunction(), typeConverter, opsToDetensor,
blockArgsToDetensor);		blockArgsToDetensor);
}		}

detensorableBranchOps =		detensorableBranchOps =
CostModel::computeBranchOpDetensoring(blockArgsToDetensor);		CostModel::computeBranchOpDetensoring(blockArgsToDetensor);

target.addDynamicallyLegalOp<GenericOp>(		target.addDynamicallyLegalOp<GenericOp>(
[&](GenericOp op) { return !opsToDetensor.count(op); });		[&](GenericOp op) { return !opsToDetensor.count(op); });

target.addDynamicallyLegalOp<FuncOp>([&](FuncOp op) {		target.addDynamicallyLegalOp<FuncOp>([&](FuncOp op) {
// A function is legal if all of its non-entry blocks are legal. We		// A function is legal if all of its non-entry blocks are legal. We
// don't legalize the entry block (i.e. the function's signature) since		// don't legalize the entry block (i.e. the function's signature)
// detensoring can't happen along external calling convention		// since detensoring can't happen along external calling convention
// boundaries, which we conservatively approximate as all function		// boundaries, which we conservatively approximate as all function
// signatures.		// signatures.
return llvm::all_of(llvm::drop_begin(op.getBody(), 1), [&](Block &block) {		return llvm::all_of(llvm::drop_begin(op.getBody(), 1), [&](Block &block) {
		ergawyAuthorUnsubmitted Done Reply Inline Actions For now, I ignored this part since it might complicate the algorithm by quite a bit. I prefer to integrate detensoring in IREE after patch first and derive further changes to the cost-model on a need-by-need basis. WDYT? ergawy: For now, I ignored this part since it might complicate the algorithm by quite a bit. I prefer…
		silvasUnsubmitted Done Reply Inline Actions I would prioritize integrating this into IREE. If you structure the code as a DFS over the graph I described above, none of these additions should be much code or very complicated. silvas: I would prioritize integrating this into IREE. If you structure the code as a DFS over the…
if (llvm::any_of(blockArgsToDetensor, [&](BlockArgument blockArgument) {		if (llvm::any_of(blockArgsToDetensor, [&](BlockArgument blockArgument) {
return blockArgument.getOwner() == &block &&		return blockArgument.getOwner() == &block &&
!typeConverter.isLegal(blockArgument.getType());		!typeConverter.isLegal(blockArgument.getType());
})) {		})) {
return false;		return false;
}		}
return true;		return true;
});		});
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/detensorize_0d.mlir

This file was moved from mlir/test/Dialect/Linalg/detensorized_0d.mlir.

The contents of this file were not changed.

mlir/test/Dialect/Linalg/detensorize_if.mlir

// RUN: mlir-opt %s -allow-unregistered-dialect -linalg-detensorize \| FileCheck %s		// RUN: mlir-opt %s -split-input-file -allow-unregistered-dialect -linalg-detensorize \| FileCheck %s

#map0 = affine_map<() -> ()>		#map0 = affine_map<() -> ()>

#attrs = {		#attrs = {
indexing_maps = [#map0, #map0, #map0],		indexing_maps = [#map0, #map0, #map0],
iterator_types = []		iterator_types = []
}		}

Show All 33 Lines	^bb3(%10: tensor<i32>): // pred: ^bb1
return %10 : tensor<i32>		return %10 : tensor<i32>
}		}

// CHECK-LABEL: func @main()		// CHECK-LABEL: func @main()
// CHECK-NEXT: constant 0		// CHECK-NEXT: constant 0
// CHECK-NEXT: constant 10		// CHECK-NEXT: constant 10
// CHECK-NEXT: br ^[[bb1:.]](%{{.}}: i32)		// CHECK-NEXT: br ^[[bb1:.]](%{{.}}: i32)
// CHECK-NEXT: ^[[bb1]](%{{.*}}: i32):		// CHECK-NEXT: ^[[bb1]](%{{.*}}: i32):
// CHECK-NEXT: tensor.from_elements %{{.*}}
// CHECK-NEXT: linalg.tensor_reshape %{{.*}}
// CHECK-NEXT: cmpi slt, %{{.}}, %{{.}}		// CHECK-NEXT: cmpi slt, %{{.}}, %{{.}}
// CHECK-NEXT: cond_br %{{.}}, ^[[bb2:.]](%{{.}} : tensor<i32>), ^bb3(%{{.}} : tensor<i32>)		// CHECK-NEXT: cond_br %{{.}}, ^[[bb2:.]](%{{.}} : i32), ^bb3(%{{.}} : i32)
// CHECK-NEXT: ^[[bb2]](%{{.*}}: tensor<i32>)		// CHECK-NEXT: ^[[bb2]](%{{.*}}: i32)
// CHECK-NEXT: linalg.init_tensor
// CHECK-NEXT: linalg.generic
// CHECK-NEXT: ^{{.}}(%{{.}}: i32, %{{.}}: i32, %{{.}}: i32)
// CHECK-NEXT: addi %{{.}}, %{{.}}		// CHECK-NEXT: addi %{{.}}, %{{.}}
// CHECK-NEXT: linalg.yield %{{.*}}		// CHECK-NEXT: br ^[[bb3:.]](%{{.}} : i32)
// CHECK-NEXT: } -> tensor<i32>		// CHECK-NEXT: ^[[bb3]](%{{.*}}: i32)
// CHECK-NEXT: br ^[[bb3:.]](%{{.}} : tensor<i32>)		// CHECK-NEXT: tensor.from_elements %{{.*}} : tensor<1xi32>
// CHECK-NEXT: ^[[bb3]](%{{.*}}: tensor<i32>)		// CHECK-NEXT: linalg.tensor_reshape %{{.*}} [] : tensor<1xi32> into tensor<i32>
		// CHECK-NEXT: return %{{.*}}
		// CHECK-NEXT: }

		// -----

		// Similar to the above test with one change: one of the block after the
		// if-condition passes/forwards its tensor argument to another block.

		#map0 = affine_map<() -> ()>

		#attrs = {
		indexing_maps = [#map0, #map0, #map0],
		iterator_types = []
		}

		func @main() -> (tensor<i32>) attributes {} {
		%c0 = constant 0 : i32
		%0 = tensor.from_elements %c0 : tensor<1xi32>
		%reshaped0 = linalg.tensor_reshape %0 [] : tensor<1xi32> into tensor<i32>
		%c10 = constant 10 : i32
		%1 = tensor.from_elements %c10 : tensor<1xi32>
		%reshaped1 = linalg.tensor_reshape %1 [] : tensor<1xi32> into tensor<i32>
		br ^bb1(%reshaped0 : tensor<i32>)

		^bb1(%2: tensor<i32>): // 2 preds: ^bb0, ^bb2
		%3 = linalg.init_tensor [] : tensor<i1>
		%4 = linalg.generic #attrs
		ins(%2, %reshaped1 : tensor<i32>, tensor<i32>)
		outs(%3 : tensor<i1>) {
		^bb0(%arg0: i32, %arg1: i32, %arg2: i1): // no predecessors
		%8 = cmpi slt, %arg0, %arg1 : i32
		linalg.yield %8 : i1
		} -> tensor<i1>
		%5 = tensor.extract %4[] : tensor<i1>
		cond_br %5, ^bb2(%2 : tensor<i32>), ^bb3(%2 : tensor<i32>)

		^bb2(%6: tensor<i32>): // pred: ^bb1
		%7 = linalg.init_tensor [] : tensor<i32>
		%8 = linalg.generic #attrs
		ins(%6, %6 : tensor<i32>, tensor<i32>)
		outs(%7 : tensor<i32>) {
		^bb0(%arg0: i32, %arg1: i32, %arg2: i32): // no predecessors
		%9 = addi %arg0, %arg1 : i32
		linalg.yield %9 : i32
		} -> tensor<i32>
		br ^bb3(%8 : tensor<i32>)

		^bb3(%10: tensor<i32>): // pred: ^bb1
		br ^bb4(%10 : tensor<i32>)

		^bb4(%11: tensor<i32>): // pred: ^bb1
		return %11 : tensor<i32>
		}

		// CHECK-LABEL: func @main()
		// CHECK-NEXT: constant 0
		// CHECK-NEXT: constant 10
		// CHECK-NEXT: br ^[[bb1:.]](%{{.}}: i32)
		// CHECK-NEXT: ^[[bb1]](%{{.*}}: i32):
		// CHECK-NEXT: cmpi slt, %{{.}}, %{{.}}
		// CHECK-NEXT: cond_br %{{.}}, ^[[bb2:.]](%{{.}} : i32), ^bb3(%{{.}} : i32)
		// CHECK-NEXT: ^[[bb2]](%{{.*}}: i32)
		// CHECK-NEXT: addi %{{.}}, %{{.}}
		// CHECK-NEXT: br ^[[bb3:.]](%{{.}} : i32)
		// CHECK-NEXT: ^[[bb3]](%{{.*}}: i32)
		// CHECK-NEXT: br ^[[bb4:.]](%{{.}} : i32)
		// CHECK-NEXT: ^[[bb4]](%{{.*}}: i32)
		// CHECK-NEXT: tensor.from_elements %{{.*}} : tensor<1xi32>
		// CHECK-NEXT: linalg.tensor_reshape %{{.*}} [] : tensor<1xi32> into tensor<i32>
		// CHECK-NEXT: return %{{.*}}
		// CHECK-NEXT: }

		// -----

		#map0 = affine_map<() -> ()>

		#attrs = {
		indexing_maps = [#map0, #map0, #map0],
		iterator_types = []
		}

		func @main() -> (tensor<i32>) attributes {} {
		%c0 = constant 0 : i32
		%0 = tensor.from_elements %c0 : tensor<1xi32>
		%reshaped0 = linalg.tensor_reshape %0 [] : tensor<1xi32> into tensor<i32>
		%c10 = constant 10 : i32
		%1 = tensor.from_elements %c10 : tensor<1xi32>
		%reshaped1 = linalg.tensor_reshape %1 [] : tensor<1xi32> into tensor<i32>
		br ^bb1(%reshaped0 : tensor<i32>)

		^bb1(%2: tensor<i32>): // 2 preds: ^bb0, ^bb2
		%3 = linalg.init_tensor [] : tensor<i1>
		%4 = linalg.generic #attrs
		ins(%2, %reshaped1 : tensor<i32>, tensor<i32>)
		outs(%3 : tensor<i1>) {
		^bb0(%arg0: i32, %arg1: i32, %arg2: i1): // no predecessors
		%8 = cmpi slt, %arg0, %arg1 : i32
		linalg.yield %8 : i1
		} -> tensor<i1>
		%5 = tensor.extract %4[] : tensor<i1>
		// This cond_br intentionally has bb2 as it's target for both branches. This
		// is to make sure that the "forward phase" of the cost-model correctly adds
		// the users of a block argument (in this case bb2's argument) to the work
		// list.
		cond_br %5, ^bb2(%2 : tensor<i32>), ^bb2(%2 : tensor<i32>)

		^bb2(%6: tensor<i32>): // pred: ^bb1
		%12 = tensor.from_elements %c10 : tensor<1xi32>
		%reshaped12 = linalg.tensor_reshape %12 [] : tensor<1xi32> into tensor<i32>
		%7 = linalg.init_tensor [] : tensor<i32>
		%8 = linalg.generic #attrs
		ins(%6, %reshaped12 : tensor<i32>, tensor<i32>)
		outs(%7 : tensor<i32>) {
		^bb0(%arg0: i32, %arg1: i32, %arg2: i32): // no predecessors
		%9 = addi %arg0, %arg1 : i32
		linalg.yield %9 : i32
		} -> tensor<i32>
		br ^bb3(%8 : tensor<i32>)

		^bb3(%10: tensor<i32>): // pred: ^bb1
		return %10 : tensor<i32>
		}

		// CHECK-LABEL: func @main()
		// CHECK-NEXT: constant 0
		// CHECK-NEXT: constant 10
		// CHECK-NEXT: br ^[[bb1:.]](%{{.}}: i32)
		// CHECK-NEXT: ^[[bb1]](%{{.*}}: i32):
		// CHECK-NEXT: cmpi slt, %{{.}}, %{{.}}
		// CHECK-NEXT: cond_br %{{.}}, ^[[bb2:.]](%{{.}} : i32), ^bb2(%{{.}} : i32)
		// CHECK-NEXT: ^[[bb2]](%{{.*}}: i32)
		// CHECK-NEXT: addi %{{.}}, %{{.}}
		// CHECK-NEXT: br ^[[bb3:.]](%{{.}} : i32)
		// CHECK-NEXT: ^[[bb3]](%{{.*}}: i32)
		// CHECK-NEXT: tensor.from_elements %{{.*}} : tensor<1xi32>
		// CHECK-NEXT: linalg.tensor_reshape %{{.*}} [] : tensor<1xi32> into tensor<i32>
// CHECK-NEXT: return %{{.*}}		// CHECK-NEXT: return %{{.*}}
// CHECK-NEXT: }		// CHECK-NEXT: }

mlir/test/Dialect/Linalg/detensorize_while.mlir

	Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines

	// Test detensoring only ops involed in control-flow.			// Test detensoring only ops involed in control-flow.
	//			//
	// DET-CF-LABEL: func @main			// DET-CF-LABEL: func @main
	// DET-CF-SAME: (%{{.}}: tensor<i32>, %{{.}}: tensor<i32>)			// DET-CF-SAME: (%{{.}}: tensor<i32>, %{{.}}: tensor<i32>)
	// DET-CF: tensor.extract {{.*}}			// DET-CF: tensor.extract {{.*}}
	// DET-CF: br ^[[bb1:.]](%{{.}} : i32)			// DET-CF: br ^[[bb1:.]](%{{.}} : i32)
	// DET-CF: ^[[bb1]](%{{.*}}: i32)			// DET-CF: ^[[bb1]](%{{.*}}: i32)
	// DET-CF-DAG tensor.from_elements {{.*}}			// DET-CF: cmpi slt, {{.*}}
	// DET-CF-DAG: linalg.tensor_reshape {{.*}}			// DET-CF: cond_br {{.}}, ^[[bb2:.]](%{{.}} : i32), ^[[bb3:.]](%{{.*}} : i32)
	// DET-CF-DAG: cmpi slt, {{.*}}
	// DET-CF: cond_br {{.}}, ^[[bb2:.]](%{{.}} : i32), ^[[bb3:.]](%{{.*}} : tensor<i32>)
	// DET-CF: ^[[bb2]](%{{.*}}: i32)			// DET-CF: ^[[bb2]](%{{.*}}: i32)
	// DET-CF: addi {{.*}}			// DET-CF: addi {{.*}}
	// DET-CF: br ^[[bb1]](%{{.*}} : i32)			// DET-CF: br ^[[bb1]](%{{.*}} : i32)
	// DET-CF: ^[[bb3]](%{{.*}}: tensor<i32>)			// DET-CF: ^[[bb3]](%{{.*}}: i32)
				// DET-CF: tensor.from_elements %{{.*}} : tensor<1xi32>
				// DET-CF: linalg.tensor_reshape %{{.*}} [] : tensor<1xi32> into tensor<i32>
	// DET-CF: return %{{.*}} : tensor<i32>			// DET-CF: return %{{.*}} : tensor<i32>

mlir/test/Dialect/Linalg/detensorized_0d.mlir

This file was moved to mlir/test/Dialect/Linalg/detensorize_0d.mlir.