Diff 328277

mlir/lib/Dialect/SCF/SCF.cpp

Show First 20 Lines • Show All 554 Lines • ▼ Show 20 Lines	if ((lbValue + stepValue).sge(ubValue)) {
llvm::append_range(blockArgs, op.getIterOperands());		llvm::append_range(blockArgs, op.getIterOperands());
replaceOpWithRegion(rewriter, op, op.getLoopBody(), blockArgs);		replaceOpWithRegion(rewriter, op, op.getLoopBody(), blockArgs);
return success();		return success();
}		}

return failure();		return failure();
}		}
};		};

		/// Canonicalize the iter_args of an scf::ForOp that involve a tensor_load and
		/// for which only the last loop iteration is actually visible outside of the
		/// loop. The canonicalization looks for a pattern such as:
		/// ```
		/// %t0 = ... : tensor_type
		/// %0 = scf.for ... iter_args(%bb0 : %t0) -> (tensor_type) {
		/// ...
		/// // %m is either tensor_to_memref(%bb00) or defined above the loop
		/// %m... : memref_type
		/// ... // uses of %m with potential inplace updates
		/// %new_tensor = tensor_load %m : memref_type
		/// ...
		/// scf.yield %new_tensor : tensor_type
		/// }
		/// ```
		///
		/// `%bb0` may have either 0 or 1 use. If it has 1 use it must be exactly a
		/// `%m = tensor_to_memref %bb0` op that feeds into the yielded `tensor_load`
		rriddleUnsubmitted Done Reply Inline Actions Where are you checking that it feeds into the tensor_load? rriddle: Where are you checking that it feeds into the tensor_load?
		/// op.
		///
		/// If no aliasing write of `%new_tensor` occurs between tensor_load and yield
		silvasUnsubmitted Done Reply Inline Actions it is a bit strange to talk about aliasing of a tensor. can you reword? silvas: it is a bit strange to talk about aliasing of a tensor. can you reword?
		/// then the value %0 visible outside of the loop is the last `tensor_load`
		/// produced in the loop.
		///
		/// For now, we approximate the absence of aliasing by only supporting the case
		/// when the tensor_load is the operation immediately preceding the yield.
		///
		/// The canonicalization rewrites the pattern as:
		/// ```
		/// // %m is either a tensor_to_memref or defined above
		/// %m... : memref_type
		/// scf.for ... { // no iter_args
		/// ... // uses of %m with potential inplace updates
		/// }
		/// %0 = tensor_load %m : memref_type
		/// ```
		struct LastTensorLoadCanonicalization : public OpRewritePattern<ForOp> {
		rriddleUnsubmitted Done Reply Inline Actions This pattern looks like something that would be better handled by LICM+a simpler canonicalization that forwarded yield results that are from outside of the loop. rriddle: This pattern looks like something that would be better handled by LICM+a simpler…
		rriddleUnsubmitted Done Reply Inline Actions Wait, nvm. Missed the iteration handling here. rriddle: Wait, nvm. Missed the iteration handling here.
		using OpRewritePattern<ForOp>::OpRewritePattern;

		LogicalResult matchAndRewrite(ForOp forOp,
		PatternRewriter &rewriter) const override {
		Location loc = forOp.getLoc();
		DenseMap<Value, Value> replacements;
		for (BlockArgument bbArg : forOp.getRegionIterArgs()) {
		ftynseUnsubmitted Done Reply Inline Actions This should never happen, I'd assert instead. ftynse: This should never happen, I'd assert instead.
		rriddleUnsubmitted Done Reply Inline Actions Why assert at all? Isn't this already verified by the op? rriddle: Why assert at all? Isn't this already verified by the op?
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions fair enough. nicolasvasilache: fair enough.
		ftynseUnsubmitted Done Reply Inline Actions I've seen numerous asserts (and wrote some) on op state that are enforced by the verifier. I do this as part of "assert liberally" policy in the LLVM style guide, often in situations where I suspect the op may evolve and having an assertion will make it easier to find the places that need changing. ftynse: I've seen numerous asserts (and wrote some) on op state that are enforced by the verifier. I do…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I'm def. +1 on defensive asserts. After considering your comments here I found it quite unlikely that scf::ForOp will evolve to multi-blocks in the future. Given the current state of the codebase I'd prefer to see a new type of op. I'd still lean towards defensive style in general, I can revive the assert if there is a stronger feeling, I don't have a strong preference here. nicolasvasilache: I'm def. +1 on defensive asserts. After considering your comments here I found it quite…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions I'm +1 on assert as well: these are both debugging and documentation help. It duplicates the verifier sometimes, but that should be OK: if it wasn't in the verifier then it couldn't be an assert in the first place :) mehdi_amini: I'm +1 on assert as well: these are both debugging and documentation help. It duplicates the…
		unsigned idx = bbArg.getArgNumber() - /numIv=/1;
		auto yieldOp = cast<scf::YieldOp>(forOp.region().front().getTerminator());
		Value yieldVal = yieldOp->getOperand(idx);
		auto tensorLoadOp = yieldVal.getDefiningOp<TensorLoadOp>();
		bool isTensor = bbArg.getType().isa<TensorType>();

		TensorToMemrefOp tensorToMemRefOp;
		if (bbArg.hasOneUse())
		tensorToMemRefOp =
		dyn_cast<TensorToMemrefOp>(*bbArg.getUsers().begin());
		if (!isTensor \|\| !tensorLoadOp \|\|
		(!bbArg.use_empty() && !tensorToMemRefOp))
		continue;
		// TODO: Any aliasing write of tensorLoadOp.memref() nested under `forOp`
		// must be before `tensorLoadOp` in the block so that the lastWrite
		// property is not subject to additional side-effects.
		// For now, we only support the case when tensorLoadOp appears immediately
		// before the terminator.
		if (tensorLoadOp->getNextNode() != yieldOp)
		continue;
		// Clone the optional tensorToMemRefOp before forOp.
		if (tensorToMemRefOp) {
		rewriter.setInsertionPoint(forOp);
		rewriter.replaceOpWithNewOp<TensorToMemrefOp>(
		tensorToMemRefOp, tensorToMemRefOp.memref().getType(),
		tensorToMemRefOp.tensor());
		}
		// Clone the tensorLoad after forOp.
		rewriter.setInsertionPointAfter(forOp);
		Value newTensorLoad =
		rewriter.create<TensorLoadOp>(loc, tensorLoadOp.memref());
		Value forOpResult = forOp.getResult(bbArg.getArgNumber());
		replacements.insert(std::make_pair(forOpResult, newTensorLoad));
		// Make the terminator just yield the bbArg, the old tensorLoadOp + the
		ftynseUnsubmitted Done Reply Inline Actions Code motion is not undoable by "root update", only modifications to the op state are, you'll have to recreate the op at the new position. ftynse: Code motion is not undoable by "root update", only modifications to the op state are, you'll…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions thanks! nicolasvasilache: thanks!
		// old bbArg (that is now directly yielded) will canonicalize away.
		rewriter.startRootUpdate(yieldOp);
		yieldOp.setOperand(idx, bbArg);
		rewriter.finalizeRootUpdate(yieldOp);
		}
		return success(!replacements.empty());
		ftynseUnsubmitted Done Reply Inline Actions Same ftynse: Same
		}
		};
} // namespace		} // namespace

void ForOp::getCanonicalizationPatterns(OwningRewritePatternList &results,		void ForOp::getCanonicalizationPatterns(OwningRewritePatternList &results,
MLIRContext *context) {		MLIRContext *context) {
results.insert<ForOpIterArgsFolder, SimplifyTrivialLoops>(context);		results.insert<ForOpIterArgsFolder, SimplifyTrivialLoops,
		LastTensorLoadCanonicalization>(context);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// IfOp		// IfOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void IfOp::build(OpBuilder &builder, OperationState &result, Value cond,		void IfOp::build(OpBuilder &builder, OperationState &result, Value cond,
		ftynseUnsubmitted Done Reply Inline Actions I'm confused by this comment. You want to replace a subset of `forOp` results with values produced by hoisted ops. I don't see how side-effects are relevant here. Furthermore, the documentation above and the test show that `iter_args` are removed, but I don't see the corresponding bbargs removed from the loop, it is just cloned. I expect something weird to happen if the bbargs that correspond to hoisted tensor updates are not trailing in the list. ftynse: I'm confused by this comment. You want to replace a subset of `forOp` results with values…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I'll rephrase but basically rewriter only has a `replaceOp` method that unconditionally deletes the op. So I I replace forOp it just gets dropped, even it has side effects. nicolasvasilache: I'll rephrase but basically rewriter only has a `replaceOp` method that unconditionally deletes…
		rriddleUnsubmitted Done Reply Inline Actions Can't you use replaceOpWithIf for this? Or provide an easier entry point to that function for what you want to do here? It isn't supported in dialect conversion ATM, but should work for canonicalization patterns. rriddle: Can't you use replaceOpWithIf for this? Or provide an easier entry point to that function for…
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Actually, I can just do nothin and let other canonicalizations kick in when appropriate... nicolasvasilache: Actually, I can just do nothin and let other canonicalizations kick in when appropriate...
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Ugh no .. that is wrong, I am stupidly missing replacements .. sorry nicolasvasilache: Ugh no .. that is wrong, I am stupidly missing replacements .. sorry
		nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions @rriddle ok replaceOpWithIf wins. It would be great to find a naming scheme that conveys the fact that replaceOp replace unconditionally whereas replaceOpWithIf does not replace ever and it is the user's responsibility to eraseOp if allUsesReplaced is set to true (if they so wish). I had completely overlooked the fact that not all replaceOpXXX do a blanket eraseOp. nicolasvasilache: @rriddle ok replaceOpWithIf wins. It would be great to find a naming scheme that conveys the…
bool withElseRegion) {		bool withElseRegion) {
build(builder, result, /resultTypes=/llvm::None, cond, withElseRegion);		build(builder, result, /resultTypes=/llvm::None, cond, withElseRegion);
}		}

void IfOp::build(OpBuilder &builder, OperationState &result,		void IfOp::build(OpBuilder &builder, OperationState &result,
TypeRange resultTypes, Value cond, bool withElseRegion) {		TypeRange resultTypes, Value cond, bool withElseRegion) {
auto addTerminator = [&](OpBuilder &nested, Location loc) {		auto addTerminator = [&](OpBuilder &nested, Location loc) {
if (resultTypes.empty())		if (resultTypes.empty())
▲ Show 20 Lines • Show All 760 Lines • Show Last 20 Lines

mlir/test/Dialect/SCF/canonicalize.mlir

	// RUN: mlir-opt %s -pass-pipeline='func(canonicalize)' \| FileCheck %s			// RUN: mlir-opt %s -pass-pipeline='func(canonicalize)' -split-input-file \| FileCheck %s

				// -----

	func @single_iteration(%A: memref<?x?x?xi32>) {			func @single_iteration(%A: memref<?x?x?xi32>) {
	%c0 = constant 0 : index			%c0 = constant 0 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	%c2 = constant 2 : index			%c2 = constant 2 : index
	%c3 = constant 3 : index			%c3 = constant 3 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	%c7 = constant 7 : index			%c7 = constant 7 : index
	▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: %[[a:.*]] = call @make_i32() : () -> i32			// CHECK-NEXT: %[[a:.*]] = call @make_i32() : () -> i32
	// CHECK-NEXT: %[[b:.*]] = call @make_i32() : () -> i32			// CHECK-NEXT: %[[b:.*]] = call @make_i32() : () -> i32
	// CHECK-NEXT: %[[r1:.]] = scf.for {{.}} iter_args(%arg4 = %[[a]]) -> (i32) {			// CHECK-NEXT: %[[r1:.]] = scf.for {{.}} iter_args(%arg4 = %[[a]]) -> (i32) {
	// CHECK-NEXT: %[[c:.*]] = call @make_i32() : () -> i32			// CHECK-NEXT: %[[c:.*]] = call @make_i32() : () -> i32
	// CHECK-NEXT: scf.yield %[[c]] : i32			// CHECK-NEXT: scf.yield %[[c]] : i32
	// CHECK-NEXT: }			// CHECK-NEXT: }
	// CHECK-NEXT: return %[[a]], %[[r1]], %[[b]] : i32, i32, i32			// CHECK-NEXT: return %[[a]], %[[r1]], %[[b]] : i32, i32, i32

				// -----

	// CHECK-LABEL: @replace_true_if			// CHECK-LABEL: @replace_true_if
	func @replace_true_if() {			func @replace_true_if() {
	%true = constant true			%true = constant true
	// CHECK-NOT: scf.if			// CHECK-NOT: scf.if
	// CHECK: "test.op"			// CHECK: "test.op"
	scf.if %true {			scf.if %true {
	"test.op"() : () -> ()			"test.op"() : () -> ()
	scf.yield			scf.yield
	}			}
	return			return
	}			}

				// -----

	// CHECK-LABEL: @remove_false_if			// CHECK-LABEL: @remove_false_if
	func @remove_false_if() {			func @remove_false_if() {
	%false = constant false			%false = constant false
	// CHECK-NOT: scf.if			// CHECK-NOT: scf.if
	// CHECK-NOT: "test.op"			// CHECK-NOT: "test.op"
	scf.if %false {			scf.if %false {
	"test.op"() : () -> ()			"test.op"() : () -> ()
	scf.yield			scf.yield
	}			}
	return			return
	}			}

				// -----

	// CHECK-LABEL: @replace_true_if_with_values			// CHECK-LABEL: @replace_true_if_with_values
	func @replace_true_if_with_values() {			func @replace_true_if_with_values() {
	%true = constant true			%true = constant true
	// CHECK-NOT: scf.if			// CHECK-NOT: scf.if
	// CHECK: %[[VAL:.*]] = "test.op"			// CHECK: %[[VAL:.*]] = "test.op"
	%0 = scf.if %true -> (i32) {			%0 = scf.if %true -> (i32) {
	%1 = "test.op"() : () -> i32			%1 = "test.op"() : () -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	} else {			} else {
	%2 = "test.other_op"() : () -> i32			%2 = "test.other_op"() : () -> i32
	scf.yield %2 : i32			scf.yield %2 : i32
	}			}
	// CHECK: "test.consume"(%[[VAL]])			// CHECK: "test.consume"(%[[VAL]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @replace_false_if_with_values			// CHECK-LABEL: @replace_false_if_with_values
	func @replace_false_if_with_values() {			func @replace_false_if_with_values() {
	%false = constant false			%false = constant false
	// CHECK-NOT: scf.if			// CHECK-NOT: scf.if
	// CHECK: %[[VAL:.*]] = "test.other_op"			// CHECK: %[[VAL:.*]] = "test.other_op"
	%0 = scf.if %false -> (i32) {			%0 = scf.if %false -> (i32) {
	%1 = "test.op"() : () -> i32			%1 = "test.op"() : () -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	} else {			} else {
	%2 = "test.other_op"() : () -> i32			%2 = "test.other_op"() : () -> i32
	scf.yield %2 : i32			scf.yield %2 : i32
	}			}
	// CHECK: "test.consume"(%[[VAL]])			// CHECK: "test.consume"(%[[VAL]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @remove_zero_iteration_loop			// CHECK-LABEL: @remove_zero_iteration_loop
	func @remove_zero_iteration_loop() {			func @remove_zero_iteration_loop() {
	%c42 = constant 42 : index			%c42 = constant 42 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> i32			%init = "test.init"() : () -> i32
	// CHECK-NOT: scf.for			// CHECK-NOT: scf.for
	%0 = scf.for %i = %c42 to %c1 step %c1 iter_args(%arg = %init) -> (i32) {			%0 = scf.for %i = %c42 to %c1 step %c1 iter_args(%arg = %init) -> (i32) {
	%1 = "test.op"(%i, %arg) : (index, i32) -> i32			%1 = "test.op"(%i, %arg) : (index, i32) -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	}			}
	// CHECK: "test.consume"(%[[INIT]])			// CHECK: "test.consume"(%[[INIT]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @remove_zero_iteration_loop_vals			// CHECK-LABEL: @remove_zero_iteration_loop_vals
	func @remove_zero_iteration_loop_vals(%arg0: index) {			func @remove_zero_iteration_loop_vals(%arg0: index) {
	%c2 = constant 2 : index			%c2 = constant 2 : index
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> i32			%init = "test.init"() : () -> i32
	// CHECK-NOT: scf.for			// CHECK-NOT: scf.for
	// CHECK-NOT: test.op			// CHECK-NOT: test.op
	%0 = scf.for %i = %arg0 to %arg0 step %c2 iter_args(%arg = %init) -> (i32) {			%0 = scf.for %i = %arg0 to %arg0 step %c2 iter_args(%arg = %init) -> (i32) {
	%1 = "test.op"(%i, %arg) : (index, i32) -> i32			%1 = "test.op"(%i, %arg) : (index, i32) -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	}			}
	// CHECK: "test.consume"(%[[INIT]])			// CHECK: "test.consume"(%[[INIT]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @replace_single_iteration_loop_1			// CHECK-LABEL: @replace_single_iteration_loop_1
	func @replace_single_iteration_loop_1() {			func @replace_single_iteration_loop_1() {
	// CHECK: %[[LB:.*]] = constant 42			// CHECK: %[[LB:.*]] = constant 42
	%c42 = constant 42 : index			%c42 = constant 42 : index
	%c43 = constant 43 : index			%c43 = constant 43 : index
	%c1 = constant 1 : index			%c1 = constant 1 : index
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> i32			%init = "test.init"() : () -> i32
	// CHECK-NOT: scf.for			// CHECK-NOT: scf.for
	// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])			// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])
	%0 = scf.for %i = %c42 to %c43 step %c1 iter_args(%arg = %init) -> (i32) {			%0 = scf.for %i = %c42 to %c43 step %c1 iter_args(%arg = %init) -> (i32) {
	%1 = "test.op"(%i, %arg) : (index, i32) -> i32			%1 = "test.op"(%i, %arg) : (index, i32) -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	}			}
	// CHECK: "test.consume"(%[[VAL]])			// CHECK: "test.consume"(%[[VAL]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @replace_single_iteration_loop_2			// CHECK-LABEL: @replace_single_iteration_loop_2
	func @replace_single_iteration_loop_2() {			func @replace_single_iteration_loop_2() {
	// CHECK: %[[LB:.*]] = constant 5			// CHECK: %[[LB:.*]] = constant 5
	%c5 = constant 5 : index			%c5 = constant 5 : index
	%c6 = constant 6 : index			%c6 = constant 6 : index
	%c11 = constant 11 : index			%c11 = constant 11 : index
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> i32			%init = "test.init"() : () -> i32
	// CHECK-NOT: scf.for			// CHECK-NOT: scf.for
	// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])			// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])
	%0 = scf.for %i = %c5 to %c11 step %c6 iter_args(%arg = %init) -> (i32) {			%0 = scf.for %i = %c5 to %c11 step %c6 iter_args(%arg = %init) -> (i32) {
	%1 = "test.op"(%i, %arg) : (index, i32) -> i32			%1 = "test.op"(%i, %arg) : (index, i32) -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	}			}
	// CHECK: "test.consume"(%[[VAL]])			// CHECK: "test.consume"(%[[VAL]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @replace_single_iteration_loop_non_unit_step			// CHECK-LABEL: @replace_single_iteration_loop_non_unit_step
	func @replace_single_iteration_loop_non_unit_step() {			func @replace_single_iteration_loop_non_unit_step() {
	// CHECK: %[[LB:.*]] = constant 42			// CHECK: %[[LB:.*]] = constant 42
	%c42 = constant 42 : index			%c42 = constant 42 : index
	%c47 = constant 47 : index			%c47 = constant 47 : index
	%c5 = constant 5 : index			%c5 = constant 5 : index
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> i32			%init = "test.init"() : () -> i32
	// CHECK-NOT: scf.for			// CHECK-NOT: scf.for
	// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])			// CHECK: %[[VAL:.*]] = "test.op"(%[[LB]], %[[INIT]])
	%0 = scf.for %i = %c42 to %c47 step %c5 iter_args(%arg = %init) -> (i32) {			%0 = scf.for %i = %c42 to %c47 step %c5 iter_args(%arg = %init) -> (i32) {
	%1 = "test.op"(%i, %arg) : (index, i32) -> i32			%1 = "test.op"(%i, %arg) : (index, i32) -> i32
	scf.yield %1 : i32			scf.yield %1 : i32
	}			}
	// CHECK: "test.consume"(%[[VAL]])			// CHECK: "test.consume"(%[[VAL]])
	"test.consume"(%0) : (i32) -> ()			"test.consume"(%0) : (i32) -> ()
	return			return
	}			}

				// -----

	// CHECK-LABEL: @remove_empty_parallel_loop			// CHECK-LABEL: @remove_empty_parallel_loop
	func @remove_empty_parallel_loop(%lb: index, %ub: index, %s: index) {			func @remove_empty_parallel_loop(%lb: index, %ub: index, %s: index) {
	// CHECK: %[[INIT:.*]] = "test.init"			// CHECK: %[[INIT:.*]] = "test.init"
	%init = "test.init"() : () -> f32			%init = "test.init"() : () -> f32
	// CHECK-NOT: scf.parallel			// CHECK-NOT: scf.parallel
	// CHECK-NOT: test.produce			// CHECK-NOT: test.produce
	// CHECK-NOT: test.transform			// CHECK-NOT: test.transform
	%0 = scf.parallel (%i, %j, %k) = (%lb, %ub, %lb) to (%ub, %ub, %ub) step (%s, %s, %s) init(%init) -> f32 {			%0 = scf.parallel (%i, %j, %k) = (%lb, %ub, %lb) to (%ub, %ub, %ub) step (%s, %s, %s) init(%init) -> f32 {
	%1 = "test.produce"() : () -> f32			%1 = "test.produce"() : () -> f32
	scf.reduce(%1) : f32 {			scf.reduce(%1) : f32 {
	^bb0(%lhs: f32, %rhs: f32):			^bb0(%lhs: f32, %rhs: f32):
	%2 = "test.transform"(%lhs, %rhs) : (f32, f32) -> f32			%2 = "test.transform"(%lhs, %rhs) : (f32, f32) -> f32
	scf.reduce.return %2 : f32			scf.reduce.return %2 : f32
	}			}
	scf.yield			scf.yield
	}			}
	// CHECK: "test.consume"(%[[INIT]])			// CHECK: "test.consume"(%[[INIT]])
	"test.consume"(%0) : (f32) -> ()			"test.consume"(%0) : (f32) -> ()
	return			return
	}			}

				// -----

				func private @process(%0 : memref<128x128xf32>)

				// CHECK-LABEL: last_value
				// CHECK-SAME: %[[T0:[0-9a-z]*]]: tensor<128x128xf32>
				// CHECK-SAME: %[[T1:[0-9a-z]*]]: tensor<128x128xf32>
				// CHECK-SAME: %[[M0:[0-9a-z]*]]: memref<128x128xf32>
				func @last_value(%t0: tensor<128x128xf32>, %t1: tensor<128x128xf32>,
				silvasUnsubmitted Done Reply Inline Actions add a test case where the safety check fires and the pattern fails to apply? silvas: add a test case where the safety check fires and the pattern fails to apply?
				%m0: memref<128x128xf32>,
				%lb : index, %ub : index, %step : index)
				-> (tensor<128x128xf32>, tensor<128x128xf32>)
				{
				// CHECK-NEXT: %[[M1:.*]] = tensor_to_memref %[[T1]] : memref<128x128xf32>
				// CHECK-NEXT: scf.for
				// CHECK-NOT: iter_args
				%0:2 = scf.for %arg0 = %lb to %ub step %step iter_args(%arg1 = %t0, %arg2 = %t1)
				-> (tensor<128x128xf32>, tensor<128x128xf32>)
				{
				%m1 = tensor_to_memref %arg2 : memref<128x128xf32>

				// CHECK-NEXT: call @process(%[[M0]]) : (memref<128x128xf32>) -> ()
				call @process(%m0) : (memref<128x128xf32>) -> ()

				// CHECK-NEXT: call @process(%[[M1]]) : (memref<128x128xf32>) -> ()
				call @process(%m1) : (memref<128x128xf32>) -> ()

				// All this stuff goes away
				%1 = tensor_load %m0 : memref<128x128xf32>
				%2 = tensor_load %m1 : memref<128x128xf32>
				scf.yield %1, %2 : tensor<128x128xf32>, tensor<128x128xf32>
				// CHECK-NEXT: }
				}

				// CHECK-NEXT: %[[R0:.*]] = tensor_load %[[M0]] : memref<128x128xf32>
				// CHECK-NEXT: %[[R1:.*]] = tensor_load %[[M1]] : memref<128x128xf32>
				// CHECK-NEXT: return %[[R0]], %[[R1]] : tensor<128x128xf32>, tensor<128x128xf32>
				return %0#0, %0#1 : tensor<128x128xf32>, tensor<128x128xf32>
				}
				silvasUnsubmitted Done Reply Inline Actions is this needed after the recent patch you landed? silvas: is this needed after the recent patch you landed?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I reverted it locally so when I update it'll go away. nicolasvasilache: I reverted it locally so when I update it'll go away.

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][scf] Canonicalize scf.for last tensor iteration result.
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 328277

mlir/lib/Dialect/SCF/SCF.cpp

mlir/test/Dialect/SCF/canonicalize.mlir

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][scf] Canonicalize scf.for last tensor iteration result.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 328277

mlir/lib/Dialect/SCF/SCF.cpp

mlir/test/Dialect/SCF/canonicalize.mlir

[mlir][scf] Canonicalize scf.for last tensor iteration result.
ClosedPublic