This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/
-
Affine/Transforms/
-
Transforms/
2/2
Transforms.h
-
Tensor/
-
CMakeLists.txt
-
TransformOps/
1/1
CMakeLists.txt
-
TensorTransformOps.h
3/3
TensorTransformOps.td
-
Transforms/
6/6
Transforms.h
-
InitAllDialects.h
-
Interfaces/
1/1
ValueBoundsOpInterface.h
-
lib/
-
Dialect/
-
Affine/Transforms/
-
Transforms/
-
ReifyValueBounds.cpp
-
Tensor/
-
TransformOps/
-
CMakeLists.txt
1/1
TensorTransformOps.cpp
-
Transforms/
-
CMakeLists.txt
1/1
IndependenceTransforms.cpp
-
Interfaces/
1/1
ValueBoundsOpInterface.cpp
-
test/Dialect/Tensor/
-
Dialect/
-
Tensor/
-
transform-op-make-loop-independent.mlir
-
utils/bazel/llvm-project-overlay/mlir/
-
bazel/
-
llvm-project-overlay/
-
mlir/
-
BUILD.bazel

Differential D143910

[mlir][tensor] Add transform to make tensor.pad/empty loop-independent
ClosedPublic

Authored by springerm on Feb 13 2023, 6:21 AM.

Download Raw Diff

Details

Reviewers

dcaballe
nicolasvasilache
bondhugula

Commits

rG77124386feb6: [mlir][tensor] Add transform to make tensor.pad loop-independent

Summary

Add a transform to make tensor.pad and tensor.empty ops independent of SCF loop IVs. Such ops can then be hoisted.

E.g.:

scf.for %iv = %lb to %ub step %step {
  %high = affine.apply affine_map<(d0)[s0] -> (s0 - d0)> (%i)[%ub]
  %p = tensor.pad %t low[5] high[%high] ...
  ...
}

Is transformed to:

%high_new = affine.apply affine_map<()[s0, s1] -> (-s0 + s1)> ()[%lb, %ub]
%p_hoistable = tensor.pad %t low[5] high[%high_new]
%dim = tensor.dim %t, %c0
%size = affine.apply affine_map<(d0)[s0, s1] -> (-d0 + s0 + s1 + 5)>(%iv)[%ub, %dim]
%slice = tensor.extract_slice %p_hoistable [0] [%size] [1]

Depends On: D146524

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

springerm created this revision.Feb 13 2023, 6:21 AM

Herald added a reviewer: bondhugula. · View Herald TranscriptFeb 13 2023, 6:21 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: hanchung, Moerafaat, bzcheeseman and 22 others. · View Herald Transcript

springerm requested review of this revision.Feb 13 2023, 6:21 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 13 2023, 6:21 AM

Herald added a subscriber: stephenneuendorffer. · View Herald Transcript

@dcaballe The same infrastructure can be used for computing upper bounds for memref allocations.

Harbormaster completed remote builds in B213412: Diff 496952.Feb 13 2023, 6:21 AM

Herald added a subscriber: jsetoain. · View Herald TranscriptFeb 13 2023, 6:21 AM

add op documentation

Harbormaster completed remote builds in B213587: Diff 497222.Feb 14 2023, 12:09 AM

springerm added a parent revision: D143909: [mlir][SCF][Utils][NFC] Make some utils public for better reuse.Feb 22 2023, 1:10 AM

nicolasvasilache added inline comments.Feb 23 2023, 9:17 AM

mlir/include/mlir/Dialect/SCF/Utils/AffineCanonicalizationUtils.h
17 ↗	(On Diff #497222)	Isn't the best practice here to just forward-declare ? I am unclear why you reverted to a mix of forward-declaration for OpBuilder and include for AffineMap/Value/ValueRange ?
mlir/include/mlir/Dialect/Tensor/TransformOps/CMakeLists.txt
7	This fails to build for me with: CMake Error at /usr/local/google/home/ntv/github/llvm-project/mlir/include/mlir/Dialect/Tensor/TransformOps/CMakeLists.txt:6 (add_mlir_doc): add_mlir_doc Function invoked with incorrect arguments for function named: add_mlir_doc
mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td
27	Please explain a little more what this entails because it involves increasing the tensor size/dimensionality.
mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	The footprint in this example can be too high: you'd want to take the ceildiv by %step.
mlir/lib/Dialect/SCF/Utils/AffineCanonicalizationUtils.cpp
87 ↗	(On Diff #497222)	Can we use `step 1` instead here and mention that this is conservative but less precise in the context of this method?

springerm marked 4 inline comments as done.Feb 27 2023, 1:24 AM

springerm added inline comments.

mlir/include/mlir/Dialect/SCF/Utils/AffineCanonicalizationUtils.h
17 ↗	(On Diff #497222)	I thought I needed the definition so that it can be used in `FailureOr<...>`. But it actually works without.
mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	I tried to do that but `FlatAffineConstraints` was unable to compute an upper bound in that case. It's unclear why this is happening, I'm looking into this. Maybe the system of inequalities is getting too complex (with various "semi-affine exprs"...). It looks like `FlatAffineConstraints` must be extended.

address comments

Harbormaster completed remote builds in B216159: Diff 500698.Feb 27 2023, 1:54 AM

springerm added inline comments.Feb 27 2023, 2:16 AM

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	This is indeed due to a shortcoming in `FlatAffineValueConstraints`: // TODO: Whenever there are local variables in the dependence // constraints, we'll conservatively over-approximate, since we don't // always explicitly compute them above (in the while loop). It is worth fixing this? Will probably take a while to understand and rewrite a 200 LOC function.

nicolasvasilache added inline comments.Feb 27 2023, 6:15 AM

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	Yes, this is the same rationale as using `step 1` below. Can we update the doc to make this explicit? I.e. divide by %step and mention that in the case of a symbol, we over-approximate with setting `step` to `1` to circumvent the limitation of the analysis.
mlir/lib/Dialect/SCF/Utils/AffineCanonicalizationUtils.cpp
86 ↗	(On Diff #500698)	if we use step 1, this is not an optional anymore and there is a bit of simplification in your code down the line

nicolasvasilache added inline comments.Feb 28 2023, 5:41 AM

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td
23	Can we rename this transform to `transform.loop_privatize` or something similar and have it take the op to privatize and the number of enclosing loops above which we want to hoist ? This should be kept in sync and compose with the recently landed `hoist_pad`. We can later evolve the syntax from num_loops to something better, informed by how we want these things to compose.
mlir/lib/Dialect/Tensor/Transforms/LoopHoisting.cpp
21 ↗	(On Diff #500698)	The filename is misleading here, this is not performing hoisting but privatization that will later enable hoisting. Can we move this functionality to a `LoopPrivatization.cpp` file ? As a followup, we should integrate the usage of privatization its usage into `HoistPadding.cpp`. We also now have `SubsetHoisting.cpp` for mechanical parts related to actual hoisting of loop-independent quantities that will also come in handy..

springerm marked 4 inline comments as done.Mar 1 2023, 1:58 AM

springerm added inline comments.

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td
23	This does not privatize the tensor/op though. It simply changes the size of the tensor. Added the number of loops to the op.
mlir/lib/Dialect/Tensor/Transforms/LoopHoisting.cpp
21 ↗	(On Diff #500698)	How about `LoopTransforms.cpp`? This transformation does not do any privatization. (Assuming "privatization" = "making a private copy of a tensor for each loop iteration".)

address comments

Harbormaster completed remote builds in B216670: Diff 501427.Mar 1 2023, 2:26 AM

update

Harbormaster completed remote builds in B216682: Diff 501448.Mar 1 2023, 3:48 AM

reimplement with ValueBoundsOpInterface

springerm retitled this revision from [mlir][tensor] Add transform to make tensor.pad loop-independent to [mlir][tensor] Add transform to make tensor.pad/empty loop-independent.Mar 21 2023, 6:36 AM

springerm edited the summary of this revision. (Show Details)

Herald added a subscriber: Groverkss. · View Herald TranscriptMar 21 2023, 6:36 AM

springerm edited parent revisions, added: D146524: [mlir][Arith] ValueBoundsOpInterface: Reify with Arith ops; removed: D143909: [mlir][SCF][Utils][NFC] Make some utils public for better reuse.Mar 21 2023, 6:36 AM

Harbormaster completed remote builds in B220709: Diff 506957.Mar 21 2023, 7:45 AM

Does this update change the way this is supposed to interact with HoistPadding ?
Or in other words, how do you see this interacting with HoistPadding ?

In D143910#4212287, @nicolasvasilache wrote:

Does this update change the way this is supposed to interact with HoistPadding ?
Or in other words, how do you see this interacting with HoistPadding ?

HoistPadding does not require this functionality, it just clones the entire loop nest. So no interaction with HoistPadding.

This revision is already a month old and was never landed, but is useful for Diego as an example how to make ops (memref.alloca in his example) hoistable. So I reimplemented it so that it takes advantage of ValueBoundsOpInterface.

springerm mentioned this in D146870: [mlir][Interfaces] ValueBoundsOpInterface: Support IntegerTypes.Mar 25 2023, 4:35 AM

springerm added a child revision: D146870: [mlir][Interfaces] ValueBoundsOpInterface: Support IntegerTypes.Mar 25 2023, 4:35 AM

springerm mentioned this in D145681: [mlir][Interfaces] Add ValueBoundsOpInterface and tensor dialect op impl.Mar 30 2023, 9:22 AM

rebase

Harbormaster completed remote builds in B222970: Diff 509973.Mar 31 2023, 6:27 AM

Looking at the ValueBounds part only for now.

mlir/include/mlir/Dialect/Affine/Transforms/Transforms.h
91	I can't follow from the description... This is converting an Affine-based value bound into an Arith-based value bound?
mlir/include/mlir/Interfaces/ValueBoundsOpInterface.h
138	typos Could you please elaborate a bit more on what "independent of the values in independencies" mean?
mlir/lib/Interfaces/ValueBoundsOpInterface.cpp
377	Something like this in the header doc is what I was hoping for :)

Herald added a subscriber: bviyer. · View Herald TranscriptApr 6 2023, 4:40 PM

address comments

springerm added inline comments.Apr 20 2023, 6:42 PM

mlir/include/mlir/Dialect/Affine/Transforms/Transforms.h
91	The name was confusing. I renamed the function and added some more documentation.

Harbormaster completed remote builds in B227038: Diff 515554.Apr 20 2023, 6:48 PM

rebase

springerm mentioned this in D149316: [mlir][memref] Add transform to make alloca ops loop-independent.Apr 26 2023, 6:23 PM

springerm added a child revision: D149316: [mlir][memref] Add transform to make alloca ops loop-independent.Apr 26 2023, 6:24 PM

Harbormaster completed remote builds in B228469: Diff 517420.Apr 26 2023, 7:15 PM

Thanks!

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	What happened with this? Was it addressed?
mlir/lib/Dialect/Tensor/TransformOps/TensorTransformOps.cpp
67	nit: ub to var
mlir/lib/Dialect/Tensor/Transforms/IndependenceTransforms.cpp
80	nit: ub to var

This revision is now accepted and ready to land.Apr 26 2023, 10:06 PM

This revision was landed with ongoing or failed builds.Apr 27 2023, 7:47 PM

Closed by commit rG77124386feb6: [mlir][tensor] Add transform to make tensor.pad loop-independent (authored by springerm). · Explain Why

This revision was automatically updated to reflect the committed changes.

springerm marked 3 inline comments as done.

springerm added a commit: rG77124386feb6: [mlir][tensor] Add transform to make tensor.pad loop-independent.

springerm added inline comments.Apr 27 2023, 7:52 PM

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h
39	There is a TODO in `SCF/IR/ValueBoundsOpInterfaceImpl.cpp`. We can't do any better at the moment.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Affine/

Transforms/

Transforms.h

14 lines

Tensor/

CMakeLists.txt

1 line

TransformOps/

CMakeLists.txt

6 lines

TensorTransformOps.h

8 lines

TensorTransformOps.td

64 lines

Transforms/

Transforms.h

39 lines

InitAllDialects.h

2 lines

Interfaces/

ValueBoundsOpInterface.h

25 lines

lib/

Dialect/

Affine/

Transforms/

ReifyValueBounds.cpp

9 lines

Tensor/

TransformOps/

CMakeLists.txt

6 lines

TensorTransformOps.cpp

81 lines

Transforms/

CMakeLists.txt

3 lines

IndependenceTransforms.cpp

136 lines

Interfaces/

ValueBoundsOpInterface.cpp

36 lines

test/

Dialect/

Tensor/

transform-op-make-loop-independent.mlir

151 lines

utils/

bazel/

llvm-project-overlay/

mlir/

BUILD.bazel

38 lines

Diff 517775

mlir/include/mlir/Dialect/Affine/Transforms/Transforms.h

	Show All 9 Lines
	// dialect.			// dialect.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H			#ifndef MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H
	#define MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H			#define MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H

	#include "mlir/Interfaces/ValueBoundsOpInterface.h"			#include "mlir/Interfaces/ValueBoundsOpInterface.h"
				#include "mlir/Support/LLVM.h"
	#include "mlir/Support/LogicalResult.h"			#include "mlir/Support/LogicalResult.h"

	namespace mlir {			namespace mlir {
				class AffineMap;
	class Location;			class Location;
	class OpBuilder;			class OpBuilder;
	class OpFoldResult;			class OpFoldResult;
	class RewritePatternSet;			class RewritePatternSet;
	class RewriterBase;			class RewriterBase;
	class Value;			class Value;

	namespace presburger {			namespace presburger {
	▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	/// By default, lower/equal bounds are closed and upper bounds are open. If			/// By default, lower/equal bounds are closed and upper bounds are open. If
	/// `closedUB` is set to "true", upper bounds are also closed.			/// `closedUB` is set to "true", upper bounds are also closed.
	FailureOr<OpFoldResult> reifyShapedValueDimBound(			FailureOr<OpFoldResult> reifyShapedValueDimBound(
	OpBuilder &b, Location loc, presburger::BoundType type, Value value,			OpBuilder &b, Location loc, presburger::BoundType type, Value value,
	int64_t dim,			int64_t dim,
	ValueBoundsConstraintSet::StopConditionFn stopCondition = nullptr,			ValueBoundsConstraintSet::StopConditionFn stopCondition = nullptr,
	bool closedUB = false);			bool closedUB = false);

				/// Materialize an already computed bound with Affine dialect ops.
				///
				dcaballeUnsubmitted Done Reply Inline Actions I can't follow from the description... This is converting an Affine-based value bound into an Arith-based value bound? dcaballe: I can't follow from the description... This is converting an Affine-based value bound into an…
				springermAuthorUnsubmitted Done Reply Inline Actions The name was confusing. I renamed the function and added some more documentation. springerm: The name was confusing. I renamed the function and added some more documentation.
				/// * `ValueBoundsOpInterface::computeBound` computes bounds but does not
				/// create IR. It is dialect independent.
				/// * `materializeComputedBound` materializes computed bounds with Affine
				/// dialect ops.
				/// * `reifyIndexValueBound`/`reifyShapedValueDimBound` are a combination of
				/// the two functions mentioned above.
				OpFoldResult materializeComputedBound(
				OpBuilder &b, Location loc, AffineMap boundMap,
				ArrayRef<std::pair<Value, std::optional<int64_t>>> mapOperands);

	} // namespace affine			} // namespace affine
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H			#endif // MLIR_DIALECT_AFFINE_TRANSFORMS_TRANSFORMS_H

mlir/include/mlir/Dialect/Tensor/CMakeLists.txt

	add_subdirectory(IR)			add_subdirectory(IR)
	add_subdirectory(Transforms)			add_subdirectory(Transforms)
				add_subdirectory(TransformOps)

mlir/include/mlir/Dialect/Tensor/TransformOps/CMakeLists.txt

This file was added.

				set(LLVM_TARGET_DEFINITIONS TensorTransformOps.td)
				mlir_tablegen(TensorTransformOps.h.inc -gen-op-decls)
				mlir_tablegen(TensorTransformOps.cpp.inc -gen-op-defs)
				add_public_tablegen_target(MLIRTensorTransformOpsIncGen)

				add_mlir_doc(TensorTransformOps TensorTransformOps Dialects/ -gen-op-doc)
				nicolasvasilacheUnsubmitted Done Reply Inline Actions This fails to build for me with: CMake Error at /usr/local/google/home/ntv/github/llvm-project/mlir/include/mlir/Dialect/Tensor/TransformOps/CMakeLists.txt:6 (add_mlir_doc): add_mlir_doc Function invoked with incorrect arguments for function named: add_mlir_doc nicolasvasilache: This fails to build for me with: ``` CMake Error at /usr/local/google/home/ntv/github/llvm…

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h

	//===- TensorTransformOps.h - Tensor transformation ops ---------- C++ --===//			//===- TensorTransformOps.h - Tensor transformation ops ---------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H			#ifndef MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H
	#define MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H			#define MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H

	#include "mlir/Dialect/PDL/IR/PDLTypes.h"			#include "mlir/Dialect/PDL/IR/PDLTypes.h"
	#include "mlir/Dialect/Transform/IR/TransformOps.h"			#include "mlir/Dialect/Transform/IR/TransformOps.h"
				#include "mlir/Dialect/Transform/IR/TransformTypes.h"
	#include "mlir/IR/OpImplementation.h"			#include "mlir/IR/OpImplementation.h"
	#include "mlir/IR/PatternMatch.h"			#include "mlir/IR/PatternMatch.h"

	namespace mlir {			namespace mlir {
				class DialectRegistry;

	namespace tensor {			namespace tensor {

	/// A specialized TrackingListener for transform ops that operate on tensor IR.			/// A specialized TrackingListener for transform ops that operate on tensor IR.
	/// This listener skips cast-like tensor ops when looking for payload op			/// This listener skips cast-like tensor ops when looking for payload op
	/// replacements.			/// replacements.
	class TrackingListener : public transform::TrackingListener {			class TrackingListener : public transform::TrackingListener {
	public:			public:
	using transform::TrackingListener::TrackingListener;			using transform::TrackingListener::TrackingListener;

	protected:			protected:
	Operation findReplacementOp(Operation op,			Operation findReplacementOp(Operation op,
	ValueRange newValues) const override;			ValueRange newValues) const override;
	};			};

				void registerTransformDialectExtension(DialectRegistry &registry);

	} // namespace tensor			} // namespace tensor
	} // namespace mlir			} // namespace mlir

				#define GET_OP_CLASSES
				#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h.inc"

	#endif // MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H			#endif // MLIR_DIALECT_TENSOR_TRANSFORMOPS_TENSORTRANSFORMOPS_H

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td

This file was added.

				//===- TensorTransformOps.td - Tensor transformation ops ---- tablegen --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef TENSOR_TRANSFORM_OPS
				#define TENSOR_TRANSFORM_OPS

				include "mlir/Dialect/PDL/IR/PDLTypes.td"
				include "mlir/Dialect/Transform/IR/TransformDialect.td"
				include "mlir/Dialect/Transform/IR/TransformInterfaces.td"
				include "mlir/Dialect/Transform/IR/TransformTypes.td"
				include "mlir/Interfaces/SideEffectInterfaces.td"
				include "mlir/IR/OpBase.td"

				def Transform_TensorPadOp : Transform_ConcreteOpType<"tensor.pad">;

				def MakeLoopIndependentOp
				: Op<Transform_Dialect, "tensor.make_loop_independent",
				[FunctionalStyleTransformOpTrait, MemoryEffectsOpInterface,
				nicolasvasilacheUnsubmitted Done Reply Inline Actions Can we rename this transform to `transform.loop_privatize` or something similar and have it take the op to privatize and the number of enclosing loops above which we want to hoist ? This should be kept in sync and compose with the recently landed `hoist_pad`. We can later evolve the syntax from num_loops to something better, informed by how we want these things to compose. nicolasvasilache: Can we rename this transform to `transform.loop_privatize` or something similar and have it…
				springermAuthorUnsubmitted Done Reply Inline Actions This does not privatize the tensor/op though. It simply changes the size of the tensor. Added the number of loops to the op. springerm: This does not privatize the tensor/op though. It simply changes the size of the tensor. Added…
				TransformOpInterface, TransformEachOpTrait]> {
				let description = [{
				Rewrite the targeted ops such that their index-typed operands no longer
				depend on any loop induction variable of the `num_loop` enclosing `scf.for`
				nicolasvasilacheUnsubmitted Done Reply Inline Actions Please explain a little more what this entails because it involves increasing the tensor size/dimensionality. nicolasvasilache: Please explain a little more what this entails because it involves increasing the tensor…
				loops. I.e., compute an upper bound that is independent of any such loop IV
				for every tensor dimension. The transformed op could then be hoisted from
				the `num_loop` enclosing loops. To preserve the original semantics, place a
				`tensor.extract_slice` inside the loop.

				Currently supported operations are:
				- tensor.empty: Replaced with a new tensor.empty with upper bound sizes,
				followed by a tensor.extract_slice.
				- tensor.pad: Replaced by an upper bound padding, followed by a
				tensor.extract_slice.

				#### Return modes

				This operation fails if at least one induction variable could not be
				eliminated. In case the targeted op is already independent of induction
				variables, this transform succeeds and returns the unmodified target op.

				Otherwise, the returned handle points to a subset of the produced ops:
				- tensor.empty: The returned handle points to the tensor.extract_slice op.
				- tensor.pad: The returned handle points to the tensor.extract_slice op.

				This transform op consumes the target handle and produces a result handle.
				}];

				let arguments = (ins PDL_Operation:$target, I64Attr:$num_loops);
				let results = (outs PDL_Operation:$transformed);
				let assemblyFormat = "$target attr-dict";

				let extraClassDeclaration = [{
				::mlir::DiagnosedSilenceableFailure applyToOne(
				::mlir::Operation *target,
				::mlir::transform::ApplyToEachResultList &results,
				::mlir::transform::TransformState &state);
				}];
				}

				#endif // TENSOR_TRANSFORM_OPS

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h

Show All 30 Lines	FailureOr<TilingResult> replaceExtractSliceWithTiledProducer(
OpBuilder &builder, tensor::ExtractSliceOp sliceOp, OpResult producerOp);		OpBuilder &builder, tensor::ExtractSliceOp sliceOp, OpResult producerOp);

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Populate functions.		// Populate functions.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// Collects a set of patterns to rewrite ops within the tensor dialect.		/// Collects a set of patterns to rewrite ops within the tensor dialect.
void populateExpandOpsPatterns(RewritePatternSet &patterns);		void populateExpandOpsPatterns(RewritePatternSet &patterns);

		nicolasvasilacheUnsubmitted Done Reply Inline Actions The footprint in this example can be too high: you'd want to take the ceildiv by %step. nicolasvasilache: The footprint in this example can be too high: you'd want to take the ceildiv by %step.
		springermAuthorUnsubmitted Done Reply Inline Actions I tried to do that but `FlatAffineConstraints` was unable to compute an upper bound in that case. It's unclear why this is happening, I'm looking into this. Maybe the system of inequalities is getting too complex (with various "semi-affine exprs"...). It looks like `FlatAffineConstraints` must be extended. springerm: I tried to do that but `FlatAffineConstraints` was unable to compute an upper bound in that…
		springermAuthorUnsubmitted Done Reply Inline Actions This is indeed due to a shortcoming in `FlatAffineValueConstraints`: // TODO: Whenever there are local variables in the dependence // constraints, we'll conservatively over-approximate, since we don't // always explicitly compute them above (in the while loop). It is worth fixing this? Will probably take a while to understand and rewrite a 200 LOC function. springerm: This is indeed due to a shortcoming in `FlatAffineValueConstraints`: ``` // TODO…
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Yes, this is the same rationale as using `step 1` below. Can we update the doc to make this explicit? I.e. divide by %step and mention that in the case of a symbol, we over-approximate with setting `step` to `1` to circumvent the limitation of the analysis. nicolasvasilache: Yes, this is the same rationale as using `step 1` below. Can we update the doc to make this…
		dcaballeUnsubmitted Done Reply Inline Actions What happened with this? Was it addressed? dcaballe: What happened with this? Was it addressed?
		springermAuthorUnsubmitted Done Reply Inline Actions There is a TODO in `SCF/IR/ValueBoundsOpInterfaceImpl.cpp`. We can't do any better at the moment. springerm: There is a TODO in `SCF/IR/ValueBoundsOpInterfaceImpl.cpp`. We can't do any better at the…
/// Appends patterns for folding tensor aliasing ops into consumer load/store		/// Appends patterns for folding tensor aliasing ops into consumer load/store
/// ops into `patterns`.		/// ops into `patterns`.
void populateFoldTensorSubsetOpPatterns(RewritePatternSet &patterns);		void populateFoldTensorSubsetOpPatterns(RewritePatternSet &patterns);

/// Collects patterns to merge consecutive tensor.insert_slice/extract_slice		/// Collects patterns to merge consecutive tensor.insert_slice/extract_slice
/// into one. These patterns are in in this separate entry point because the		/// into one. These patterns are in in this separate entry point because the
/// bufferization is sensitive over IR structure, particularly those		/// bufferization is sensitive over IR structure, particularly those
/// tensor.extract_slice and tensor.insert_slice ops for creating the slices.		/// tensor.extract_slice and tensor.insert_slice ops for creating the slices.
void populateMergeConsecutiveInsertExtractSlicePatterns(		void populateMergeConsecutiveInsertExtractSlicePatterns(
RewritePatternSet &patterns);		RewritePatternSet &patterns);

/// Populates `patterns` with patterns that fold `tensor.expand_shape` and		/// Populates `patterns` with patterns that fold `tensor.expand_shape` and
/// `tensor.collapse_shape` into other ops.		/// `tensor.collapse_shape` into other ops.
void populateReassociativeReshapeFoldingPatterns(RewritePatternSet &patterns);		void populateReassociativeReshapeFoldingPatterns(RewritePatternSet &patterns);

/// Populates `patterns` with patterns that fold tensor.empty with		/// Populates `patterns` with patterns that fold tensor.empty with
/// tensor.[extract_slice\|cast\|expand_shape\|collapse_shape].		/// tensor.[extract_slice\|cast\|expand_shape\|collapse_shape].
void populateFoldTensorEmptyPatterns(RewritePatternSet &patterns);		void populateFoldTensorEmptyPatterns(RewritePatternSet &patterns);

/// Populates `patterns` with patterns that fold operations like `tensor.pad`		/// Populates `patterns` with patterns that fold operations like `tensor.pad`
/// and `tensor.extract_slice` into `tensor.pack` and `tensor.unpack` operations		/// and `tensor.extract_slice` into `tensor.pack` and `tensor.unpack` operations
/// respectively.		/// respectively.
void populateFoldIntoPackAndUnpackPatterns(RewritePatternSet &patterns);		void populateFoldIntoPackAndUnpackPatterns(RewritePatternSet &patterns);

		//===----------------------------------------------------------------------===//
		// Transform helpers
		//===----------------------------------------------------------------------===//

		/// Build a new tensor::PadOp with low/high padding that is independent of all
		/// given independencies. If the op is already independent of all
		/// independencies, the same PadOp result is returned.
		///
		/// Failure indicates the no suitable upper bound for low/high padding could be
		/// found.
		///
		/// Example:
		/// scf.for %iv = %lb to %ub step %step {
		/// %high = affine.apply affine_map<(d0)[s0] -> (s0 - d0)> (%i)[%ub]
		/// %p = tensor.pad %t low[5] high[%high] ...
		/// ...
		/// }
		///
		/// The function builds IR such as:
		/// %high_new = affine.apply affine_map<()[s0, s1] -> (-s0 + s1)> ()[%lb, %ub]
		/// %p_hoistable = tensor.pad %t low[5] high[%high_new]
		/// %dim = tensor.dim %t, %c0
		/// %size = affine.apply affine_map<(d0)[s0, s1] -> (-d0 + s0 + s1 + 5)>
		/// (%iv)[%ub, %dim]
		/// %slice = tensor.extract_slice %p_hoistable [0] [%size] [1]
		///
		/// The slice is returned.
		FailureOr<Value> buildIndependentOp(OpBuilder &b, tensor::PadOp padOp,
		ValueRange independencies);

		/// Build a new tensor::EmptyOp who's dynamic sizes are independent of all
		/// given independencies. If the op is already independent of all
		/// independencies, the same EmptyOp result is returned.
		///
		/// Failure indicates the no suitable upper bound for the dynamic sizes could be
		/// found.
		FailureOr<Value> buildIndependentOp(OpBuilder &b, tensor::EmptyOp emptyOp,
		ValueRange independencies);

} // namespace tensor		} // namespace tensor
} // namespace mlir		} // namespace mlir

#endif // MLIR_DIALECT_TENSOR_TRANSFORMS_TRANSFORMS_H		#endif // MLIR_DIALECT_TENSOR_TRANSFORMS_TRANSFORMS_H

mlir/include/mlir/InitAllDialects.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
#include "mlir/Dialect/Shape/IR/Shape.h"		#include "mlir/Dialect/Shape/IR/Shape.h"
#include "mlir/Dialect/Shape/Transforms/BufferizableOpInterfaceImpl.h"		#include "mlir/Dialect/Shape/Transforms/BufferizableOpInterfaceImpl.h"
#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
#include "mlir/Dialect/SparseTensor/Transforms/BufferizableOpInterfaceImpl.h"		#include "mlir/Dialect/SparseTensor/Transforms/BufferizableOpInterfaceImpl.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/Dialect/Tensor/IR/TensorInferTypeOpInterfaceImpl.h"		#include "mlir/Dialect/Tensor/IR/TensorInferTypeOpInterfaceImpl.h"
#include "mlir/Dialect/Tensor/IR/TensorTilingInterfaceImpl.h"		#include "mlir/Dialect/Tensor/IR/TensorTilingInterfaceImpl.h"
#include "mlir/Dialect/Tensor/IR/ValueBoundsOpInterfaceImpl.h"		#include "mlir/Dialect/Tensor/IR/ValueBoundsOpInterfaceImpl.h"
		#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h"
#include "mlir/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.h"		#include "mlir/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.h"
#include "mlir/Dialect/Tosa/IR/TosaOps.h"		#include "mlir/Dialect/Tosa/IR/TosaOps.h"
#include "mlir/Dialect/Transform/IR/TransformDialect.h"		#include "mlir/Dialect/Transform/IR/TransformDialect.h"
#include "mlir/Dialect/Vector/IR/VectorOps.h"		#include "mlir/Dialect/Vector/IR/VectorOps.h"
#include "mlir/Dialect/Vector/TransformOps/VectorTransformOps.h"		#include "mlir/Dialect/Vector/TransformOps/VectorTransformOps.h"
#include "mlir/Dialect/Vector/Transforms/BufferizableOpInterfaceImpl.h"		#include "mlir/Dialect/Vector/Transforms/BufferizableOpInterfaceImpl.h"
#include "mlir/Dialect/X86Vector/X86VectorDialect.h"		#include "mlir/Dialect/X86Vector/X86VectorDialect.h"
#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	inline void registerAllDialects(DialectRegistry &registry) {

// Register all dialect extensions.		// Register all dialect extensions.
affine::registerTransformDialectExtension(registry);		affine::registerTransformDialectExtension(registry);
bufferization::registerTransformDialectExtension(registry);		bufferization::registerTransformDialectExtension(registry);
gpu::registerTransformDialectExtension(registry);		gpu::registerTransformDialectExtension(registry);
linalg::registerTransformDialectExtension(registry);		linalg::registerTransformDialectExtension(registry);
memref::registerTransformDialectExtension(registry);		memref::registerTransformDialectExtension(registry);
scf::registerTransformDialectExtension(registry);		scf::registerTransformDialectExtension(registry);
		tensor::registerTransformDialectExtension(registry);
vector::registerTransformDialectExtension(registry);		vector::registerTransformDialectExtension(registry);

// Register all external models.		// Register all external models.
affine::registerValueBoundsOpInterfaceExternalModels(registry);		affine::registerValueBoundsOpInterfaceExternalModels(registry);
arith::registerBufferizableOpInterfaceExternalModels(registry);		arith::registerBufferizableOpInterfaceExternalModels(registry);
arith::registerValueBoundsOpInterfaceExternalModels(registry);		arith::registerValueBoundsOpInterfaceExternalModels(registry);
bufferization::func_ext::registerBufferizableOpInterfaceExternalModels(		bufferization::func_ext::registerBufferizableOpInterfaceExternalModels(
registry);		registry);
Show All 27 Lines

mlir/include/mlir/Interfaces/ValueBoundsOpInterface.h

Show First 20 Lines • Show All 108 Lines • ▼ Show 20 Lines	static LogicalResult computeBound(AffineMap &resultMap,
presburger::BoundType type, Value value,		presburger::BoundType type, Value value,
std::optional<int64_t> dim,		std::optional<int64_t> dim,
StopConditionFn stopCondition,		StopConditionFn stopCondition,
bool closedUB = false);		bool closedUB = false);

/// Compute a bound in terms of the values/dimensions in `dependencies`. The		/// Compute a bound in terms of the values/dimensions in `dependencies`. The
/// computed bound consists of only constant terms and dependent values (or		/// computed bound consists of only constant terms and dependent values (or
/// dimension sizes thereof).		/// dimension sizes thereof).
static LogicalResult computeBound(AffineMap &resultMap,		static LogicalResult
ValueDimList &mapOperands,		computeDependentBound(AffineMap &resultMap, ValueDimList &mapOperands,
presburger::BoundType type, Value value,		presburger::BoundType type, Value value,
std::optional<int64_t> dim,		std::optional<int64_t> dim, ValueDimList dependencies,
ValueDimList dependencies,		bool closedUB = false);

		/// Compute a bound in that is independent of all values in `independencies`.
		///
		/// Independencies are the opposite of dependencies. The computed bound does
		/// not contain any SSA values that are part of `independencies`. E.g., this
		/// function can be used to make ops hoistable from loops. To that end, ops
		/// must be made independent of loop induction variables (in the case of "for"
		/// loops). Loop induction variables are the independencies; they may not
		/// appear in the computed bound.
		static LogicalResult
		computeIndependentBound(AffineMap &resultMap, ValueDimList &mapOperands,
		presburger::BoundType type, Value value,
		std::optional<int64_t> dim, ValueRange independencies,
bool closedUB = false);		bool closedUB = false);

/// Compute a constant bound for the given index-typed value or shape		/// Compute a constant bound for the given index-typed value or shape
/// dimension size.		/// dimension size.
		dcaballeUnsubmitted Done Reply Inline Actions typos Could you please elaborate a bit more on what "independent of the values in independencies" mean? dcaballe: typos Could you please elaborate a bit more on what "independent of the values in…
///		///
/// `dim` must be `nullopt` if and only if `value` is index-typed. This		/// `dim` must be `nullopt` if and only if `value` is index-typed. This
/// function traverses the backward slice of the given value in a		/// function traverses the backward slice of the given value in a
/// worklist-driven manner until `stopCondition` evaluates to "true". The		/// worklist-driven manner until `stopCondition` evaluates to "true". The
/// constraint set is populated according to `ValueBoundsOpInterface` for each		/// constraint set is populated according to `ValueBoundsOpInterface` for each
/// visited value. (No constraints are added for values for which the stop		/// visited value. (No constraints are added for values for which the stop
/// condition evaluates to "true".)		/// condition evaluates to "true".)
///		///
▲ Show 20 Lines • Show All 108 Lines • Show Last 20 Lines

mlir/lib/Dialect/Affine/Transforms/ReifyValueBounds.cpp

	Show All 13 Lines
	#include "mlir/Interfaces/ValueBoundsOpInterface.h"			#include "mlir/Interfaces/ValueBoundsOpInterface.h"

	using namespace mlir;			using namespace mlir;
	using namespace mlir::affine;			using namespace mlir::affine;

	static FailureOr<OpFoldResult>			static FailureOr<OpFoldResult>
	reifyValueBound(OpBuilder &b, Location loc, presburger::BoundType type,			reifyValueBound(OpBuilder &b, Location loc, presburger::BoundType type,
	Value value, std::optional<int64_t> dim,			Value value, std::optional<int64_t> dim,
	function_ref<bool(Value, std::optional<int64_t>)> stopCondition,			ValueBoundsConstraintSet::StopConditionFn stopCondition,
	bool closedUB) {			bool closedUB) {
	// Compute bound.			// Compute bound.
	AffineMap boundMap;			AffineMap boundMap;
	ValueDimList mapOperands;			ValueDimList mapOperands;
	if (failed(ValueBoundsConstraintSet::computeBound(			if (failed(ValueBoundsConstraintSet::computeBound(
	boundMap, mapOperands, type, value, dim, stopCondition, closedUB)))			boundMap, mapOperands, type, value, dim, stopCondition, closedUB)))
	return failure();			return failure();

				// Reify bound.
				return affine::materializeComputedBound(b, loc, boundMap, mapOperands);
				}

				OpFoldResult affine::materializeComputedBound(
				OpBuilder &b, Location loc, AffineMap boundMap,
				ArrayRef<std::pair<Value, std::optional<int64_t>>> mapOperands) {
	// Materialize tensor.dim/memref.dim ops.			// Materialize tensor.dim/memref.dim ops.
	SmallVector<Value> operands;			SmallVector<Value> operands;
	for (auto valueDim : mapOperands) {			for (auto valueDim : mapOperands) {
	Value value = valueDim.first;			Value value = valueDim.first;
	std::optional<int64_t> dim = valueDim.second;			std::optional<int64_t> dim = valueDim.second;

	if (!dim.has_value()) {			if (!dim.has_value()) {
	// This is an index-typed value.			// This is an index-typed value.
	▲ Show 20 Lines • Show All 64 Lines • Show Last 20 Lines

mlir/lib/Dialect/Tensor/TransformOps/CMakeLists.txt

	add_mlir_dialect_library(MLIRTensorTransformOps			add_mlir_dialect_library(MLIRTensorTransformOps
	TensorTransformOps.cpp			TensorTransformOps.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Tensor/TransformOps			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Tensor/TransformOps

				DEPENDS
				MLIRTensorTransformOpsIncGen

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
				MLIRAffineDialect
	MLIRIR			MLIRIR
	MLIRPDLDialect			MLIRPDLDialect
				MLIRSCFDialect
	MLIRTensorDialect			MLIRTensorDialect
				MLIRTensorTransforms
	MLIRTransformDialect			MLIRTransformDialect
	)			)

mlir/lib/Dialect/Tensor/TransformOps/TensorTransformOps.cpp

//===- TensorTransformOps.cpp - Implementation of tensor transform ops ----===//		//===- TensorTransformOps.cpp - Implementation of tensor transform ops ----===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h"		#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h"

		#include "mlir/Dialect/Affine/IR/AffineOps.h"
		#include "mlir/Dialect/SCF/IR/SCF.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
		#include "mlir/Dialect/Tensor/Transforms/Transforms.h"
#include "mlir/Dialect/Transform/IR/TransformDialect.h"		#include "mlir/Dialect/Transform/IR/TransformDialect.h"
		#include "mlir/Dialect/Transform/IR/TransformInterfaces.h"
#include "llvm/ADT/TypeSwitch.h"		#include "llvm/ADT/TypeSwitch.h"

using namespace mlir;		using namespace mlir;
using namespace tensor;		using namespace tensor;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TrackingListener		// TrackingListener
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
Show All 23 Lines	llvm::TypeSwitch<Operation *>(defOp)
[&](ExpandShapeOp op) { values.push_back(op.getSrc()); })		[&](ExpandShapeOp op) { values.push_back(op.getSrc()); })
.Case<ReshapeOp>(		.Case<ReshapeOp>(
[&](ReshapeOp op) { values.push_back(op.getSource()); })		[&](ReshapeOp op) { values.push_back(op.getSource()); })
.Default([](Operation *op) {});		.Default([](Operation *op) {});
} while (!values.empty());		} while (!values.empty());

return nullptr;		return nullptr;
}		}

		//===----------------------------------------------------------------------===//
		// MakeLoopIndependentOp
		//===----------------------------------------------------------------------===//

		DiagnosedSilenceableFailure transform::MakeLoopIndependentOp::applyToOne(
		Operation *target, transform::ApplyToEachResultList &results,
		transform::TransformState &state) {
		// Gather IVs.
		SmallVector<Value> ivs;
		Operation *nextOp = target;
		for (uint64_t i = 0, e = getNumLoops(); i < e; ++i) {
		dcaballeUnsubmitted Done Reply Inline Actions nit: ub to var dcaballe: nit: ub to var
		nextOp = nextOp->getParentOfType<scf::ForOp>();
		if (!nextOp) {
		DiagnosedSilenceableFailure diag = emitSilenceableError()
		<< "could not find " << i
		<< "-th enclosing loop";
		diag.attachNote(target->getLoc()) << "target op";
		return diag;
		}
		ivs.push_back(cast<scf::ForOp>(nextOp).getInductionVar());
		}

		// Rewrite IR.
		IRRewriter rewriter(target->getContext());
		FailureOr<Value> replacement = failure();
		if (auto padOp = dyn_cast<tensor::PadOp>(target)) {
		replacement = tensor::buildIndependentOp(rewriter, padOp, ivs);
		} else if (auto emptyOp = dyn_cast<tensor::EmptyOp>(target)) {
		replacement = tensor::buildIndependentOp(rewriter, emptyOp, ivs);
		} else {
		DiagnosedSilenceableFailure diag = emitSilenceableError()
		<< "unsupported target op";
		diag.attachNote(target->getLoc()) << "target op";
		return diag;
		}
		if (failed(replacement)) {
		DiagnosedSilenceableFailure diag =
		emitSilenceableError() << "could not make target op loop-independent";
		diag.attachNote(target->getLoc()) << "target op";
		return diag;
		}
		rewriter.replaceOp(target, *replacement);
		results.push_back(replacement->getDefiningOp());
		return DiagnosedSilenceableFailure::success();
		}

		//===----------------------------------------------------------------------===//
		// Transform op registration
		//===----------------------------------------------------------------------===//

		namespace {
		class TensorTransformDialectExtension
		: public transform::TransformDialectExtension<
		TensorTransformDialectExtension> {
		public:
		using Base::Base;

		void init() {
		declareGeneratedDialect<affine::AffineDialect>();
		declareGeneratedDialect<tensor::TensorDialect>();

		registerTransformOps<
		#define GET_OP_LIST
		#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.cpp.inc"
		>();
		}
		};
		} // namespace

		#define GET_OP_CLASSES
		#include "mlir/Dialect/Tensor/TransformOps/TensorTransformOps.cpp.inc"

		void mlir::tensor::registerTransformDialectExtension(
		DialectRegistry &registry) {
		registry.addExtensions<TensorTransformDialectExtension>();
		}

mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt

	add_mlir_dialect_library(MLIRTensorTransforms			add_mlir_dialect_library(MLIRTensorTransforms
	BufferizableOpInterfaceImpl.cpp			BufferizableOpInterfaceImpl.cpp
	Bufferize.cpp			Bufferize.cpp
	EmptyOpPatterns.cpp			EmptyOpPatterns.cpp
	ExtractSliceFromReshapeUtils.cpp			ExtractSliceFromReshapeUtils.cpp
	FoldIntoPackAndUnpackPatterns.cpp			FoldIntoPackAndUnpackPatterns.cpp
	FoldTensorSubsetOps.cpp			FoldTensorSubsetOps.cpp
				IndependenceTransforms.cpp
	MergeConsecutiveInsertExtractSlicePatterns.cpp			MergeConsecutiveInsertExtractSlicePatterns.cpp
	ReshapePatterns.cpp			ReshapePatterns.cpp
	SwapExtractSliceWithProducerPatterns.cpp			SwapExtractSliceWithProducerPatterns.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Tensor/Transforms			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Tensor/Transforms

	DEPENDS			DEPENDS
	MLIRTensorTransformsIncGen			MLIRTensorTransformsIncGen

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	MLIRAffineDialect			MLIRAffineDialect
				MLIRAffineTransforms
	MLIRAffineUtils			MLIRAffineUtils
	MLIRArithDialect			MLIRArithDialect
	MLIRBufferizationDialect			MLIRBufferizationDialect
	MLIRBufferizationTransforms			MLIRBufferizationTransforms
	MLIRIR			MLIRIR
	MLIRLinalgDialect			MLIRLinalgDialect
	MLIRMemRefDialect			MLIRMemRefDialect
	MLIRPass			MLIRPass
	MLIRSCFDialect			MLIRSCFDialect
	MLIRTensorDialect			MLIRTensorDialect
	MLIRTilingInterface			MLIRTilingInterface
	MLIRTransforms			MLIRTransforms
	MLIRVectorDialect			MLIRVectorDialect
				MLIRValueBoundsOpInterface
	)			)

mlir/lib/Dialect/Tensor/Transforms/IndependenceTransforms.cpp

This file was added.

				//===- IndependenceTransforms.cpp - Make ops independent of values --------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Tensor/Transforms/Transforms.h"

				#include "mlir/Dialect/Affine/IR/AffineOps.h"
				#include "mlir/Dialect/Affine/Transforms/Transforms.h"
				#include "mlir/Dialect/Tensor/IR/Tensor.h"
				#include "mlir/Dialect/Utils/StaticValueUtils.h"
				#include "mlir/Interfaces/ValueBoundsOpInterface.h"

				using namespace mlir;
				using namespace mlir::tensor;

				/// Make the given OpFoldResult independent of all independencies.
				static FailureOr<OpFoldResult> makeIndependent(OpBuilder &b, Location loc,
				OpFoldResult ofr,
				ValueRange independencies) {
				if (ofr.is<Attribute>())
				return ofr;
				Value value = ofr.get<Value>();
				AffineMap boundMap;
				ValueDimList mapOperands;
				if (failed(ValueBoundsConstraintSet::computeIndependentBound(
				boundMap, mapOperands, presburger::BoundType::UB, value,
				/dim=/std::nullopt, independencies, /closedUB=/true)))
				return failure();
				return mlir::affine::materializeComputedBound(b, loc, boundMap, mapOperands);
				}

				FailureOr<Value> tensor::buildIndependentOp(OpBuilder &b, tensor::PadOp padOp,
				ValueRange independencies) {
				OpBuilder::InsertionGuard g(b);
				b.setInsertionPoint(padOp);
				Location loc = padOp.getLoc();

				// Non-constant padding not supported.
				Value constantPadding = padOp.getConstantPaddingValue();
				if (!constantPadding)
				return failure();

				SmallVector<OpFoldResult> newMixedLow, newMixedHigh;
				for (OpFoldResult ofr : padOp.getMixedLowPad()) {
				auto ub = makeIndependent(b, loc, ofr, independencies);
				if (failed(ub))
				return failure();
				newMixedLow.push_back(*ub);
				}
				for (OpFoldResult ofr : padOp.getMixedHighPad()) {
				auto ub = makeIndependent(b, loc, ofr, independencies);
				if (failed(ub))
				return failure();
				newMixedHigh.push_back(*ub);
				}

				// Return existing tensor::PadOp if nothing has changed.
				if (llvm::equal(padOp.getMixedLowPad(), newMixedLow) &&
				llvm::equal(padOp.getMixedHighPad(), newMixedHigh))
				return padOp.getResult();

				// Create a new tensor::PadOp.
				auto newPadOp = b.create<PadOp>(
				loc, padOp.getResultType(), padOp.getSource(), newMixedLow, newMixedHigh,
				constantPadding, padOp.getNofold(), /attrs=/ArrayRef<NamedAttribute>{});

				// Create a tensor::ExtractSliceOp.
				// Reify the result sizes of the old tensor::PadOp.
				ReifiedRankedShapedTypeDims reifiedSizes;
				ReifyRankedShapedTypeOpInterface reifyShapedTypeInterface =
				dyn_cast<ReifyRankedShapedTypeOpInterface>(padOp.getOperation());
				if (failed(reifyShapedTypeInterface.reifyResultShapes(b, reifiedSizes)))
				return failure();
				SmallVector<OpFoldResult> offsets, sizes, strides;
				for (int64_t i = 0, e = padOp.getResultType().getRank(); i < e; ++i) {
				// offset = ub(low_padding) - low_padding
				dcaballeUnsubmitted Done Reply Inline Actions nit: ub to var dcaballe: nit: ub to var
				OpFoldResult prevLow = padOp.getMixedLowPad()[i];
				if (prevLow.is<Attribute>()) {
				offsets.push_back(b.getIndexAttr(0));
				} else {
				offsets.push_back(
				b.create<affine::AffineApplyOp>(
				loc, b.getAffineDimExpr(0) - b.getAffineDimExpr(1),
				std::initializer_list<Value>{newMixedLow[i].get<Value>(),
				prevLow.get<Value>()})
				.getResult());
				}
				// size = reified result size
				if (!padOp.getResultType().isDynamicDim(i)) {
				sizes.push_back(b.getIndexAttr(padOp.getResultType().getDimSize(i)));
				} else {
				sizes.push_back(reifiedSizes[0][i]);
				}
				// stride = 1
				strides.push_back(b.getIndexAttr(1));
				}

				return b.create<ExtractSliceOp>(loc, newPadOp, offsets, sizes, strides)
				.getResult();
				}

				FailureOr<Value> tensor::buildIndependentOp(OpBuilder &b,
				tensor::EmptyOp emptyOp,
				ValueRange independencies) {
				OpBuilder::InsertionGuard g(b);
				b.setInsertionPoint(emptyOp);
				Location loc = emptyOp.getLoc();

				SmallVector<OpFoldResult> newSizes;
				for (OpFoldResult ofr : emptyOp.getMixedSizes()) {
				auto ub = makeIndependent(b, loc, ofr, independencies);
				if (failed(ub))
				return failure();
				newSizes.push_back(*ub);
				}

				// Return existing tensor::EmptyOp if nothing has changed.
				if (llvm::equal(emptyOp.getMixedSizes(), newSizes))
				return emptyOp.getResult();

				// Create a new tensor::EmptyOp.
				Value newEmptyOp =
				b.create<EmptyOp>(loc, newSizes, emptyOp.getType().getElementType());

				// Create a tensor::ExtractSliceOp.
				SmallVector<OpFoldResult> offsets(newSizes.size(), b.getIndexAttr(0));
				SmallVector<OpFoldResult> strides(newSizes.size(), b.getIndexAttr(1));
				return b
				.create<ExtractSliceOp>(loc, newEmptyOp, offsets, emptyOp.getMixedSizes(),
				strides)
				.getResult();
				}

mlir/lib/Interfaces/ValueBoundsOpInterface.cpp

Show First 20 Lines • Show All 350 Lines • ▼ Show 20 Lines	for (int64_t i = 0; i < cstr.cstr.getNumDimAndSymbolVars(); ++i) {
mapOperands.push_back(std::make_pair(value, dim));		mapOperands.push_back(std::make_pair(value, dim));
}		}

resultMap = bound.replaceDimsAndSymbols(replacementDims, replacementSymbols,		resultMap = bound.replaceDimsAndSymbols(replacementDims, replacementSymbols,
numDims, numSymbols);		numDims, numSymbols);
return success();		return success();
}		}

LogicalResult ValueBoundsConstraintSet::computeBound(		LogicalResult ValueBoundsConstraintSet::computeDependentBound(
AffineMap &resultMap, ValueDimList &mapOperands, presburger::BoundType type,		AffineMap &resultMap, ValueDimList &mapOperands, presburger::BoundType type,
Value value, std::optional<int64_t> dim, ValueDimList dependencies,		Value value, std::optional<int64_t> dim, ValueDimList dependencies,
bool closedUB) {		bool closedUB) {
return computeBound(		return computeBound(
resultMap, mapOperands, type, value, dim,		resultMap, mapOperands, type, value, dim,
[&](Value v, std::optional<int64_t> d) {		[&](Value v, std::optional<int64_t> d) {
return llvm::is_contained(dependencies, std::make_pair(v, d));		return llvm::is_contained(dependencies, std::make_pair(v, d));
},		},
closedUB);		closedUB);
}		}

		LogicalResult ValueBoundsConstraintSet::computeIndependentBound(
		AffineMap &resultMap, ValueDimList &mapOperands, presburger::BoundType type,
		Value value, std::optional<int64_t> dim, ValueRange independencies,
		bool closedUB) {
		// Return "true" if the given value is independent of all values in
		// `independencies`. I.e., neither the value itself nor any value in the
		// backward slice (reverse use-def chain) is contained in `independencies`.
		dcaballeUnsubmitted Done Reply Inline Actions Something like this in the header doc is what I was hoping for :) dcaballe: Something like this in the header doc is what I was hoping for :)
		auto isIndependent = [&](Value v) {
		SmallVector<Value> worklist;
		DenseSet<Value> visited;
		worklist.push_back(v);
		while (!worklist.empty()) {
		Value next = worklist.pop_back_val();
		if (visited.contains(next))
		continue;
		visited.insert(next);
		if (llvm::is_contained(independencies, next))
		return false;
		// TODO: DominanceInfo could be used to stop the traversal early.
		Operation *op = next.getDefiningOp();
		if (!op)
		continue;
		worklist.append(op->getOperands().begin(), op->getOperands().end());
		}
		return true;
		};

		// Reify bounds in terms of any independent values.
		return computeBound(
		resultMap, mapOperands, type, value, dim,
		[&](Value v, std::optional<int64_t> d) { return isIndependent(v); },
		closedUB);
		}

FailureOr<int64_t> ValueBoundsConstraintSet::computeConstantBound(		FailureOr<int64_t> ValueBoundsConstraintSet::computeConstantBound(
presburger::BoundType type, Value value, std::optional<int64_t> dim,		presburger::BoundType type, Value value, std::optional<int64_t> dim,
StopConditionFn stopCondition, bool closedUB) {		StopConditionFn stopCondition, bool closedUB) {
#ifndef NDEBUG		#ifndef NDEBUG
assertValidValueDim(value, dim);		assertValidValueDim(value, dim);
#endif // NDEBUG		#endif // NDEBUG

// Process the backward slice of `value` (i.e., reverse use-def chain) until		// Process the backward slice of `value` (i.e., reverse use-def chain) until
▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

mlir/test/Dialect/Tensor/transform-op-make-loop-independent.mlir

This file was added.

				// RUN: mlir-opt %s -allow-unregistered-dialect \
				// RUN: -test-transform-dialect-interpreter -canonicalize \
				// RUN: -split-input-file -verify-diagnostics \| FileCheck %s

				// This is a test case where "high" padding depends on the IV.

				// CHECK: #[[$map:.*]] = affine_map<()[s0, s1] -> (s0 - s1)>
				// CHECK: #[[$map1:.*]] = affine_map<(d0)[s0, s1] -> (-d0 + s0 + s1 + 5)>
				// CHECK-LABEL: func @make_pad_loop_independent_1(
				// CHECK-SAME: %[[lb:.]]: index, %[[ub:.]]: index, %[[step:.*]]: index,
				// CHECK-SAME: %[[t:.*]]: tensor<?xf32>
				func.func @make_pad_loop_independent_1(%lb: index, %ub: index, %step: index,
				%t: tensor<?xf32>, %f: f32) {
				// CHECK: scf.for %[[iv:.*]] = %[[lb]] to %[[ub]]
				scf.for %i = %lb to %ub step %step {
				// CHECK: %[[high:.*]] = affine.apply #[[$map]]()[%[[ub]], %[[lb]]]
				// CHECK: %[[padded:.*]] = tensor.pad %[[t]] low[5] high[%[[high]]]
				// CHECK: %[[dim:.*]] = tensor.dim %[[t]]
				// CHECK: %[[size:.*]] = affine.apply #[[$map1]](%[[iv]])[%[[ub]], %[[dim]]]
				// CHECK: %[[replacement:.*]] = tensor.extract_slice %[[padded]][0] [%[[size]]] [1]
				%high = affine.apply affine_map<(d0)[s0] -> (s0 - d0)> (%i)[%ub]
				%p = tensor.pad %t low[5] high[%high] {
				^bb0(%arg1: index):
				tensor.yield %f : f32
				} : tensor<?xf32> to tensor<?xf32>
				// CHECK: "dummy.some_use"(%[[replacement]])
				"dummy.some_use"(%p) : (tensor<?xf32>) -> ()
				}
				return
				}

				transform.sequence failures(propagate) {
				^bb1(%arg1: !pdl.operation):
				%0 = transform.structured.match ops{["tensor.pad"]} in %arg1 : (!pdl.operation) -> !pdl.operation
				%1 = transform.tensor.make_loop_independent %0 {num_loops = 1}
				}

				// -----

				// This is a test case where "low" padding depends on the IV.

				// CHECK: #[[$map:.*]] = affine_map<()[s0, s1] -> (s0 - s1)>
				// CHECK: #[[$map1:.*]] = affine_map<(d0)[s0, s1] -> (-d0 + s0 + s1 + 5)>
				// CHECK: #[[$map2:.*]] = affine_map<(d0)[s0] -> (d0 - s0)>
				// CHECK-LABEL: func @make_pad_loop_independent_1(
				// CHECK-SAME: %[[lb:.]]: index, %[[ub:.]]: index, %[[step:.*]]: index,
				// CHECK-SAME: %[[t:.*]]: tensor<?xf32>
				func.func @make_pad_loop_independent_1(%lb: index, %ub: index, %step: index,
				%t: tensor<?xf32>, %f: f32) {
				// CHECK: scf.for %[[iv:.*]] = %[[lb]] to %[[ub]]
				scf.for %i = %lb to %ub step %step {
				// CHECK: %[[low:.*]] = affine.apply #[[$map]]()[%[[ub]], %[[lb]]]
				// CHECK: %[[padded:.*]] = tensor.pad %[[t]] low[%[[low]]] high[5]
				// CHECK: %[[dim:.*]] = tensor.dim %[[t]]
				// CHECK: %[[size:.*]] = affine.apply #[[$map1]](%[[iv]])[%[[ub]], %[[dim]]]
				// CHECK: %[[offset:.*]] = affine.apply #[[$map2]](%[[iv]])[%[[lb]]]
				// CHECK: %[[replacement:.*]] = tensor.extract_slice %[[padded]][%[[offset]]] [%[[size]]] [1]
				%low = affine.apply affine_map<(d0)[s0] -> (s0 - d0)> (%i)[%ub]
				%p = tensor.pad %t low[%low] high[5] {
				^bb0(%arg1: index):
				tensor.yield %f : f32
				} : tensor<?xf32> to tensor<?xf32>
				// CHECK: "dummy.some_use"(%[[replacement]])
				"dummy.some_use"(%p) : (tensor<?xf32>) -> ()
				}
				return
				}

				transform.sequence failures(propagate) {
				^bb1(%arg1: !pdl.operation):
				%0 = transform.structured.match ops{["tensor.pad"]} in %arg1 : (!pdl.operation) -> !pdl.operation
				%1 = transform.tensor.make_loop_independent %0 {num_loops = 1}
				}

				// -----

				// CHECK: #[[$map:.]] = affine_map<()[s0] -> (s0 2 - 2)>
				// CHECK-LABEL: func @two_loops(
				func.func @two_loops(%lb: index, %ub: index, %step: index,
				%t: tensor<?xf32>, %f: f32) {
				scf.for %i = %lb to %ub step %step {
				scf.for %j = %lb to %ub step %step {
				// CHECK: affine.apply #map()[%{{.*}}]
				%low = affine.apply affine_map<(d0, d1)[] -> (d0 + d1)> (%i, %j)[]
				%p = tensor.pad %t low[%low] high[5] {
				^bb0(%arg1: index):
				tensor.yield %f : f32
				} : tensor<?xf32> to tensor<?xf32>
				"dummy.some_use"(%p) : (tensor<?xf32>) -> ()
				}
				}
				return
				}

				transform.sequence failures(propagate) {
				^bb1(%arg1: !pdl.operation):
				%0 = transform.structured.match ops{["tensor.pad"]} in %arg1 : (!pdl.operation) -> !pdl.operation
				%1 = transform.tensor.make_loop_independent %0 {num_loops = 2}
				}

				// -----

				func.func @not_enough_loops(%lb: index, %ub: index, %step: index,
				%t: tensor<?xf32>, %f: f32) {
				scf.for %i = %lb to %ub step %step {
				scf.for %j = %lb to %ub step %step {
				%low = affine.apply affine_map<(d0, d1)[] -> (d0 + d1)> (%i, %j)[]
				// expected-note@below {{target op}}
				%p = tensor.pad %t low[%low] high[5] {
				^bb0(%arg1: index):
				tensor.yield %f : f32
				} : tensor<?xf32> to tensor<?xf32>
				"dummy.some_use"(%p) : (tensor<?xf32>) -> ()
				}
				}
				return
				}

				transform.sequence failures(propagate) {
				^bb1(%arg1: !pdl.operation):
				%0 = transform.structured.match ops{["tensor.pad"]} in %arg1 : (!pdl.operation) -> !pdl.operation
				// expected-error@below {{could not find 2-th enclosing loop}}
				%1 = transform.tensor.make_loop_independent %0 {num_loops = 3}
				}

				// -----

				// CHECK: #[[$map:.*]] = affine_map<(d0)[s0] -> (-d0 + s0)>
				// CHECK: #[[$map1:.*]] = affine_map<()[s0, s1] -> (s0 - s1)>
				// CHECK-LABEL: func @make_empty_loop_independent(
				// CHECK-SAME: %[[lb:.]]: index, %[[ub:.]]: index, %[[step:.*]]: index)
				func.func @make_empty_loop_independent(%lb: index, %ub: index, %step: index) {
				// CHECK: scf.for %[[iv:.*]] = %[[lb]] to %[[ub]]
				scf.for %i = %lb to %ub step %step {
				// CHECK: %[[slice_sz:.*]] = affine.apply #[[$map]](%[[iv]])[%[[ub]]]
				// CHECK: %[[empty_sz:.*]] = affine.apply #[[$map1]]()[%[[ub]], %[[lb]]]
				// CHECK: %[[empty:.*]] = tensor.empty(%[[empty_sz]]) : tensor<?xf32>
				// CHECK: %[[replacement:.*]] = tensor.extract_slice %[[empty]][0] [%[[slice_sz]]] [1]
				%sz = affine.apply affine_map<(d0)[s0] -> (s0 - d0)> (%i)[%ub]
				%empty = tensor.empty(%sz) : tensor<?xf32>
				// CHECK: "dummy.some_use"(%[[replacement]])
				"dummy.some_use"(%empty) : (tensor<?xf32>) -> ()
				}
				return
				}

				transform.sequence failures(propagate) {
				^bb1(%arg1: !pdl.operation):
				%0 = transform.structured.match ops{["tensor.empty"]} in %arg1 : (!pdl.operation) -> !pdl.operation
				%1 = transform.tensor.make_loop_independent %0 {num_loops = 1}
				}

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,802 Lines • ▼ Show 20 Lines	hdrs = [
"include/mlir/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.h",		"include/mlir/Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.h",
"include/mlir/Dialect/Tensor/Transforms/Passes.h",		"include/mlir/Dialect/Tensor/Transforms/Passes.h",
"include/mlir/Dialect/Tensor/Transforms/TransformUtils.h",		"include/mlir/Dialect/Tensor/Transforms/TransformUtils.h",
"include/mlir/Dialect/Tensor/Transforms/Transforms.h",		"include/mlir/Dialect/Tensor/Transforms/Transforms.h",
],		],
includes = ["include"],		includes = ["include"],
deps = [		deps = [
":AffineDialect",		":AffineDialect",
		":AffineTransforms",
":AffineUtils",		":AffineUtils",
":ArithDialect",		":ArithDialect",
":ArithUtils",		":ArithUtils",
":BufferizationDialect",		":BufferizationDialect",
":BufferizationTransforms",		":BufferizationTransforms",
":DialectUtils",		":DialectUtils",
":FuncDialect",		":FuncDialect",
":IR",		":IR",
":LinalgDialect",		":LinalgDialect",
":MemRefDialect",		":MemRefDialect",
":Pass",		":Pass",
":SCFDialect",		":SCFDialect",
":TensorDialect",		":TensorDialect",
":TensorPassIncGen",		":TensorPassIncGen",
":TilingInterface",		":TilingInterface",
":Transforms",		":Transforms",
		":ValueBoundsOpInterface",
":VectorDialect",		":VectorDialect",
"//llvm:Support",		"//llvm:Support",
],		],
)		)

		td_library(
		name = "TensorTransformOpsTdFiles",
		srcs = [
		"include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td",
		],
		includes = ["include"],
		deps = [
		":PDLDialect",
		":TransformDialectTdFiles",
		],
		)

		gentbl_cc_library(
		name = "TensorTransformOpsIncGen",
		strip_include_prefix = "include",
		tbl_outs = [
		(
		["-gen-op-decls"],
		"include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h.inc",
		),
		(
		["-gen-op-defs"],
		"include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.cpp.inc",
		),
		],
		tblgen = ":mlir-tblgen",
		td_file = "include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td",
		deps = [
		":TensorTransformOpsTdFiles",
		],
		)

cc_library(		cc_library(
name = "TensorTransformOps",		name = "TensorTransformOps",
srcs = glob(["lib/Dialect/Tensor/TransformOps/*.cpp"]),		srcs = glob(["lib/Dialect/Tensor/TransformOps/*.cpp"]),
hdrs = glob(["include/mlir/Dialect/Tensor/TransformOps/*.h"]),		hdrs = glob(["include/mlir/Dialect/Tensor/TransformOps/*.h"]),
includes = ["include"],		includes = ["include"],
deps = [		deps = [
		":AffineDialect",
":IR",		":IR",
":PDLDialect",		":PDLDialect",
		":SCFDialect",
":TensorDialect",		":TensorDialect",
		":TensorTransformOpsIncGen",
		":TensorTransforms",
":TransformDialect",		":TransformDialect",
"//llvm:Support",		"//llvm:Support",
],		],
)		)

cc_library(		cc_library(
name = "Rewrite",		name = "Rewrite",
srcs = glob([		srcs = glob([
▲ Show 20 Lines • Show All 5,382 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][tensor] Add transform to make tensor.pad/empty loop-independentClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 517775

mlir/include/mlir/Dialect/Affine/Transforms/Transforms.h

mlir/include/mlir/Dialect/Tensor/CMakeLists.txt

mlir/include/mlir/Dialect/Tensor/TransformOps/CMakeLists.txt

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.h

mlir/include/mlir/Dialect/Tensor/TransformOps/TensorTransformOps.td

mlir/include/mlir/Dialect/Tensor/Transforms/Transforms.h

mlir/include/mlir/InitAllDialects.h

mlir/include/mlir/Interfaces/ValueBoundsOpInterface.h

mlir/lib/Dialect/Affine/Transforms/ReifyValueBounds.cpp

mlir/lib/Dialect/Tensor/TransformOps/CMakeLists.txt

mlir/lib/Dialect/Tensor/TransformOps/TensorTransformOps.cpp

mlir/lib/Dialect/Tensor/Transforms/CMakeLists.txt

mlir/lib/Dialect/Tensor/Transforms/IndependenceTransforms.cpp

mlir/lib/Interfaces/ValueBoundsOpInterface.cpp

mlir/test/Dialect/Tensor/transform-op-make-loop-independent.mlir

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

[mlir][tensor] Add transform to make tensor.pad/empty loop-independent
ClosedPublic