This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/
-
mlir/
-
Dialect/
-
Bufferization/Transforms/
-
Transforms/
-
Passes.h
-
Passes.td
-
SparseTensor/Transforms/
-
Transforms/
-
Passes.h
-
lib/Dialect/
-
Dialect/
-
Bufferization/Transforms/
-
Transforms/
-
TensorCopyInsertion.cpp
-
SparseTensor/
-
Pipelines/
1/2
SparseTensorPipelines.cpp
-
Transforms/
-
CMakeLists.txt
-
DenseBufferizationPass.cpp
-
SparsificationAndBufferizationPass.cpp
-
test/
-
Dialect/
-
Bufferization/Transforms/
-
Transforms/
-
tensor-copy-insertion-memory-space-invalid.mlir
-
tensor-copy-insertion-memory-space.mlir
-
tensor-copy-insertion.mlir
-
SCF/
-
one-shot-bufferize-tensor-copy-insertion.mlir
-
SparseTensor/
-
one_shot_bufferize_tensor_copy_insertion.mlir
-
sparse_sddmm.mlir
-
Tensor/
-
one-shot-bufferize-tensor-copy-insertion.mlir
-
lib/Dialect/
-
Dialect/
-
Bufferization/
-
CMakeLists.txt
-
TestTensorCopyInsertion.cpp
-
CMakeLists.txt
-
tools/mlir-opt/
-
mlir-opt/
-
CMakeLists.txt
-
mlir-opt.cpp
-
utils/bazel/llvm-project-overlay/mlir/
-
bazel/
-
llvm-project-overlay/
-
mlir/
-
BUILD.bazel
-
test/
-
BUILD.bazel

Differential D138915

[mlir][bufferization][WIP] Make `TensorCopyInsertionPass` a test pass
ClosedPublic

Authored by springerm on Nov 29 2022, 5:55 AM.

Download Raw Diff

Details

Reviewers

aartbik
Peiming
jreiffers
nicolasvasilache

Commits

rGc1fef4e88ad4: [mlir][bufferization] Make `TensorCopyInsertionPass` a test pass

Summary

TensorCopyInsertion should not have been exposed as a pass. This was a flaw in the original design. It is a preparation step for bufferization and certain transforms (that would otherwise be legal) are illegal between TensorCopyInsertion and actual rewrite to MemRef ops. Therefore, even if broken down as two separate steps internally, they should be exposed as a single pass.

This change affects the sparse compiler, which uses TensorCopyInsertionPass. A new SparseBufferizationPass is added to replace all passes in the sparse tensor pipeline from TensorCopyInsertionPass until the actual bufferization (rewrite to memref/non-tensor). It is generally unsafe to run arbitrary passes in-between, in particular passes that hoist tensor ops out of loops or change SSA use-def chains along tensor ops.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

springerm created this revision.Nov 29 2022, 5:55 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 29 2022, 5:55 AM

Herald added subscribers: jsetoain, Moerafaat, zero9178 and 22 others. · View Herald Transcript

springerm requested review of this revision.Nov 29 2022, 5:55 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptNov 29 2022, 5:55 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B199997: Diff 478547.Nov 29 2022, 6:02 AM

Thanks for the fix!

This revision is now accepted and ready to land.Nov 29 2022, 6:22 AM

(Will wait for Aart's comments.)

aartbik added inline comments.Nov 30 2022, 9:23 AM

mlir/lib/Dialect/SparseTensor/Pipelines/SparseTensorPipelines.cpp
60	At very first glance, I am a bit -1 on moving these passes out of the pipeline into a bufferization pass. I understand why you are doing this but is there any way you can still do this while not breaking this flow here?
mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
1 ↗	(On Diff #478547)	name seems copy paste

Thanks Matthias, I believe that this is the correct thing to do!

mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
51 ↗	(On Diff #478547)	Is there any reason that we need to do insertTensorCopy before Sparsification in particular? If we can move all dense related bufferization after sparsification, wouldn't that be more clear? Or am I missing anything? It seems that it could be dangerous to do optimization based on simple SSA def-use chain after InsertTensorCopies, I am a little bit worry that it might cause other issues in the future (or there is other remaining issue in the current pipeline). If we somehow must perform insertTensorCopy here, what types of optimization are sound afterwards (i.e., what assumption can we hold for the dense tensor SSA value)? is there any general rules that we should follow?
60 ↗	(On Diff #478547)	nit: If we want to wrap these passes into the same class, I'd prefer including `createPreSparsificationRewritePass` in this pass as well, so that Pre- and Post-rewriting passes are in the same place.

The requirements are:

TensorCopyInsertion must run before sparsification, sparsetensorconversion, densebufferization (otherwise we may be missing copies).
The same pass that runs TensorCopyInsertion should also rewrite all tensor type ops to memref type ops. I.e. all tensor type ops are gone afterwards.

I wanted to keep the passes in the original pipeline but couldn’t find a way. Do you see a better layering?

Is there any reason that we need to do insertTensorCopy before Sparsification

sparsification removes the linalg.generic with loops right? memref loops maybe even? we cannot analyze these anymore. tensorcopyinsertion must run before introducing any memrefs.

It seems that it could be dangerous to do optimization based on simple SSA def-use chain after InsertTensorCopies

no more analyses happen after tensorcopyinsertion. in particular no bufferization analysis

If we somehow must perform insertTensorCopy here, what types of optimization are sound afterwards (i.e., what assumption can we hold for the dense tensor SSA value)? is there any general rules that we should follow?

the short answer is none. ideally do not touch the IR apart from rewriting tensor ops to memref ops. the dense bufferization does a Operation::walk over the IR instead of using the greedypatternrewriter for that reason. the greedypatternrewriter could call folding patterns which are unsafe. basically only BufferizableOpInterface::bufferize (and it’s sparse equivalent) is allowed

Thanks for finding a solution, Matthias. Some feedback below, but assuming no two-pass solution can be made, I would at least like to see this documented and using a better "pass" name for the compound pass.

mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
1 ↗	(On Diff #478547)	That was perhaps a bit vague, but I mean that after the renaming that should be "SparseBufferizationPass.cpp"
34 ↗	(On Diff #478547)	I gave this another good look, and grouping this all together is probably a good way to make sure stuff happens in the right order and with the right information. I guess I objected a little on creating a "mini"pipeline as part of the larger pipeline (or perhaps a "super"pass on top of our other passes). If you still see a way of putting the two parts you need in two passes and keep the original pipeline, that would have my preference. But guessing that is not feasible, I think my other objection came from the name SparseBufferizationPass. I think if (1) we give this a better name, like SparsificationPass (which is heavily overloaded by now, so I am very open to better names) (2) we document really well that this is grouping stuff together to form a logical consistent unit (that does all the rewriting and bufferization "atomically" and cannot be split in clean passes), I am okay with this change.
60 ↗	(On Diff #478547)	+1 on keeping the pre/post together

springerm added inline comments.Dec 1 2022, 6:11 AM

mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
60 ↗	(On Diff #478547)	`PreSparsificationRewriting` contains a pattern that should not run after `TensorCopyInsertion`: `FoldInvariantYield`. This was the reason for one of the miscompiles that we were seeing. Can we rename `PreSparsificationRewriting` and `PostSparsificationRewriting` into something more descriptive (that ideally describes what the pass is actually doing)?

springerm added inline comments.Dec 1 2022, 6:25 AM

mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
34 ↗	(On Diff #478547)	Imo `SparsificationPass` does not describe it very accurately because the pass is doing more than that. This pass basically converts a tensor program into a memref program. In particular, it also handles dense tensor IR. But this is just naming and for you to decide. Some ideas that came to mind: `sparse_tensor::SparseAndDenseBufferizationPass` `sparse_tensor::BufferizationPass` `sparse_tensor::ConvertTensorToMemrefPass` `sparse_tensor::SparsificationPass` and we mention in the pass description that it also does other stuff. If we go with `SparsificationPass`: There is already a `SparsificationPass`, so we would have to rename that one. I always thought of it as `ConvertSparseLinalgToLoopsPass`. Because there is also a `ConvertLinalgToLoopsPass` which does the same thing, just for the dense case.
92–96 ↗	(On Diff #478547)	Do you see a better way than having all these options here as a field? I wanted to use `SparseCompilerOptions`, but it is defined in `Pipelines/Passes.h` (not in `Transforms`). I guess I could create a new `SparseBufferizationOptions` (or whatever we call the pass).

Peiming added inline comments.Dec 1 2022, 10:02 AM

mlir/lib/Dialect/SparseTensor/Transforms/SparseBufferizationPass.cpp
60 ↗	(On Diff #478547)	It should still be okay to put `PreSparsificationRewriting` into the class (before `TensorCopyInsertion`)? But I think you are right about renaming, probably that is a better way to make thing more clear. I think `FoldInvariantYield` is to enable `FuseSparseMulitplyOverAdd`, maybe we can just call it `FuseSparseOperations`? I do not have a good name for `PostSparsificationRewriting` though. @aartbik What do you think?

Okay, this is LGTM provided that
(1) we name the new "mini"pipeline, sparse_tensor::SparsificationAndBufferizationPass
(2) we move PreSparsificationRewriting into that pass, running first, but just so they are all logically contained
(3) we keep the Pre/PostSparsificationRewriting names; nothing is perfect but these names reflect the current function reasonably well

wrengr added inline comments.Dec 1 2022, 6:58 PM

mlir/lib/Dialect/SparseTensor/Pipelines/SparseTensorPipelines.cpp
59–61	I wonder if this is still relevant here? Before this differential this conditional bailed out before doing all our various sparse passes, but now it's only doing so after all of `SparseBufferizationPass`. So I'm thinking the conditional should either be moved into `SparseBufferizationPass` or else removed entirely

address comments

Harbormaster completed remote builds in B200732: Diff 479569.Dec 2 2022, 4:17 AM

This revision was landed with ongoing or failed builds.Dec 2 2022, 6:39 AM

Closed by commit rGc1fef4e88ad4: [mlir][bufferization] Make `TensorCopyInsertionPass` a test pass (authored by springerm). · Explain Why

This revision was automatically updated to reflect the committed changes.

springerm added a commit: rGc1fef4e88ad4: [mlir][bufferization] Make `TensorCopyInsertionPass` a test pass.

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Bufferization/

Transforms/

Passes.h

5 lines

Passes.td

31 lines

SparseTensor/

Transforms/

Passes.h

7 lines

lib/

Dialect/

Bufferization/

Transforms/

TensorCopyInsertion.cpp

42 lines

SparseTensor/

Pipelines/

SparseTensorPipelines.cpp

21 lines

Transforms/

CMakeLists.txt

4 lines

DenseBufferizationPass.cpp

SparsificationAndBufferizationPass.cpp

155 lines

test/

Dialect/

Bufferization/

Transforms/

tensor-copy-insertion-memory-space-invalid.mlir

2 lines

tensor-copy-insertion-memory-space.mlir

2 lines

tensor-copy-insertion.mlir

6 lines

SCF/

one-shot-bufferize-tensor-copy-insertion.mlir

4 lines

SparseTensor/

one_shot_bufferize_tensor_copy_insertion.mlir

4 lines

sparse_sddmm.mlir

2 lines

Tensor/

one-shot-bufferize-tensor-copy-insertion.mlir

4 lines

lib/

Dialect/

Bufferization/

CMakeLists.txt

12 lines

TestTensorCopyInsertion.cpp

78 lines

CMakeLists.txt

1 line

tools/

mlir-opt/

CMakeLists.txt

1 line

mlir-opt.cpp

2 lines

utils/

bazel/

llvm-project-overlay/

mlir/

BUILD.bazel

1 line

test/

BUILD.bazel

13 lines

Diff 479618

mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.h

	Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines

	/// Create a pass that tries to eliminate tensor.empty ops that are anchored on			/// Create a pass that tries to eliminate tensor.empty ops that are anchored on
	/// insert_slice ops.			/// insert_slice ops.
	std::unique_ptr<Pass> createEmptyTensorEliminationPass();			std::unique_ptr<Pass> createEmptyTensorEliminationPass();

	/// Create a pass that bufferizes ops from the bufferization dialect.			/// Create a pass that bufferizes ops from the bufferization dialect.
	std::unique_ptr<Pass> createBufferizationBufferizePass();			std::unique_ptr<Pass> createBufferizationBufferizePass();

	/// Create a pass that resolves out-of-place tensor OpOperands with copies.
	std::unique_ptr<Pass> createTensorCopyInsertionPass();
	std::unique_ptr<Pass>
	createTensorCopyInsertionPass(const OneShotBufferizationOptions &options);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Registration			// Registration
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	/// Register external models for AllocationOpInterface.			/// Register external models for AllocationOpInterface.
	void registerAllocationOpInterfaceExternalModels(DialectRegistry &registry);			void registerAllocationOpInterfaceExternalModels(DialectRegistry &registry);

	/// Generate the code for registering passes.			/// Generate the code for registering passes.
	#define GEN_PASS_REGISTRATION			#define GEN_PASS_REGISTRATION
	#include "mlir/Dialect/Bufferization/Transforms/Passes.h.inc"			#include "mlir/Dialect/Bufferization/Transforms/Passes.h.inc"

	} // namespace bufferization			} // namespace bufferization
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_BUFFERIZATION_TRANSFORMS_PASSES_H			#endif // MLIR_DIALECT_BUFFERIZATION_TRANSFORMS_PASSES_H

mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td

Show First 20 Lines • Show All 334 Lines • ▼ Show 20 Lines	Option<"maxAllocSizeInBytes", "max-alloc-size-in-bytes", "unsigned",
/default=/"1024",		/default=/"1024",
"Maximal size in bytes to promote allocations to stack.">,		"Maximal size in bytes to promote allocations to stack.">,
Option<"maxRankOfAllocatedMemRef", "max-rank-of-allocated-memref", "unsigned",		Option<"maxRankOfAllocatedMemRef", "max-rank-of-allocated-memref", "unsigned",
/default=/"1",		/default=/"1",
"Maximal memref rank to promote dynamic buffers.">,		"Maximal memref rank to promote dynamic buffers.">,
];		];
}		}

def TensorCopyInsertion : Pass<"tensor-copy-insertion"> {
let summary = "Make all tensor IR inplaceable by inserting copies";
let description = [{
This pass runs One-Shot Analysis and inserts copies for all OpOperands that
were decided to bufferize out-of-place. After running this pass, a
bufferization can write to buffers directly (without making copies) and no
longer has to care about potential read-after-write conflicts.

Note: By default, all newly inserted tensor copies/allocs (i.e., newly
created `bufferization.alloc_tensor` ops) that do not escape block are
annotated with `escape = false`. If `create-allocs` is unset, all newly
inserted tensor copies/allocs are annotated with `escape = true`. In that
case, they are not getting deallocated when bufferizing the IR.
}];
let options = [
Option<"allowReturnAllocs", "allow-return-allocs", "bool",
/default=/"false",
"Allows returning/yielding new allocations from a block.">,
Option<"bufferizeFunctionBoundaries", "bufferize-function-boundaries",
"bool", /default=/"0",
"Bufferize function boundaries (experimental).">,
Option<"createDeallocs", "create-deallocs", "bool", /default=/"true",
"Specify if new allocations should be deallocated.">,
Option<"mustInferMemorySpace", "must-infer-memory-space", "bool",
/default=/"false",
"The memory space of an memref types must always be inferred. If "
"unset, a default memory space of 0 is used otherwise.">,
];
let constructor = "mlir::bufferization::createTensorCopyInsertionPass()";
}

def EmptyTensorElimination : Pass<"eliminate-empty-tensors"> {		def EmptyTensorElimination : Pass<"eliminate-empty-tensors"> {
let summary = "Try to eliminate all tensor.empty ops.";		let summary = "Try to eliminate all tensor.empty ops.";
let description = [{		let description = [{
This pass tries to eliminate all insert_slice op-anchored tensor.empty ops.		This pass tries to eliminate all insert_slice op-anchored tensor.empty ops.
I.e., when a value that is equivalent to an tensor.empty op is inserted into		I.e., when a value that is equivalent to an tensor.empty op is inserted into
another tensor, this pass tries to rewrite the IR in such a way that the		another tensor, this pass tries to rewrite the IR in such a way that the
destination tensor of the insert_slice op is used directly instead of the		destination tensor of the insert_slice op is used directly instead of the
tensor.empty result.		tensor.empty result.
}];		}];
let constructor = "mlir::bufferization::createEmptyTensorEliminationPass()";		let constructor = "mlir::bufferization::createEmptyTensorEliminationPass()";
}		}

#endif // MLIR_DIALECT_BUFFERIZATION_TRANSFORMS_PASSES		#endif // MLIR_DIALECT_BUFFERIZATION_TRANSFORMS_PASSES

mlir/include/mlir/Dialect/SparseTensor/Transforms/Passes.h

	Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines
	std::unique_ptr<Pass>			std::unique_ptr<Pass>
	createPostSparsificationRewritePass(bool enableRT, bool enableForeach = true,			createPostSparsificationRewritePass(bool enableRT, bool enableForeach = true,
	bool enableConvert = true);			bool enableConvert = true);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Other rewriting rules and passes.			// Other rewriting rules and passes.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	std::unique_ptr<Pass> createDenseBufferizationPass(			std::unique_ptr<Pass> createSparsificationAndBufferizationPass(
	const bufferization::OneShotBufferizationOptions &options);			const bufferization::OneShotBufferizationOptions &bufferizationOptions,
				const SparsificationOptions &sparsificationOptions,
				const SparseTensorConversionOptions &sparseTensorConversionOptions,
				bool enableRuntimeLibrary, bool enableBufferInitialization);

	void populateSparseBufferRewriting(RewritePatternSet &patterns,			void populateSparseBufferRewriting(RewritePatternSet &patterns,
	bool enableBufferInitialization);			bool enableBufferInitialization);

	std::unique_ptr<Pass> createSparseBufferRewritePass();			std::unique_ptr<Pass> createSparseBufferRewritePass();
	std::unique_ptr<Pass>			std::unique_ptr<Pass>
	createSparseBufferRewritePass(bool enableBufferInitialization);			createSparseBufferRewritePass(bool enableBufferInitialization);

	Show All 21 Lines

mlir/lib/Dialect/Bufferization/Transforms/TensorCopyInsertion.cpp

Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	WalkResult result = op->walk([&](Operation *op) {
if (failed(bufferizableOp.resolveConflicts(rewriter, state)))		if (failed(bufferizableOp.resolveConflicts(rewriter, state)))
return WalkResult::interrupt();		return WalkResult::interrupt();

return WalkResult::advance();		return WalkResult::advance();
});		});

return failure(result.wasInterrupted());		return failure(result.wasInterrupted());
}		}

namespace {
struct TensorCopyInsertionPass
: public bufferization::impl::TensorCopyInsertionBase<
TensorCopyInsertionPass> {
TensorCopyInsertionPass() : options(llvm::None) {}
TensorCopyInsertionPass(const OneShotBufferizationOptions &options)
: options(options) {}

void getDependentDialects(DialectRegistry &registry) const override {
registry.insert<bufferization::BufferizationDialect>();
}

void runOnOperation() override {
if (options) {
if (failed(insertTensorCopies(getOperation(), *options)))
signalPassFailure();
} else {
OneShotBufferizationOptions options;
options.allowReturnAllocs = allowReturnAllocs;
options.bufferizeFunctionBoundaries = bufferizeFunctionBoundaries;
options.createDeallocs = createDeallocs;
if (mustInferMemorySpace)
options.defaultMemorySpace = None;
if (failed(insertTensorCopies(getOperation(), options)))
signalPassFailure();
}
}

private:
Optional<OneShotBufferizationOptions> options;
};
} // namespace

std::unique_ptr<Pass> mlir::bufferization::createTensorCopyInsertionPass() {
return std::make_unique<TensorCopyInsertionPass>();
}

std::unique_ptr<Pass> mlir::bufferization::createTensorCopyInsertionPass(
const OneShotBufferizationOptions &options) {
return std::make_unique<TensorCopyInsertionPass>(options);
}

mlir/lib/Dialect/SparseTensor/Pipelines/SparseTensorPipelines.cpp

	Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Pipeline implementation.			// Pipeline implementation.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	void mlir::sparse_tensor::buildSparseCompiler(			void mlir::sparse_tensor::buildSparseCompiler(
	OpPassManager &pm, const SparseCompilerOptions &options) {			OpPassManager &pm, const SparseCompilerOptions &options) {
	pm.addNestedPass<func::FuncOp>(createLinalgGeneralizationPass());			pm.addNestedPass<func::FuncOp>(createLinalgGeneralizationPass());
	pm.addPass(			pm.addPass(createSparsificationAndBufferizationPass(
	bufferization::createTensorCopyInsertionPass(getBufferizationOptions(			getBufferizationOptions(options.testBufferizationAnalysisOnly),
	/analysisOnly=/options.testBufferizationAnalysisOnly)));			options.sparsificationOptions(), options.sparseTensorConversionOptions(),
				options.enableRuntimeLibrary, options.enableBufferInitialization));
	if (options.testBufferizationAnalysisOnly)			if (options.testBufferizationAnalysisOnly)
	return;			return;
	pm.addPass(createPreSparsificationRewritePass());
	aartbikUnsubmitted Not Done Reply Inline Actions At very first glance, I am a bit -1 on moving these passes out of the pipeline into a bufferization pass. I understand why you are doing this but is there any way you can still do this while not breaking this flow here? aartbik: At very first glance, I am a bit -1 on moving these passes out of the pipeline into a…
	pm.addPass(createSparsificationPass(options.sparsificationOptions()));
	pm.addPass(createPostSparsificationRewritePass(options.enableRuntimeLibrary));
	if (options.enableRuntimeLibrary) {
	pm.addPass(createSparseTensorConversionPass(
	options.sparseTensorConversionOptions()));
	} else {
	pm.addPass(
	createSparseTensorCodegenPass(options.enableBufferInitialization));
	pm.addPass(
	createSparseBufferRewritePass(options.enableBufferInitialization));
	}
	pm.addPass(createDenseBufferizationPass(
	getBufferizationOptions(/analysisOnly=/false)));
	pm.addNestedPass<func::FuncOp>(createCanonicalizerPass());			pm.addNestedPass<func::FuncOp>(createCanonicalizerPass());
				wrengrUnsubmitted Done Reply Inline Actions I wonder if this is still relevant here? Before this differential this conditional bailed out before doing all our various sparse passes, but now it's only doing so after all of `SparseBufferizationPass`. So I'm thinking the conditional should either be moved into `SparseBufferizationPass` or else removed entirely wrengr: I wonder if this is still relevant here? Before this differential this conditional bailed out…
	pm.addNestedPass<func::FuncOp>(			pm.addNestedPass<func::FuncOp>(
	mlir::bufferization::createFinalizingBufferizePass());			mlir::bufferization::createFinalizingBufferizePass());
	// TODO(springerm): Add sparse support to the BufferDeallocation pass and add			// TODO(springerm): Add sparse support to the BufferDeallocation pass and add
	// it to this pipeline.			// it to this pipeline.
	pm.addNestedPass<func::FuncOp>(createConvertLinalgToLoopsPass());			pm.addNestedPass<func::FuncOp>(createConvertLinalgToLoopsPass());
	pm.addNestedPass<func::FuncOp>(createConvertVectorToSCFPass());			pm.addNestedPass<func::FuncOp>(createConvertVectorToSCFPass());
	pm.addNestedPass<func::FuncOp>(createConvertSCFToCFPass());			pm.addNestedPass<func::FuncOp>(createConvertSCFToCFPass());
	pm.addPass(createLowerAffinePass());			pm.addPass(createLowerAffinePass());
	Show All 24 Lines

mlir/lib/Dialect/SparseTensor/Transforms/CMakeLists.txt

	add_mlir_dialect_library(MLIRSparseTensorTransforms			add_mlir_dialect_library(MLIRSparseTensorTransforms
	BufferizableOpInterfaceImpl.cpp			BufferizableOpInterfaceImpl.cpp
	CodegenUtils.cpp			CodegenUtils.cpp
	DenseBufferizationPass.cpp
	Sparsification.cpp
	SparseBufferRewriting.cpp			SparseBufferRewriting.cpp
	SparseTensorCodegen.cpp			SparseTensorCodegen.cpp
	SparseTensorConversion.cpp			SparseTensorConversion.cpp
	SparseTensorPasses.cpp			SparseTensorPasses.cpp
	SparseTensorRewriting.cpp			SparseTensorRewriting.cpp
	SparseVectorization.cpp			SparseVectorization.cpp
				Sparsification.cpp
				SparsificationAndBufferizationPass.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/SparseTensor			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/SparseTensor

	DEPENDS			DEPENDS
	MLIRSparseTensorPassIncGen			MLIRSparseTensorPassIncGen

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	Show All 21 Lines

mlir/lib/Dialect/SparseTensor/Transforms/DenseBufferizationPass.cpp

This file was deleted.

	//===- DenseBufferizationPass.cpp - Dense bufferization pass --------------===//
	//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//
	//===----------------------------------------------------------------------===//

	#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"

	#include "mlir/Dialect/Bufferization/IR/BufferizableOpInterface.h"
	#include "mlir/Dialect/Bufferization/Transforms/Bufferize.h"
	#include "mlir/Dialect/Bufferization/Transforms/OneShotAnalysis.h"
	#include "mlir/Dialect/Func/IR/FuncOps.h"
	#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"

	using namespace mlir;
	using namespace mlir::func;

	namespace mlir {
	namespace sparse_tensor {

	/// Return `true` if one of the given types is a sparse tensor type.
	static bool containsSparseTensor(TypeRange types) {
	for (Type t : types)
	if (getSparseTensorEncoding(t))
	return true;
	return false;
	}

	/// A pass that bufferizes only dense tensor ops and ignores all sparse tensor
	/// ops. No buffer copies are inserted. All tensor OpOperands must be
	/// inplacable.
	class BufferizeDenseOpsPass
	: public PassWrapper<BufferizeDenseOpsPass, OperationPass<ModuleOp>> {
	public:
	BufferizeDenseOpsPass(
	const bufferization::OneShotBufferizationOptions &options)
	: options(options) {}

	void runOnOperation() override {
	// Disallow all sparse tensor ops, so that only dense tensor ops are
	// bufferized.
	bufferization::OpFilter opFilter;
	opFilter.allowOperation([&](Operation *op) {
	if (containsSparseTensor(TypeRange(op->getResults())) \|\|
	containsSparseTensor(TypeRange(op->getOperands())))
	return false;
	if (auto funcOp = dyn_cast<func::FuncOp>(op)) {
	FunctionType funcType = funcOp.getFunctionType();
	if (containsSparseTensor(funcType.getInputs()) \|\|
	containsSparseTensor(funcType.getResults()))
	return false;
	}
	return true;
	});

	if (failed(bufferization::bufferizeOp(getOperation(), options,
	/copyBeforeWrite=/false,
	&opFilter)))
	signalPassFailure();
	}

	private:
	bufferization::OneShotBufferizationOptions options;
	};
	} // namespace sparse_tensor
	} // namespace mlir

	std::unique_ptr<Pass> mlir::createDenseBufferizationPass(
	const bufferization::OneShotBufferizationOptions &options) {
	return std::make_unique<mlir::sparse_tensor::BufferizeDenseOpsPass>(options);
	}

mlir/lib/Dialect/SparseTensor/Transforms/SparsificationAndBufferizationPass.cpp

This file was added.

				//===- SparsificationAndBufferizationPass.cpp - Tensor to Memref Lowering -===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"

				#include "mlir/Dialect/Bufferization/IR/BufferizableOpInterface.h"
				#include "mlir/Dialect/Bufferization/Transforms/Bufferize.h"
				#include "mlir/Dialect/Bufferization/Transforms/OneShotAnalysis.h"
				#include "mlir/Dialect/Bufferization/Transforms/Transforms.h"
				#include "mlir/Dialect/Func/IR/FuncOps.h"
				#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
				#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
				#include "mlir/Pass/PassManager.h"

				using namespace mlir;
				using namespace mlir::func;

				namespace mlir {
				namespace sparse_tensor {

				/// Return `true` if one of the given types is a sparse tensor type.
				static bool containsSparseTensor(TypeRange types) {
				for (Type t : types)
				if (getSparseTensorEncoding(t))
				return true;
				return false;
				}

				/// A pass that lowers tensor ops to memref ops, regardless of whether they are
				/// dense or sparse.
				///
				/// One-Shot Analysis is used to detect RaW conflicts and to insert buffer
				/// copies of the tensor level (`insertTensorCopies`). Afterwards, the lowering
				/// of tensor ops to memref ops follows a different code path depending on
				/// whether the op is sparse or dense:
				///
				/// * Sparse tensor ops are lowered through Sparsification and follow-up pass
				/// that lowers sparse_tensor dialect ops.
				/// * Dense tensor ops are lowered through BufferizableOpInterface
				/// implementations.
				class SparsificationAndBufferizationPass
				: public PassWrapper<SparsificationAndBufferizationPass,
				OperationPass<ModuleOp>> {
				public:
				SparsificationAndBufferizationPass(
				const bufferization::OneShotBufferizationOptions &bufferizationOptions,
				const SparsificationOptions &sparsificationOptions,
				const SparseTensorConversionOptions &sparseTensorConversionOptions,
				bool enableRuntimeLibrary, bool enableBufferInitialization)
				: bufferizationOptions(bufferizationOptions),
				sparsificationOptions(sparsificationOptions),
				sparseTensorConversionOptions(sparseTensorConversionOptions),
				enableRuntimeLibrary(enableRuntimeLibrary),
				enableBufferInitialization(enableBufferInitialization) {}

				/// Bufferize all dense ops. This assumes that no further analysis is needed
				/// and that all required buffer copies were already inserted by
				/// `insertTensorCopies` in the form of `bufferization.alloc_tensor` ops.
				LogicalResult runDenseBufferization() {
				bufferization::OpFilter denseOpFilter;
				denseOpFilter.allowOperation([&](Operation *op) {
				if (containsSparseTensor(TypeRange(op->getResults())) \|\|
				containsSparseTensor(TypeRange(op->getOperands())))
				return false;
				if (auto funcOp = dyn_cast<func::FuncOp>(op)) {
				FunctionType funcType = funcOp.getFunctionType();
				if (containsSparseTensor(funcType.getInputs()) \|\|
				containsSparseTensor(funcType.getResults()))
				return false;
				}
				return true;
				});
				return bufferization::bufferizeOp(getOperation(), bufferizationOptions,
				/copyBeforeWrite=/false,
				&denseOpFilter);
				}

				void runOnOperation() override {
				{
				// Run enabling transformations.
				OpPassManager pm("builtin.module");
				pm.addPass(createPreSparsificationRewritePass());
				if (failed(runPipeline(pm, getOperation())))
				return signalPassFailure();
				}

				// Insert tensor copies. This step runs One-Shot Analysis (which analyzes
				// SSA use-def chains of tensor IR) and decides where buffer copies are
				// needed and where buffers can be written to in-place. These decisions are
				// materialized in the IR in the form of `bufferization.alloc_tensor` ops.
				//
				// Note: All following steps in this pass must be careful not to modify the
				// structure of the IR (i.e., tensor use-def chains), as that could
				// invalidate the results of the analysis. From now on, only small and
				// localized rewrites are allowed, such as replacing a tensor op with its
				// memref equivalent.
				if (failed(bufferization::insertTensorCopies(getOperation(),
				bufferizationOptions)))
				return signalPassFailure();

				// `testAnalysisOnly` is a debug/testing flag. If set, the results of
				// OneShotAnalysis are added to the IR via attributes. In that case, do not
				// continue with the remaining pipeline.
				if (bufferizationOptions.testAnalysisOnly)
				return;

				// Bufferize all sparse ops. No further analysis is needed. All required
				// buffer copies were already inserted by `insertTensorCopies` in the form
				// of `bufferization.alloc_tensor` ops.
				{
				OpPassManager pm("builtin.module");
				pm.addPass(createSparsificationPass(sparsificationOptions));
				pm.addPass(createPostSparsificationRewritePass(enableRuntimeLibrary));
				if (enableRuntimeLibrary) {
				pm.addPass(
				createSparseTensorConversionPass(sparseTensorConversionOptions));
				} else {
				pm.addPass(createSparseTensorCodegenPass(enableBufferInitialization));
				pm.addPass(createSparseBufferRewritePass(enableBufferInitialization));
				}
				if (failed(runPipeline(pm, getOperation())))
				return signalPassFailure();
				}

				// Bufferize all dense ops.
				if (failed(runDenseBufferization()))
				signalPassFailure();
				}

				private:
				bufferization::OneShotBufferizationOptions bufferizationOptions;
				SparsificationOptions sparsificationOptions;
				SparseTensorConversionOptions sparseTensorConversionOptions;
				bool enableRuntimeLibrary;
				bool enableBufferInitialization;
				};
				} // namespace sparse_tensor
				} // namespace mlir

				std::unique_ptr<Pass> mlir::createSparsificationAndBufferizationPass(
				const bufferization::OneShotBufferizationOptions &bufferizationOptions,
				const SparsificationOptions &sparsificationOptions,
				const SparseTensorConversionOptions &sparseTensorConversionOptions,
				bool enableRuntimeLibrary, bool enableBufferInitialization) {
				return std::make_unique<
				mlir::sparse_tensor::SparsificationAndBufferizationPass>(
				bufferizationOptions, sparsificationOptions,
				sparseTensorConversionOptions, enableRuntimeLibrary,
				enableBufferInitialization);
				}

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion-memory-space-invalid.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion="must-infer-memory-space" -split-input-file -verify-diagnostics			// RUN: mlir-opt %s -test-tensor-copy-insertion="must-infer-memory-space" -split-input-file -verify-diagnostics

	// An alloc is inserted but the copy is emitted. Therefore, the memory space			// An alloc is inserted but the copy is emitted. Therefore, the memory space
	// should be specified on the alloc_tensor op.			// should be specified on the alloc_tensor op.
	func.func @memory_space_of_unknown_op() -> (tensor<10xf32>, tensor<10xf32>) {			func.func @memory_space_of_unknown_op() -> (tensor<10xf32>, tensor<10xf32>) {
	%c0 = arith.constant 0 : index			%c0 = arith.constant 0 : index
	%cst = arith.constant 0.0 : f32			%cst = arith.constant 0.0 : f32
	// expected-error @+1 {{could not infer memory space}}			// expected-error @+1 {{could not infer memory space}}
	%t = bufferization.alloc_tensor() : tensor<10xf32>			%t = bufferization.alloc_tensor() : tensor<10xf32>
	%s = tensor.insert %cst into %t[%c0] : tensor<10xf32>			%s = tensor.insert %cst into %t[%c0] : tensor<10xf32>
	return %s, %t : tensor<10xf32>, tensor<10xf32>			return %s, %t : tensor<10xf32>, tensor<10xf32>
	}			}

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion-memory-space.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion="must-infer-memory-space" -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-tensor-copy-insertion="must-infer-memory-space" -split-input-file \| FileCheck %s

	// CHECK-LABEL: func @unknown_op_copy			// CHECK-LABEL: func @unknown_op_copy
	func.func @unknown_op_copy() -> (tensor<10xf32>, tensor<10xf32>) {			func.func @unknown_op_copy() -> (tensor<10xf32>, tensor<10xf32>) {
	%c0 = arith.constant 0 : index			%c0 = arith.constant 0 : index
	%cst = arith.constant 0.0 : f32			%cst = arith.constant 0.0 : f32
	// CHECK: %[[dummy:.*]] = "test.dummy_op"() : () -> tensor<10xf32>			// CHECK: %[[dummy:.*]] = "test.dummy_op"() : () -> tensor<10xf32>
	%t = "test.dummy_op"() : () -> tensor<10xf32>			%t = "test.dummy_op"() : () -> tensor<10xf32>
	// CHECK: %[[copy:.*]] = bufferization.alloc_tensor() copy(%[[dummy]]) {bufferization.escape = [false]} : tensor<10xf32>			// CHECK: %[[copy:.*]] = bufferization.alloc_tensor() copy(%[[dummy]]) {bufferization.escape = [false]} : tensor<10xf32>
	Show All 16 Lines

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-tensor-copy-insertion -split-input-file \| FileCheck %s
	// RUN: mlir-opt %s -tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC			// RUN: mlir-opt %s -test-tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC
	// RUN: mlir-opt %s -tensor-copy-insertion="create-deallocs=0" -split-input-file \| FileCheck %s --check-prefix=CHECK-NO-DEALLOC			// RUN: mlir-opt %s -test-tensor-copy-insertion="create-deallocs=0" -split-input-file \| FileCheck %s --check-prefix=CHECK-NO-DEALLOC

	// CHECK-LABEL: func @read_after_write_conflict(			// CHECK-LABEL: func @read_after_write_conflict(
	// CHECK-SAME: %[[t:.*]]: tensor<?xf32>			// CHECK-SAME: %[[t:.*]]: tensor<?xf32>
	// CHECK-FUNC-LABEL: func @read_after_write_conflict(			// CHECK-FUNC-LABEL: func @read_after_write_conflict(
	// CHECK-NO-DEALLOC-LABEL: func @read_after_write_conflict(			// CHECK-NO-DEALLOC-LABEL: func @read_after_write_conflict(
	func.func @read_after_write_conflict(%t: tensor<?xf32>, %idx: index, %f: f32)			func.func @read_after_write_conflict(%t: tensor<?xf32>, %idx: index, %f: f32)
	-> (tensor<?xf32>, tensor<?xf32>)			-> (tensor<?xf32>, tensor<?xf32>)
	{			{
	▲ Show 20 Lines • Show All 76 Lines • Show Last 20 Lines

mlir/test/Dialect/SCF/one-shot-bufferize-tensor-copy-insertion.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion="allow-return-allocs" -allow-unregistered-dialect -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-tensor-copy-insertion="allow-return-allocs" -allow-unregistered-dialect -split-input-file \| FileCheck %s
	// RUN: mlir-opt %s -tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC			// RUN: mlir-opt %s -test-tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC

	// CHECK-LABEL: func @scf_for(			// CHECK-LABEL: func @scf_for(
	// CHECK-SAME: %[[A:.]]: tensor<?xf32>, %[[B:.]]: tensor<?xf32>			// CHECK-SAME: %[[A:.]]: tensor<?xf32>, %[[B:.]]: tensor<?xf32>
	func.func @scf_for(%A : tensor<?xf32>, %B : tensor<?xf32>,			func.func @scf_for(%A : tensor<?xf32>, %B : tensor<?xf32>,
	%lb : index, %ub : index, %step : index)			%lb : index, %ub : index, %step : index)
	-> (tensor<?xf32>, tensor<?xf32>)			-> (tensor<?xf32>, tensor<?xf32>)
	{			{
	// CHECK: %[[A_copy:.*]] = bufferization.alloc_tensor() copy(%[[A]]) {bufferization.escape = [false]} : tensor<?xf32>			// CHECK: %[[A_copy:.*]] = bufferization.alloc_tensor() copy(%[[A]]) {bufferization.escape = [false]} : tensor<?xf32>
	▲ Show 20 Lines • Show All 124 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/one_shot_bufferize_tensor_copy_insertion.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion="allow-return-allocs" \| FileCheck %s			// RUN: mlir-opt %s -test-tensor-copy-insertion="allow-return-allocs" \| FileCheck %s
	// RUN: mlir-opt %s -tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" \| FileCheck %s --check-prefix=CHECK-FUNC			// RUN: mlir-opt %s -test-tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" \| FileCheck %s --check-prefix=CHECK-FUNC

	#DCSR = #sparse_tensor.encoding<{			#DCSR = #sparse_tensor.encoding<{
	dimLevelType = [ "compressed", "compressed" ],			dimLevelType = [ "compressed", "compressed" ],
	dimOrdering = affine_map<(i,j) -> (i,j)>			dimOrdering = affine_map<(i,j) -> (i,j)>
	}>			}>

	// CHECK-LABEL: func @bufferization_alloc_tensor			// CHECK-LABEL: func @bufferization_alloc_tensor
	// CHECK-FUNC-LABEL: func @bufferization_alloc_tensor			// CHECK-FUNC-LABEL: func @bufferization_alloc_tensor
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/sparse_sddmm.mlir

	// RUN: mlir-opt %s --tensor-copy-insertion --pre-sparsification-rewrite --sparsification --cse \| FileCheck %s			// RUN: mlir-opt %s --test-tensor-copy-insertion --pre-sparsification-rewrite --sparsification --cse \| FileCheck %s

	#SM = #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>			#SM = #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>

	#trait_matmul = {			#trait_matmul = {
	indexing_maps = [			indexing_maps = [
	affine_map<(d0, d1, d2) -> (d1, d0)>,			affine_map<(d0, d1, d2) -> (d1, d0)>,
	affine_map<(d0, d1, d2) -> (d0, d2)>,			affine_map<(d0, d1, d2) -> (d0, d2)>,
	affine_map<(d0, d1, d2) -> (d1, d2)>			affine_map<(d0, d1, d2) -> (d1, d2)>
	▲ Show 20 Lines • Show All 198 Lines • Show Last 20 Lines

mlir/test/Dialect/Tensor/one-shot-bufferize-tensor-copy-insertion.mlir

	// RUN: mlir-opt %s -tensor-copy-insertion -split-input-file \| FileCheck %s			// RUN: mlir-opt %s -test-tensor-copy-insertion -split-input-file \| FileCheck %s
	// RUN: mlir-opt %s -tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC			// RUN: mlir-opt %s -test-tensor-copy-insertion="bufferize-function-boundaries allow-return-allocs" -split-input-file \| FileCheck %s --check-prefix=CHECK-FUNC

	// CHECK-LABEL: func @extract_slice(			// CHECK-LABEL: func @extract_slice(
	// CHECK-SAME: %[[t:.*]]: tensor<?xf32>			// CHECK-SAME: %[[t:.*]]: tensor<?xf32>
	// CHECK-FUNC-LABEL: func @extract_slice(			// CHECK-FUNC-LABEL: func @extract_slice(
	func.func @extract_slice(%t: tensor<?xf32>, %idx: index, %f: f32)			func.func @extract_slice(%t: tensor<?xf32>, %idx: index, %f: f32)
	-> (tensor<5xf32>, tensor<?xf32>)			-> (tensor<5xf32>, tensor<?xf32>)
	{			{
	// CHECK: %[[extract_slice:.*]] = tensor.extract_slice %[[t]][10] [5] [1]			// CHECK: %[[extract_slice:.*]] = tensor.extract_slice %[[t]][10] [5] [1]
	%0 = tensor.extract_slice %t[10][5][1] : tensor<?xf32> to tensor<5xf32>			%0 = tensor.extract_slice %t[10][5][1] : tensor<?xf32> to tensor<5xf32>
	// CHECK: %[[alloc:.*]] = bufferization.alloc_tensor() copy(%[[extract_slice]]) {bufferization.escape = [false]} : tensor<5xf32>			// CHECK: %[[alloc:.*]] = bufferization.alloc_tensor() copy(%[[extract_slice]]) {bufferization.escape = [false]} : tensor<5xf32>
	// CHECK-FUNC: bufferization.alloc_tensor() copy(%{{.*}}) {bufferization.escape = [true]} : tensor<5xf32>			// CHECK-FUNC: bufferization.alloc_tensor() copy(%{{.*}}) {bufferization.escape = [true]} : tensor<5xf32>
	// CHECK: %[[insert:.]] = tensor.insert %{{.}} into %[[alloc]]			// CHECK: %[[insert:.]] = tensor.insert %{{.}} into %[[alloc]]
	%1 = tensor.insert %f into %0[%idx] : tensor<5xf32>			%1 = tensor.insert %f into %0[%idx] : tensor<5xf32>
	// CHECK: return %[[insert]], %[[t]]			// CHECK: return %[[insert]], %[[t]]
	return %1, %t : tensor<5xf32>, tensor<?xf32>			return %1, %t : tensor<5xf32>, tensor<?xf32>
	}			}

mlir/test/lib/Dialect/Bufferization/CMakeLists.txt

This file was added.

				# Exclude tests from libMLIR.so
				add_mlir_library(MLIRBufferizationTestPasses
				TestTensorCopyInsertion.cpp

				EXCLUDE_FROM_LIBMLIR

				LINK_LIBS PUBLIC
				MLIRBufferizationDialect
				MLIRBufferizationTransforms
				MLIRIR
				MLIRPass
				)

mlir/test/lib/Dialect/Bufferization/TestTensorCopyInsertion.cpp

This file was added.

				//===- TestTensorCopyInsertion.cpp - Bufferization Analysis ------ c++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Bufferization/IR/Bufferization.h"
				#include "mlir/Dialect/Bufferization/Transforms/OneShotAnalysis.h"
				#include "mlir/Dialect/Bufferization/Transforms/Transforms.h"
				#include "mlir/Pass/Pass.h"

				using namespace mlir;

				namespace {
				/// This pass runs One-Shot Analysis and inserts copies for all OpOperands that
				/// were decided to bufferize out-of-place. After running this pass, a
				/// bufferization can write to buffers directly (without making copies) and no
				/// longer has to care about potential read-after-write conflicts.
				///
				/// Note: By default, all newly inserted tensor copies/allocs (i.e., newly
				/// created `bufferization.alloc_tensor` ops) that do not escape block are
				/// annotated with `escape = false`. If `create-allocs` is unset, all newly
				/// inserted tensor copies/allocs are annotated with `escape = true`. In that
				/// case, they are not getting deallocated when bufferizing the IR.
				struct TestTensorCopyInsertionPass
				: public PassWrapper<TestTensorCopyInsertionPass, OperationPass<ModuleOp>> {
				MLIR_DEFINE_EXPLICIT_INTERNAL_INLINE_TYPE_ID(TestTensorCopyInsertionPass)

				TestTensorCopyInsertionPass() = default;
				TestTensorCopyInsertionPass(const TestTensorCopyInsertionPass &pass)
				: PassWrapper(pass) {}

				void getDependentDialects(DialectRegistry &registry) const override {
				registry.insert<bufferization::BufferizationDialect>();
				}
				StringRef getArgument() const final { return "test-tensor-copy-insertion"; }
				StringRef getDescription() const final {
				return "Module pass to test Tensor Copy Insertion";
				}

				void runOnOperation() override {
				bufferization::OneShotBufferizationOptions options;
				options.allowReturnAllocs = allowReturnAllocs;
				options.bufferizeFunctionBoundaries = bufferizeFunctionBoundaries;
				options.createDeallocs = createDeallocs;
				if (mustInferMemorySpace)
				options.defaultMemorySpace = None;
				if (failed(bufferization::insertTensorCopies(getOperation(), options)))
				signalPassFailure();
				}

				Option<bool> allowReturnAllocs{
				*this, "allow-return-allocs",
				llvm::cl::desc("Allows returning/yielding new allocations from a block."),
				llvm::cl::init(false)};
				Option<bool> bufferizeFunctionBoundaries{
				*this, "bufferize-function-boundaries",
				llvm::cl::desc("Bufferize function boundaries."), llvm::cl::init(false)};
				Option<bool> createDeallocs{
				*this, "create-deallocs",
				llvm::cl::desc("Specify if new allocations should be deallocated."),
				llvm::cl::init(true)};
				Option<bool> mustInferMemorySpace{
				*this, "must-infer-memory-space",
				llvm::cl::desc(
				"The memory space of an memref types must always be inferred. If "
				"unset, a default memory space of 0 is used otherwise."),
				llvm::cl::init(false)};
				};
				} // namespace

				namespace mlir::test {
				void registerTestTensorCopyInsertionPass() {
				PassRegistration<TestTensorCopyInsertionPass>();
				}
				} // namespace mlir::test

mlir/test/lib/Dialect/CMakeLists.txt

	add_subdirectory(Affine)			add_subdirectory(Affine)
	add_subdirectory(Arith)			add_subdirectory(Arith)
				add_subdirectory(Bufferization)
	add_subdirectory(DLTI)			add_subdirectory(DLTI)
	add_subdirectory(Func)			add_subdirectory(Func)
	add_subdirectory(GPU)			add_subdirectory(GPU)
	add_subdirectory(Linalg)			add_subdirectory(Linalg)
	add_subdirectory(LLVM)			add_subdirectory(LLVM)
	add_subdirectory(Math)			add_subdirectory(Math)
	add_subdirectory(MemRef)			add_subdirectory(MemRef)
	add_subdirectory(NVGPU)			add_subdirectory(NVGPU)
	Show All 9 Lines

mlir/tools/mlir-opt/CMakeLists.txt

Show All 9 Lines	set(LLVM_LINK_COMPONENTS
AsmParser		AsmParser
)		)

if(MLIR_INCLUDE_TESTS)		if(MLIR_INCLUDE_TESTS)
set(test_libs		set(test_libs
MLIRTestFuncToLLVM		MLIRTestFuncToLLVM
MLIRAffineTransformsTestPasses		MLIRAffineTransformsTestPasses
MLIRArithTestPasses		MLIRArithTestPasses
		MLIRBufferizationTestPasses
MLIRDLTITestPasses		MLIRDLTITestPasses
MLIRFuncTestPasses		MLIRFuncTestPasses
MLIRGPUTestPasses		MLIRGPUTestPasses
MLIRLinalgTestPasses		MLIRLinalgTestPasses
MLIRMathTestPasses		MLIRMathTestPasses
MLIRMemRefTestPasses		MLIRMemRefTestPasses
MLIRNVGPUTestPasses		MLIRNVGPUTestPasses
MLIRSCFTestPasses		MLIRSCFTestPasses
▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

mlir/tools/mlir-opt/mlir-opt.cpp

Show First 20 Lines • Show All 107 Lines • ▼ Show 20 Lines
void registerTestPadFusion();		void registerTestPadFusion();
void registerTestPDLByteCodePass();		void registerTestPDLByteCodePass();
void registerTestPDLLPasses();		void registerTestPDLLPasses();
void registerTestPreparationPassWithAllowedMemrefResults();		void registerTestPreparationPassWithAllowedMemrefResults();
void registerTestRecursiveTypesPass();		void registerTestRecursiveTypesPass();
void registerTestSCFUtilsPass();		void registerTestSCFUtilsPass();
void registerTestShapeMappingPass();		void registerTestShapeMappingPass();
void registerTestSliceAnalysisPass();		void registerTestSliceAnalysisPass();
		void registerTestTensorCopyInsertionPass();
void registerTestTensorTransforms();		void registerTestTensorTransforms();
void registerTestTilingInterface();		void registerTestTilingInterface();
void registerTestTopologicalSortAnalysisPass();		void registerTestTopologicalSortAnalysisPass();
void registerTestTransformDialectEraseSchedulePass();		void registerTestTransformDialectEraseSchedulePass();
void registerTestTransformDialectInterpreterPass();		void registerTestTransformDialectInterpreterPass();
void registerTestVectorLowerings();		void registerTestVectorLowerings();
void registerTestNvgpuLowerings();		void registerTestNvgpuLowerings();
} // namespace test		} // namespace test
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	#endif
mlir::test::registerTestOpaqueLoc();		mlir::test::registerTestOpaqueLoc();
mlir::test::registerTestPadFusion();		mlir::test::registerTestPadFusion();
mlir::test::registerTestPDLByteCodePass();		mlir::test::registerTestPDLByteCodePass();
mlir::test::registerTestPDLLPasses();		mlir::test::registerTestPDLLPasses();
mlir::test::registerTestRecursiveTypesPass();		mlir::test::registerTestRecursiveTypesPass();
mlir::test::registerTestSCFUtilsPass();		mlir::test::registerTestSCFUtilsPass();
mlir::test::registerTestShapeMappingPass();		mlir::test::registerTestShapeMappingPass();
mlir::test::registerTestSliceAnalysisPass();		mlir::test::registerTestSliceAnalysisPass();
		mlir::test::registerTestTensorCopyInsertionPass();
mlir::test::registerTestTensorTransforms();		mlir::test::registerTestTensorTransforms();
mlir::test::registerTestTilingInterface();		mlir::test::registerTestTilingInterface();
mlir::test::registerTestTopologicalSortAnalysisPass();		mlir::test::registerTestTopologicalSortAnalysisPass();
mlir::test::registerTestTransformDialectEraseSchedulePass();		mlir::test::registerTestTransformDialectEraseSchedulePass();
mlir::test::registerTestTransformDialectInterpreterPass();		mlir::test::registerTestTransformDialectInterpreterPass();
mlir::test::registerTestVectorLowerings();		mlir::test::registerTestVectorLowerings();
mlir::test::registerTestNvgpuLowerings();		mlir::test::registerTestNvgpuLowerings();
}		}
Show All 18 Lines

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 6,859 Lines • ▼ Show 20 Lines	deps = [
":SCFToGPU",		":SCFToGPU",
":Support",		":Support",
":Transforms",		":Transforms",
"//llvm:AllTargetsCodeGens",		"//llvm:AllTargetsCodeGens",
"//llvm:Support",		"//llvm:Support",
"//mlir/test:TestAffine",		"//mlir/test:TestAffine",
"//mlir/test:TestAnalysis",		"//mlir/test:TestAnalysis",
"//mlir/test:TestArith",		"//mlir/test:TestArith",
		"//mlir/test:TestBufferization",
"//mlir/test:TestDLTI",		"//mlir/test:TestDLTI",
"//mlir/test:TestDialect",		"//mlir/test:TestDialect",
"//mlir/test:TestFunc",		"//mlir/test:TestFunc",
"//mlir/test:TestFuncToLLVM",		"//mlir/test:TestFuncToLLVM",
"//mlir/test:TestGPU",		"//mlir/test:TestGPU",
"//mlir/test:TestIR",		"//mlir/test:TestIR",
"//mlir/test:TestLinalg",		"//mlir/test:TestLinalg",
"//mlir/test:TestLLVM",		"//mlir/test:TestLLVM",
▲ Show 20 Lines • Show All 3,535 Lines • Show Last 20 Lines

utils/bazel/llvm-project-overlay/mlir/test/BUILD.bazel

Show First 20 Lines • Show All 696 Lines • ▼ Show 20 Lines	deps = [
"//mlir:MathDialect",		"//mlir:MathDialect",
"//mlir:Pass",		"//mlir:Pass",
"//mlir:Transforms",		"//mlir:Transforms",
"//mlir:VectorDialect",		"//mlir:VectorDialect",
],		],
)		)

cc_library(		cc_library(
		name = "TestBufferization",
		srcs = glob(["lib/Dialect/Bufferization/*.cpp"]),
		defines = ["MLIR_CUDA_CONVERSIONS_ENABLED"],
		includes = ["lib/Dialect/Test"],
		deps = [
		"//mlir:BufferizationDialect",
		"//mlir:BufferizationTransforms",
		"//mlir:IR",
		"//mlir:Pass",
		],
		)

		cc_library(
name = "TestShapeDialect",		name = "TestShapeDialect",
srcs = [		srcs = [
"lib/Dialect/Shape/TestShapeFunctions.cpp",		"lib/Dialect/Shape/TestShapeFunctions.cpp",
"lib/Dialect/Shape/TestShapeMappingAnalysis.cpp",		"lib/Dialect/Shape/TestShapeMappingAnalysis.cpp",
],		],
deps = [		deps = [
"//llvm:Support",		"//llvm:Support",
"//mlir:FuncDialect",		"//mlir:FuncDialect",
▲ Show 20 Lines • Show All 162 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][bufferization][WIP] Make `TensorCopyInsertionPass` a test passClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 479618

mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.h

mlir/include/mlir/Dialect/Bufferization/Transforms/Passes.td

mlir/include/mlir/Dialect/SparseTensor/Transforms/Passes.h

mlir/lib/Dialect/Bufferization/Transforms/TensorCopyInsertion.cpp

mlir/lib/Dialect/SparseTensor/Pipelines/SparseTensorPipelines.cpp

mlir/lib/Dialect/SparseTensor/Transforms/CMakeLists.txt

mlir/lib/Dialect/SparseTensor/Transforms/DenseBufferizationPass.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparsificationAndBufferizationPass.cpp

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion-memory-space-invalid.mlir

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion-memory-space.mlir

mlir/test/Dialect/Bufferization/Transforms/tensor-copy-insertion.mlir

mlir/test/Dialect/SCF/one-shot-bufferize-tensor-copy-insertion.mlir

mlir/test/Dialect/SparseTensor/one_shot_bufferize_tensor_copy_insertion.mlir

mlir/test/Dialect/SparseTensor/sparse_sddmm.mlir

mlir/test/Dialect/Tensor/one-shot-bufferize-tensor-copy-insertion.mlir

mlir/test/lib/Dialect/Bufferization/CMakeLists.txt

mlir/test/lib/Dialect/Bufferization/TestTensorCopyInsertion.cpp

mlir/test/lib/Dialect/CMakeLists.txt

mlir/tools/mlir-opt/CMakeLists.txt

mlir/tools/mlir-opt/mlir-opt.cpp

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

utils/bazel/llvm-project-overlay/mlir/test/BUILD.bazel

[mlir][bufferization][WIP] Make `TensorCopyInsertionPass` a test pass
ClosedPublic