This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/SparseTensor/IR/
-
mlir/
-
Dialect/
-
SparseTensor/
-
IR/
9/10
SparseTensor.h
2/2
SparseTensorAttrDefs.td
5/6
SparseTensorType.h
-
lib/
-
CAPI/Dialect/
-
Dialect/
-
SparseTensor.cpp
-
Dialect/SparseTensor/
-
SparseTensor/
-
IR/
1/1
SparseTensorDialect.cpp
-
Transforms/
-
CodegenEnv.cpp
-
CodegenUtils.h
-
CodegenUtils.cpp
-
LoopEmitter.cpp
-
SparseStorageSpecifierToLLVM.cpp
-
SparseTensorCodegen.cpp
-
SparseTensorConversion.cpp
-
SparseTensorRewriting.cpp
-
SparseTensorStorageLayout.h
-
SparseTensorStorageLayout.cpp
-
Sparsification.cpp
-
test/Dialect/SparseTensor/
-
Dialect/
-
SparseTensor/
-
invalid.mlir
-
invalid_encoding.mlir
-
utils/bazel/llvm-project-overlay/mlir/
-
bazel/
-
llvm-project-overlay/
-
mlir/
-
BUILD.bazel

Differential D143800

[mlir][sparse] Factoring out SparseTensorType class
ClosedPublic

Authored by wrengr on Feb 10 2023, 6:27 PM.

Download Raw Diff

Details

Reviewers

aartbik
bixia
Peiming
ftynse
nicolasvasilache
dcaballe

Commits

rGf708a549b87e: [mlir][sparse] Factoring out SparseTensorType class

Summary

This change adds a new SparseTensorType class for making the "dim" vs "lvl" distinction more overt, and for abstracting over the differences between sparse-tensors and dense-tensors. In addition, this change also adds new type aliases Dimension, Level, and FieldIndex to make code more self-documenting.

Although the diff is very large, the majority of the changes are mechanical in nature (e.g., changing types to use the new aliases, updating variable names to match, etc). Along the way I also made many variables const when they could be; the majority of which required only adding the keyword. A few places had conditional definitions of these variables, requiring actual code changes; however, that was only done when the overall change was extremely local and easy to extract. All these changes are included in the current patch only because it would be too onerous to split them off into a separate patch.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

wrengr created this revision.Feb 10 2023, 6:27 PM

Herald added a reviewer: ftynse. · View Herald TranscriptFeb 10 2023, 6:27 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: hanchung, jsetoain, Moerafaat and 21 others. · View Herald Transcript

wrengr requested review of this revision.Feb 10 2023, 6:27 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptFeb 10 2023, 6:27 PM

Herald added a reviewer: dcaballe. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

wrengr mentioned this in D143861: [mlir][sparse] Extend readCOOIndices to support overhead types beyond index_type..Feb 13 2023, 11:37 AM

Harbormaster completed remote builds in B213188: Diff 496649.Feb 13 2023, 11:56 AM

wrengr mentioned this in D143862: [mlir][sparse] Add runtime support for reading a COO tensor and writing the data to the given indices and values buffers..Feb 13 2023, 12:25 PM

wrengr added a child revision: D143946: [mlir][sparse] adding `SparseTensorType::get{Pointer,Index}Type` methods.Feb 13 2023, 1:31 PM

wrengr added a child revision: D143949: [mlir][sparse] misc code cleanup.Feb 13 2023, 1:52 PM

@stella.stamenova This patch introduces some deprecation warnings, which are intended for other folks on our team to help track down the offending use-sites so they can fix them. Is there anything I need to do to ensure the LLVM buildbot doesn't reject this patch because of the new warnings?

In D143800#4124624, @wrengr wrote:

@stella.stamenova This patch introduces some deprecation warnings, which are intended for other folks on our team to help track down the offending use-sites so they can fix them. Is there anything I need to do to ensure the LLVM buildbot doesn't reject this patch because of the new warnings?

It will fail because it enforces warnings as errors - are there a lot of warnings that will be generated and will it take long to fix them? I can see us living with a red bot for a day assuming that people are working on applying fixes as necessary.

In D143800#4124639, @stella.stamenova wrote:

It will fail because it enforces warnings as errors

Yeah, that's what I was guessing. I'm currently defining the deprecation annotation with a macro, so it'd be easy enough to (conditionally) disable. Since the buildbot enforces warnings as errors, I'm guessing there's no standard LLVM idiom for "only enable this macro when explicitly requested"?

are there a lot of warnings that will be generated and will it take long to fix them? I can see us living with a red bot for a day assuming that people are working on applying fixes as necessary.

I forget how many callsites remain for the deprecated functions, but there's a dozen or so iirc. Unfortunately they're not terribly easy to fix. (The functions are deprecated because they rely on some faulty assumptions, so the deprecation is to help track down all the places we made those faulty assumptions; but fixing them might require nontrivial code restructuring.) The team's working on it, but it might take more than a day.

I'll un-deprecate them for now

One thing that we could do is disable werror on the buildbot for the moment and then re-enable it. It's not terribly difficult, but it relies on buildbot master being re-started to take effect, so it might not even take effect before the warnings are fixed.

I don't want to cause more work for y'all. I just wanted to double-check before landing, to avoid getting auto-reverted :)

I've already added "fixme" comments at all the callsites I could find, so I'm fine with wrapping the deprecation macro in #if 0 so it's easy to re-enable locally as needed.

disabling the deprecation warnings. And adding a few static_cast to equality assertions between different types.

Harbormaster completed remote builds in B213548: Diff 497167.Feb 13 2023, 6:36 PM

Peiming added inline comments.Feb 14 2023, 9:48 AM

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h
52	Are we going to use `Ship` for real? How about `DynSize` or something else?
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h
102	Might be helpful to provide `operator==` so we can use pointer comparison between `rtp1==rtp2` for SparseTensorType just like RankedTensorType.

aartbik added inline comments.Feb 14 2023, 10:36 AM

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h
52	We could remove the TODO and use DynSize for now?
107–108	update doc too to say level?
122–123	stored dim, i.e. level (or just level)?
123–145	stored dimension -> level? EDIT: will not be commenting on these further, since you also plan a doc cleanup after this
124	If this is just to avoid the error on windows, can we make it a linux-only guard or so (as you know, commented out code is usually frowned upon, even when there for a good reason ;-)
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.td
264	Although true in general, these are methods on the encoding, right? So I am not sure we want to say this? EDIT: Ah, later I see this is related to the getImpl() change you added, still very subtle ;-)
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h
49	In this case, it is more a pre-compute than memoize, right? I know that it does not matter much, but I have slightly different interpretation when I see memoization ;-) [from my Prolog days ;-)]
mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp
412–413	can we just else-if this?

Addressing comments

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h
52	I'm not a big fan of `DynSize` personally, since I feel like there's a confusion between (1) the type where the `kDynamic` token itself lives, vs (2) the type where actual dynamic sizes live (i.e., the compiler's type for the runtime values; hence, a particular subset of `Value`). This distinction becomes particularly salient in a different CL I'm working on which defines a variant of `OpFoldResult` to encompass both static and dynamic types without losing track of which is which. One of the motivations for that CL is to clean up a bunch of places where we currently have ad-hoc versions of `mlir::getMixedValues` and similar stuff from `Dialect/Utils/StaticValueUtils.h` Also, that name obfuscates that the alias is the singular of "shape" rather than of "sizes" ;) But I'll make the change for now, for the sake of not blocking the rest of this patch.
124	Yeah, this is to avoid getting rejected by the LLVM buildbot. I could enable it for non-Windows, but I'm guessing their Linux bot is also configured with `-Werror`... I'd love to have this be triggered by a "for developers" flag, so that our team automatically sees it but end-users don't have to. Alas, afaik there's no such flag already in our toolchains. (Of course, even if there were, Blaze disables deprecation warnings by default so folks would only see it when building via cmake)
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.td
264	Adding this comment for the sake of posterity (since you already figured it out :) In the implementations of `isAllDense`, `isAllOrdered`, `hasIdDimOrdering`, etc, I have the implementations check `getImpl()` before checking the rest of the condition. Adding that check does three things: (1) it avoids segfaults from calling these methods on the null-attr, (2) it avoids the boilerplate of needing to use the condition `(enc && enc.isAllDense())` at every callsite, (3) and it unifies handling of sparse-tensors and dense-tensors, since the condition in the second point is the one we actually care about. Since the semantic unification of the third point is a specific and intentional goal of the implementation, it makes sense to document that fact.
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h
49	True, this isn't tabulation like you get in Prolog/Datalog/etc; but it is still caching a memo in order to avoid recomputing the value every time it's needed. [Owing to my experience with Haskell, Dyna, AliceML, etc, I think of "memoization" as more than just dynamic-programming; but afaik my use of the term isn't at odds with how it's used elsewhere :)] Whereas I interpret "precompute" to mean multi-pass algorithms which build up some intermediate/auxiliary datastructure (e.g., to make queries faster than going to the original data, or like our `SparseTensorNNZ` class).
102	It looks like `operator==` was one of the things I split off into a separate CL. Did you want me to merge it back into this CL, or is a separate CL fine?

git-clang-format

Harbormaster completed remote builds in B213739: Diff 497444.Feb 14 2023, 2:42 PM

wrengr added a child revision: D144052: [mlir][sparse] Adding `SparseTensorType::{operator==, hasSameDimToLvlMap}`.Feb 14 2023, 3:16 PM

wrengr added inline comments.Feb 14 2023, 3:17 PM

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h
102	See D144052

wrengr marked an inline comment as done.Feb 14 2023, 3:22 PM

aartbik accepted this revision.Feb 14 2023, 6:03 PM

aartbik added inline comments.

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h
52	Ah, yeah, I remember the discussion on singular vs plural. Yeah, finding right names can be hard ;-(
124	Agreed that would be nice. But this is fine now as far as I am concerned.
mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h
49	Fair enough!

This revision is now accepted and ready to land.Feb 14 2023, 6:03 PM

rebase

Harbormaster completed remote builds in B213785: Diff 497518.Feb 14 2023, 6:56 PM

Closed by commit rGf708a549b87e: [mlir][sparse] Factoring out SparseTensorType class (authored by wrengr). · Explain WhyFeb 14 2023, 7:17 PM

This revision was automatically updated to reflect the committed changes.

wrengr added a commit: rGf708a549b87e: [mlir][sparse] Factoring out SparseTensorType class.

wrengr mentioned this in D141975: [mlir][sparse] factored out a new DimLvlMapping class.Feb 15 2023, 11:56 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

SparseTensor/

IR/

SparseTensor.h

160 lines

SparseTensorAttrDefs.td

39 lines

SparseTensorType.h

232 lines

lib/

CAPI/

Dialect/

SparseTensor.cpp

6 lines

Dialect/

SparseTensor/

IR/

SparseTensorDialect.cpp

390 lines

Transforms/

5 lines

11 lines

86 lines

40 lines

SparseStorageSpecifierToLLVM.cpp

4 lines

SparseTensorCodegen.cpp

345 lines

SparseTensorConversion.cpp

586 lines

SparseTensorRewriting.cpp

276 lines

SparseTensorStorageLayout.h

216 lines

SparseTensorStorageLayout.cpp

105 lines

Sparsification.cpp

133 lines

test/

Dialect/

SparseTensor/

invalid.mlir

12 lines

invalid_encoding.mlir

6 lines

utils/

bazel/

llvm-project-overlay/

mlir/

BUILD.bazel

5 lines

Diff 497532

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h

	Show All 12 Lines
	#include "mlir/IR/BuiltinTypes.h"			#include "mlir/IR/BuiltinTypes.h"
	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/IR/OpDefinition.h"			#include "mlir/IR/OpDefinition.h"
	#include "mlir/IR/OpImplementation.h"			#include "mlir/IR/OpImplementation.h"
	#include "mlir/IR/TensorEncoding.h"			#include "mlir/IR/TensorEncoding.h"
	#include "mlir/Interfaces/InferTypeOpInterface.h"			#include "mlir/Interfaces/InferTypeOpInterface.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"

				//===----------------------------------------------------------------------===//
				//
				// Type aliases to help code be more self-documenting. Unfortunately
				// these are not type-checked, so they only provide documentation rather
				// than doing anything to prevent mixups.
				//
				// We must include these here (rather than in "SparseTensorType.h")
				// because they are used by methods declared in the tablegen files.
				//
				//===----------------------------------------------------------------------===//

				namespace mlir {
				namespace sparse_tensor {

				/// The type of dimension identifiers, and dimension-ranks. We use the
				/// same type for both identifiers and ranks because the latter are used
				/// mainly for ordering-comparisons against the former (just like how the
				/// one-past-the-end iterators are used).
				using Dimension = uint64_t;

				/// The type of level identifiers, and level-ranks. We use the same
				/// type for both identifiers and ranks because the latter are used
				/// mainly for ordering-comparisons against the former (just like how
				/// the one-past-the-end iterators are used).
				using Level = uint64_t;

				/// The type for individual components of a compile-time shape. We avoid
				/// calling this "size" because we use the term "sizes" to indicate the
				/// actual run-time sizes, whereas this type also allows the value
				/// `ShapedType::kDynamic`.
				using DynSize = int64_t;

				PeimingUnsubmitted Done Reply Inline Actions Are we going to use `Ship` for real? How about `DynSize` or something else? Peiming: Are we going to use `Ship` for real? How about `DynSize` or something else?
				aartbikUnsubmitted Done Reply Inline Actions We could remove the TODO and use DynSize for now? aartbik: We could remove the TODO and use DynSize for now?
				wrengrAuthorUnsubmitted Done Reply Inline Actions I'm not a big fan of `DynSize` personally, since I feel like there's a confusion between (1) the type where the `kDynamic` token itself lives, vs (2) the type where actual dynamic sizes live (i.e., the compiler's type for the runtime values; hence, a particular subset of `Value`). This distinction becomes particularly salient in a different CL I'm working on which defines a variant of `OpFoldResult` to encompass both static and dynamic types without losing track of which is which. One of the motivations for that CL is to clean up a bunch of places where we currently have ad-hoc versions of `mlir::getMixedValues` and similar stuff from `Dialect/Utils/StaticValueUtils.h` Also, that name obfuscates that the alias is the singular of "shape" rather than of "sizes" ;) But I'll make the change for now, for the sake of not blocking the rest of this patch. wrengr: I'm not a big fan of `DynSize` personally, since I feel like there's a confusion between (1)…
				aartbikUnsubmitted Done Reply Inline Actions Ah, yeah, I remember the discussion on singular vs plural. Yeah, finding right names can be hard ;-( aartbik: Ah, yeah, I remember the discussion on singular vs plural. Yeah, finding right names can be…
				/// The type for individual components of a compile-time shape which
				/// are known not to be `ShapedType::kDynamic`.
				using StaticSize = int64_t;

				} // namespace sparse_tensor
				} // namespace mlir

				//===----------------------------------------------------------------------===//
				// TableGen-defined classes
				//===----------------------------------------------------------------------===//

	// We must include Enums.h.inc before AttrDefs.h.inc due to dependency between			// We must include Enums.h.inc before AttrDefs.h.inc due to dependency between
	// StorageSpecifierKindAttr and StorageSpeciferKind Enum.			// StorageSpecifierKindAttr and StorageSpeciferKind Enum.

	#define GET_ATTRDEF_CLASSES			#define GET_ATTRDEF_CLASSES
	#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrEnums.h.inc"			#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrEnums.h.inc"

	#define GET_ATTRDEF_CLASSES			#define GET_ATTRDEF_CLASSES
	#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.h.inc"			#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.h.inc"

	#define GET_TYPEDEF_CLASSES			#define GET_TYPEDEF_CLASSES
	#include "mlir/Dialect/SparseTensor/IR/SparseTensorTypes.h.inc"			#include "mlir/Dialect/SparseTensor/IR/SparseTensorTypes.h.inc"

	#define GET_OP_CLASSES			#define GET_OP_CLASSES
	#include "mlir/Dialect/SparseTensor/IR/SparseTensorOps.h.inc"			#include "mlir/Dialect/SparseTensor/IR/SparseTensorOps.h.inc"

	#include "mlir/Dialect/SparseTensor/IR/SparseTensorOpsDialect.h.inc"			#include "mlir/Dialect/SparseTensor/IR/SparseTensorOpsDialect.h.inc"

				//===----------------------------------------------------------------------===//
				// Additional convenience methods.
				//===----------------------------------------------------------------------===//

	namespace mlir {			namespace mlir {
	namespace sparse_tensor {			namespace sparse_tensor {

	/// Convenience method to abbreviate casting `getType()`.			/// Convenience method to abbreviate casting `getType()`.
	template <typename T>			template <typename T>
	inline RankedTensorType getRankedTensorType(T t) {			inline RankedTensorType getRankedTensorType(T t) {
	return t.getType().template cast<RankedTensorType>();			return t.getType().template cast<RankedTensorType>();
	}			}

	/// Convenience method to abbreviate casting `getType()`.			/// Convenience method to abbreviate casting `getType()`.
	template <typename T>			template <typename T>
	inline MemRefType getMemRefType(T t) {			inline MemRefType getMemRefType(T t) {
	return t.getType().template cast<MemRefType>();			return t.getType().template cast<MemRefType>();
	}			}

	/// Convenience method to get a sparse encoding attribute from a type.			/// Convenience method to get a sparse encoding attribute from a type.
	/// Returns null-attribute for any type without an encoding.			/// Returns null-attribute for any type without an encoding.
	SparseTensorEncodingAttr getSparseTensorEncoding(Type type);			SparseTensorEncodingAttr getSparseTensorEncoding(Type type);

	/// Returns true iff the given type is a type for a COO tensor with the last			/// Returns true iff the given type is a COO type where the last level
	/// dimension level type being unique.			/// is unique.
	bool isUniqueCOOType(TensorType tp);			bool isUniqueCOOType(TensorType tp);

	/// Returns the starting dimension for a trailing COO region that spans across			/// Returns the starting level for a trailing COO region that spans
				aartbikUnsubmitted Done Reply Inline Actions update doc too to say level? aartbik: update doc too to say level?
	/// at least two dimensions. If no such COO region is found, returns the rank			/// at least two levels. If no such COO region is found, then returns
	/// of the tensor.			/// the level-rank.
	unsigned getCOOStart(SparseTensorEncodingAttr enc);			Level getCOOStart(SparseTensorEncodingAttr enc);

	/// Helpers to setup a COO type.			/// Helpers to setup a COO type.
	RankedTensorType getCOOFromTypeWithOrdering(RankedTensorType src,			RankedTensorType getCOOFromTypeWithOrdering(RankedTensorType src,
	AffineMap ordering, bool ordered);			AffineMap ordering, bool ordered);

	RankedTensorType getCOOFromType(RankedTensorType src, bool ordered);			RankedTensorType getCOOFromType(RankedTensorType src, bool ordered);

	//			//
	// Dimension level types.
	//

	// MSVC does not allow this function to be constexpr, because
	// `SparseTensorEncodingAttr::operator bool` isn't declared constexpr.
	// And therefore all functions calling it cannot be constexpr either.
	// TODO: since Clang does allow these to be constexpr, perhaps we should
	// define a macro to abstract over `inline` vs `constexpr` annotations.
	inline DimLevelType getDimLevelType(SparseTensorEncodingAttr enc, uint64_t d) {
	if (enc) {
	auto types = enc.getDimLevelType();
	assert(d < types.size() && "Dimension out of bounds");
	return types[d];
	}
	return DimLevelType::Dense; // unannotated tensor is dense
	}

	inline DimLevelType getDimLevelType(RankedTensorType type, uint64_t d) {
	return getDimLevelType(getSparseTensorEncoding(type), d);
	}

	/// Convenience function to test for dense dimension (0 <= d < rank).
	inline bool isDenseDim(RankedTensorType type, uint64_t d) {
	return isDenseDLT(getDimLevelType(type, d));
	}

	/// Convenience function to test for compressed dimension (0 <= d < rank).
	inline bool isCompressedDim(RankedTensorType type, uint64_t d) {
	return isCompressedDLT(getDimLevelType(type, d));
	}

	/// Convenience function to test for singleton dimension (0 <= d < rank).
	inline bool isSingletonDim(RankedTensorType type, uint64_t d) {
	return isSingletonDLT(getDimLevelType(type, d));
	}

	/// Convenience function to test for dense dimension (0 <= d < rank).
	inline bool isDenseDim(SparseTensorEncodingAttr enc, uint64_t d) {
	return isDenseDLT(getDimLevelType(enc, d));
	}

	/// Convenience function to test for compressed dimension (0 <= d < rank).
	inline bool isCompressedDim(SparseTensorEncodingAttr enc, uint64_t d) {
	return isCompressedDLT(getDimLevelType(enc, d));
	}

	/// Convenience function to test for singleton dimension (0 <= d < rank).
	inline bool isSingletonDim(SparseTensorEncodingAttr enc, uint64_t d) {
	return isSingletonDLT(getDimLevelType(enc, d));
	}

	//
	// Dimension level properties.
	//

	/// Convenience function to test for ordered property in the
	/// given dimension (0 <= d < rank).
	inline bool isOrderedDim(RankedTensorType type, uint64_t d) {
	return isOrderedDLT(getDimLevelType(type, d));
	}

	/// Convenience function to test for unique property in the
	/// given dimension (0 <= d < rank).
	inline bool isUniqueDim(RankedTensorType type, uint64_t d) {
	return isUniqueDLT(getDimLevelType(type, d));
	}

	//
	// Reordering.			// Reordering.
	//			//

	uint64_t toOrigDim(SparseTensorEncodingAttr enc, uint64_t d);			// This CPP guard is to disable deprecation warnings for the LLVM
				aartbikUnsubmitted Done Reply Inline Actions stored dim, i.e. level (or just level)? aartbik: stored dim, i.e. level (or just level)?
	uint64_t toStoredDim(SparseTensorEncodingAttr enc, uint64_t d);			// build-bot, while making it easy to re-enable it for local development.
				aartbikUnsubmitted Not Done Reply Inline Actions If this is just to avoid the error on windows, can we make it a linux-only guard or so (as you know, commented out code is usually frowned upon, even when there for a good reason ;-) aartbik: If this is just to avoid the error on windows, can we make it a linux-only guard or so (as you…
				wrengrAuthorUnsubmitted Done Reply Inline Actions Yeah, this is to avoid getting rejected by the LLVM buildbot. I could enable it for non-Windows, but I'm guessing their Linux bot is also configured with `-Werror`... I'd love to have this be triggered by a "for developers" flag, so that our team automatically sees it but end-users don't have to. Alas, afaik there's no such flag already in our toolchains. (Of course, even if there were, Blaze disables deprecation warnings by default so folks would only see it when building via cmake) wrengr: Yeah, this is to avoid getting rejected by the LLVM buildbot. I could enable it for non-Windows…
				aartbikUnsubmitted Done Reply Inline Actions Agreed that would be nice. But this is fine now as far as I am concerned. aartbik: Agreed that would be nice. But this is fine now as far as I am concerned.
				#if 0
				#define DEPRECATED \
				LLVM_DEPRECATED("The toOrigDim/toStoredDim functions are deprecated " \
				"because they only work for permutations; therefore any " \
				"code using them cannot support non-permutations.", \
				"")
				#else
				#define DEPRECATED
				#endif

				/// [deprecated] Convenience method to translate the given level to the
				/// corresponding dimension. Requires: `0 <= l < lvlRank`.
				DEPRECATED Dimension toOrigDim(SparseTensorEncodingAttr enc, Level l);
				DEPRECATED Dimension toOrigDim(RankedTensorType type, Level l);

				/// [deprecated] Convenience method to translate the given dimension to
				/// the corresponding level. Requires: `0 <= d < dimRank`.
				DEPRECATED Level toStoredDim(SparseTensorEncodingAttr enc, Dimension d);
				DEPRECATED Level toStoredDim(RankedTensorType type, Dimension d);

	/// Convenience method to translate the given stored dimension			#undef DEPRECATED
				aartbikUnsubmitted Done Reply Inline Actions stored dimension -> level? EDIT: will not be commenting on these further, since you also plan a doc cleanup after this aartbik: stored dimension -> level? EDIT: will not be commenting on these further, since you also plan…
	/// to the original dimension (0 <= d < rank).
	uint64_t toOrigDim(RankedTensorType type, uint64_t d);

	/// Convenience method to translate the given original dimension
	/// to the stored dimension (0 <= d < rank).
	uint64_t toStoredDim(RankedTensorType type, uint64_t d);

	} // namespace sparse_tensor			} // namespace sparse_tensor
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_SPARSETENSOR_IR_SPARSETENSOR_H_			#endif // MLIR_DIALECT_SPARSETENSOR_IR_SPARSETENSOR_H_

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.td

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{

/// Returns the type for index storage based on indexBitWidth		/// Returns the type for index storage based on indexBitWidth
Type getIndexType() const;		Type getIndexType() const;

/// Constructs a new encoding with the dimOrdering and higherOrdering		/// Constructs a new encoding with the dimOrdering and higherOrdering
/// reset to the default/identity.		/// reset to the default/identity.
SparseTensorEncodingAttr withoutOrdering() const;		SparseTensorEncodingAttr withoutOrdering() const;

/// Return true if every level is dense in the encoding.		/// Returns true if every level is dense. Also returns true for
		/// the null encoding (since dense-tensors are always all-dense).
		aartbikUnsubmitted Done Reply Inline Actions Although true in general, these are methods on the encoding, right? So I am not sure we want to say this? EDIT: Ah, later I see this is related to the getImpl() change you added, still very subtle ;-) aartbik: Although true in general, these are methods on the encoding, right? So I am not sure we want…
		wrengrAuthorUnsubmitted Done Reply Inline Actions Adding this comment for the sake of posterity (since you already figured it out :) In the implementations of `isAllDense`, `isAllOrdered`, `hasIdDimOrdering`, etc, I have the implementations check `getImpl()` before checking the rest of the condition. Adding that check does three things: (1) it avoids segfaults from calling these methods on the null-attr, (2) it avoids the boilerplate of needing to use the condition `(enc && enc.isAllDense())` at every callsite, (3) and it unifies handling of sparse-tensors and dense-tensors, since the condition in the second point is the one we actually care about. Since the semantic unification of the third point is a specific and intentional goal of the implementation, it makes sense to document that fact. wrengr: Adding this comment for the sake of posterity (since you already figured it out :) In the…
bool isAllDense() const;		bool isAllDense() const;

/// Return true if the encoding has an identity dimension ordering.		/// Returns true if every level is ordered. Also returns true for
		/// the null encoding (since dense-tensors are always all-ordered).
		bool isAllOrdered() const;

		/// Returns true if the encoding has an identity dimension ordering.
		/// Also returns true for the null encoding (since dense-tensors
		/// always have the identity ordering).
bool hasIdDimOrdering() const;		bool hasIdDimOrdering() const;

		/// Returns the number of storage levels. Asserts that the encoding
		/// is non-null (since there is no fixed result that's valid for
		/// every dense-tensor).
		::mlir::sparse_tensor::Level getLvlRank() const;

		/// Safely looks up the level-type for the requested level. (Returns
		/// `DimLevelType::Dense` for the null encoding, since dense-tensors
		/// are always all-dense.)
		::mlir::sparse_tensor::DimLevelType getLvlType(::mlir::sparse_tensor::Level l) const;

		bool isDenseLvl(::mlir::sparse_tensor::Level l) const { return isDenseDLT(getLvlType(l)); }
		bool isCompressedLvl(::mlir::sparse_tensor::Level l) const { return isCompressedDLT(getLvlType(l)); }
		bool isSingletonLvl(::mlir::sparse_tensor::Level l) const { return isSingletonDLT(getLvlType(l)); }
		bool isOrderedLvl(::mlir::sparse_tensor::Level l) const { return isOrderedDLT(getLvlType(l)); }
		bool isUniqueLvl(::mlir::sparse_tensor::Level l) const { return isUniqueDLT(getLvlType(l)); }

bool isSlice() const {		bool isSlice() const {
return !getDimSlices().empty();		return !getDimSlices().empty();
}		}

std::optional<uint64_t> getStaticDimSliceOffset(unsigned dim) const;		std::optional<uint64_t> getStaticDimSliceOffset(::mlir::sparse_tensor::Dimension dim) const;
std::optional<uint64_t> getStaticDimSliceSize(unsigned dim) const;		std::optional<uint64_t> getStaticDimSliceSize(::mlir::sparse_tensor::Dimension dim) const;
std::optional<uint64_t> getStaticDimSliceStride(unsigned dim) const;		std::optional<uint64_t> getStaticDimSliceStride(::mlir::sparse_tensor::Dimension dim) const;
std::optional<uint64_t> getStaticLvlSliceOffset(unsigned lvl) const;		std::optional<uint64_t> getStaticLvlSliceOffset(::mlir::sparse_tensor::Level lvl) const;
std::optional<uint64_t> getStaticLvlSliceSize(unsigned lvl) const;		std::optional<uint64_t> getStaticLvlSliceSize(::mlir::sparse_tensor::Level lvl) const;
std::optional<uint64_t> getStaticLvlSliceStride(unsigned lvl) const;		std::optional<uint64_t> getStaticLvlSliceStride(::mlir::sparse_tensor::Level lvl) const;
}];		}];

let genVerifyDecl = 1;		let genVerifyDecl = 1;
let hasCustomAssemblyFormat = 1;		let hasCustomAssemblyFormat = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Sparse Tensor Storage Specifier Enum Attribute.		// Sparse Tensor Storage Specifier Enum Attribute.
▲ Show 20 Lines • Show All 77 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h

This file was added.

				//===- SparseTensorType.h - Wrapper around RankedTensorType ------ C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This header defines the `SparseTensorType` wrapper class.
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_DIALECT_SPARSETENSOR_IR_SPARSETENSORTYPE_H_
				#define MLIR_DIALECT_SPARSETENSOR_IR_SPARSETENSORTYPE_H_

				#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"

				namespace mlir {
				namespace sparse_tensor {

				//===----------------------------------------------------------------------===//
				/// A wrapper around `RankedTensorType`, which has three goals:
				///
				/// (1) To provide a uniform API for querying aspects of sparse-tensor
				/// types; in particular, to make the "dimension" vs "level" distinction
				/// overt (i.e., explicit everywhere). Thus, throughout the sparse-compiler
				/// this class should be preferred over using `RankedTensorType` or
				/// `ShapedType` directly, since the methods of the latter do not make
				/// the "dimension" vs "level" distinction overt.
				///
				/// (2) To provide a uniform abstraction over both sparse-tensor
				/// types (i.e., `RankedTensorType` with `SparseTensorEncodingAttr`)
				/// and dense-tensor types (i.e., `RankedTensorType` without an encoding).
				/// That is, we want to manipulate dense-tensor types using the same API
				/// that we use for manipulating sparse-tensor types; both to keep the
				/// "dimension" vs "level" distinction overt, and to avoid needing to
				/// handle certain cases specially in the sparse-compiler.
				///
				/// (3) To provide uniform handling of "defaults". In particular
				/// this means that dense-tensors should always return the same answers
				/// as sparse-tensors with a default encoding. But it additionally means
				/// that the answers should be normalized, so that there's no way to
				/// distinguish between non-provided data (which is filled in by default)
				/// vs explicitly-provided data which equals the defaults.
				///
				class SparseTensorType {
				public:
				// We memoize `lvlRank` and `dim2lvl` to avoid repeating the
				// conditionals throughout the rest of the class.
				aartbikUnsubmitted Not Done Reply Inline Actions In this case, it is more a pre-compute than memoize, right? I know that it does not matter much, but I have slightly different interpretation when I see memoization ;-) [from my Prolog days ;-)] aartbik: In this case, it is more a pre-compute than memoize, right? I know that it does not matter much…
				wrengrAuthorUnsubmitted Done Reply Inline Actions True, this isn't tabulation like you get in Prolog/Datalog/etc; but it is still caching a memo in order to avoid recomputing the value every time it's needed. [Owing to my experience with Haskell, Dyna, AliceML, etc, I think of "memoization" as more than just dynamic-programming; but afaik my use of the term isn't at odds with how it's used elsewhere :)] Whereas I interpret "precompute" to mean multi-pass algorithms which build up some intermediate/auxiliary datastructure (e.g., to make queries faster than going to the original data, or like our `SparseTensorNNZ` class). wrengr: True, this isn't tabulation like you get in Prolog/Datalog/etc; but it is still caching a memo…
				aartbikUnsubmitted Done Reply Inline Actions Fair enough! aartbik: Fair enough!
				SparseTensorType(RankedTensorType rtp)
				: rtp(rtp), enc(getSparseTensorEncoding(rtp)),
				lvlRank(enc ? enc.getLvlRank() : getDimRank()),
				dim2lvl(enc.hasIdDimOrdering() ? AffineMap() : enc.getDimOrdering()) {
				assert((!isIdentity() \|\| getDimRank() == lvlRank) && "Rank mismatch");
				}

				SparseTensorType(ShapedType stp, SparseTensorEncodingAttr enc)
				: SparseTensorType(
				RankedTensorType::get(stp.getShape(), stp.getElementType(), enc)) {}

				/// Constructs a new `SparseTensorType` with the same dimension-shape
				/// and element type, but with the encoding replaced by the given encoding.
				SparseTensorType withEncoding(SparseTensorEncodingAttr newEnc) const {
				return SparseTensorType(rtp, newEnc);
				}

				/// Constructs a new `SparseTensorType` with the same dimension-shape
				/// and element type, but with the encoding replaced by
				/// `getEncoding().withoutOrdering()`.
				SparseTensorType withoutOrdering() const {
				return withEncoding(enc.withoutOrdering());
				}

				/// Allow implicit conversion to `RankedTensorType`, `ShapedType`,
				/// and `Type`. These are implicit to help alleviate the impedance
				/// mismatch for code that has not been converted to use `SparseTensorType`
				/// directly. Once more of the sparse compiler has been converted to
				/// using `SparseTensorType`, we may want to make these explicit instead.
				///
				/// WARNING: This user-defined-conversion method causes overload
				/// ambiguity whenever passing a `SparseTensorType` directly to a
				/// function which is overloaded to accept either `Type` or `TypeRange`.
				/// In particular, this includes `RewriterBase::replaceOpWithNewOp<OpTy>`
				/// and `OpBuilder::create<OpTy>` whenever the `OpTy::build` is overloaded
				/// thus. This happens because the `TypeRange<T>(T&&)` ctor is implicit
				/// as well, and there's no SFINAE we can add to this method that would
				/// block subsequent application of that ctor. The only way to fix the
				/// overload ambiguity is to avoid implicit conversion at the callsite:
				/// e.g., by using `static_cast` to make the conversion explicit, by
				/// assigning the `SparseTensorType` to a temporary variable of the
				/// desired type, etc.
				//
				// NOTE: We implement this as a single templated user-defined-conversion
				// function to avoid ambiguity problems when the desired result is `Type`
				// (since both `RankedTensorType` and `ShapedType` can be implicitly
				// converted to `Type`).
				template <typename T, typename = std::enable_if_t<
				std::is_convertible_v<RankedTensorType, T>>>
				/implicit/ operator T() const {
				return rtp;
				}

				PeimingUnsubmitted Done Reply Inline Actions Might be helpful to provide `operator==` so we can use pointer comparison between `rtp1==rtp2` for SparseTensorType just like RankedTensorType. Peiming: Might be helpful to provide `operator==` so we can use pointer comparison between `rtp1==rtp2`…
				wrengrAuthorUnsubmitted Done Reply Inline Actions It looks like `operator==` was one of the things I split off into a separate CL. Did you want me to merge it back into this CL, or is a separate CL fine? wrengr: It looks like `operator==` was one of the things I split off into a separate CL. Did you want…
				wrengrAuthorUnsubmitted Done Reply Inline Actions See D144052 wrengr: See D144052
				/// Explicitly convert to `RankedTensorType`. This method is
				/// a convenience for resolving overload-ambiguity issues with
				/// implicit conversion.
				RankedTensorType getRankedTensorType() const { return rtp; }

				MLIRContext *getContext() const { return rtp.getContext(); }

				Type getElementType() const { return rtp.getElementType(); }

				/// Returns the encoding (or the null-attribute for dense-tensors).
				SparseTensorEncodingAttr getEncoding() const { return enc; }

				/// Returns true for tensors which have an encoding, and false for
				/// those which do not. Therefore tensors with an all-dense encoding
				/// return true.
				bool hasEncoding() const { return static_cast<bool>(enc); }

				/// Returns true for tensors where every level is dense.
				/// (This is always true for dense-tensors.)
				bool isAllDense() const { return enc.isAllDense(); }

				/// Returns true for tensors where every level is ordered.
				/// (This is always true for dense-tensors.)
				bool isAllOrdered() const { return enc.isAllOrdered(); }

				/// Returns true if the dimToLvl mapping is the identity.
				/// (This is always true for dense-tensors.)
				bool isIdentity() const { return !dim2lvl; }

				/// Returns the dimToLvl mapping (or the null-map for the identity).
				AffineMap getDimToLvlMap() const { return dim2lvl; }

				/// Returns the dimToLvl mapping, where the identity map is expanded out
				/// into a full `AffineMap`. This method is provided as a convenience,
				/// but for most purposes other methods (`isIdentity`, `getDimToLvlMap`,
				/// etc) will be more helpful.
				AffineMap getExpandedDimToLvlMap() const {
				return dim2lvl
				? dim2lvl
				: AffineMap::getMultiDimIdentityMap(getDimRank(), getContext());
				}

				/// Returns the dimension-rank.
				Dimension getDimRank() const { return rtp.getRank(); }

				/// Returns the level-rank.
				Level getLvlRank() const { return lvlRank; }

				/// Returns the dimension-shape.
				ArrayRef<DynSize> getDimShape() const { return rtp.getShape(); }

				/// Safely looks up the requested dimension-DynSize. If you intend
				/// to check the result with `ShapedType::isDynamic`, then see the
				/// `getStaticDimSize` method instead.
				DynSize getDynamicDimSize(Dimension d) const {
				assert(d < getDimRank() && "Dimension is out of bounds");
				return getDimShape()[d];
				}

				/// Safely looks up the requested dimension-size, mapping dynamic
				/// sizes to `std::nullopt`.
				std::optional<StaticSize> getStaticDimSize(Dimension d) const {
				const DynSize sh = getDynamicDimSize(d);
				return ShapedType::isDynamic(sh) ? std::nullopt
				: std::optional<StaticSize>(sh);
				}

				/// Returns true if no dimension has dynamic size.
				bool hasStaticDimShape() const { return rtp.hasStaticShape(); }

				/// Returns true if any dimension has dynamic size.
				bool hasDynamicDimShape() const { return !hasStaticDimShape(); }

				/// Returns true if the given dimension has dynamic size. If you
				/// intend to call `getDynamicDimSize` based on the result, then see
				/// the `getStaticDimSize` method instead.
				bool isDynamicDim(Dimension d) const {
				// We don't use `rtp.isDynamicDim(d)` because we want the
				// OOB error message to be consistent with `getDynamicDimSize`.
				return ShapedType::isDynamic(getDynamicDimSize(d));
				}

				/// Returns the number of dimensions which have dynamic sizes.
				/// The return type is `int64_t` to maintain consistency with
				/// `ShapedType::Trait<T>::getNumDynamicDims`.
				int64_t getNumDynamicDims() const { return rtp.getNumDynamicDims(); }

				DimLevelType getLvlType(Level l) const {
				// This OOB check is for dense-tensors, since this class knows
				// their lvlRank (whereas STEA::getLvlType will/can only check
				// OOB for sparse-tensors).
				assert(l < lvlRank && "Level out of bounds");
				return enc.getLvlType(l);
				}

				// We can't just delegate these, since we want to use this class's
				// `getLvlType` method instead of STEA's.
				bool isDenseLvl(Level l) const { return isDenseDLT(getLvlType(l)); }
				bool isCompressedLvl(Level l) const { return isCompressedDLT(getLvlType(l)); }
				bool isSingletonLvl(Level l) const { return isSingletonDLT(getLvlType(l)); }
				bool isOrderedLvl(Level l) const { return isOrderedDLT(getLvlType(l)); }
				bool isUniqueLvl(Level l) const { return isUniqueDLT(getLvlType(l)); }

				/// Returns the index-overhead bitwidth, defaulting to zero.
				unsigned getIndexBitWidth() const { return enc ? enc.getIndexBitWidth() : 0; }

				/// Returns the pointer-overhead bitwidth, defaulting to zero.
				unsigned getPointerBitWidth() const {
				return enc ? enc.getPointerBitWidth() : 0;
				}

				private:
				// These two must be const, to ensure coherence of the memoized fields.
				const RankedTensorType rtp;
				const SparseTensorEncodingAttr enc;
				// Memoized to avoid frequent redundant conditionals.
				const Level lvlRank;
				const AffineMap dim2lvl;
				};

				/// Convenience method to abbreviate wrapping `getRankedTensorType`.
				template <typename T>
				inline SparseTensorType getSparseTensorType(T t) {
				return SparseTensorType(getRankedTensorType(t));
				}

				} // namespace sparse_tensor
				} // namespace mlir

				#endif // MLIR_DIALECT_SPARSETENSOR_IR_SPARSETENSORTYPE_H_

mlir/lib/CAPI/Dialect/SparseTensor.cpp

	Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines

	MlirAffineMap			MlirAffineMap
	mlirSparseTensorEncodingAttrGetHigherOrdering(MlirAttribute attr) {			mlirSparseTensorEncodingAttrGetHigherOrdering(MlirAttribute attr) {
	return wrap(			return wrap(
	unwrap(attr).cast<SparseTensorEncodingAttr>().getHigherOrdering());			unwrap(attr).cast<SparseTensorEncodingAttr>().getHigherOrdering());
	}			}

	intptr_t mlirSparseTensorEncodingGetNumDimLevelTypes(MlirAttribute attr) {			intptr_t mlirSparseTensorEncodingGetNumDimLevelTypes(MlirAttribute attr) {
	return unwrap(attr).cast<SparseTensorEncodingAttr>().getDimLevelType().size();			return unwrap(attr).cast<SparseTensorEncodingAttr>().getLvlRank();
	}			}

	MlirSparseTensorDimLevelType			MlirSparseTensorDimLevelType
	mlirSparseTensorEncodingAttrGetDimLevelType(MlirAttribute attr, intptr_t pos) {			mlirSparseTensorEncodingAttrGetDimLevelType(MlirAttribute attr, intptr_t lvl) {
	return static_cast<MlirSparseTensorDimLevelType>(			return static_cast<MlirSparseTensorDimLevelType>(
	unwrap(attr).cast<SparseTensorEncodingAttr>().getDimLevelType()[pos]);			unwrap(attr).cast<SparseTensorEncodingAttr>().getLvlType(lvl));
	}			}

	int mlirSparseTensorEncodingAttrGetPointerBitWidth(MlirAttribute attr) {			int mlirSparseTensorEncodingAttrGetPointerBitWidth(MlirAttribute attr) {
	return unwrap(attr).cast<SparseTensorEncodingAttr>().getPointerBitWidth();			return unwrap(attr).cast<SparseTensorEncodingAttr>().getPointerBitWidth();
	}			}

	int mlirSparseTensorEncodingAttrGetIndexBitWidth(MlirAttribute attr) {			int mlirSparseTensorEncodingAttrGetIndexBitWidth(MlirAttribute attr) {
	return unwrap(attr).cast<SparseTensorEncodingAttr>().getIndexBitWidth();			return unwrap(attr).cast<SparseTensorEncodingAttr>().getIndexBitWidth();
	}			}

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp

//===- SparseTensorDialect.cpp - Sparse tensor dialect implementation -----===//		//===- SparseTensorDialect.cpp - Sparse tensor dialect implementation -----===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include <utility>		#include <utility>

#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"

#include "mlir/Dialect/Arith/IR/Arith.h"		#include "mlir/Dialect/Arith/IR/Arith.h"
#include "mlir/IR/Builders.h"		#include "mlir/IR/Builders.h"
#include "mlir/IR/DialectImplementation.h"		#include "mlir/IR/DialectImplementation.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "llvm/ADT/TypeSwitch.h"		#include "llvm/ADT/TypeSwitch.h"
#include "llvm/Support/FormatVariadic.h"		#include "llvm/Support/FormatVariadic.h"

#define GET_ATTRDEF_CLASSES		#define GET_ATTRDEF_CLASSES
#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.cpp.inc"		#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.cpp.inc"
#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrEnums.cpp.inc"		#include "mlir/Dialect/SparseTensor/IR/SparseTensorAttrEnums.cpp.inc"

#define GET_TYPEDEF_CLASSES		#define GET_TYPEDEF_CLASSES
#include "mlir/Dialect/SparseTensor/IR/SparseTensorTypes.cpp.inc"		#include "mlir/Dialect/SparseTensor/IR/SparseTensorTypes.cpp.inc"

using namespace mlir;		using namespace mlir;
using namespace mlir::sparse_tensor;		using namespace mlir::sparse_tensor;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Additional convenience methods.		// Additional convenience methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

		/// Gets the dimension-rank of the type of some `T`. (In particular
		/// this is only used for `Value` and `TypedValue<RankedTensorType>`.)
template <typename T>		template <typename T>
static inline int64_t getTypeRank(T t) {		static inline Dimension getDimRank(T t) {
return getRankedTensorType(t).getRank();		return getRankedTensorType(t).getRank();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TensorDialect Attribute Methods.		// TensorDialect Attribute Methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static bool acceptBitWidth(unsigned bitWidth) {		static bool acceptBitWidth(unsigned bitWidth) {
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines

SparseTensorEncodingAttr SparseTensorEncodingAttr::withoutOrdering() const {		SparseTensorEncodingAttr SparseTensorEncodingAttr::withoutOrdering() const {
return SparseTensorEncodingAttr::get(		return SparseTensorEncodingAttr::get(
getContext(), getDimLevelType(), AffineMap(), AffineMap(),		getContext(), getDimLevelType(), AffineMap(), AffineMap(),
getPointerBitWidth(), getIndexBitWidth());		getPointerBitWidth(), getIndexBitWidth());
}		}

bool SparseTensorEncodingAttr::isAllDense() const {		bool SparseTensorEncodingAttr::isAllDense() const {
return llvm::all_of(getDimLevelType(), isDenseDLT);		return !getImpl() \|\| llvm::all_of(getDimLevelType(), isDenseDLT);
		}

		bool SparseTensorEncodingAttr::isAllOrdered() const {
		return !getImpl() \|\| llvm::all_of(getDimLevelType(), isOrderedDLT);
}		}

bool SparseTensorEncodingAttr::hasIdDimOrdering() const {		bool SparseTensorEncodingAttr::hasIdDimOrdering() const {
return !getDimOrdering() \|\| getDimOrdering().isIdentity();		return !getImpl() \|\| !getDimOrdering() \|\| getDimOrdering().isIdentity();
		}

		Level SparseTensorEncodingAttr::getLvlRank() const {
		assert(getImpl() && "Uninitialized SparseTensorEncodingAttr");
		return getDimLevelType().size();
		}

		DimLevelType SparseTensorEncodingAttr::getLvlType(Level l) const {
		if (!getImpl())
		return DimLevelType::Dense;
		assert(l < getLvlRank() && "Level is out of bounds");
		return getDimLevelType()[l];
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticDimSliceOffset(unsigned dim) const {		SparseTensorEncodingAttr::getStaticDimSliceOffset(Dimension dim) const {
return getDimSlices()[dim].getStaticOffset();		return getDimSlices()[dim].getStaticOffset();
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticDimSliceSize(unsigned dim) const {		SparseTensorEncodingAttr::getStaticDimSliceSize(Dimension dim) const {
return getDimSlices()[dim].getStaticSize();		return getDimSlices()[dim].getStaticSize();
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticDimSliceStride(unsigned dim) const {		SparseTensorEncodingAttr::getStaticDimSliceStride(Dimension dim) const {
return getDimSlices()[dim].getStaticStride();		return getDimSlices()[dim].getStaticStride();
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticLvlSliceOffset(unsigned lvl) const {		SparseTensorEncodingAttr::getStaticLvlSliceOffset(Level lvl) const {
		// FIXME: `toOrigDim` is deprecated.
return getStaticDimSliceOffset(toOrigDim(*this, lvl));		return getStaticDimSliceOffset(toOrigDim(*this, lvl));
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticLvlSliceSize(unsigned lvl) const {		SparseTensorEncodingAttr::getStaticLvlSliceSize(Level lvl) const {
		// FIXME: `toOrigDim` is deprecated.
return getStaticDimSliceSize(toOrigDim(*this, lvl));		return getStaticDimSliceSize(toOrigDim(*this, lvl));
}		}

std::optional<uint64_t>		std::optional<uint64_t>
SparseTensorEncodingAttr::getStaticLvlSliceStride(unsigned lvl) const {		SparseTensorEncodingAttr::getStaticLvlSliceStride(Level lvl) const {
		// FIXME: `toOrigDim` is deprecated.
return getStaticDimSliceStride(toOrigDim(*this, lvl));		return getStaticDimSliceStride(toOrigDim(*this, lvl));
}		}

const static DimLevelType validDLTs[] = {		const static DimLevelType validDLTs[] = {
DimLevelType::Dense, DimLevelType::Compressed,		DimLevelType::Dense, DimLevelType::Compressed,
DimLevelType::CompressedNu, DimLevelType::CompressedNo,		DimLevelType::CompressedNu, DimLevelType::CompressedNo,
DimLevelType::CompressedNuNo, DimLevelType::Singleton,		DimLevelType::CompressedNuNo, DimLevelType::Singleton,
DimLevelType::SingletonNu, DimLevelType::SingletonNo,		DimLevelType::SingletonNu, DimLevelType::SingletonNo,
▲ Show 20 Lines • Show All 114 Lines • ▼ Show 20 Lines	#undef RETURN_ON_FAIL
// Construct struct-like storage for attribute.		// Construct struct-like storage for attribute.
return parser.getChecked<SparseTensorEncodingAttr>(		return parser.getChecked<SparseTensorEncodingAttr>(
parser.getContext(), dlt, dimOrd, higherOrd, ptr, ind, slices);		parser.getContext(), dlt, dimOrd, higherOrd, ptr, ind, slices);
}		}

void SparseTensorEncodingAttr::print(AsmPrinter &printer) const {		void SparseTensorEncodingAttr::print(AsmPrinter &printer) const {
// Print the struct-like storage in dictionary fashion.		// Print the struct-like storage in dictionary fashion.
printer << "<{ dimLevelType = [ ";		printer << "<{ dimLevelType = [ ";
for (unsigned i = 0, e = getDimLevelType().size(); i < e; i++) {		llvm::interleaveComma(getDimLevelType(), printer, [&](DimLevelType dlt) {
printer << "\"" << toMLIRString(getDimLevelType()[i]) << "\"";		printer << "\"" << toMLIRString(dlt) << "\"";
if (i != e - 1)		});
printer << ", ";
}
printer << " ]";		printer << " ]";
// Print remaining members only for non-default values.		// Print remaining members only for non-default values.
if (!hasIdDimOrdering())		if (!hasIdDimOrdering())
printer << ", dimOrdering = affine_map<" << getDimOrdering() << ">";		printer << ", dimOrdering = affine_map<" << getDimOrdering() << ">";
if (getHigherOrdering())		if (getHigherOrdering())
printer << ", higherOrdering = affine_map<" << getHigherOrdering() << ">";		printer << ", higherOrdering = affine_map<" << getHigherOrdering() << ">";
if (getPointerBitWidth())		if (getPointerBitWidth())
printer << ", pointerBitWidth = " << getPointerBitWidth();		printer << ", pointerBitWidth = " << getPointerBitWidth();
Show All 17 Lines	LogicalResult SparseTensorEncodingAttr::verify(
function_ref<InFlightDiagnostic()> emitError,		function_ref<InFlightDiagnostic()> emitError,
ArrayRef<DimLevelType> dimLevelType, AffineMap dimOrdering,		ArrayRef<DimLevelType> dimLevelType, AffineMap dimOrdering,
AffineMap higherOrdering, unsigned pointerBitWidth, unsigned indexBitWidth,		AffineMap higherOrdering, unsigned pointerBitWidth, unsigned indexBitWidth,
ArrayRef<SparseTensorDimSliceAttr> dimSlices) {		ArrayRef<SparseTensorDimSliceAttr> dimSlices) {
if (!acceptBitWidth(pointerBitWidth))		if (!acceptBitWidth(pointerBitWidth))
return emitError() << "unexpected pointer bitwidth: " << pointerBitWidth;		return emitError() << "unexpected pointer bitwidth: " << pointerBitWidth;
if (!acceptBitWidth(indexBitWidth))		if (!acceptBitWidth(indexBitWidth))
return emitError() << "unexpected index bitwidth: " << indexBitWidth;		return emitError() << "unexpected index bitwidth: " << indexBitWidth;
		// Before we can check that the level-rank is consistent/coherent
		// across all fields, we need to define it. The source-of-truth for
		// the `getLvlRank` method is the length of the level-types array,
		// since it must always be provided and have full rank; therefore we
		// use that same source-of-truth here.
		const Level lvlRank = dimLevelType.size();
		if (lvlRank == 0)
		return emitError() << "expected a non-empty array for level types";
if (dimOrdering) {		if (dimOrdering) {
if (!dimOrdering.isPermutation())		if (!dimOrdering.isPermutation())
return emitError()		return emitError()
<< "expected a permutation affine map for dimension ordering";		<< "expected a permutation affine map for dimension ordering";
if (dimOrdering.getNumResults() != dimLevelType.size())		if (dimOrdering.getNumResults() != lvlRank)
return emitError() << "unexpected mismatch in ordering and dimension "		return emitError() << "unexpected mismatch in ordering and dimension "
"level types size";		"level types size";
}		}
if (higherOrdering) {		if (higherOrdering) {
if (higherOrdering.getNumDims() >= higherOrdering.getNumResults())		if (higherOrdering.getNumDims() >= higherOrdering.getNumResults())
return emitError() << "unexpected higher ordering mapping from "		return emitError() << "unexpected higher ordering mapping from "
<< higherOrdering.getNumDims() << " to "		<< higherOrdering.getNumDims() << " to "
<< higherOrdering.getNumResults();		<< higherOrdering.getNumResults();
if (higherOrdering.getNumResults() != dimLevelType.size())		if (higherOrdering.getNumResults() != lvlRank)
return emitError() << "unexpected mismatch in higher ordering and "		return emitError() << "unexpected mismatch in higher ordering and "
"dimension level types size";		"dimension level types size";
}		}
if (!dimSlices.empty() && dimSlices.size() != dimLevelType.size()) {		if (!dimSlices.empty() && dimSlices.size() != lvlRank) {
return emitError() << "unexpected mismatch in dimension slices and "		return emitError() << "unexpected mismatch in dimension slices and "
"dimension level type size";		"dimension level type size";
}		}
return success();		return success();
}		}

#define RETURN_FAILURE_IF_FAILED(X) \		#define RETURN_FAILURE_IF_FAILED(X) \
if (failed(X)) { \		if (failed(X)) { \
return failure(); \		return failure(); \
}		}

LogicalResult SparseTensorEncodingAttr::verifyEncoding(		LogicalResult SparseTensorEncodingAttr::verifyEncoding(
ArrayRef<int64_t> shape, Type elementType,		ArrayRef<DynSize> dimShape, Type elementType,
function_ref<InFlightDiagnostic()> emitError) const {		function_ref<InFlightDiagnostic()> emitError) const {
// Check structural integrity.		// Check structural integrity. In particular, this ensures that the
		// level-rank is coherent across all the fields.
RETURN_FAILURE_IF_FAILED(verify(		RETURN_FAILURE_IF_FAILED(verify(
emitError, getDimLevelType(), getDimOrdering(), getHigherOrdering(),		emitError, getDimLevelType(), getDimOrdering(), getHigherOrdering(),
getPointerBitWidth(), getIndexBitWidth(), getDimSlices()))		getPointerBitWidth(), getIndexBitWidth(), getDimSlices()))
// Check integrity with tensor type specifics. Dimension ordering is optional,		// Check integrity with tensor type specifics. In particular, we
// but we always should have dimension level types for the full rank.		// need only check that the dimension-rank of the tensor agrees with
unsigned size = shape.size();		// the dimension-rank of the encoding.
if (size == 0)		const Dimension dimRank = dimShape.size();
		if (dimRank == 0)
return emitError() << "expected non-scalar sparse tensor";		return emitError() << "expected non-scalar sparse tensor";
if (getHigherOrdering()) {		if (const auto higherOrdering = getHigherOrdering()) {
if (getHigherOrdering().getNumDims() != size)		if (higherOrdering.getNumDims() != dimRank)
return emitError() << "expected an affine map of size " << size		return emitError() << "expected an affine map with " << dimRank
<< " for higher ordering";		<< " dimensions for higher ordering";

// TODO: verification of higher ordering contents		// TODO: verification of higher ordering contents
		} else if (dimRank != getLvlRank()) {
		aartbikUnsubmitted Done Reply Inline Actions can we just else-if this? aartbik: can we just else-if this?
size = getHigherOrdering().getNumResults(); // higher-order size!		return emitError() << "expected an array of size " << dimRank
}
if (getDimOrdering() && getDimOrdering().getNumResults() != size)
return emitError() << "expected an affine map of size " << size
<< " for dimension ordering";
if (getDimLevelType().size() != size)
return emitError() << "expected an array of size " << size
<< " for dimension level types";		<< " for dimension level types";
		}
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Convenience Methods.		// Convenience Methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

SparseTensorEncodingAttr		SparseTensorEncodingAttr
mlir::sparse_tensor::getSparseTensorEncoding(Type type) {		mlir::sparse_tensor::getSparseTensorEncoding(Type type) {
if (auto ttp = type.dyn_cast<RankedTensorType>())		if (auto ttp = type.dyn_cast<RankedTensorType>())
return ttp.getEncoding().dyn_cast_or_null<SparseTensorEncodingAttr>();		return ttp.getEncoding().dyn_cast_or_null<SparseTensorEncodingAttr>();
if (auto mdtp = type.dyn_cast<StorageSpecifierType>())		if (auto mdtp = type.dyn_cast<StorageSpecifierType>())
return mdtp.getEncoding();		return mdtp.getEncoding();
return nullptr;		return nullptr;
}		}

/// Returns true iff the given sparse tensor encoding attribute has a trailing		/// Returns true iff the given sparse tensor encoding attribute has a trailing
/// COO region starting at the given dimension.		/// COO region starting at the given level.
static bool isCOOType(SparseTensorEncodingAttr enc, uint64_t s, bool isUnique) {		static bool isCOOType(SparseTensorEncodingAttr enc, Level startLvl,
uint64_t rank = enc.getDimLevelType().size();		bool isUnique) {
assert(s < rank && "Dimension out of bounds");		if (!enc \|\| !enc.isCompressedLvl(startLvl))
if (!isCompressedDim(enc, s))
return false;		return false;
		const Level lvlRank = enc.getLvlRank();
for (uint64_t i = s + 1; i < rank; ++i)		for (Level l = startLvl + 1; l < lvlRank; ++l)
if (!isSingletonDim(enc, i))		if (!enc.isSingletonLvl(l))
return false;		return false;
		// If isUnique is true, then make sure that the last level is unique,
// If isUnique is true, then make sure that the last dimension level is		// that is, lvlRank == 1 (unique the only compressed) and lvlRank > 1
// unique, that is, rank == 1 (unique the only compressed) and rank > 1
// (unique on the last singleton).		// (unique on the last singleton).
return !isUnique \|\| isUniqueDLT(getDimLevelType(enc, rank - 1));		return !isUnique \|\| enc.isUniqueLvl(lvlRank - 1);
}		}

bool mlir::sparse_tensor::isUniqueCOOType(TensorType tp) {		bool mlir::sparse_tensor::isUniqueCOOType(TensorType tp) {
SparseTensorEncodingAttr enc = getSparseTensorEncoding(tp);		return isCOOType(getSparseTensorEncoding(tp), 0, /isUnique=/true);
return enc && isCOOType(enc, 0, /isUnique=/true);
}		}

unsigned mlir::sparse_tensor::getCOOStart(SparseTensorEncodingAttr enc) {		Level mlir::sparse_tensor::getCOOStart(SparseTensorEncodingAttr enc) {
const unsigned rank = enc.getDimLevelType().size();		// We only consider COO region with at least two levels for the purpose
// We only consider COO region with at least two dimensions for the purpose
// of AOS storage optimization.		// of AOS storage optimization.
if (rank > 1)		const Level lvlRank = enc.getLvlRank();
for (unsigned r = 0; r < rank - 1; r++)		if (lvlRank > 1)
if (isCOOType(enc, r, /isUnique=/false))		for (Level l = 0; l < lvlRank - 1; l++)
return r;		if (isCOOType(enc, l, /isUnique=/false))
		return l;
return rank;		return lvlRank;
}		}

// Helpers to setup a COO type.		// Helpers to setup a COO type.
RankedTensorType sparse_tensor::getCOOFromTypeWithOrdering(RankedTensorType src,		RankedTensorType sparse_tensor::getCOOFromTypeWithOrdering(RankedTensorType rtt,
AffineMap ordering,		AffineMap lvlPerm,
bool ordered) {		bool ordered) {
auto *ctx = src.getContext();		const SparseTensorType src(rtt);
auto rank = src.getRank();		// The dim-rank of the source `RankedTensorType` is used as the lvl-rank
SmallVector<DimLevelType> dims;		// of the result `RankedTensorType`. This follows from the fact that the
		// result's encoding has the default higher-ordering (hence the result's
// An unordered and non-unique compressed dim at beginning.		// lvl-rank equals its dim-rank). We don't need to assert that `lvlRank`
// If this is also the last dimension, then it is unique.		// agrees with the size of `lvlPerm` because that will be verified by
dims.push_back(*getDimLevelType(LevelFormat::Compressed, ordered, rank == 1));		// `STEA::get`.
if (rank > 1) {		const Level lvlRank = src.getDimRank();
		SmallVector<DimLevelType> lvlTypes;

		// An unordered and non-unique compressed level at beginning.
		// If this is also the last level, then it is unique.
		lvlTypes.push_back(
		*getDimLevelType(LevelFormat::Compressed, ordered, lvlRank == 1));
		if (lvlRank > 1) {
// TODO: it is actually ordered at the level for ordered input.		// TODO: it is actually ordered at the level for ordered input.
// Followed by unordered non-unique n-2 singleton levels.		// Followed by unordered non-unique n-2 singleton levels.
std::fill_n(std::back_inserter(dims), rank - 2,		std::fill_n(std::back_inserter(lvlTypes), lvlRank - 2,
*getDimLevelType(LevelFormat::Singleton, ordered, false));		*getDimLevelType(LevelFormat::Singleton, ordered, false));
// Ends by a unique singleton level unless the tensor rank is 1.		// Ends by a unique singleton level unless the lvlRank is 1.
dims.push_back(*getDimLevelType(LevelFormat::Singleton, ordered, true));		lvlTypes.push_back(*getDimLevelType(LevelFormat::Singleton, ordered, true));
}		}

SparseTensorEncodingAttr encSrc = getSparseTensorEncoding(src);
// TODO: Maybe pick the bitwidth based on input/output tensors (probably the		// TODO: Maybe pick the bitwidth based on input/output tensors (probably the
// largest one among them) in the original operation instead of using the		// largest one among them) in the original operation instead of using the
// default value.		// default value.
unsigned pointerBitWidth = encSrc ? encSrc.getPointerBitWidth() : 0;		unsigned pointerBitWidth = src.getPointerBitWidth();
unsigned indexBitWidth = encSrc ? encSrc.getIndexBitWidth() : 0;		unsigned indexBitWidth = src.getIndexBitWidth();
auto enc = SparseTensorEncodingAttr::get(ctx, dims, ordering, AffineMap(),		auto enc = SparseTensorEncodingAttr::get(src.getContext(), lvlTypes, lvlPerm,
pointerBitWidth, indexBitWidth);		AffineMap(), pointerBitWidth,
return RankedTensorType::get(src.getShape(), src.getElementType(), enc);		indexBitWidth);
		return RankedTensorType::get(src.getDimShape(), src.getElementType(), enc);
}		}

RankedTensorType sparse_tensor::getCOOFromType(RankedTensorType src,		RankedTensorType sparse_tensor::getCOOFromType(RankedTensorType src,
bool ordered) {		bool ordered) {
return getCOOFromTypeWithOrdering(		return getCOOFromTypeWithOrdering(
src, AffineMap::getMultiDimIdentityMap(src.getRank(), src.getContext()),		src, AffineMap::getMultiDimIdentityMap(src.getRank(), src.getContext()),
ordered);		ordered);
}		}

uint64_t mlir::sparse_tensor::toOrigDim(SparseTensorEncodingAttr enc,		// TODO: Remove this definition once all use-sites have been fixed to
uint64_t d) {		// properly handle non-permutations.
		Dimension mlir::sparse_tensor::toOrigDim(SparseTensorEncodingAttr enc,
		Level l) {
if (enc) {		if (enc) {
auto order = enc.getDimOrdering();		auto order = enc.getDimOrdering();
if (order) {		if (order) {
assert(order.isPermutation());		assert(order.isPermutation());
return order.getDimPosition(d);		return order.getDimPosition(l);
}		}
}		}
return d;		return l;
}		}

uint64_t mlir::sparse_tensor::toStoredDim(SparseTensorEncodingAttr enc,		// TODO: Remove this definition once all use-sites have been fixed to
uint64_t d) {		// properly handle non-permutations.
		Level mlir::sparse_tensor::toStoredDim(SparseTensorEncodingAttr enc,
		Dimension d) {
if (enc) {		if (enc) {
auto order = enc.getDimOrdering();		auto order = enc.getDimOrdering();
if (order) {		if (order) {
assert(order.isPermutation());		assert(order.isPermutation());
auto maybePos =		auto maybePos =
order.getResultPosition(getAffineDimExpr(d, enc.getContext()));		order.getResultPosition(getAffineDimExpr(d, enc.getContext()));
assert(maybePos.has_value());		assert(maybePos.has_value());
return *maybePos;		return *maybePos;
}		}
}		}
return d;		return d;
}		}

uint64_t mlir::sparse_tensor::toOrigDim(RankedTensorType type, uint64_t d) {		// TODO: Remove this definition once all use-sites have been fixed to
assert(d < static_cast<uint64_t>(type.getRank()));		// properly handle non-permutations.
return toOrigDim(getSparseTensorEncoding(type), d);		Dimension mlir::sparse_tensor::toOrigDim(RankedTensorType type, Level l) {
		const auto enc = getSparseTensorEncoding(type);
		assert(l < enc.getLvlRank());
		return toOrigDim(enc, l);
}		}

uint64_t mlir::sparse_tensor::toStoredDim(RankedTensorType type, uint64_t d) {		// TODO: Remove this definition once all use-sites have been fixed to
assert(d < static_cast<uint64_t>(type.getRank()));		// properly handle non-permutations.
		Level mlir::sparse_tensor::toStoredDim(RankedTensorType type, Dimension d) {
		assert(d < static_cast<Dimension>(type.getRank()));
return toStoredDim(getSparseTensorEncoding(type), d);		return toStoredDim(getSparseTensorEncoding(type), d);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SparseTensorDialect Types.		// SparseTensorDialect Types.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// We normalized sparse tensor encoding attribute by always using		/// We normalized sparse tensor encoding attribute by always using
Show All 25 Lines	IntegerType StorageSpecifierType::getSizesType() const {
unsigned idxBitWidth =		unsigned idxBitWidth =
getEncoding().getIndexBitWidth() ? getEncoding().getIndexBitWidth() : 64u;		getEncoding().getIndexBitWidth() ? getEncoding().getIndexBitWidth() : 64u;
unsigned ptrBitWidth =		unsigned ptrBitWidth =
getEncoding().getIndexBitWidth() ? getEncoding().getIndexBitWidth() : 64u;		getEncoding().getIndexBitWidth() ? getEncoding().getIndexBitWidth() : 64u;

return IntegerType::get(getContext(), std::max(idxBitWidth, ptrBitWidth));		return IntegerType::get(getContext(), std::max(idxBitWidth, ptrBitWidth));
}		}

		// FIXME: see note [CLARIFY_DIM_LVL] in
		// "lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.h"
Type StorageSpecifierType::getFieldType(StorageSpecifierKind kind,		Type StorageSpecifierType::getFieldType(StorageSpecifierKind kind,
std::optional<unsigned> dim) const {		std::optional<unsigned> dim) const {
if (kind != StorageSpecifierKind::ValMemSize)		if (kind != StorageSpecifierKind::ValMemSize)
assert(dim);		assert(dim);

// Right now, we store every sizes metadata using the same size type.		// Right now, we store every sizes metadata using the same size type.
// TODO: the field size type can be defined dimensional wise after sparse		// TODO: the field size type can be defined dimensional wise after sparse
// tensor encoding supports per dimension index/pointer bitwidth.		// tensor encoding supports per dimension index/pointer bitwidth.
return getSizesType();		return getSizesType();
}		}

		// FIXME: see note [CLARIFY_DIM_LVL] in
		// "lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.h"
Type StorageSpecifierType::getFieldType(StorageSpecifierKind kind,		Type StorageSpecifierType::getFieldType(StorageSpecifierKind kind,
std::optional<APInt> dim) const {		std::optional<APInt> dim) const {
return getFieldType(kind, dim ? std::optional(dim.value().getZExtValue())		return getFieldType(kind, dim ? std::optional(dim.value().getZExtValue())
: std::nullopt);		: std::nullopt);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SparseTensorDialect Operations.		// SparseTensorDialect Operations.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static LogicalResult isInBounds(uint64_t dim, Value tensor) {		static LogicalResult dimIsInBounds(Dimension dim, Value tensor) {
return success(dim < static_cast<uint64_t>(getTypeRank(tensor)));		return success(dim < getDimRank(tensor));
}		}

static LogicalResult isMatchingWidth(Value result, unsigned width) {		static LogicalResult isMatchingWidth(Value result, unsigned width) {
const Type etp = getMemRefType(result).getElementType();		const Type etp = getMemRefType(result).getElementType();
return success(width == 0 ? etp.isIndex() : etp.isInteger(width));		return success(width == 0 ? etp.isIndex() : etp.isInteger(width));
}		}

static LogicalResult verifySparsifierGetterSetter(		static LogicalResult verifySparsifierGetterSetter(
StorageSpecifierKind mdKind, std::optional<APInt> dim,		StorageSpecifierKind mdKind, std::optional<APInt> lvl,
TypedValue<StorageSpecifierType> md, Operation *op) {		TypedValue<StorageSpecifierType> md, Operation *op) {
if (mdKind == StorageSpecifierKind::ValMemSize && dim) {		if (mdKind == StorageSpecifierKind::ValMemSize && lvl) {
return op->emitError(		return op->emitError(
"redundant dimension argument for querying value memory size");		"redundant level argument for querying value memory size");
}		}

auto enc = md.getType().getEncoding();		const auto enc = md.getType().getEncoding();
ArrayRef<DimLevelType> dlts = enc.getDimLevelType();		const Level lvlRank = enc.getLvlRank();
unsigned rank = dlts.size();

if (mdKind != StorageSpecifierKind::ValMemSize) {		if (mdKind != StorageSpecifierKind::ValMemSize) {
if (!dim)		if (!lvl)
return op->emitError("missing dimension argument");		return op->emitError("missing level argument");

unsigned d = dim.value().getZExtValue();		const Level l = lvl.value().getZExtValue();
if (d >= rank)		if (l >= lvlRank)
return op->emitError("requested dimension out of bound");		return op->emitError("requested level out of bound");

if (mdKind == StorageSpecifierKind::PtrMemSize && isSingletonDLT(dlts[d]))		if (mdKind == StorageSpecifierKind::PtrMemSize && enc.isSingletonLvl(l))
return op->emitError(		return op->emitError(
"requested pointer memory size on a singleton level");		"requested pointer memory size on a singleton level");
}		}
return success();		return success();
}		}

LogicalResult NewOp::verify() {		LogicalResult NewOp::verify() {
if (getExpandSymmetry() && getTypeRank(getResult()) != 2)		if (getExpandSymmetry() && getDimRank(getResult()) != 2)
return emitOpError("expand_symmetry can only be used for 2D tensors");		return emitOpError("expand_symmetry can only be used for 2D tensors");
return success();		return success();
}		}

static LogicalResult verifyPackUnPack(Operation *op, TensorType cooTp,		static LogicalResult verifyPackUnPack(Operation *op, TensorType cooTp,
TensorType dataTp, TensorType idxTp) {		TensorType dataTp, TensorType idxTp) {
if (!isUniqueCOOType(cooTp))		if (!isUniqueCOOType(cooTp))
return op->emitError("must operate on a COO tensor");		return op->emitError("must operate on a COO tensor");
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	if (auto tp1 = getSource().getType().dyn_cast<RankedTensorType>()) {
if (auto tp2 = getDest().getType().dyn_cast<RankedTensorType>()) {		if (auto tp2 = getDest().getType().dyn_cast<RankedTensorType>()) {
if (tp1.getRank() != tp2.getRank())		if (tp1.getRank() != tp2.getRank())
return emitError("unexpected conversion mismatch in rank");		return emitError("unexpected conversion mismatch in rank");
auto shape1 = tp1.getShape();		auto shape1 = tp1.getShape();
auto shape2 = tp2.getShape();		auto shape2 = tp2.getShape();
// Accept size matches between the source and the destination type		// Accept size matches between the source and the destination type
// (e.g. 10 vs. 10, 10 vs. ?, or ? vs. ?), but reject direct mismatches or		// (e.g. 10 vs. 10, 10 vs. ?, or ? vs. ?), but reject direct mismatches or
// matches that would need a runtime assert (e.g. 10 vs. 20 or ? vs. 10).		// matches that would need a runtime assert (e.g. 10 vs. 20 or ? vs. 10).
for (unsigned d = 0, rank = tp1.getRank(); d < rank; d++)		for (Dimension d = 0, dimRank = tp1.getRank(); d < dimRank; d++)
if (shape1[d] != shape2[d] && shape2[d] != ShapedType::kDynamic)		if (shape1[d] != shape2[d] && shape2[d] != ShapedType::kDynamic)
return emitError("unexpected conversion mismatch in dimension ") << d;		return emitError("unexpected conversion mismatch in dimension ") << d;
return success();		return success();
}		}
}		}
return emitError("unexpected type in convert");		return emitError("unexpected type in convert");
}		}

OpFoldResult ConvertOp::fold(FoldAdaptor adaptor) {		OpFoldResult ConvertOp::fold(FoldAdaptor adaptor) {
Type dstType = getType();		Type dstType = getType();
// Fold trivial dense-to-dense convert and leave trivial sparse-to-sparse		// Fold trivial dense-to-dense convert and leave trivial sparse-to-sparse
// convert for codegen to remove. This is because we use trivial		// convert for codegen to remove. This is because we use trivial
// sparse-to-sparse convert to tell bufferization that the sparse codegen		// sparse-to-sparse convert to tell bufferization that the sparse codegen
// will expand the tensor buffer into sparse tensor storage.		// will expand the tensor buffer into sparse tensor storage.
if (!getSparseTensorEncoding(dstType) && dstType == getSource().getType())		if (!getSparseTensorEncoding(dstType) && dstType == getSource().getType())
return getSource();		return getSource();
return {};		return {};
}		}

LogicalResult ToPointersOp::verify() {		LogicalResult ToPointersOp::verify() {
auto e = getSparseTensorEncoding(getTensor().getType());		auto e = getSparseTensorEncoding(getTensor().getType());
if (failed(isInBounds(getDimension().getZExtValue(), getTensor())))		// FIXME: there seems to be some dim/lvl confusion here.
		if (failed(dimIsInBounds(getDimension().getZExtValue(), getTensor())))
return emitError("requested pointers dimension out of bounds");		return emitError("requested pointers dimension out of bounds");
if (failed(isMatchingWidth(getResult(), e.getPointerBitWidth())))		if (failed(isMatchingWidth(getResult(), e.getPointerBitWidth())))
return emitError("unexpected type for pointers");		return emitError("unexpected type for pointers");
return success();		return success();
}		}

LogicalResult ToIndicesOp::verify() {		LogicalResult ToIndicesOp::verify() {
auto e = getSparseTensorEncoding(getTensor().getType());		auto e = getSparseTensorEncoding(getTensor().getType());
if (failed(isInBounds(getDimension().getZExtValue(), getTensor())))		// FIXME: there seems to be some dim/lvl confusion here.
		if (failed(dimIsInBounds(getDimension().getZExtValue(), getTensor())))
return emitError("requested indices dimension out of bounds");		return emitError("requested indices dimension out of bounds");
if (failed(isMatchingWidth(getResult(), e.getIndexBitWidth())))		if (failed(isMatchingWidth(getResult(), e.getIndexBitWidth())))
return emitError("unexpected type for indices");		return emitError("unexpected type for indices");
return success();		return success();
}		}

LogicalResult ToIndicesBufferOp::verify() {		LogicalResult ToIndicesBufferOp::verify() {
auto e = getSparseTensorEncoding(getTensor().getType());		auto e = getSparseTensorEncoding(getTensor().getType());
if (getCOOStart(e) >= e.getDimLevelType().size())		if (getCOOStart(e) >= e.getLvlRank())
return emitError("expected sparse tensor with a COO region");		return emitError("expected sparse tensor with a COO region");
return success();		return success();
}		}

LogicalResult ToValuesOp::verify() {		LogicalResult ToValuesOp::verify() {
auto ttp = getRankedTensorType(getTensor());		auto ttp = getRankedTensorType(getTensor());
auto mtp = getMemRefType(getResult());		auto mtp = getMemRefType(getResult());
if (ttp.getElementType() != mtp.getElementType())		if (ttp.getElementType() != mtp.getElementType())
▲ Show 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	LogicalResult UnaryOp::verify() {
if (!absent.empty()) {		if (!absent.empty()) {
RETURN_FAILURE_IF_FAILED(		RETURN_FAILURE_IF_FAILED(
verifyNumBlockArgs(this, absent, "absent", TypeRange{}, outputType))		verifyNumBlockArgs(this, absent, "absent", TypeRange{}, outputType))
}		}
return success();		return success();
}		}

LogicalResult ConcatenateOp::verify() {		LogicalResult ConcatenateOp::verify() {
auto dstTp = getRankedTensorType(*this);		const auto dstTp = getSparseTensorType(*this);
uint64_t concatDim = getDimension().getZExtValue();		const Dimension concatDim = getDimension().getZExtValue();
unsigned rank = dstTp.getRank();		const Dimension dimRank = dstTp.getDimRank();

if (getInputs().size() <= 1)		if (getInputs().size() <= 1)
return emitError("Need at least two tensors to concatenate.");		return emitError("Need at least two tensors to concatenate.");

for (auto type : getInputs().getTypes()) {		if (concatDim >= dimRank)
auto shape = type.cast<RankedTensorType>().getShape();
for (auto dim : shape) {
if (ShapedType::isDynamic(dim))
return emitError("Only statically-sized input tensors are supported.");
}
}

if (concatDim >= rank)
return emitError(llvm::formatv(		return emitError(llvm::formatv(
"Failed to concatentate tensors with rank={0} on dimension={1}.", rank,		"Concat-dimension is out of bounds for dimension-rank ({0} >= {1})",
concatDim));		concatDim, dimRank));

for (size_t i = 0, e = getInputs().size(); i < e; i++) {		for (const auto &it : llvm::enumerate(getInputs())) {
const auto inputRank = getTypeRank(getInputs()[i]);		const auto i = it.index();
if (inputRank != rank)		const auto srcTp = getSparseTensorType(it.value());
		if (srcTp.hasDynamicDimShape())
		return emitError(llvm::formatv("Input tensor ${0} has dynamic shape", i));
		const Dimension srcDimRank = srcTp.getDimRank();
		if (srcDimRank != dimRank)
return emitError(		return emitError(
llvm::formatv("The input tensor ${0} has a different rank (rank={1}) "		llvm::formatv("Input tensor ${0} has a different rank (rank={1}) "
"from the output tensor (rank={2}).",		"from the output tensor (rank={2}).",
i, inputRank, rank));		i, srcDimRank, dimRank));
}		}

for (unsigned i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
const auto dstDim = dstTp.getShape()[i];		const DynSize dstSh = dstTp.getDimShape()[d];
if (i == concatDim) {		if (d == concatDim) {
if (!ShapedType::isDynamic(dstDim)) {		if (!ShapedType::isDynamic(dstSh)) {
// If we reach here, all inputs should have static shapes.		// If we reach here, then all inputs have static shapes. So we
unsigned sumDim = 0;		// can use `getDimShape()[d]` instead of `*getDynamicDimSize(d)`
for (auto src : getInputs())		// to avoid redundant assertions in the loop.
sumDim += getRankedTensorType(src).getShape()[i];		StaticSize sumSz = 0;
		for (const auto src : getInputs())
		sumSz += getSparseTensorType(src).getDimShape()[d];
// If all dimension are statically known, the sum of all the input		// If all dimension are statically known, the sum of all the input
// dimensions should be equal to the output dimension.		// dimensions should be equal to the output dimension.
if (sumDim != dstDim)		if (sumSz != dstSh)
return emitError(		return emitError(
"The concatenation dimension of the output tensor should be the "		"The concatenation dimension of the output tensor should be the "
"sum of all the concatenation dimensions of the input tensors.");		"sum of all the concatenation dimensions of the input tensors.");
}		}
} else {		} else {
int64_t prev = dstDim;		DynSize prev = dstSh;
for (auto src : getInputs()) {		for (const auto src : getInputs()) {
const auto d = getRankedTensorType(src).getShape()[i];		const auto sh = getSparseTensorType(src).getDimShape()[d];
if (!ShapedType::isDynamic(prev) && d != prev)		if (!ShapedType::isDynamic(prev) && sh != prev)
return emitError("All dimensions (expect for the concatenating one) "		return emitError("All dimensions (expect for the concatenating one) "
"should be equal.");		"should be equal.");
prev = d;		prev = sh;
}		}
}		}
}		}

return success();		return success();
}		}

LogicalResult InsertOp::verify() {		LogicalResult InsertOp::verify() {
if (getTypeRank(getTensor()) != static_cast<int64_t>(getIndices().size()))		if (getDimRank(getTensor()) != static_cast<Dimension>(getIndices().size()))
return emitOpError("incorrect number of indices");		return emitOpError("incorrect number of indices");
return success();		return success();
}		}

void PushBackOp::build(OpBuilder &builder, OperationState &result,		void PushBackOp::build(OpBuilder &builder, OperationState &result,
Value curSize, Value inBuffer, Value value) {		Value curSize, Value inBuffer, Value value) {
build(builder, result, curSize, inBuffer, value, Value());		build(builder, result, curSize, inBuffer, value, Value());
}		}

LogicalResult PushBackOp::verify() {		LogicalResult PushBackOp::verify() {
if (Value n = getN()) {		if (Value n = getN()) {
auto nValue = dyn_cast_or_null<arith::ConstantIndexOp>(n.getDefiningOp());		auto nValue = dyn_cast_or_null<arith::ConstantIndexOp>(n.getDefiningOp());
if (nValue && nValue.value() < 1)		if (nValue && nValue.value() < 1)
return emitOpError("n must be not less than 1");		return emitOpError("n must be not less than 1");
}		}
return success();		return success();
}		}

LogicalResult CompressOp::verify() {		LogicalResult CompressOp::verify() {
if (getTypeRank(getTensor()) != 1 + static_cast<int64_t>(getIndices().size()))		if (getDimRank(getTensor()) !=
		1 + static_cast<Dimension>(getIndices().size()))
return emitOpError("incorrect number of indices");		return emitOpError("incorrect number of indices");
return success();		return success();
}		}

void ForeachOp::build(		void ForeachOp::build(
OpBuilder &builder, OperationState &result, Value tensor,		OpBuilder &builder, OperationState &result, Value tensor,
function_ref<void(OpBuilder &, Location, ValueRange, Value, ValueRange)>		function_ref<void(OpBuilder &, Location, ValueRange, Value, ValueRange)>
bodyBuilder) {		bodyBuilder) {
build(builder, result, tensor, std::nullopt, bodyBuilder);		build(builder, result, tensor, std::nullopt, bodyBuilder);
}		}

void ForeachOp::build(		void ForeachOp::build(
OpBuilder &builder, OperationState &result, Value tensor,		OpBuilder &builder, OperationState &result, Value tensor,
ValueRange initArgs,		ValueRange initArgs,
function_ref<void(OpBuilder &, Location, ValueRange, Value, ValueRange)>		function_ref<void(OpBuilder &, Location, ValueRange, Value, ValueRange)>
bodyBuilder) {		bodyBuilder) {
build(builder, result, initArgs.getTypes(), tensor, initArgs);		build(builder, result, initArgs.getTypes(), tensor, initArgs);
// Builds foreach body.		// Builds foreach body.
if (!bodyBuilder)		if (!bodyBuilder)
return;		return;
auto rtp = getRankedTensorType(tensor);		const auto stt = getSparseTensorType(tensor);
int64_t rank = rtp.getRank();		const Dimension dimRank = stt.getDimRank();

SmallVector<Type> blockArgTypes;		// Starts with `dimRank`-many indices.
// Starts with n index.		SmallVector<Type> blockArgTypes(dimRank, builder.getIndexType());
std::fill_n(std::back_inserter(blockArgTypes), rank, builder.getIndexType());
// Followed by one value.		// Followed by one value.
blockArgTypes.push_back(rtp.getElementType());		blockArgTypes.push_back(stt.getElementType());
// Followed by reduction variable.		// Followed by the reduction variables.
blockArgTypes.append(initArgs.getTypes().begin(), initArgs.getTypes().end());		blockArgTypes.append(initArgs.getTypes().begin(), initArgs.getTypes().end());

SmallVector<Location> blockArgLocs;		SmallVector<Location> blockArgLocs(blockArgTypes.size(), tensor.getLoc());
std::fill_n(std::back_inserter(blockArgLocs), blockArgTypes.size(),
tensor.getLoc());

OpBuilder::InsertionGuard guard(builder);		OpBuilder::InsertionGuard guard(builder);
auto &region = *result.regions.front();		auto &region = *result.regions.front();
Block *bodyBlock =		Block *bodyBlock =
builder.createBlock(&region, region.end(), blockArgTypes, blockArgLocs);		builder.createBlock(&region, region.end(), blockArgTypes, blockArgLocs);
bodyBuilder(builder, result.location,		bodyBuilder(builder, result.location,
bodyBlock->getArguments().slice(0, rank),		bodyBlock->getArguments().slice(0, dimRank),
bodyBlock->getArguments()[rank],		bodyBlock->getArguments()[dimRank],
bodyBlock->getArguments().drop_front(rank + 1));		bodyBlock->getArguments().drop_front(dimRank + 1));
}		}

LogicalResult ForeachOp::verify() {		LogicalResult ForeachOp::verify() {
auto t = getRankedTensorType(getTensor());		const auto t = getSparseTensorType(getTensor());
auto args = getBody()->getArguments();		const Dimension dimRank = t.getDimRank();
		const auto args = getBody()->getArguments();

if (static_cast<size_t>(t.getRank()) + 1 + getInitArgs().size() !=		if (static_cast<size_t>(dimRank) + 1 + getInitArgs().size() != args.size())
args.size())
return emitError("Unmatched number of arguments in the block");		return emitError("Unmatched number of arguments in the block");

if (getNumResults() != getInitArgs().size())		if (getNumResults() != getInitArgs().size())
return emitError("Mismatch in number of init arguments and results");		return emitError("Mismatch in number of init arguments and results");

if (getResultTypes() != getInitArgs().getTypes())		if (getResultTypes() != getInitArgs().getTypes())
return emitError("Mismatch in types of init arguments and results");		return emitError("Mismatch in types of init arguments and results");

		// Cannot mark this const, because the getters aren't.
auto yield = cast<YieldOp>(getBody()->getTerminator());		auto yield = cast<YieldOp>(getBody()->getTerminator());
if (yield.getNumOperands() != getNumResults() \|\|		if (yield.getNumOperands() != getNumResults() \|\|
yield.getOperands().getTypes() != getResultTypes())		yield.getOperands().getTypes() != getResultTypes())
return emitError("Mismatch in types of yield values and results");		return emitError("Mismatch in types of yield values and results");

for (int64_t i = 0, e = t.getRank(); i < e; i++)		const auto iTp = IndexType::get(getContext());
if (args[i].getType() != IndexType::get(getContext()))		for (Dimension d = 0; d < dimRank; d++)
		if (args[d].getType() != iTp)
emitError(		emitError(
llvm::formatv("Expecting Index type for argument at index {0}", i));		llvm::formatv("Expecting Index type for argument at index {0}", d));

auto elemTp = t.getElementType();		const auto elemTp = t.getElementType();
auto valueTp = args[t.getRank()].getType();		const auto valueTp = args[dimRank].getType();
if (elemTp != valueTp)		if (elemTp != valueTp)
emitError(llvm::formatv("Unmatched element type between input tensor and "		emitError(llvm::formatv("Unmatched element type between input tensor and "
"block argument, expected:{0}, got: {1}",		"block argument, expected:{0}, got: {1}",
elemTp, valueTp));		elemTp, valueTp));
return success();		return success();
}		}

LogicalResult ReduceOp::verify() {		LogicalResult ReduceOp::verify() {
Show All 22 Lines	LogicalResult SortOp::verify() {

auto n = getN().getDefiningOp<arith::ConstantIndexOp>();		auto n = getN().getDefiningOp<arith::ConstantIndexOp>();

Type xtp = getMemRefType(getXs().front()).getElementType();		Type xtp = getMemRefType(getXs().front()).getElementType();
auto checkTypes = [&](ValueRange operands,		auto checkTypes = [&](ValueRange operands,
bool checkEleType = true) -> LogicalResult {		bool checkEleType = true) -> LogicalResult {
for (Value opnd : operands) {		for (Value opnd : operands) {
auto mtp = getMemRefType(opnd);		auto mtp = getMemRefType(opnd);
int64_t dim = mtp.getShape()[0];		const DynSize sh = mtp.getShape()[0];
// We can't check the size of dynamic dimension at compile-time, but all		// We can't check the size of dynamic dimension at compile-time, but all
// xs and ys should have a dimension not less than n at runtime.		// xs and ys should have a dimension not less than n at runtime.
if (n && !ShapedType::isDynamic(dim) && dim < n.value())		if (n && !ShapedType::isDynamic(sh) && sh < n.value())
return emitError(llvm::formatv("xs and ys need to have a dimension >= n"		return emitError(llvm::formatv("xs and ys need to have a dimension >= n"
": {0} < {1}",		": {0} < {1}",
dim, n.value()));		sh, n.value()));

if (checkEleType && xtp != mtp.getElementType())		if (checkEleType && xtp != mtp.getElementType())
return emitError("mismatch xs element types");		return emitError("mismatch xs element types");
}		}
return success();		return success();
};		};
RETURN_FAILURE_IF_FAILED(checkTypes(getXs()))		RETURN_FAILURE_IF_FAILED(checkTypes(getXs()))
return n ? checkTypes(getYs(), false) : success();		return n ? checkTypes(getYs(), false) : success();
Show All 13 Lines	if (auto nxAttr = getNxAttr()) {
if (nx < 1)		if (nx < 1)
emitError(llvm::formatv("Expected nx > 1, got {0}", nx));		emitError(llvm::formatv("Expected nx > 1, got {0}", nx));
}		}
uint64_t ny = 0;		uint64_t ny = 0;
if (auto nyAttr = getNyAttr()) {		if (auto nyAttr = getNyAttr()) {
ny = nyAttr.getInt();		ny = nyAttr.getInt();
}		}

auto checkDim = [&](Value v, uint64_t min, const char *message) {		// FIXME: update the types of variables used in expressions bassed as
auto tp = getMemRefType(v);		// the `minSize` argument, to avoid implicit casting at the callsites
int64_t dim = tp.getShape()[0];		// of this lambda.
if (!ShapedType::isDynamic(dim) && dim < (int64_t)min) {		const auto checkDim = [&](Value v, StaticSize minSize, const char *message) {
emitError(llvm::formatv("{0} got {1} < {2}", message, dim, min));		const DynSize sh = getMemRefType(v).getShape()[0];
}		if (!ShapedType::isDynamic(sh) && sh < minSize)
		emitError(llvm::formatv("{0} got {1} < {2}", message, sh, minSize));
};		};

checkDim(getXy(), n * (nx + ny), "Expected dimension(xy) >= n * (nx + ny)");		checkDim(getXy(), n * (nx + ny), "Expected dimension(xy) >= n * (nx + ny)");

for (Value opnd : getYs()) {		for (Value opnd : getYs()) {
checkDim(opnd, n, "Expected dimension(y) >= n");		checkDim(opnd, n, "Expected dimension(y) >= n");
}		}

Show All 40 Lines

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp

//===- CodegenEnv.cpp - Code generation environment class ----------------===//		//===- CodegenEnv.cpp - Code generation environment class ----------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "CodegenEnv.h"		#include "CodegenEnv.h"

#include "mlir/Dialect/Bufferization/IR/Bufferization.h"		#include "mlir/Dialect/Bufferization/IR/Bufferization.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"

#include <optional>		#include <optional>

using namespace mlir;		using namespace mlir;
using namespace mlir::sparse_tensor;		using namespace mlir::sparse_tensor;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Code generation environment helper functions		// Code generation environment helper functions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	if (it == utils::IteratorType::reduction) {
if (latticeMerger.hasNegateOnOut(exp))		if (latticeMerger.hasNegateOnOut(exp))
return false;		return false;
break;		break;
}		}
}		}

OpOperand *lhs = linalgOp.getDpsInitOperand(0);		OpOperand *lhs = linalgOp.getDpsInitOperand(0);
unsigned tensor = lhs->getOperandNumber();		unsigned tensor = lhs->getOperandNumber();
auto enc = getSparseTensorEncoding(lhs->get().getType());
// An non-annotated output tensor is assumed dense, and becomes a random		// An non-annotated output tensor is assumed dense, and becomes a random
// access n-dim memref. Admissible since insertions cannot occur.		// access n-dim memref. Admissible since insertions cannot occur.
if (!enc \|\| enc.isAllDense())		if (getSparseTensorType(lhs->get()).isAllDense())
return true;		return true;

// A tensor expression with a sparse output tensor that changes its values		// A tensor expression with a sparse output tensor that changes its values
// but not its nonzero structure, an operation called "simply dynamic" in		// but not its nonzero structure, an operation called "simply dynamic" in
// [Bik96,Ch9], is also admissible without special env.		// [Bik96,Ch9], is also admissible without special env.
if (latticeMerger.isSingleCondition(tensor, exp))		if (latticeMerger.isSingleCondition(tensor, exp))
return true;		return true;

▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/CodegenUtils.h

	Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines
	void storeIndices(OpBuilder &builder, Location loc, unsigned rank, Value ind,			void storeIndices(OpBuilder &builder, Location loc, unsigned rank, Value ind,
	ValueRange ivs, unsigned offsetDim = 0,			ValueRange ivs, unsigned offsetDim = 0,
	Value offset = Value());			Value offset = Value());

	/// Reshapes the linear values buffer for an annotated all dense sparse tensor			/// Reshapes the linear values buffer for an annotated all dense sparse tensor
	/// to match the shape of the corresponding dense tensor to support direct			/// to match the shape of the corresponding dense tensor to support direct
	/// access of the buffer through indices.			/// access of the buffer through indices.
	Value reshapeValuesToLevels(OpBuilder &builder, Location loc,			Value reshapeValuesToLevels(OpBuilder &builder, Location loc,
	SparseTensorEncodingAttr enc,			SparseTensorEncodingAttr enc, ValueRange dimSizes,
	const SmallVectorImpl<Value> &dimSizes,
	Value valuesBuffer, Value idxBuffer);			Value valuesBuffer, Value idxBuffer);

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Inlined constant generators.			// Inlined constant generators.
	//			//
	// All these functions are just wrappers to improve code legibility;			// All these functions are just wrappers to improve code legibility;
	// therefore, we mark them as `inline` to avoid introducing any additional			// therefore, we mark them as `inline` to avoid introducing any additional
	// overhead due to the legibility.			// overhead due to the legibility.
	▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines
	}			}

	inline bool isZeroRankedTensorOrScalar(Type type) {			inline bool isZeroRankedTensorOrScalar(Type type) {
	auto rtp = type.dyn_cast<RankedTensorType>();			auto rtp = type.dyn_cast<RankedTensorType>();
	return !rtp \|\| rtp.getRank() == 0;			return !rtp \|\| rtp.getRank() == 0;
	}			}

	/// Infers the result type and generates ToPointersOp.			/// Infers the result type and generates ToPointersOp.
	Value genToPointers(OpBuilder &builder, Location loc, Value tensor, uint64_t d);			Value genToPointers(OpBuilder &builder, Location loc, Value tensor, Level lvl);

	/// Infers the result type and generates ToIndicesOp. If the dim is within a COO			/// Infers the result type and generates ToIndicesOp. If the lvl is within a COO
	/// region, the result type is a memref with unknown stride and offset.			/// region, the result type is a memref with unknown stride and offset.
	/// Otherwise, the result type is a memref without any specified layout.			/// Otherwise, the result type is a memref without any specified layout.
	Value genToIndices(OpBuilder &builder, Location loc, Value tensor, uint64_t d,			Value genToIndices(OpBuilder &builder, Location loc, Value tensor, Level lvl,
	uint64_t cooStart);			Level cooStart);

	/// Infers the result type and generates ToValuesOp.			/// Infers the result type and generates ToValuesOp.
	Value genToValues(OpBuilder &builder, Location loc, Value tensor);			Value genToValues(OpBuilder &builder, Location loc, Value tensor);

	/// Generates code to retrieve the values size for the sparse tensor.			/// Generates code to retrieve the values size for the sparse tensor.
	Value genValMemSize(OpBuilder &builder, Location loc, Value tensor);			Value genValMemSize(OpBuilder &builder, Location loc, Value tensor);

	} // namespace sparse_tensor			} // namespace sparse_tensor
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_CODEGENUTILS_H_			#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_CODEGENUTILS_H_

mlir/lib/Dialect/SparseTensor/Transforms/CodegenUtils.cpp

Show First 20 Lines • Show All 230 Lines • ▼ Show 20 Lines	return builder.create<arith::CmpIOp>(loc, arith::CmpIPredicate::ne, v,
zero);		zero);
if (tp.dyn_cast<ComplexType>())		if (tp.dyn_cast<ComplexType>())
return builder.create<complex::NotEqualOp>(loc, v, zero);		return builder.create<complex::NotEqualOp>(loc, v, zero);
llvm_unreachable("Non-numeric type");		llvm_unreachable("Non-numeric type");
}		}

void mlir::sparse_tensor::genReshapeDstShape(		void mlir::sparse_tensor::genReshapeDstShape(
Location loc, PatternRewriter &rewriter, SmallVectorImpl<Value> &dstShape,		Location loc, PatternRewriter &rewriter, SmallVectorImpl<Value> &dstShape,
ArrayRef<Value> srcShape, ArrayRef<int64_t> staticDstShape,		ArrayRef<Value> srcShape, ArrayRef<StaticSize> staticDstShape,
ArrayRef<ReassociationIndices> reassociation) {		ArrayRef<ReassociationIndices> reassociation) {
// Collapse shape.		// Collapse shape.
if (reassociation.size() < srcShape.size()) {		if (reassociation.size() < srcShape.size()) {
unsigned start = 0;		unsigned start = 0;
for (const auto &map : llvm::enumerate(reassociation)) {		for (const auto &map : llvm::enumerate(reassociation)) {
auto dstDim = constantIndex(rewriter, loc, 1);		auto dstDim = constantIndex(rewriter, loc, 1);
for (unsigned i = start; i < start + map.value().size(); i++) {		for (unsigned i = start; i < start + map.value().size(); i++) {
dstDim = rewriter.create<arith::MulIOp>(loc, dstDim, srcShape[i]);		dstDim = rewriter.create<arith::MulIOp>(loc, dstDim, srcShape[i]);
Show All 16 Lines	for (unsigned i = 0, size = srcShape.size(); i < size; i++) {
for (unsigned j = start; j < start + map.size(); j++) {		for (unsigned j = start; j < start + map.size(); j++) {
// There can be only one dynamic sized dimension among dimensions		// There can be only one dynamic sized dimension among dimensions
// expanded from the i-th dimension in srcShape.		// expanded from the i-th dimension in srcShape.
// For example, if srcDim = 8, then the expanded shape could be <2x?x2>,		// For example, if srcDim = 8, then the expanded shape could be <2x?x2>,
// but not <2x?x?>.		// but not <2x?x?>.
if (staticDstShape[j] == ShapedType::kDynamic) {		if (staticDstShape[j] == ShapedType::kDynamic) {
// The expanded dimension has dynamic size. We compute the dimension		// The expanded dimension has dynamic size. We compute the dimension
// by dividing srcDim by the product of the static dimensions.		// by dividing srcDim by the product of the static dimensions.
int64_t product = 1;		StaticSize product = 1;
for (unsigned k = start; k < start + map.size(); k++) {		for (unsigned k = start; k < start + map.size(); k++) {
if (staticDstShape[k] != ShapedType::kDynamic) {		if (staticDstShape[k] != ShapedType::kDynamic) {
product *= staticDstShape[k];		product *= staticDstShape[k];
}		}
}		}
// Compute the dynamic dimension size.		// Compute the dynamic dimension size.
Value productVal = constantIndex(rewriter, loc, product);		Value productVal = constantIndex(rewriter, loc, product);
Value dynamicSize =		Value dynamicSize =
▲ Show 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	scf::buildLoopNest(
bodyBuilder(builder, loc, val, indicesArray);		bodyBuilder(builder, loc, val, indicesArray);
return {};		return {};
});		});
}		}

void mlir::sparse_tensor::sizesFromSrc(OpBuilder &builder,		void mlir::sparse_tensor::sizesFromSrc(OpBuilder &builder,
SmallVectorImpl<Value> &sizes,		SmallVectorImpl<Value> &sizes,
Location loc, Value src) {		Location loc, Value src) {
unsigned rank = src.getType().cast<ShapedType>().getRank();		const Dimension dimRank = getSparseTensorType(src).getDimRank();
for (unsigned i = 0; i < rank; i++)		for (Dimension d = 0; d < dimRank; d++)
sizes.push_back(linalg::createOrFoldDimOp(builder, loc, src, i));		sizes.push_back(linalg::createOrFoldDimOp(builder, loc, src, d));
}		}

Operation mlir::sparse_tensor::getTop(Operation op) {		Operation mlir::sparse_tensor::getTop(Operation op) {
for (; isa<scf::ForOp>(op->getParentOp()) \|\|		for (; isa<scf::ForOp>(op->getParentOp()) \|\|
isa<scf::WhileOp>(op->getParentOp()) \|\|		isa<scf::WhileOp>(op->getParentOp()) \|\|
isa<scf::ParallelOp>(op->getParentOp()) \|\|		isa<scf::ParallelOp>(op->getParentOp()) \|\|
isa<scf::IfOp>(op->getParentOp());		isa<scf::IfOp>(op->getParentOp());
op = op->getParentOp())		op = op->getParentOp())
Show All 30 Lines	if (attr.getElementType().isa<ComplexType>()) {
val = rewriter.create<arith::ConstantOp>(loc, valAttr);		val = rewriter.create<arith::ConstantOp>(loc, valAttr);
}		}
assert(val);		assert(val);
callback(coords, val);		callback(coords, val);
}		}
}		}

void sparse_tensor::storeIndices(OpBuilder &builder, Location loc,		void sparse_tensor::storeIndices(OpBuilder &builder, Location loc,
unsigned rank, Value ind, ValueRange ivs,		unsigned size, Value ind, ValueRange ivs,
unsigned offsetDim, Value offset) {		unsigned offsetDim, Value offset) {
for (unsigned i = 0; i < rank; i++) {		#ifndef NDEBUG
		const auto memTp = ind.getType().cast<MemRefType>();
		(void)memTp;
		assert(memTp.getRank() == 1);
		const DynSize memSh = memTp.getDimSize(0);
		(void)memSh;
		assert(ShapedType::isDynamic(memSh) \|\| memSh == static_cast<DynSize>(size));
		assert(ivs.size() == static_cast<size_t>(size));
		assert(offsetDim < size);
		#endif // NDEBUG

		for (unsigned i = 0; i < size; i++) {
Value idx = ivs[i];		Value idx = ivs[i];
if (offsetDim == i && offset)		if (offsetDim == i && offset)
idx = builder.create<arith::AddIOp>(loc, idx, offset);		idx = builder.create<arith::AddIOp>(loc, idx, offset);
builder.create<memref::StoreOp>(loc, idx, ind,		builder.create<memref::StoreOp>(loc, idx, ind,
constantIndex(builder, loc, i));		constantIndex(builder, loc, i));
}		}
}		}

Value sparse_tensor::reshapeValuesToLevels(		Value sparse_tensor::reshapeValuesToLevels(OpBuilder &builder, Location loc,
OpBuilder &builder, Location loc, SparseTensorEncodingAttr enc,		SparseTensorEncodingAttr enc,
const SmallVectorImpl<Value> &dimSizes, Value valuesBuffer,		ValueRange dimSizes,
		Value valuesBuffer,
Value idxBuffer) {		Value idxBuffer) {
// Use the dstIdx to store the level sizes.		// Use the `idxBuffer` to store the level sizes.
unsigned rank = enc.getDimLevelType().size();		const Level lvlRank = enc.getLvlRank();
SmallVector<Value> lvlSizes;		SmallVector<Value> lvlSizes;
for (unsigned i = 0; i < dimSizes.size(); i++)		lvlSizes.reserve(lvlRank);
lvlSizes.push_back(dimSizes[toOrigDim(enc, i)]);		for (Level l = 0; l < lvlRank; l++)
storeIndices(builder, loc, rank, idxBuffer, lvlSizes);		// FIXME: `toOrigDim` is deprecated.
		lvlSizes.push_back(dimSizes[toOrigDim(enc, l)]);
		storeIndices(builder, loc, lvlRank, idxBuffer, lvlSizes);
// The memref ReshapeOp requires the sizes buffer to have a static		// The memref ReshapeOp requires the sizes buffer to have a static
// shape.		// shape.
idxBuffer = builder.create<memref::CastOp>(		const auto iTp = builder.getIndexType();
loc, MemRefType::get({rank}, builder.getIndexType()), idxBuffer);		const SmallVector<DynSize, 1> idxBufferShape{static_cast<DynSize>(lvlRank)};
SmallVector<int64_t> shape(rank, ShapedType::kDynamic);		const auto idxBufferTp = MemRefType::get(idxBufferShape, iTp);
Type elemTp = getMemRefType(valuesBuffer).getElementType();		idxBuffer = builder.create<memref::CastOp>(loc, idxBufferTp, idxBuffer);
return builder.create<memref::ReshapeOp>(loc, MemRefType::get(shape, elemTp),		const SmallVector<DynSize> resShape(lvlRank, ShapedType::kDynamic);
valuesBuffer, idxBuffer);		const Type elemTp = getMemRefType(valuesBuffer).getElementType();
		const auto resTp = MemRefType::get(resShape, elemTp);
		return builder.create<memref::ReshapeOp>(loc, resTp, valuesBuffer, idxBuffer);
}		}

Value sparse_tensor::genToPointers(OpBuilder &builder, Location loc,		Value sparse_tensor::genToPointers(OpBuilder &builder, Location loc,
Value tensor, uint64_t d) {		Value tensor, Level lvl) {
RankedTensorType srcTp = getRankedTensorType(tensor);		const auto srcTp = getSparseTensorType(tensor);
SparseTensorEncodingAttr encSrc = getSparseTensorEncoding(srcTp);		const Type ptrTp = getPointerOverheadType(builder, srcTp.getEncoding());
Type ptrTp = get1DMemRefType(getPointerOverheadType(builder, encSrc),		const Type memTp = get1DMemRefType(ptrTp, /withLayout=/false);
/withLayout=/false);		return builder.create<ToPointersOp>(loc, memTp, tensor,
return builder.create<ToPointersOp>(loc, ptrTp, tensor,		builder.getIndexAttr(lvl));
builder.getIndexAttr(d));
}		}

Value sparse_tensor::genToIndices(OpBuilder &builder, Location loc,		Value sparse_tensor::genToIndices(OpBuilder &builder, Location loc,
Value tensor, uint64_t d, uint64_t cooStart) {		Value tensor, Level lvl, Level cooStart) {
RankedTensorType srcTp = getRankedTensorType(tensor);		const auto srcTp = getSparseTensorType(tensor);
SparseTensorEncodingAttr encSrc = getSparseTensorEncoding(srcTp);		const Type idxTp = getIndexOverheadType(builder, srcTp.getEncoding());
Type indTp = get1DMemRefType(getIndexOverheadType(builder, encSrc),		const Type memTp = get1DMemRefType(idxTp, /withLayout=/lvl >= cooStart);
/withLayout=/d >= cooStart);		return builder.create<ToIndicesOp>(loc, memTp, tensor,
return builder.create<ToIndicesOp>(loc, indTp, tensor,		builder.getIndexAttr(lvl));
builder.getIndexAttr(d));
}		}

Value sparse_tensor::genToValues(OpBuilder &builder, Location loc,		Value sparse_tensor::genToValues(OpBuilder &builder, Location loc,
Value tensor) {		Value tensor) {
RankedTensorType srcTp = getRankedTensorType(tensor);		RankedTensorType srcTp = getRankedTensorType(tensor);
Type valTp = get1DMemRefType(srcTp.getElementType(),		Type valTp = get1DMemRefType(srcTp.getElementType(),
/withLayout=/false);		/withLayout=/false);
return builder.create<ToValuesOp>(loc, valTp, tensor);		return builder.create<ToValuesOp>(loc, valTp, tensor);
}		}

Value sparse_tensor::genValMemSize(OpBuilder &builder, Location loc,		Value sparse_tensor::genValMemSize(OpBuilder &builder, Location loc,
Value tensor) {		Value tensor) {
return getDescriptorFromTensorTuple(tensor).getValMemSize(builder, loc);		return getDescriptorFromTensorTuple(tensor).getValMemSize(builder, loc);
}		}

mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	void LoopEmitter::initialize(ValueRange tensors, StringAttr loopTag,
}		}
}		}

void LoopEmitter::initializeLoopEmit(OpBuilder &builder, Location loc,		void LoopEmitter::initializeLoopEmit(OpBuilder &builder, Location loc,
LoopEmitter::OutputUpdater updater) {		LoopEmitter::OutputUpdater updater) {
// For every tensor, find lower and upper bound on dimensions, set the		// For every tensor, find lower and upper bound on dimensions, set the
// same bounds on loop indices, and obtain dense or sparse buffer(s).		// same bounds on loop indices, and obtain dense or sparse buffer(s).
for (size_t t = 0, e = tensors.size(); t < e; t++) {		for (size_t t = 0, e = tensors.size(); t < e; t++) {
auto tensor = tensors[t];		const auto tensor = tensors[t];
auto rtp = tensor.getType().dyn_cast<RankedTensorType>();		const auto rtp = tensor.getType().dyn_cast<RankedTensorType>();
if (!rtp)		if (!rtp)
// Skips only scalar, zero ranked tensor still need to be bufferized and		// Skips only scalar, zero ranked tensor still need to be bufferized and
// (probably) filled with zeros by users.		// (probably) filled with zeros by users.
continue;		continue;
auto rank = rtp.getRank();		// FIXME: the definition of `lvlRank` looks more like a dim-rank;
auto shape = rtp.getShape();		// but the variable is used as a level everywhere below, which
auto enc = getSparseTensorEncoding(rtp);		// suggests there may be some dim/lvl confusion going on here.
uint64_t cooStart = enc ? getCOOStart(enc) : rank;		const Level lvlRank = rtp.getRank();
// Scan all dimensions of current tensor.		const auto shape = rtp.getShape();
for (int64_t d = 0; d < rank; d++) {		const auto enc = getSparseTensorEncoding(rtp);
		const Level cooStart = enc ? getCOOStart(enc) : lvlRank;
		// Scan all levels of current tensor.
		for (Level l = 0; l < lvlRank; l++) {
// This should be called only once at beginning.		// This should be called only once at beginning.
assert(!ptrBuffer[t][d] && !idxBuffer[t][d] && !highs[t][d]);		assert(!ptrBuffer[t][l] && !idxBuffer[t][l] && !highs[t][l]);
		const auto dlt = dimTypes[t][l];
// Handle sparse storage schemes.		// Handle sparse storage schemes.
if (isCompressedDLT(dimTypes[t][d])) {		if (isCompressedDLT(dlt)) {
// Generate sparse primitives to obtains pointer and indices.		// Generate sparse primitives to obtains pointer and indices.
ptrBuffer[t][d] = genToPointers(builder, loc, tensor, d);		ptrBuffer[t][l] = genToPointers(builder, loc, tensor, l);
idxBuffer[t][d] = genToIndices(builder, loc, tensor, d, cooStart);		idxBuffer[t][l] = genToIndices(builder, loc, tensor, l, cooStart);
} else if (isSingletonDLT(dimTypes[t][d])) {		} else if (isSingletonDLT(dlt)) {
// Singleton dimension, fetch indices.		// Singleton dimension, fetch indices.
idxBuffer[t][d] = genToIndices(builder, loc, tensor, d, cooStart);		idxBuffer[t][l] = genToIndices(builder, loc, tensor, l, cooStart);
} else {		} else {
// Dense dimension, nothing to fetch.		// Dense dimension, nothing to fetch.
assert(isDenseDLT(dimTypes[t][d]));		assert(isDenseDLT(dlt));
}		}

// Find upper bound in current dimension.		// Find upper bound in current dimension.
unsigned p = toOrigDim(enc, d);		// FIXME: `toOrigDim` is deprecated
Value up = mlir::linalg::createOrFoldDimOp(builder, loc, tensor, p);		const Dimension d = toOrigDim(enc, l);
highs[t][d] = up;		highs[t][l] = mlir::linalg::createOrFoldDimOp(builder, loc, tensor, d);
}		}

// Perform the required bufferization. Dense inputs materialize		// Perform the required bufferization. Dense inputs materialize
// from the input tensors. Sparse inputs use sparse primitives to obtain the		// from the input tensors. Sparse inputs use sparse primitives to obtain the
// values.		// values.
// Delegates extra output initialization to clients.		// Delegates extra output initialization to clients.
bool isOutput = isOutputTensor(t);		bool isOutput = isOutputTensor(t);
Type elementType = rtp.getElementType();		Type elementType = rtp.getElementType();
▲ Show 20 Lines • Show All 582 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseStorageSpecifierToLLVM.cpp

	Show All 18 Lines

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Helper methods.			// Helper methods.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	static SmallVector<Type, 2> getSpecifierFields(StorageSpecifierType tp) {			static SmallVector<Type, 2> getSpecifierFields(StorageSpecifierType tp) {
	MLIRContext *ctx = tp.getContext();			MLIRContext *ctx = tp.getContext();
	auto enc = tp.getEncoding();			auto enc = tp.getEncoding();
	unsigned rank = enc.getDimLevelType().size();			const Level lvlRank = enc.getLvlRank();

	SmallVector<Type, 2> result;			SmallVector<Type, 2> result;
	auto indexType = tp.getSizesType();			auto indexType = tp.getSizesType();
	auto dimSizes = LLVM::LLVMArrayType::get(ctx, indexType, rank);			auto dimSizes = LLVM::LLVMArrayType::get(ctx, indexType, lvlRank);
	auto memSizes = LLVM::LLVMArrayType::get(ctx, indexType,			auto memSizes = LLVM::LLVMArrayType::get(ctx, indexType,
	getNumDataFieldsFromEncoding(enc));			getNumDataFieldsFromEncoding(enc));
	result.push_back(dimSizes);			result.push_back(dimSizes);
	result.push_back(memSizes);			result.push_back(memSizes);
	return result;			return result;
	}			}

	static Type convertSpecifier(StorageSpecifierType tp) {			static Type convertSpecifier(StorageSpecifierType tp) {
	▲ Show 20 Lines • Show All 170 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorCodegen.cpp

Show All 18 Lines
#include "SparseTensorStorageLayout.h"		#include "SparseTensorStorageLayout.h"

#include "mlir/Dialect/Bufferization/IR/Bufferization.h"		#include "mlir/Dialect/Bufferization/IR/Bufferization.h"
#include "mlir/Dialect/Func/IR/FuncOps.h"		#include "mlir/Dialect/Func/IR/FuncOps.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
#include "mlir/Dialect/MemRef/IR/MemRef.h"		#include "mlir/Dialect/MemRef/IR/MemRef.h"
#include "mlir/Dialect/SparseTensor/IR/Enums.h"		#include "mlir/Dialect/SparseTensor/IR/Enums.h"
#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"		#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/Transforms/DialectConversion.h"		#include "mlir/Transforms/DialectConversion.h"
		#include "llvm/Support/FormatVariadic.h"

#include <optional>		#include <optional>

using namespace mlir;		using namespace mlir;
using namespace mlir::sparse_tensor;		using namespace mlir::sparse_tensor;

namespace {		namespace {

using FuncGeneratorType =		using FuncGeneratorType =
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = fields.size(); i < e; i++)
fields[i] = forOp.getRegionIterArg(i);		fields[i] = forOp.getRegionIterArg(i);
builder.setInsertionPointToStart(forOp.getBody());		builder.setInsertionPointToStart(forOp.getBody());
return forOp;		return forOp;
}		}

/// Gets the dimension size for the given sparse tensor at the given		/// Gets the dimension size for the given sparse tensor at the given
/// original dimension 'dim'.		/// original dimension 'dim'.
static Value sizeFromTensorAtDim(OpBuilder &builder, Location loc,		static Value sizeFromTensorAtDim(OpBuilder &builder, Location loc,
SparseTensorDescriptor desc, unsigned dim) {		SparseTensorDescriptor desc, Dimension dim) {
RankedTensorType rtp = desc.getTensorType();		const SparseTensorType stt(desc.getRankedTensorType());
// Access into static dimension can query original type directly.		// Access into static dimension can query original type directly.
// Note that this is typically already done by DimOp's folding.		// Note that this is typically already done by DimOp's folding.
auto shape = rtp.getShape();		if (auto sz = stt.getStaticDimSize(dim))
if (!ShapedType::isDynamic(shape[dim]))		return constantIndex(builder, loc, *sz);
return constantIndex(builder, loc, shape[dim]);

// Any other query can consult the dimSizes array at field DimSizesIdx,		// Any other query can consult the dimSizes array at field DimSizesIdx,
// accounting for the reordering applied to the sparse storage.		// accounting for the reordering applied to the sparse storage.
return desc.getDimSize(builder, loc, toStoredDim(rtp, dim));		// FIXME: `toStoredDim` is deprecated.
		const Level lvl = toStoredDim(stt, dim);
		// FIXME: this method seems to get level sizes, but the name is confusing
		return desc.getDimSize(builder, loc, lvl);
}		}

// Gets the dimension size at the given stored level 'lvl', either as a		// Gets the dimension size at the given stored level 'lvl', either as a
// constant for a static size, or otherwise dynamically through memSizes.		// constant for a static size, or otherwise dynamically through memSizes.
static Value sizeFromTensorAtLvl(OpBuilder &builder, Location loc,		static Value sizeFromTensorAtLvl(OpBuilder &builder, Location loc,
SparseTensorDescriptor desc, unsigned lvl) {		SparseTensorDescriptor desc, Level lvl) {
		// FIXME: `toOrigDim` is deprecated.
return sizeFromTensorAtDim(builder, loc, desc,		return sizeFromTensorAtDim(builder, loc, desc,
toOrigDim(desc.getTensorType(), lvl));		toOrigDim(desc.getRankedTensorType(), lvl));
}		}

static void createPushback(OpBuilder &builder, Location loc,		static void createPushback(OpBuilder &builder, Location loc,
MutSparseTensorDescriptor desc,		MutSparseTensorDescriptor desc,
SparseTensorFieldKind kind,		SparseTensorFieldKind kind, std::optional<Level> lvl,
std::optional<unsigned> dim, Value value,		Value value, Value repeat = Value()) {
Value repeat = Value()) {		Type etp = desc.getMemRefElementType(kind, lvl);
Type etp = desc.getMemRefElementType(kind, dim);		Value field = desc.getMemRefField(kind, lvl);
Value field = desc.getMemRefField(kind, dim);
StorageSpecifierKind specFieldKind = toSpecifierKind(kind);		StorageSpecifierKind specFieldKind = toSpecifierKind(kind);

auto pushBackOp = builder.create<PushBackOp>(		auto pushBackOp = builder.create<PushBackOp>(
loc, desc.getSpecifierField(builder, loc, specFieldKind, dim), field,		loc, desc.getSpecifierField(builder, loc, specFieldKind, lvl), field,
toType(builder, loc, value, etp), repeat);		toType(builder, loc, value, etp), repeat);

desc.setMemRefField(kind, dim, pushBackOp.getOutBuffer());		desc.setMemRefField(kind, lvl, pushBackOp.getOutBuffer());
desc.setSpecifierField(builder, loc, specFieldKind, dim,		desc.setSpecifierField(builder, loc, specFieldKind, lvl,
pushBackOp.getNewSize());		pushBackOp.getNewSize());
}		}

/// Generates code that allocates a sparse storage scheme for given rank.		/// Generates code that allocates a sparse storage scheme for given rank.
static void allocSchemeForRank(OpBuilder &builder, Location loc,		static void allocSchemeForRank(OpBuilder &builder, Location loc,
MutSparseTensorDescriptor desc, unsigned r0) {		MutSparseTensorDescriptor desc, Level startLvl) {
RankedTensorType rtp = desc.getTensorType();		const SparseTensorType stt(desc.getRankedTensorType());
unsigned rank = rtp.getShape().size();
Value linear = constantIndex(builder, loc, 1);		Value linear = constantIndex(builder, loc, 1);
for (unsigned r = r0; r < rank; r++) {		const Level lvlRank = stt.getLvlRank();
if (isCompressedDim(rtp, r)) {		for (Level l = startLvl; l < lvlRank; l++) {
		const auto dlt = stt.getLvlType(l);
		if (isCompressedDLT(dlt)) {
// Append linear x pointers, initialized to zero. Since each compressed		// Append linear x pointers, initialized to zero. Since each compressed
// dimension initially already has a single zero entry, this maintains		// dimension initially already has a single zero entry, this maintains
// the desired "linear + 1" length property at all times.		// the desired "linear + 1" length property at all times.
Type ptrType = getSparseTensorEncoding(rtp).getPointerType();		Type ptrType = stt.getEncoding().getPointerType();
Value ptrZero = constantZero(builder, loc, ptrType);		Value ptrZero = constantZero(builder, loc, ptrType);
createPushback(builder, loc, desc, SparseTensorFieldKind::PtrMemRef, r,		createPushback(builder, loc, desc, SparseTensorFieldKind::PtrMemRef, l,
ptrZero, linear);		ptrZero, linear);
return;		return;
}		}
if (isSingletonDim(rtp, r)) {		if (isSingletonDLT(dlt)) {
return; // nothing to do		return; // nothing to do
}		}
// Keep compounding the size, but nothing needs to be initialized		// Keep compounding the size, but nothing needs to be initialized
// at this level. We will eventually reach a compressed level or		// at this level. We will eventually reach a compressed level or
// otherwise the values array for the from-here "all-dense" case.		// otherwise the values array for the from-here "all-dense" case.
assert(isDenseDim(rtp, r));		assert(isDenseDLT(dlt));
Value size = sizeFromTensorAtLvl(builder, loc, desc, r);		Value size = sizeFromTensorAtLvl(builder, loc, desc, l);
linear = builder.create<arith::MulIOp>(loc, linear, size);		linear = builder.create<arith::MulIOp>(loc, linear, size);
}		}
// Reached values array so prepare for an insertion.		// Reached values array so prepare for an insertion.
Value valZero = constantZero(builder, loc, rtp.getElementType());		Value valZero = constantZero(builder, loc, stt.getElementType());
createPushback(builder, loc, desc, SparseTensorFieldKind::ValMemRef,		createPushback(builder, loc, desc, SparseTensorFieldKind::ValMemRef,
std::nullopt, valZero, linear);		std::nullopt, valZero, linear);
}		}

/// Creates allocation operation.		/// Creates allocation operation.
static Value createAllocation(OpBuilder &builder, Location loc,		static Value createAllocation(OpBuilder &builder, Location loc,
MemRefType memRefType, Value sz,		MemRefType memRefType, Value sz,
bool enableInit) {		bool enableInit) {
Value buffer = builder.create<memref::AllocOp>(loc, memRefType, sz);		Value buffer = builder.create<memref::AllocOp>(loc, memRefType, sz);
Type elemType = memRefType.getElementType();		Type elemType = memRefType.getElementType();
if (enableInit) {		if (enableInit) {
Value fillValue = constantZero(builder, loc, elemType);		Value fillValue = constantZero(builder, loc, elemType);
builder.create<linalg::FillOp>(loc, fillValue, buffer);		builder.create<linalg::FillOp>(loc, fillValue, buffer);
}		}
return buffer;		return buffer;
}		}

/// Creates allocation for each field in sparse tensor type. Note that		/// Creates allocation for each field in sparse tensor type. Note that
/// for all dynamic memrefs, the memory size is really the capacity of		/// for all dynamic memrefs, the memory size is really the capacity of
/// the "vector", while the actual size resides in the sizes array.		/// the "vector", while the actual size resides in the sizes array.
///		///
/// TODO: for efficiency, we will need heuristis to make educated guesses		/// TODO: for efficiency, we will need heuristics to make educated guesses
/// on the required capacities (see heuristic variable).		/// on the required capacities (see heuristic variable).
///		///
static void createAllocFields(OpBuilder &builder, Location loc, Type type,		static void createAllocFields(OpBuilder &builder, Location loc,
ValueRange dynSizes, bool enableInit,		SparseTensorType stt, ValueRange dynSizes,
SmallVectorImpl<Value> &fields, Value sizeHint) {		bool enableInit, SmallVectorImpl<Value> &fields,
RankedTensorType rtp = type.cast<RankedTensorType>();		Value sizeHint) {
SparseTensorEncodingAttr enc = getSparseTensorEncoding(rtp);

// Build original sizes.		// Build original sizes.
SmallVector<Value> sizes;		assert((dynSizes.size() == static_cast<size_t>(stt.getNumDynamicDims())) &&
auto shape = rtp.getShape();		"Got wrong number of dynamic sizes");
unsigned rank = shape.size();		const Dimension dimRank = stt.getDimRank();
for (unsigned r = 0, o = 0; r < rank; r++) {		SmallVector<Value> dimSizes;
if (ShapedType::isDynamic(shape[r]))		dimSizes.reserve(dimRank);
sizes.push_back(dynSizes[o++]);		unsigned i = 0; // cumulative index into `dynSizes`.
else		for (const DynSize sh : stt.getDimShape())
sizes.push_back(constantIndex(builder, loc, shape[r]));		dimSizes.push_back(ShapedType::isDynamic(sh)
}		? dynSizes[i++]
		: constantIndex(builder, loc, sh));

// Set up some heuristic sizes. We try to set the initial		// Set up some heuristic sizes. We try to set the initial
// size based on available information. Otherwise we just		// size based on available information. Otherwise we just
// initialize a few elements to start the reallocation chain.		// initialize a few elements to start the reallocation chain.
// TODO: refine this		// TODO: refine this
Value ptrHeuristic, idxHeuristic, valHeuristic;		Value ptrHeuristic, idxHeuristic, valHeuristic;
if (enc.isAllDense()) {		if (stt.isAllDense()) {
Value linear = sizes[0];		valHeuristic = dimSizes[0];
for (unsigned r = 1; r < rank; r++) {		for (const Value sz : ArrayRef<Value>{dimSizes}.drop_front())
linear = builder.create<arith::MulIOp>(loc, linear, sizes[r]);		valHeuristic = builder.create<arith::MulIOp>(loc, valHeuristic, sz);
}
valHeuristic = linear;
} else if (sizeHint) {		} else if (sizeHint) {
if (getCOOStart(enc) == 0) {		if (getCOOStart(stt.getEncoding()) == 0) {
ptrHeuristic = constantIndex(builder, loc, 2);		ptrHeuristic = constantIndex(builder, loc, 2);
idxHeuristic = builder.create<arith::MulIOp>(		idxHeuristic = builder.create<arith::MulIOp>(
loc, constantIndex(builder, loc, rank), sizeHint); // AOS		loc, constantIndex(builder, loc, dimRank), sizeHint); // AOS
} else if (rank == 2 && isDenseDim(rtp, 0) && isCompressedDim(rtp, 1)) {		} else if (dimRank == 2 && stt.isDenseLvl(0) && stt.isCompressedLvl(1)) {
ptrHeuristic = builder.create<arith::AddIOp>(		ptrHeuristic = builder.create<arith::AddIOp>(
loc, sizeHint, constantIndex(builder, loc, 1));		loc, sizeHint, constantIndex(builder, loc, 1));
idxHeuristic = sizeHint;		idxHeuristic = sizeHint;
} else {		} else {
ptrHeuristic = idxHeuristic = constantIndex(builder, loc, 16);		ptrHeuristic = idxHeuristic = constantIndex(builder, loc, 16);
}		}
valHeuristic = sizeHint;		valHeuristic = sizeHint;
} else {		} else {
ptrHeuristic = idxHeuristic = valHeuristic =		ptrHeuristic = idxHeuristic = valHeuristic =
constantIndex(builder, loc, 16);		constantIndex(builder, loc, 16);
}		}

foreachFieldAndTypeInSparseTensor(		foreachFieldAndTypeInSparseTensor(
rtp,		stt,
[&builder, &fields, rtp, loc, ptrHeuristic, idxHeuristic, valHeuristic,		[&builder, &fields, stt, loc, ptrHeuristic, idxHeuristic, valHeuristic,
enableInit](Type fType, unsigned fIdx, SparseTensorFieldKind fKind,		enableInit](Type fType, FieldIndex fIdx, SparseTensorFieldKind fKind,
unsigned /dim/, DimLevelType /dlt/) -> bool {		Level /lvl/, DimLevelType /dlt/) -> bool {
assert(fields.size() == fIdx);		assert(fields.size() == fIdx);
Value field;		Value field;
switch (fKind) {		switch (fKind) {
case SparseTensorFieldKind::StorageSpec:		case SparseTensorFieldKind::StorageSpec:
field = SparseTensorSpecifier::getInitValue(builder, loc, rtp);		field = SparseTensorSpecifier::getInitValue(builder, loc, stt);
break;		break;
case SparseTensorFieldKind::PtrMemRef:		case SparseTensorFieldKind::PtrMemRef:
case SparseTensorFieldKind::IdxMemRef:		case SparseTensorFieldKind::IdxMemRef:
case SparseTensorFieldKind::ValMemRef:		case SparseTensorFieldKind::ValMemRef:
field = createAllocation(		field = createAllocation(
builder, loc, fType.cast<MemRefType>(),		builder, loc, fType.cast<MemRefType>(),
(fKind == SparseTensorFieldKind::PtrMemRef) ? ptrHeuristic		(fKind == SparseTensorFieldKind::PtrMemRef) ? ptrHeuristic
: (fKind == SparseTensorFieldKind::IdxMemRef) ? idxHeuristic		: (fKind == SparseTensorFieldKind::IdxMemRef) ? idxHeuristic
: valHeuristic,		: valHeuristic,
enableInit);		enableInit);
break;		break;
}		}
assert(field);		assert(field);
fields.push_back(field);		fields.push_back(field);
// Returns true to continue the iteration.		// Returns true to continue the iteration.
return true;		return true;
});		});

MutSparseTensorDescriptor desc(rtp, fields);		MutSparseTensorDescriptor desc(stt, fields);

// Initialize the storage scheme to an empty tensor. Initialized memSizes		// Initialize the storage scheme to an empty tensor. Initialized memSizes
// to all zeros, sets the dimSizes to known values and gives all pointer		// to all zeros, sets the dimSizes to known values and gives all pointer
// fields an initial zero entry, so that it is easier to maintain the		// fields an initial zero entry, so that it is easier to maintain the
// "linear + 1" length property.		// "linear + 1" length property.
Value ptrZero =		Value ptrZero =
constantZero(builder, loc, getSparseTensorEncoding(rtp).getPointerType());		constantZero(builder, loc, stt.getEncoding().getPointerType());
for (unsigned r = 0; r < rank; r++) {		for (Level lvlRank = stt.getLvlRank(), l = 0; l < lvlRank; l++) {
unsigned ro = toOrigDim(rtp, r);
// Fills dim sizes array.		// Fills dim sizes array.
desc.setDimSize(builder, loc, r, sizes[ro]);		// FIXME: this method seems to set level sizes, but the name is confusing
		// FIXME: `toOrigDim` is deprecated.
		desc.setDimSize(builder, loc, l, dimSizes[toOrigDim(stt, l)]);
// Pushes a leading zero to pointers memref.		// Pushes a leading zero to pointers memref.
if (isCompressedDim(rtp, r)) {		if (stt.isCompressedLvl(l))
createPushback(builder, loc, desc, SparseTensorFieldKind::PtrMemRef, r,		createPushback(builder, loc, desc, SparseTensorFieldKind::PtrMemRef, l,
ptrZero);		ptrZero);
}		}
}
allocSchemeForRank(builder, loc, desc, /rank=/0);		allocSchemeForRank(builder, loc, desc, /rank=/0);
}		}

/// Helper method that generates block specific to compressed case:		/// Helper method that generates block specific to compressed case:
///		///
/// plo = pointers[d][pos[d-1]]		/// plo = pointers[l][pos[l-1]]
/// phi = pointers[d][pos[d-1]+1]		/// phi = pointers[l][pos[l-1]+1]
/// msz = indices[d].size()		/// msz = indices[l].size()
/// if (plo < phi) {		/// if (plo < phi) {
/// present = indices[d][phi-1] == i[d]		/// present = indices[l][phi-1] == i[l]
/// } else { // first insertion		/// } else { // first insertion
/// present = false		/// present = false
/// pointers[d][pos[d-1]] = msz		/// pointers[l][pos[l-1]] = msz
/// }		/// }
/// if (present) { // index already present		/// if (present) { // index already present
/// next = phi-1		/// next = phi-1
/// } else {		/// } else {
/// indices[d].push_back(i[d])		/// indices[l].push_back(i[l])
/// pointers[d][pos[d-1]+1] = msz+1		/// pointers[l][pos[l-1]+1] = msz+1
/// next = msz		/// next = msz
/// <prepare dimension d + 1>		/// <prepare level l + 1>
/// }		/// }
/// pos[d] = next		/// pos[l] = next
static Value genCompressed(OpBuilder &builder, Location loc,		static Value genCompressed(OpBuilder &builder, Location loc,
MutSparseTensorDescriptor desc,		MutSparseTensorDescriptor desc, ValueRange indices,
SmallVectorImpl<Value> &indices, Value value,		Value value, Value pos, Level lvl) {
Value pos, unsigned d) {		const SparseTensorType stt(desc.getRankedTensorType());
RankedTensorType rtp = desc.getTensorType();		const Level lvlRank = stt.getLvlRank();
unsigned rank = rtp.getShape().size();		assert(lvl < lvlRank && "Level is out of bounds");
		assert(indices.size() == static_cast<size_t>(lvlRank) &&
		"Level-rank mismatch");
SmallVector<Type> types;		SmallVector<Type> types;
Type indexType = builder.getIndexType();		Type indexType = builder.getIndexType();
Type boolType = builder.getIntegerType(1);		Type boolType = builder.getIntegerType(1);
unsigned idxIndex;		unsigned idxIndex;
unsigned idxStride;		unsigned idxStride;
std::tie(idxIndex, idxStride) = desc.getIdxMemRefIndexAndStride(d);		std::tie(idxIndex, idxStride) = desc.getIdxMemRefIndexAndStride(lvl);
Value one = constantIndex(builder, loc, 1);		Value one = constantIndex(builder, loc, 1);
Value pp1 = builder.create<arith::AddIOp>(loc, pos, one);		Value pp1 = builder.create<arith::AddIOp>(loc, pos, one);
Value plo = genLoad(builder, loc, desc.getPtrMemRef(d), pos);		Value plo = genLoad(builder, loc, desc.getPtrMemRef(lvl), pos);
Value phi = genLoad(builder, loc, desc.getPtrMemRef(d), pp1);		Value phi = genLoad(builder, loc, desc.getPtrMemRef(lvl), pp1);
Value msz = desc.getIdxMemSize(builder, loc, d);		Value msz = desc.getIdxMemSize(builder, loc, lvl);
Value idxStrideC;		Value idxStrideC;
if (idxStride > 1) {		if (idxStride > 1) {
idxStrideC = constantIndex(builder, loc, idxStride);		idxStrideC = constantIndex(builder, loc, idxStride);
msz = builder.create<arith::DivUIOp>(loc, msz, idxStrideC);		msz = builder.create<arith::DivUIOp>(loc, msz, idxStrideC);
}		}
Value phim1 = builder.create<arith::SubIOp>(		Value phim1 = builder.create<arith::SubIOp>(
loc, toType(builder, loc, phi, indexType), one);		loc, toType(builder, loc, phi, indexType), one);
// Conditional expression.		// Conditional expression.
Value lt =		Value lt =
builder.create<arith::CmpIOp>(loc, arith::CmpIPredicate::ult, plo, phi);		builder.create<arith::CmpIOp>(loc, arith::CmpIPredicate::ult, plo, phi);
types.push_back(boolType);		types.push_back(boolType);
scf::IfOp ifOp1 = builder.create<scf::IfOp>(loc, types, lt, /else/ true);		scf::IfOp ifOp1 = builder.create<scf::IfOp>(loc, types, lt, /else/ true);
types.pop_back();		types.pop_back();
builder.setInsertionPointToStart(&ifOp1.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp1.getThenRegion().front());
Value crd = genLoad(		Value crd = genLoad(
builder, loc, desc.getMemRefField(idxIndex),		builder, loc, desc.getMemRefField(idxIndex),
idxStride > 1 ? builder.create<arith::MulIOp>(loc, phim1, idxStrideC)		idxStride > 1 ? builder.create<arith::MulIOp>(loc, phim1, idxStrideC)
: phim1);		: phim1);
Value eq = builder.create<arith::CmpIOp>(loc, arith::CmpIPredicate::eq,		Value eq = builder.create<arith::CmpIOp>(loc, arith::CmpIPredicate::eq,
toType(builder, loc, crd, indexType),		toType(builder, loc, crd, indexType),
indices[d]);		indices[lvl]);
builder.create<scf::YieldOp>(loc, eq);		builder.create<scf::YieldOp>(loc, eq);
builder.setInsertionPointToStart(&ifOp1.getElseRegion().front());		builder.setInsertionPointToStart(&ifOp1.getElseRegion().front());
if (d > 0)		if (lvl > 0)
genStore(builder, loc, msz, desc.getPtrMemRef(d), pos);		genStore(builder, loc, msz, desc.getPtrMemRef(lvl), pos);
builder.create<scf::YieldOp>(loc, constantI1(builder, loc, false));		builder.create<scf::YieldOp>(loc, constantI1(builder, loc, false));
builder.setInsertionPointAfter(ifOp1);		builder.setInsertionPointAfter(ifOp1);
Value p = ifOp1.getResult(0);
// If present construct. Note that for a non-unique dimension level, we		// If present construct. Note that for a non-unique dimension level, we
// simply set the condition to false and rely on CSE/DCE to clean up the IR.		// simply set the condition to false and rely on CSE/DCE to clean up the IR.
//		//
// TODO: generate less temporary IR?		// TODO: generate less temporary IR?
//		//
for (unsigned i = 0, e = desc.getNumFields(); i < e; i++)		for (unsigned i = 0, e = desc.getNumFields(); i < e; i++)
types.push_back(desc.getField(i).getType());		types.push_back(desc.getField(i).getType());
types.push_back(indexType);		types.push_back(indexType);
if (!isUniqueDim(rtp, d))		const Value p = stt.isUniqueLvl(lvl) ? ifOp1.getResult(0)
p = constantI1(builder, loc, false);		: constantI1(builder, loc, false);
scf::IfOp ifOp2 = builder.create<scf::IfOp>(loc, types, p, /else/ true);		scf::IfOp ifOp2 = builder.create<scf::IfOp>(loc, types, p, /else/ true);
// If present (fields unaffected, update next to phim1).		// If present (fields unaffected, update next to phim1).
builder.setInsertionPointToStart(&ifOp2.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp2.getThenRegion().front());

// FIXME: This does not looks like a clean way, but probably the most		// FIXME: This does not looks like a clean way, but probably the most
// efficient way.		// efficient way.
desc.getFields().push_back(phim1);		desc.getFields().push_back(phim1);
builder.create<scf::YieldOp>(loc, desc.getFields());		builder.create<scf::YieldOp>(loc, desc.getFields());
desc.getFields().pop_back();		desc.getFields().pop_back();

// If !present (changes fields, update next).		// If !present (changes fields, update next).
builder.setInsertionPointToStart(&ifOp2.getElseRegion().front());		builder.setInsertionPointToStart(&ifOp2.getElseRegion().front());
Value mszp1 = builder.create<arith::AddIOp>(loc, msz, one);		Value mszp1 = builder.create<arith::AddIOp>(loc, msz, one);
genStore(builder, loc, mszp1, desc.getPtrMemRef(d), pp1);		genStore(builder, loc, mszp1, desc.getPtrMemRef(lvl), pp1);
createPushback(builder, loc, desc, SparseTensorFieldKind::IdxMemRef, d,		createPushback(builder, loc, desc, SparseTensorFieldKind::IdxMemRef, lvl,
indices[d]);		indices[lvl]);
// Prepare the next dimension "as needed".		// Prepare the next dimension "as needed".
if ((d + 1) < rank)		if ((lvl + 1) < lvlRank)
allocSchemeForRank(builder, loc, desc, d + 1);		allocSchemeForRank(builder, loc, desc, lvl + 1);

desc.getFields().push_back(msz);		desc.getFields().push_back(msz);
builder.create<scf::YieldOp>(loc, desc.getFields());		builder.create<scf::YieldOp>(loc, desc.getFields());
desc.getFields().pop_back();		desc.getFields().pop_back();

// Update fields and return next pos.		// Update fields and return next pos.
builder.setInsertionPointAfter(ifOp2);		builder.setInsertionPointAfter(ifOp2);
unsigned o = 0;		unsigned o = 0;
Show All 10 Lines
/// storage scheme in an appending/inserting kind of fashion (i.e. no		/// storage scheme in an appending/inserting kind of fashion (i.e. no
/// in-between insertions that need data movement). The implementation		/// in-between insertions that need data movement). The implementation
/// relies on CSE/DCE to clean up all bookkeeping that is not needed.		/// relies on CSE/DCE to clean up all bookkeeping that is not needed.
///		///
/// TODO: better unord/not-unique; also generalize, optimize, specialize!		/// TODO: better unord/not-unique; also generalize, optimize, specialize!
///		///
static void genInsertBody(OpBuilder &builder, ModuleOp module,		static void genInsertBody(OpBuilder &builder, ModuleOp module,
func::FuncOp func, RankedTensorType rtp) {		func::FuncOp func, RankedTensorType rtp) {
OpBuilder::InsertionGuard insertionGuard(builder);		const OpBuilder::InsertionGuard insertionGuard(builder);
Block *entryBlock = func.addEntryBlock();		Block *const entryBlock = func.addEntryBlock();
builder.setInsertionPointToStart(entryBlock);		builder.setInsertionPointToStart(entryBlock);
		const ValueRange args = entryBlock->getArguments();
Location loc = func.getLoc();		const Location loc = func.getLoc();
ValueRange args = entryBlock->getArguments();		const SparseTensorType stt(rtp);
unsigned rank = rtp.getShape().size();		const Level lvlRank = stt.getLvlRank();

// Construct fields and indices arrays from parameters.		// Construct fields and indices arrays from parameters.
ValueRange tmp = args.drop_back(rank + 1);		SmallVector<Value> fields = llvm::to_vector(args.drop_back(lvlRank + 1));
SmallVector<Value> fields(tmp.begin(), tmp.end());
MutSparseTensorDescriptor desc(rtp, fields);		MutSparseTensorDescriptor desc(rtp, fields);
tmp = args.take_back(rank + 1).drop_back();		const SmallVector<Value> indices =
SmallVector<Value> indices(tmp.begin(), tmp.end());		llvm::to_vector(args.take_back(lvlRank + 1).drop_back());
Value value = args.back();		Value value = args.back();
Value pos = constantZero(builder, loc, builder.getIndexType());		Value pos = constantZero(builder, loc, builder.getIndexType());
// Generate code for every dimension.		// Generate code for every level.
for (unsigned d = 0; d < rank; d++) {		for (Level l = 0; l < lvlRank; l++) {
if (isCompressedDim(rtp, d)) {		const auto dlt = stt.getLvlType(l);
		if (isCompressedDLT(dlt)) {
// Create:		// Create:
// if (!present) {		// if (!present) {
// indices[d].push_back(i[d])		// indices[l].push_back(i[l])
// <update pointers and prepare dimension d + 1>		// <update pointers and prepare level l + 1>
// }		// }
// pos[d] = indices.size() - 1		// pos[l] = indices.size() - 1
// <insert @ pos[d] at next dimension d + 1>		// <insert @ pos[l] at next level l + 1>
pos = genCompressed(builder, loc, desc, indices, value, pos, d);		pos = genCompressed(builder, loc, desc, indices, value, pos, l);
} else if (isSingletonDim(rtp, d)) {		} else if (isSingletonDLT(dlt)) {
// Create:		// Create:
// indices[d].push_back(i[d])		// indices[l].push_back(i[l])
// pos[d] = pos[d-1]		// pos[l] = pos[l-1]
// <insert @ pos[d] at next dimension d + 1>		// <insert @ pos[l] at next level l + 1>
createPushback(builder, loc, desc, SparseTensorFieldKind::IdxMemRef, d,		createPushback(builder, loc, desc, SparseTensorFieldKind::IdxMemRef, l,
indices[d]);		indices[l]);
} else {		} else {
assert(isDenseDim(rtp, d));		assert(isDenseDLT(dlt));
// Construct the new position as:		// Construct the new position as:
// pos[d] = size * pos[d-1] + i[d]		// pos[l] = size * pos[l-1] + i[l]
// <insert @ pos[d] at next dimension d + 1>		// <insert @ pos[l] at next level l + 1>
Value size = sizeFromTensorAtLvl(builder, loc, desc, d);		Value size = sizeFromTensorAtLvl(builder, loc, desc, l);
Value mult = builder.create<arith::MulIOp>(loc, size, pos);		Value mult = builder.create<arith::MulIOp>(loc, size, pos);
pos = builder.create<arith::AddIOp>(loc, mult, indices[d]);		pos = builder.create<arith::AddIOp>(loc, mult, indices[l]);
}		}
}		}
// Reached the actual value append/insert.		// Reached the actual value append/insert.
if (!isDenseDim(rtp, rank - 1))		if (!stt.isDenseLvl(lvlRank - 1))
createPushback(builder, loc, desc, SparseTensorFieldKind::ValMemRef,		createPushback(builder, loc, desc, SparseTensorFieldKind::ValMemRef,
std::nullopt, value);		std::nullopt, value);
else		else
genStore(builder, loc, value, desc.getValMemRef(), pos);		genStore(builder, loc, value, desc.getValMemRef(), pos);
builder.create<func::ReturnOp>(loc, fields);		builder.create<func::ReturnOp>(loc, fields);
}		}

/// Generates a call to a function to perform an insertion operation. If the		/// Generates a call to a function to perform an insertion operation. If the
/// function doesn't exist yet, call `createFunc` to generate the function.		/// function doesn't exist yet, call `createFunc` to generate the function.
static void genInsertionCallHelper(OpBuilder &builder,		static void genInsertionCallHelper(OpBuilder &builder,
MutSparseTensorDescriptor desc,		MutSparseTensorDescriptor desc,
SmallVectorImpl<Value> &indices, Value value,		SmallVectorImpl<Value> &indices, Value value,
func::FuncOp insertPoint,		func::FuncOp insertPoint,
StringRef namePrefix,		StringRef namePrefix,
FuncGeneratorType createFunc) {		FuncGeneratorType createFunc) {
// The mangled name of the function has this format:		// The mangled name of the function has this format:
// <namePrefix>_<DLT>_<shape>_<ordering>_<eltType>		// <namePrefix>_<DLT>_<shape>_<ordering>_<eltType>
// _<indexBitWidth>_<pointerBitWidth>		// _<indexBitWidth>_<pointerBitWidth>
RankedTensorType rtp = desc.getTensorType();		const SparseTensorType stt(desc.getRankedTensorType());
SmallString<32> nameBuffer;		SmallString<32> nameBuffer;
llvm::raw_svector_ostream nameOstream(nameBuffer);		llvm::raw_svector_ostream nameOstream(nameBuffer);
nameOstream << namePrefix;		nameOstream << namePrefix;
unsigned rank = rtp.getShape().size();		assert(static_cast<size_t>(stt.getLvlRank()) == indices.size());
assert(rank == indices.size());		const Level lvlRank = stt.getLvlRank();
for (unsigned d = 0; d < rank; d++) {		for (Level l = 0; l < lvlRank; l++)
nameOstream << toMLIRString(getDimLevelType(rtp, d)) << "_";		nameOstream << toMLIRString(stt.getLvlType(l)) << "_";
}
// Static dim sizes are used in the generated code while dynamic sizes are		// Static dim sizes are used in the generated code while dynamic sizes are
// loaded from the dimSizes buffer. This is the reason for adding the shape		// loaded from the dimSizes buffer. This is the reason for adding the shape
// to the function name.		// to the function name.
for (auto d : rtp.getShape())		for (const auto sh : stt.getDimShape())
nameOstream << d << "_";		nameOstream << sh << "_";
SparseTensorEncodingAttr enc = getSparseTensorEncoding(rtp);
// Permutation information is also used in generating insertion.		// Permutation information is also used in generating insertion.
if (enc.getDimOrdering() && !enc.getDimOrdering().isIdentity())		if (!stt.isIdentity())
nameOstream << enc.getDimOrdering() << "_";		nameOstream << stt.getDimToLvlMap() << "_";
nameOstream << rtp.getElementType() << "_";		nameOstream << stt.getElementType() << "_";
nameOstream << enc.getIndexBitWidth() << "_" << enc.getPointerBitWidth();		nameOstream << stt.getIndexBitWidth() << "_" << stt.getPointerBitWidth();

// Look up the function.		// Look up the function.
ModuleOp module = insertPoint->getParentOfType<ModuleOp>();		ModuleOp module = insertPoint->getParentOfType<ModuleOp>();
MLIRContext *context = module.getContext();		MLIRContext *context = module.getContext();
auto result = SymbolRefAttr::get(context, nameOstream.str());		auto result = SymbolRefAttr::get(context, nameOstream.str());
auto func = module.lookupSymbol<func::FuncOp>(result.getAttr());		auto func = module.lookupSymbol<func::FuncOp>(result.getAttr());

// Construct parameters for fields and indices.		// Construct parameters for fields and indices.
SmallVector<Value> operands(desc.getFields().begin(), desc.getFields().end());		SmallVector<Value> operands = llvm::to_vector(desc.getFields());
operands.append(indices.begin(), indices.end());		operands.append(indices);
operands.push_back(value);		operands.push_back(value);
Location loc = insertPoint.getLoc();		Location loc = insertPoint.getLoc();

if (!func) {		if (!func) {
// Create the function.		// Create the function.
OpBuilder::InsertionGuard insertionGuard(builder);		OpBuilder::InsertionGuard insertionGuard(builder);
builder.setInsertionPoint(insertPoint);		builder.setInsertionPoint(insertPoint);

func = builder.create<func::FuncOp>(		func = builder.create<func::FuncOp>(
loc, nameOstream.str(),		loc, nameOstream.str(),
FunctionType::get(context, ValueRange(operands).getTypes(),		FunctionType::get(context, ValueRange(operands).getTypes(),
ValueRange(desc.getFields()).getTypes()));		ValueRange(desc.getFields()).getTypes()));
func.setPrivate();		func.setPrivate();
createFunc(builder, module, func, rtp);		createFunc(builder, module, func, stt);
}		}

// Generate a call to perform the insertion and update `fields` with values		// Generate a call to perform the insertion and update `fields` with values
// returned from the call.		// returned from the call.
func::CallOp call = builder.create<func::CallOp>(loc, func, operands);		func::CallOp call = builder.create<func::CallOp>(loc, func, operands);
for (size_t i = 0, e = desc.getNumFields(); i < e; i++) {		for (size_t i = 0, e = desc.getNumFields(); i < e; i++) {
desc.getFields()[i] = call.getResult(i);		desc.getFields()[i] = call.getResult(i);
}		}
}		}

/// Generations insertion finalization code.		/// Generations insertion finalization code.
static void genEndInsert(OpBuilder &builder, Location loc,		static void genEndInsert(OpBuilder &builder, Location loc,
SparseTensorDescriptor desc) {		SparseTensorDescriptor desc) {
RankedTensorType rtp = desc.getTensorType();		const SparseTensorType stt(desc.getRankedTensorType());
unsigned rank = rtp.getShape().size();		const Level lvlRank = stt.getLvlRank();
for (unsigned d = 0; d < rank; d++) {		for (Level l = 0; l < lvlRank; l++) {
if (isCompressedDim(rtp, d)) {		const auto dlt = stt.getLvlType(l);
		if (isCompressedDLT(dlt)) {
// Compressed dimensions need a pointer cleanup for all entries		// Compressed dimensions need a pointer cleanup for all entries
// that were not visited during the insertion pass.		// that were not visited during the insertion pass.
//		//
// TODO: avoid cleanup and keep compressed scheme consistent at all		// TODO: avoid cleanup and keep compressed scheme consistent at all
// times?		// times?
//		//
if (d > 0) {		if (l > 0) {
Type ptrType = getSparseTensorEncoding(rtp).getPointerType();		Type ptrType = stt.getEncoding().getPointerType();
Value ptrMemRef = desc.getPtrMemRef(d);		Value ptrMemRef = desc.getPtrMemRef(l);
Value hi = desc.getPtrMemSize(builder, loc, d);		Value hi = desc.getPtrMemSize(builder, loc, l);
Value zero = constantIndex(builder, loc, 0);		Value zero = constantIndex(builder, loc, 0);
Value one = constantIndex(builder, loc, 1);		Value one = constantIndex(builder, loc, 1);
// Vector of only one, but needed by createFor's prototype.		// Vector of only one, but needed by createFor's prototype.
SmallVector<Value, 1> inits{genLoad(builder, loc, ptrMemRef, zero)};		SmallVector<Value, 1> inits{genLoad(builder, loc, ptrMemRef, zero)};
scf::ForOp loop = createFor(builder, loc, hi, inits, one);		scf::ForOp loop = createFor(builder, loc, hi, inits, one);
Value i = loop.getInductionVar();		Value i = loop.getInductionVar();
Value oldv = loop.getRegionIterArg(0);		Value oldv = loop.getRegionIterArg(0);
Value newv = genLoad(builder, loc, ptrMemRef, i);		Value newv = genLoad(builder, loc, ptrMemRef, i);
Value ptrZero = constantZero(builder, loc, ptrType);		Value ptrZero = constantZero(builder, loc, ptrType);
Value cond = builder.create<arith::CmpIOp>(		Value cond = builder.create<arith::CmpIOp>(
loc, arith::CmpIPredicate::eq, newv, ptrZero);		loc, arith::CmpIPredicate::eq, newv, ptrZero);
scf::IfOp ifOp = builder.create<scf::IfOp>(loc, TypeRange(ptrType),		scf::IfOp ifOp = builder.create<scf::IfOp>(loc, TypeRange(ptrType),
cond, /else/ true);		cond, /else/ true);
builder.setInsertionPointToStart(&ifOp.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp.getThenRegion().front());
genStore(builder, loc, oldv, ptrMemRef, i);		genStore(builder, loc, oldv, ptrMemRef, i);
builder.create<scf::YieldOp>(loc, oldv);		builder.create<scf::YieldOp>(loc, oldv);
builder.setInsertionPointToStart(&ifOp.getElseRegion().front());		builder.setInsertionPointToStart(&ifOp.getElseRegion().front());
builder.create<scf::YieldOp>(loc, newv);		builder.create<scf::YieldOp>(loc, newv);
builder.setInsertionPointAfter(ifOp);		builder.setInsertionPointAfter(ifOp);
builder.create<scf::YieldOp>(loc, ifOp.getResult(0));		builder.create<scf::YieldOp>(loc, ifOp.getResult(0));
builder.setInsertionPointAfter(loop);		builder.setInsertionPointAfter(loop);
}		}
} else {		} else {
assert(isDenseDim(rtp, d) \|\| isSingletonDim(rtp, d));		assert(isDenseDLT(dlt) \|\| isSingletonDLT(dlt));
}		}
}		}
}		}

/// Returns a memref that fits the requested length (reallocates if requested		/// Returns a memref that fits the requested length (reallocates if requested
/// length is larger, or creates a subview if it is smaller).		/// length is larger, or creates a subview if it is smaller).
static Value reallocOrSubView(OpBuilder &builder, Location loc, int64_t len,		static Value reallocOrSubView(OpBuilder &builder, Location loc, int64_t len,
Value buffer) {		Value buffer) {
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	public:
SparseTensorAllocConverter(TypeConverter &typeConverter, MLIRContext *context,		SparseTensorAllocConverter(TypeConverter &typeConverter, MLIRContext *context,
bool enableInit)		bool enableInit)
: OpConversionPattern(typeConverter, context),		: OpConversionPattern(typeConverter, context),
enableBufferInitialization(enableInit) {}		enableBufferInitialization(enableInit) {}

LogicalResult		LogicalResult
matchAndRewrite(bufferization::AllocTensorOp op, OpAdaptor adaptor,		matchAndRewrite(bufferization::AllocTensorOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
RankedTensorType resType = op.getType();		const auto resType = getSparseTensorType(op);
auto enc = getSparseTensorEncoding(resType);		if (!resType.hasEncoding())
if (!enc)
return failure();		return failure();
if (op.getCopy())		if (op.getCopy())
return rewriter.notifyMatchFailure(op, "tensor copy not implemented");		return rewriter.notifyMatchFailure(op, "tensor copy not implemented");

// Construct allocation for each field.		// Construct allocation for each field.
Location loc = op.getLoc();		const Location loc = op.getLoc();
Value sizeHint = op.getSizeHint();		const Value sizeHint = op.getSizeHint();
		const ValueRange dynSizes = adaptor.getDynamicSizes();
		const size_t found = dynSizes.size();
		const int64_t expected = resType.getNumDynamicDims();
		if (found != static_cast<size_t>(expected))
		return rewriter.notifyMatchFailure(
		op, llvm::formatv(
		"Got wrong number of dynamic sizes: Found={0}, Expected={1}",
		found, expected));
SmallVector<Value> fields;		SmallVector<Value> fields;
createAllocFields(rewriter, loc, resType, adaptor.getOperands(),		createAllocFields(rewriter, loc, resType, dynSizes,
enableBufferInitialization, fields, sizeHint);		enableBufferInitialization, fields, sizeHint);
// Replace operation with resulting memrefs.		// Replace operation with resulting memrefs.
rewriter.replaceOp(op, genTuple(rewriter, loc, resType, fields));		rewriter.replaceOp(op, genTuple(rewriter, loc, resType, fields));
return success();		return success();
}		}

private:		private:
bool enableBufferInitialization;		bool enableBufferInitialization;
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(ExpandOp op, OpAdaptor adaptor,		matchAndRewrite(ExpandOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
if (!getSparseTensorEncoding(op.getTensor().getType()))		if (!getSparseTensorEncoding(op.getTensor().getType()))
return failure();		return failure();
Location loc = op->getLoc();		Location loc = op->getLoc();
auto desc = getDescriptorFromTensorTuple(adaptor.getTensor());		auto desc = getDescriptorFromTensorTuple(adaptor.getTensor());
auto srcType = getRankedTensorType(op.getTensor());		const auto srcType = getSparseTensorType(op.getTensor());
Type eltType = srcType.getElementType();		Type eltType = srcType.getElementType();
Type boolType = rewriter.getIntegerType(1);		Type boolType = rewriter.getIntegerType(1);
Type idxType = rewriter.getIndexType();		Type idxType = rewriter.getIndexType();
// All initialization should be done on entry of the loop nest.		// All initialization should be done on entry of the loop nest.
rewriter.setInsertionPointAfter(op.getTensor().getDefiningOp());		rewriter.setInsertionPointAfter(op.getTensor().getDefiningOp());
// Determine the size for access expansion (always the innermost stored		// Determine the size for access expansion (always the innermost stored
// dimension size, translated back to original dimension). Note that we		// dimension size, translated back to original dimension). Note that we
// recursively rewrite the new DimOp on the original tensor.		// recursively rewrite the new DimOp on the original tensor.
unsigned innerDim = toOrigDim(srcType, srcType.getRank() - 1);		// FIXME: `toOrigDim` is deprecated.
auto sz = sizeFromTensorAtDim(rewriter, loc, desc, innerDim);		const Dimension innerDim = toOrigDim(srcType, srcType.getLvlRank() - 1);
		const auto sz = sizeFromTensorAtDim(rewriter, loc, desc, innerDim);
// Generate a memref for `sz` elements of type `t`.		// Generate a memref for `sz` elements of type `t`.
auto genAlloc = [&](Type t) {		const auto genAlloc = [&](Type t) {
auto memTp = MemRefType::get({ShapedType::kDynamic}, t);		const auto memTp = MemRefType::get({ShapedType::kDynamic}, t);
return rewriter.create<memref::AllocOp>(loc, memTp, ValueRange{sz});		return rewriter.create<memref::AllocOp>(loc, memTp, ValueRange{sz});
};		};
// Allocate temporary buffers for values/filled-switch and added.		// Allocate temporary buffers for values/filled-switch and added.
// We do not use stack buffers for this, since the expanded size may		// We do not use stack buffers for this, since the expanded size may
// be rather large (as it envelops a single expanded dense dimension).		// be rather large (as it envelops a single expanded dense dimension).
Value values = genAlloc(eltType);		Value values = genAlloc(eltType);
Value filled = genAlloc(boolType);		Value filled = genAlloc(boolType);
Value added = genAlloc(idxType);		Value added = genAlloc(idxType);
Show All 25 Lines	matchAndRewrite(CompressOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
Location loc = op->getLoc();		Location loc = op->getLoc();
SmallVector<Value> fields;		SmallVector<Value> fields;
auto desc = getMutDescriptorFromTensorTuple(adaptor.getTensor(), fields);		auto desc = getMutDescriptorFromTensorTuple(adaptor.getTensor(), fields);
Value values = adaptor.getValues();		Value values = adaptor.getValues();
Value filled = adaptor.getFilled();		Value filled = adaptor.getFilled();
Value added = adaptor.getAdded();		Value added = adaptor.getAdded();
Value count = adaptor.getCount();		Value count = adaptor.getCount();
RankedTensorType dstType = desc.getTensorType();		const SparseTensorType dstType(desc.getRankedTensorType());
Type eltType = dstType.getElementType();		Type eltType = dstType.getElementType();
// Prepare indices.		// Prepare indices.
SmallVector<Value> indices(adaptor.getIndices());		SmallVector<Value> indices(adaptor.getIndices());
// If the innermost dimension is ordered, we need to sort the indices		// If the innermost level is ordered, we need to sort the indices
// in the "added" array prior to applying the compression.		// in the "added" array prior to applying the compression.
unsigned rank = dstType.getShape().size();		if (dstType.isOrderedLvl(dstType.getLvlRank() - 1))
if (isOrderedDim(dstType, rank - 1))
rewriter.create<SortOp>(loc, count, ValueRange{added}, ValueRange{},		rewriter.create<SortOp>(loc, count, ValueRange{added}, ValueRange{},
SparseTensorSortKind::HybridQuickSort);		SparseTensorSortKind::HybridQuickSort);
// While performing the insertions, we also need to reset the elements		// While performing the insertions, we also need to reset the elements
// of the values/filled-switch by only iterating over the set elements,		// of the values/filled-switch by only iterating over the set elements,
// to ensure that the runtime complexity remains proportional to the		// to ensure that the runtime complexity remains proportional to the
// sparsity of the expanded access pattern.		// sparsity of the expanded access pattern.
//		//
// Generate		// Generate
▲ Show 20 Lines • Show All 207 Lines • ▼ Show 20 Lines
};		};

struct SparsePackOpConverter : public OpConversionPattern<PackOp> {		struct SparsePackOpConverter : public OpConversionPattern<PackOp> {
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(PackOp op, OpAdaptor adaptor,		matchAndRewrite(PackOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {

auto rtp = op.getResult().getType().cast<RankedTensorType>();		const auto rtp = getRankedTensorType(op.getResult());
assert(isUniqueCOOType(rtp));		assert(isUniqueCOOType(rtp));

SmallVector<Value> fields;		SmallVector<Value> fields;
Location loc = op.getLoc();		Location loc = op.getLoc();

foreachFieldAndTypeInSparseTensor(		foreachFieldAndTypeInSparseTensor(
rtp,		rtp,
[&rewriter, &fields, &op, rtp,		[&rewriter, &fields, &op, rtp,
loc](Type fType, unsigned fIdx, SparseTensorFieldKind fKind,		loc](Type fType, FieldIndex fIdx, SparseTensorFieldKind fKind,
unsigned /dim/, DimLevelType /dlt/) -> bool {		Level /lvl/, DimLevelType /dlt/) -> bool {
assert(fields.size() == fIdx);		assert(fields.size() == fIdx);
auto enc = getSparseTensorEncoding(rtp);		auto enc = getSparseTensorEncoding(rtp);
Value field;		Value field;
switch (fKind) {		switch (fKind) {
case SparseTensorFieldKind::StorageSpec:		case SparseTensorFieldKind::StorageSpec:
field = SparseTensorSpecifier::getInitValue(rewriter, loc, rtp);		field = SparseTensorSpecifier::getInitValue(rewriter, loc, rtp);
break;		break;
case SparseTensorFieldKind::PtrMemRef: {		case SparseTensorFieldKind::PtrMemRef: {
▲ Show 20 Lines • Show All 138 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorConversion.cpp

Show All 19 Lines

#include "mlir/Dialect/Bufferization/IR/BufferizableOpInterface.h"		#include "mlir/Dialect/Bufferization/IR/BufferizableOpInterface.h"
#include "mlir/Dialect/Bufferization/IR/Bufferization.h"		#include "mlir/Dialect/Bufferization/IR/Bufferization.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
#include "mlir/Dialect/MemRef/IR/MemRef.h"		#include "mlir/Dialect/MemRef/IR/MemRef.h"
#include "mlir/Dialect/SCF/IR/SCF.h"		#include "mlir/Dialect/SCF/IR/SCF.h"
#include "mlir/Dialect/SparseTensor/IR/Enums.h"		#include "mlir/Dialect/SparseTensor/IR/Enums.h"
#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"		#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/Transforms/DialectConversion.h"		#include "mlir/Transforms/DialectConversion.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::sparse_tensor;		using namespace mlir::sparse_tensor;

namespace {		namespace {
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	static Value genDimSizeCall(OpBuilder &builder, Location loc, Value tensor,
Type iTp = builder.getIndexType();		Type iTp = builder.getIndexType();
return createFuncCall(builder, loc, name, iTp, params, EmitCInterface::Off)		return createFuncCall(builder, loc, name, iTp, params, EmitCInterface::Off)
.getResult(0);		.getResult(0);
}		}

/// Looks up a level-size by returning a statically-computed constant		/// Looks up a level-size by returning a statically-computed constant
/// (when possible), or by calling `genLvlSizeCall` (when dynamic).		/// (when possible), or by calling `genLvlSizeCall` (when dynamic).
static Value createOrFoldLvlCall(OpBuilder &builder, Location loc,		static Value createOrFoldLvlCall(OpBuilder &builder, Location loc,
SparseTensorEncodingAttr enc, ShapedType stp,		SparseTensorType stt, Value tensor,
Value tensor, unsigned lvl) {		Level lvl) {
// Only sparse tensors have "levels" to query.		// Only sparse tensors have "levels" to query.
assert(enc);		assert(stt.hasEncoding());
auto dimOrder = enc.getDimOrdering();
// TODO: The following implementation only handles permutations;		// TODO: The following implementation only handles permutations;
// we'll need to generalize this to handle arbitrary AffineExpr.		// we'll need to generalize this to handle arbitrary AffineExpr.
//		//
// There's no need to assert `isPermutation` here: because		// There's no need to assert `isPermutation` here: because
// `getDimPosition` checks that the expr isa `AffineDimExpr`,		// `getDimPosition` checks that the expr isa `AffineDimExpr`,
// which is all we care about (for supporting permutations).		// which is all we care about (for supporting permutations).
unsigned dim = dimOrder ? dimOrder.getDimPosition(lvl) : lvl;		const Dimension dim =
auto s = stp.getShape()[dim];		stt.isIdentity() ? lvl : stt.getDimToLvlMap().getDimPosition(lvl);
if (s != ShapedType::kDynamic)		if (const auto sz = stt.getStaticDimSize(dim))
return constantIndex(builder, loc, s);		return constantIndex(builder, loc, *sz);
// If we cannot statically compute the size from the shape, then we		// If we cannot statically compute the size from the shape, then we
// must dynamically query it. (In principle we could also dynamically		// must dynamically query it. (In principle we could also dynamically
// compute it, but since we already did so to construct the `tensor`		// compute it, but since we already did so to construct the `tensor`
// in the first place, we might as well query rather than recompute.)		// in the first place, we might as well query rather than recompute.)
return genLvlSizeCall(builder, loc, tensor, lvl);		return genLvlSizeCall(builder, loc, tensor, lvl);
}		}

/// Looks up a dimension-size by returning a constant from the shape		/// Looks up a dimension-size by returning a constant from the shape
/// (for static sizes), or by calling `genDimSizeCall` (for dynamic sizes		/// (for static sizes), or by calling `genDimSizeCall` (for dynamic sizes
/// of sparse tensors) or `linalg::createOrFoldDimOp` (for dynamic sizes		/// of sparse tensors) or `linalg::createOrFoldDimOp` (for dynamic sizes
/// of dense tensors).		/// of dense tensors).
static Value createOrFoldDimCall(OpBuilder &builder, Location loc,		static Value createOrFoldDimCall(OpBuilder &builder, Location loc,
SparseTensorEncodingAttr enc, ShapedType stp,		SparseTensorType stt, Value tensor,
Value tensor, unsigned dim) {		Dimension dim) {
auto s = stp.getShape()[dim];		if (const auto sz = stt.getStaticDimSize(dim))
if (s != ShapedType::kDynamic)		return constantIndex(builder, loc, *sz);
return constantIndex(builder, loc, s);		if (stt.hasEncoding())
if (enc)
return genDimSizeCall(builder, loc, tensor, dim);		return genDimSizeCall(builder, loc, tensor, dim);
return linalg::createOrFoldDimOp(builder, loc, tensor, dim);		return linalg::createOrFoldDimOp(builder, loc, tensor, dim);
}		}

/// Populates the array with the dimension-sizes of the given tensor.		/// Populates the array with the dimension-sizes of the given tensor.
static void fillDimSizes(OpBuilder &builder, Location loc,		static void fillDimSizes(OpBuilder &builder, Location loc, SparseTensorType stt,
SparseTensorEncodingAttr enc, ShapedType stp,
Value tensor, SmallVectorImpl<Value> &out) {		Value tensor, SmallVectorImpl<Value> &out) {
unsigned dimRank = stp.getRank();		const Dimension dimRank = stt.getDimRank();
		out.clear();
out.reserve(dimRank);		out.reserve(dimRank);
for (unsigned d = 0; d < dimRank; d++)		for (Dimension d = 0; d < dimRank; d++)
out.push_back(createOrFoldDimCall(builder, loc, enc, stp, tensor, d));		out.push_back(createOrFoldDimCall(builder, loc, stt, tensor, d));
}		}

/// Returns an array with the dimension-sizes of the given tensor.		/// Returns an array with the dimension-sizes of the given tensor.
static SmallVector<Value> getDimSizes(OpBuilder &builder, Location loc,		static SmallVector<Value> getDimSizes(OpBuilder &builder, Location loc,
SparseTensorEncodingAttr enc,		SparseTensorType stt, Value tensor) {
ShapedType stp, Value tensor) {
SmallVector<Value> out;		SmallVector<Value> out;
fillDimSizes(builder, loc, enc, stp, tensor, out);		fillDimSizes(builder, loc, stt, tensor, out);
return out;		return out;
}		}

/// Populates the array with the dimension-shape of the given `ShapedType`,		/// Populates the array with the dimension-shape of the given
/// where dynamic sizes are represented by zero.		/// `SparseTensorType`, where dynamic sizes are represented by zero.
static void fillDimShape(OpBuilder &builder, Location loc, ShapedType stp,		static void fillDimShape(OpBuilder &builder, Location loc, SparseTensorType stt,
SmallVectorImpl<Value> &out) {		SmallVectorImpl<Value> &out) {
auto shape = stp.getShape();		out.clear();
unsigned dimRank = stp.getRank();		out.reserve(stt.getDimRank());
out.reserve(dimRank);		for (const DynSize sh : stt.getDimShape()) {
for (unsigned d = 0; d < dimRank; d++) {		const auto s = ShapedType::isDynamic(sh) ? 0 : sh;
auto s = shape[d] == ShapedType::kDynamic ? 0 : shape[d];
out.push_back(constantIndex(builder, loc, s));		out.push_back(constantIndex(builder, loc, s));
}		}
}		}

/// Returns an array with the dimension-shape of the given `ShapedType`,		/// Returns an array with the dimension-shape of the given `SparseTensorType`,
/// where dynamic sizes are represented by zero.		/// where dynamic sizes are represented by zero.
static SmallVector<Value> getDimShape(OpBuilder &builder, Location loc,		static SmallVector<Value> getDimShape(OpBuilder &builder, Location loc,
ShapedType stp) {		SparseTensorType stt) {
SmallVector<Value> out;		SmallVector<Value> out;
fillDimShape(builder, loc, stp, out);		fillDimShape(builder, loc, stt, out);
return out;		return out;
}		}

/// Populates the given sizes array for concatenation from type (for static		/// Populates the given sizes array for concatenation from type (for static
/// sizes) and from an already-converted opaque pointer source (for dynamic		/// sizes) and from an already-converted opaque pointer source (for dynamic
/// sizes).		/// sizes).
static void concatSizesFromInputs(OpBuilder &builder,		static void concatDimSizesFromInputs(OpBuilder &builder, Location loc,
SmallVectorImpl<Value> &sizes, Location loc,		SparseTensorType dstTp, ValueRange srcs,
ShapedType dstTp, ValueRange srcs,		Dimension dim,
unsigned dim) {		SmallVectorImpl<Value> &dimSizes) {
auto dstShape = dstTp.getShape();		assert(dim < dstTp.getDimRank() && "Dimension is out of bounds");
		dimSizes.clear();

auto srcTp = srcs[0].getType().cast<ShapedType>();
auto srcEnc = getSparseTensorEncoding(srcTp);
// We first fills the sizes from an input tensor, and then		// We first fills the sizes from an input tensor, and then
// compute the size of the concatenation dimension if necessary.		// compute the size of the concatenation dimension if necessary.
if (srcEnc)		const auto srcTp = getSparseTensorType(srcs[0]);
		if (srcTp.hasEncoding())
// Reuses sizes from an arbitrary input tensor is fine.		// Reuses sizes from an arbitrary input tensor is fine.
fillDimSizes(builder, loc, srcEnc, srcTp, srcs[0], sizes);		fillDimSizes(builder, loc, srcTp, srcs[0], dimSizes);
else		else
sizesFromSrc(builder, sizes, loc, srcs[0]);		sizesFromSrc(builder, dimSizes, loc, srcs[0]);

// Sum up on the `dim` if the dimension is dynamic.		if (const auto sz = dstTp.getStaticDimSize(dim)) {
if (dstShape[dim] != ShapedType::kDynamic) {
// Faithfully take the static size.		// Faithfully take the static size.
sizes[dim] = constantIndex(builder, loc, dstShape[dim]);		dimSizes[dim] = constantIndex(builder, loc, *sz);
} else {		} else {
// Else, compute the shape dynamically.		// Else, dynamically compute the size.
for (size_t i = 1, sz = srcs.size(); i < sz; i++) {		for (const auto src : srcs.drop_front()) {
auto srcTp = srcs[i].getType().cast<ShapedType>();		const auto srcTp = getSparseTensorType(src);
auto encSrc = getSparseTensorEncoding(srcTp);		Value srcSz = createOrFoldDimCall(builder, loc, srcTp, src, dim);
Value srcSz =		dimSizes[dim] = builder.create<arith::AddIOp>(loc, dimSizes[dim], srcSz);
createOrFoldDimCall(builder, loc, encSrc, srcTp, srcs[i], dim);
// Sum up all the sizes.
sizes[dim] = builder.create<arith::AddIOp>(loc, sizes[dim], srcSz);
}		}
}		}
}		}

/// Generates an uninitialized buffer of the given size and type,		/// Generates an uninitialized buffer of the given size and type,
/// but returns it as type `memref<? x $tp>` (rather than as type		/// but returns it as type `memref<? x $tp>` (rather than as type
/// `memref<$sz x $tp>`). Unlike temporary buffers on the stack,		/// `memref<$sz x $tp>`). Unlike temporary buffers on the stack,
/// this buffer must be explicitly deallocated by client.		/// this buffer must be explicitly deallocated by client.
static Value genAlloc(RewriterBase &rewriter, Location loc, Value sz, Type tp) {		static Value genAlloc(RewriterBase &rewriter, Location loc, Value sz, Type tp) {
auto memTp = MemRefType::get({ShapedType::kDynamic}, tp);		auto memTp = MemRefType::get({ShapedType::kDynamic}, tp);
return rewriter.create<memref::AllocOp>(loc, memTp, ValueRange{sz});		return rewriter.create<memref::AllocOp>(loc, memTp, ValueRange{sz});
}		}

/// Generates a temporary buffer for the level-types of the given encoding.		/// Generates a temporary buffer for the level-types of the given encoding.
static Value genLvlTypesBuffer(OpBuilder &builder, Location loc,		static Value genLvlTypesBuffer(OpBuilder &builder, Location loc,
SparseTensorEncodingAttr enc) {		SparseTensorType stt) {
SmallVector<Value> lvlTypes;		SmallVector<Value> lvlTypes;
auto dlts = enc.getDimLevelType();		lvlTypes.reserve(stt.getLvlRank());
lvlTypes.reserve(dlts.size());		for (const auto dlt : stt.getEncoding().getDimLevelType())
for (auto dlt : dlts)
lvlTypes.push_back(constantDimLevelTypeEncoding(builder, loc, dlt));		lvlTypes.push_back(constantDimLevelTypeEncoding(builder, loc, dlt));
return allocaBuffer(builder, loc, lvlTypes);		return allocaBuffer(builder, loc, lvlTypes);
}		}

/// This class abstracts over the API of `_mlir_ciface_newSparseTensor`:		/// This class abstracts over the API of `_mlir_ciface_newSparseTensor`:
/// the "swiss army knife" method of the sparse runtime support library		/// the "swiss army knife" method of the sparse runtime support library
/// for materializing sparse tensors into the computation. This abstraction		/// for materializing sparse tensors into the computation. This abstraction
/// reduces the need to make modifications to client code whenever that		/// reduces the need to make modifications to client code whenever that
/// API changes.		/// API changes.
class NewCallParams final {		class NewCallParams final {
public:		public:
/// Allocates the `ValueRange` for the `func::CallOp` parameters,		/// Allocates the `ValueRange` for the `func::CallOp` parameters,
/// but does not initialize them.		/// but does not initialize them.
NewCallParams(OpBuilder &builder, Location loc)		NewCallParams(OpBuilder &builder, Location loc)
: builder(builder), loc(loc), pTp(getOpaquePointerType(builder)) {}		: builder(builder), loc(loc), pTp(getOpaquePointerType(builder)) {}

/// Initializes all static parameters (i.e., those which indicate		/// Initializes all static parameters (i.e., those which indicate
/// type-level information such as the encoding and sizes), generating		/// type-level information such as the encoding and sizes), generating
/// MLIR buffers as needed, and returning `this` for method chaining.		/// MLIR buffers as needed, and returning `this` for method chaining.
/// This method does not set the action and pointer arguments, since		/// This method does not set the action and pointer arguments, since
/// those are handled by `genNewCall` instead.		/// those are handled by `genNewCall` instead.
NewCallParams &genBuffers(SparseTensorEncodingAttr enc, ValueRange sizes,		NewCallParams &genBuffers(SparseTensorType stt, ValueRange dimSizes);
ShapedType stp);

/// (Re)sets the C++ template type parameters, and returns `this`		/// (Re)sets the C++ template type parameters, and returns `this`
/// for method chaining. This is already done as part of `genBuffers`,		/// for method chaining. This is already done as part of `genBuffers`,
/// but is factored out so that it can also be called independently		/// but is factored out so that it can also be called independently
/// whenever subsequent `genNewCall` calls want to reuse the same		/// whenever subsequent `genNewCall` calls want to reuse the same
/// buffers but different type parameters.		/// buffers but different type parameters.
//		//
// TODO: This is only ever used by sparse2sparse-viaCOO `ConvertOp`;		// TODO: This is only ever used by sparse2sparse-viaCOO `ConvertOp`;
// is there a better way to handle that than this one-off setter method?		// is there a better way to handle that than this one-off setter method?
NewCallParams &setTemplateTypes(SparseTensorEncodingAttr enc,		NewCallParams &setTemplateTypes(SparseTensorType stt) {
ShapedType stp) {		const auto enc = stt.getEncoding();
params[kParamPtrTp] = constantPointerTypeEncoding(builder, loc, enc);		params[kParamPtrTp] = constantPointerTypeEncoding(builder, loc, enc);
params[kParamIndTp] = constantIndexTypeEncoding(builder, loc, enc);		params[kParamIndTp] = constantIndexTypeEncoding(builder, loc, enc);
params[kParamValTp] =		params[kParamValTp] =
constantPrimaryTypeEncoding(builder, loc, stp.getElementType());		constantPrimaryTypeEncoding(builder, loc, stt.getElementType());
return *this;		return *this;
}		}

/// Checks whether all the static parameters have been initialized.		/// Checks whether all the static parameters have been initialized.
bool isInitialized() const {		bool isInitialized() const {
for (unsigned i = 0; i < kNumStaticParams; ++i)		for (unsigned i = 0; i < kNumStaticParams; ++i)
if (!params[i])		if (!params[i])
return false;		return false;
Show All 40 Lines	private:
Location loc;		Location loc;
Type pTp;		Type pTp;
Value params[kNumParams];		Value params[kNumParams];
};		};

// TODO: see the note at `_mlir_ciface_newSparseTensor` about how		// TODO: see the note at `_mlir_ciface_newSparseTensor` about how
// the meaning of the various arguments (e.g., "sizes" vs "shapes")		// the meaning of the various arguments (e.g., "sizes" vs "shapes")
// is inconsistent between the different actions.		// is inconsistent between the different actions.
NewCallParams &NewCallParams::genBuffers(SparseTensorEncodingAttr enc,		NewCallParams &NewCallParams::genBuffers(SparseTensorType stt,
ValueRange dimSizes, ShapedType stp) {		ValueRange dimSizes) {
const unsigned lvlRank = enc.getDimLevelType().size();		const Level lvlRank = stt.getLvlRank();
const unsigned dimRank = stp.getRank();		const Dimension dimRank = stt.getDimRank();
// Sparsity annotations.		// Sparsity annotations.
params[kParamLvlTypes] = genLvlTypesBuffer(builder, loc, enc);		params[kParamLvlTypes] = genLvlTypesBuffer(builder, loc, stt);
// Dimension-sizes array of the enveloping tensor. Useful for either		// Dimension-sizes array of the enveloping tensor. Useful for either
// verification of external data, or for construction of internal data.		// verification of external data, or for construction of internal data.
assert(dimSizes.size() == dimRank && "Dimension-rank mismatch");		assert(dimSizes.size() == static_cast<size_t>(dimRank) &&
		"Dimension-rank mismatch");
params[kParamDimSizes] = allocaBuffer(builder, loc, dimSizes);		params[kParamDimSizes] = allocaBuffer(builder, loc, dimSizes);
// The level-sizes array must be passed as well, since for arbitrary		// The level-sizes array must be passed as well, since for arbitrary
// dim2lvl mappings it cannot be trivially reconstructed at runtime.		// dim2lvl mappings it cannot be trivially reconstructed at runtime.
// For now however, since we're still assuming permutations, we will		// For now however, since we're still assuming permutations, we will
// initialize this parameter alongside the `dim2lvl` and `lvl2dim`		// initialize this parameter alongside the `dim2lvl` and `lvl2dim`
// parameters below. We preinitialize `lvlSizes` for code symmetry.		// parameters below. We preinitialize `lvlSizes` for code symmetry.
SmallVector<Value> lvlSizes(lvlRank);		SmallVector<Value> lvlSizes(lvlRank);
// The dimension-to-level mapping and its inverse. We must preinitialize		// The dimension-to-level mapping and its inverse. We must preinitialize
// `dim2lvl` so that the true branch below can perform random-access		// `dim2lvl` so that the true branch below can perform random-access
// `operator[]` assignment. We preinitialize `lvl2dim` for code symmetry.		// `operator[]` assignment. We preinitialize `lvl2dim` for code symmetry.
SmallVector<Value> dim2lvl(dimRank);		SmallVector<Value> dim2lvl(dimRank);
SmallVector<Value> lvl2dim(lvlRank);		SmallVector<Value> lvl2dim(lvlRank);
auto dimOrder = enc.getDimOrdering();		if (!stt.isIdentity()) {
if (dimOrder) {		const auto dimOrder = stt.getDimToLvlMap();
assert(dimOrder.isPermutation());		assert(dimOrder.isPermutation());
for (unsigned l = 0; l < lvlRank; l++) {		for (Level l = 0; l < lvlRank; l++) {
// The `d`th source variable occurs in the `l`th result position.		// The `d`th source variable occurs in the `l`th result position.
uint64_t d = dimOrder.getDimPosition(l);		const Dimension d = dimOrder.getDimPosition(l);
dim2lvl[d] = constantIndex(builder, loc, l);		dim2lvl[d] = constantIndex(builder, loc, l);
lvl2dim[l] = constantIndex(builder, loc, d);		lvl2dim[l] = constantIndex(builder, loc, d);
lvlSizes[l] = dimSizes[d];		lvlSizes[l] = dimSizes[d];
}		}
} else {		} else {
assert(dimRank == lvlRank && "Rank mismatch");		// The `SparseTensorType` ctor already ensures `dimRank == lvlRank`
for (unsigned i = 0; i < lvlRank; i++) {		// when `isIdentity`; so no need to re-assert it here.
dim2lvl[i] = lvl2dim[i] = constantIndex(builder, loc, i);		for (Level l = 0; l < lvlRank; l++) {
lvlSizes[i] = dimSizes[i];		dim2lvl[l] = lvl2dim[l] = constantIndex(builder, loc, l);
		lvlSizes[l] = dimSizes[l];
}		}
}		}
params[kParamLvlSizes] = allocaBuffer(builder, loc, lvlSizes);		params[kParamLvlSizes] = allocaBuffer(builder, loc, lvlSizes);
params[kParamLvl2Dim] = allocaBuffer(builder, loc, lvl2dim);		params[kParamLvl2Dim] = allocaBuffer(builder, loc, lvl2dim);
params[kParamDim2Lvl] =		params[kParamDim2Lvl] = stt.isIdentity()
dimOrder ? allocaBuffer(builder, loc, dim2lvl) : params[kParamLvl2Dim];		? params[kParamLvl2Dim]
		: allocaBuffer(builder, loc, dim2lvl);
// Secondary and primary types encoding.		// Secondary and primary types encoding.
setTemplateTypes(enc, stp);		setTemplateTypes(stt);
// Finally, make note that initialization is complete.		// Finally, make note that initialization is complete.
assert(isInitialized() && "Initialization failed");		assert(isInitialized() && "Initialization failed");
// And return `this` for method chaining.		// And return `this` for method chaining.
return *this;		return *this;
}		}

/// Generates a call to obtain the values array.		/// Generates a call to obtain the values array.
static Value genValuesCall(OpBuilder &builder, Location loc, ShapedType tp,		static Value genValuesCall(OpBuilder &builder, Location loc, ShapedType tp,
▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	static void insertScalarIntoDenseTensor(OpBuilder &builder, Location loc,
Value elemV = builder.create<memref::LoadOp>(loc, elemPtr);		Value elemV = builder.create<memref::LoadOp>(loc, elemPtr);
builder.create<memref::StoreOp>(loc, elemV, tensor, ivs);		builder.create<memref::StoreOp>(loc, elemV, tensor, ivs);
}		}

/// Determine if the runtime library supports direct conversion to the		/// Determine if the runtime library supports direct conversion to the
/// given target `dimTypes`.		/// given target `dimTypes`.
static bool canUseDirectConversion(ArrayRef<DimLevelType> dimTypes) {		static bool canUseDirectConversion(ArrayRef<DimLevelType> dimTypes) {
bool alreadyCompressed = false;		bool alreadyCompressed = false;
for (uint64_t rank = dimTypes.size(), r = 0; r < rank; r++) {		for (const auto dlt : dimTypes) {
const DimLevelType dlt = dimTypes[r];
if (isCompressedDLT(dlt)) {		if (isCompressedDLT(dlt)) {
if (alreadyCompressed)		if (alreadyCompressed)
return false; // Multiple compressed dimensions not yet supported.		return false; // Multiple compressed dimensions not yet supported.
alreadyCompressed = true;		alreadyCompressed = true;
} else if (isDenseDLT(dlt)) {		} else if (isDenseDLT(dlt)) {
if (alreadyCompressed)		if (alreadyCompressed)
return false; // Dense after Compressed not yet supported.		return false; // Dense after Compressed not yet supported.
} else if (isSingletonDLT(dlt)) {		} else if (isSingletonDLT(dlt)) {
// Direct conversion doesn't have any particular problems with		// Direct conversion doesn't have any particular problems with
// singleton after compressed.		// singleton after compressed.
} else { // TODO: investigate		} else { // TODO: investigate
return false;		return false;
}		}
}		}
return true;		return true;
}		}

/// Helper method to translate indices during a reshaping operation.		/// Helper method to translate indices during a reshaping operation.
/// TODO: provide as general utility to MLIR at large?		/// TODO: provide as general utility to MLIR at large?
static void translateIndices(Location loc, ConversionPatternRewriter &rewriter,		static void translateIndices(Location loc, ConversionPatternRewriter &rewriter,
ArrayRef<ReassociationIndices> reassociation,		ArrayRef<ReassociationIndices> reassociation,
TensorType dstTp, TensorType srcTp, Value dstIdx,		TensorType dstTp, TensorType srcTp, Value dstIdx,
Value srcIdx, ArrayRef<Value> dstShape,		Value srcIdx, ArrayRef<Value> dstShape,
ArrayRef<Value> srcShape) {		ArrayRef<Value> srcShape) {
unsigned dstRank = dstTp.getRank();		const Dimension dstRank = dstTp.getRank();
unsigned srcRank = srcTp.getRank();		const Dimension srcRank = srcTp.getRank();

SmallVector<Value> srcIndices;		SmallVector<Value> srcIndices;
for (unsigned i = 0; i < srcRank; i++) {		srcIndices.reserve(srcRank);
		for (Dimension d = 0; d < srcRank; d++) {
Value idx = rewriter.create<memref::LoadOp>(		Value idx = rewriter.create<memref::LoadOp>(
loc, srcIdx, constantIndex(rewriter, loc, i));		loc, srcIdx, constantIndex(rewriter, loc, d));
srcIndices.push_back(idx);		srcIndices.push_back(idx);
}		}

SmallVector<Value> dstIndices;		SmallVector<Value> dstIndices;
translateIndicesArray(rewriter, loc, reassociation, srcIndices, srcShape,		translateIndicesArray(rewriter, loc, reassociation, srcIndices, srcShape,
dstShape, dstIndices);		dstShape, dstIndices);

for (unsigned i = 0; i < dstRank; i++)		for (Dimension d = 0; d < dstRank; d++)
rewriter.create<memref::StoreOp>(loc, dstIndices[i], dstIdx,		rewriter.create<memref::StoreOp>(loc, dstIndices[d], dstIdx,
constantIndex(rewriter, loc, i));		constantIndex(rewriter, loc, d));
}		}

/// Generate code for a general sparse to sparse reshaping operation.		/// Generate code for a general sparse to sparse reshaping operation.
/// Note that unlike dense reshaping (which can be done with a "cheap"		/// Note that unlike dense reshaping (which can be done with a "cheap"
/// change of view), sparse reshaping is currently done with actual		/// change of view), sparse reshaping is currently done with actual
/// data shuffling.		/// data shuffling.
///		///
/// TODO: proportional to nnz, but still a lot of data movement		/// TODO: proportional to nnz, but still a lot of data movement
/// https://github.com/llvm/llvm-project/issues/56477		/// https://github.com/llvm/llvm-project/issues/56477
///		///
/// iter = src->toCOO();		/// iter = src->toCOO();
/// coo = newSparseCOO()		/// coo = newSparseCOO()
/// while (elem = iter->getNext()) {		/// while (elem = iter->getNext()) {
/// coo->add(reshape(elem.indices), elem.value)		/// coo->add(reshape(elem.indices), elem.value)
/// }		/// }
/// s = newSparseTensor(coo)		/// s = newSparseTensor(coo)
template <typename ReshapeOp>		template <typename ReshapeOp>
static LogicalResult		static LogicalResult
genSparse2SparseReshape(ReshapeOp op, typename ReshapeOp::Adaptor adaptor,		genSparse2SparseReshape(ReshapeOp op, typename ReshapeOp::Adaptor adaptor,
ConversionPatternRewriter &rewriter) {		ConversionPatternRewriter &rewriter) {
Location loc = op.getLoc();		Location loc = op.getLoc();
auto srcTp = getRankedTensorType(op.getSrc());		const auto srcTp = getSparseTensorType(op.getSrc());
auto dstTp = getRankedTensorType(op.getResult());		const auto dstTp = getSparseTensorType(op.getResult());
auto encSrc = getSparseTensorEncoding(srcTp);		if (!srcTp.hasEncoding() \|\| !dstTp.hasEncoding())
auto encDst = getSparseTensorEncoding(dstTp);
if (!encDst \|\| !encSrc)
return failure();		return failure();
Type elemTp = srcTp.getElementType();		Type elemTp = srcTp.getElementType();
assert(elemTp == dstTp.getElementType() &&		assert(elemTp == dstTp.getElementType() &&
"reshape should not change element type");		"reshape should not change element type");
// Start an iterator over the source tensor (in original index order).		// Start an iterator over the source tensor (in original index order).
const auto noPerm = encSrc.withoutOrdering();
SmallVector<Value> srcDimSizes =		SmallVector<Value> srcDimSizes =
getDimSizes(rewriter, loc, encSrc, srcTp, adaptor.getSrc());		getDimSizes(rewriter, loc, srcTp, adaptor.getSrc());
NewCallParams params(rewriter, loc);		NewCallParams params(rewriter, loc);
Value iter = params.genBuffers(noPerm, srcDimSizes, srcTp)		Value iter = params.genBuffers(srcTp.withoutOrdering(), srcDimSizes)
.genNewCall(Action::kToIterator, adaptor.getSrc());		.genNewCall(Action::kToIterator, adaptor.getSrc());
// Start a new COO for the destination tensor.		// Start a new COO for the destination tensor.
SmallVector<Value> dstDimSizes;		SmallVector<Value> dstDimSizes;
if (dstTp.hasStaticShape())		if (dstTp.hasStaticDimShape())
// Static "shapes" are in fact "sizes".		// Static "shapes" are in fact "sizes".
fillDimShape(rewriter, loc, dstTp, dstDimSizes);		fillDimShape(rewriter, loc, dstTp, dstDimSizes);
else		else
genReshapeDstShape(loc, rewriter, dstDimSizes, srcDimSizes,		genReshapeDstShape(loc, rewriter, dstDimSizes, srcDimSizes,
dstTp.getShape(), op.getReassociationIndices());		dstTp.getDimShape(), op.getReassociationIndices());
Value coo = params.genBuffers(encDst, dstDimSizes, dstTp)		Value coo =
.genNewCall(Action::kEmptyCOO);		params.genBuffers(dstTp, dstDimSizes).genNewCall(Action::kEmptyCOO);
Value dstPerm = params.getDim2LvlMap();		Value dstPerm = params.getDim2LvlMap();
// Construct a while loop over the iterator.		// Construct a while loop over the iterator.
Type iTp = rewriter.getIndexType();		Type iTp = rewriter.getIndexType();
Value srcIdx = genAlloca(rewriter, loc, srcTp.getRank(), iTp);		Value srcIdx = genAlloca(rewriter, loc, srcTp.getDimRank(), iTp);
Value dstIdx = genAlloca(rewriter, loc, dstTp.getRank(), iTp);		Value dstIdx = genAlloca(rewriter, loc, dstTp.getDimRank(), iTp);
Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);		Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);
SmallVector<Value> noArgs;		SmallVector<Value> noArgs;
SmallVector<Type> noTypes;		SmallVector<Type> noTypes;
auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);		auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);
Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);		Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);
rewriter.setInsertionPointToEnd(before);		rewriter.setInsertionPointToEnd(before);
Value cond = genGetNextCall(rewriter, loc, iter, srcIdx, elemPtr);		Value cond = genGetNextCall(rewriter, loc, iter, srcIdx, elemPtr);
rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());		rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());
Show All 19 Lines
// while (elem = coo->getNext()) {		// while (elem = coo->getNext()) {
// bodyBuilder		// bodyBuilder
// }		// }
// TODO: It can be used by other operators (ReshapeOp, ConvertOP) conversion to		// TODO: It can be used by other operators (ReshapeOp, ConvertOP) conversion to
// reduce code repetition!		// reduce code repetition!
// TODO: rename to `genSparseIterationLoop`?		// TODO: rename to `genSparseIterationLoop`?
static void genSparseCOOIterationLoop(		static void genSparseCOOIterationLoop(
ConversionPatternRewriter &rewriter, Location loc, Value t,		ConversionPatternRewriter &rewriter, Location loc, Value t,
RankedTensorType tensorTp,		SparseTensorType stt,
function_ref<void(OpBuilder &, Location, Value, Value)> bodyBuilder) {		function_ref<void(OpBuilder &, Location, Value, Value)> bodyBuilder) {
auto enc = getSparseTensorEncoding(tensorTp);		assert(stt.hasEncoding() &&
assert(enc && "Generating Sparse Tensor COO Loop on a Dense Tensor!");		"Generating Sparse Tensor COO Loop on a Dense Tensor!");
		const Dimension dimRank = stt.getDimRank();
unsigned rank = tensorTp.getRank();		const Type elemTp = stt.getElementType();
Type elemTp = tensorTp.getElementType();

// Start an iterator over the tensor (in original index order).		// Start an iterator over the tensor (in original index order).
const auto noPerm = enc.withoutOrdering();		const auto noPerm = stt.withoutOrdering();
SmallVector<Value> dimSizes = getDimSizes(rewriter, loc, noPerm, tensorTp, t);		SmallVector<Value> dimSizes = getDimSizes(rewriter, loc, noPerm, t);
Value iter = NewCallParams(rewriter, loc)		Value iter = NewCallParams(rewriter, loc)
.genBuffers(noPerm, dimSizes, tensorTp)		.genBuffers(noPerm, dimSizes)
.genNewCall(Action::kToIterator, t);		.genNewCall(Action::kToIterator, t);

// Construct a while loop over the iterator.		// Construct a while loop over the iterator.
Value srcIdx = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		Value srcIdx = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);		Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);
SmallVector<Value> noArgs;		SmallVector<Value> noArgs;
SmallVector<Type> noTypes;		SmallVector<Type> noTypes;
auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);		auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);
Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);		Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);
rewriter.setInsertionPointToEnd(before);		rewriter.setInsertionPointToEnd(before);
Value cond = genGetNextCall(rewriter, loc, iter, srcIdx, elemPtr);		Value cond = genGetNextCall(rewriter, loc, iter, srcIdx, elemPtr);
rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());		rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());
Block *after = rewriter.createBlock(&whileOp.getAfter(), {}, noTypes);		Block *after = rewriter.createBlock(&whileOp.getAfter(), {}, noTypes);
rewriter.setInsertionPointToStart(after);		rewriter.setInsertionPointToStart(after);

bool hasDenseDim = llvm::any_of(		const bool hasDenseDim =
enc.getDimLevelType(), [](DimLevelType dlt) { return isDenseDLT(dlt); });		llvm::any_of(stt.getEncoding().getDimLevelType(), isDenseDLT);
if (hasDenseDim) {		if (hasDenseDim) {
Value elemV = rewriter.create<memref::LoadOp>(loc, elemPtr);		Value elemV = rewriter.create<memref::LoadOp>(loc, elemPtr);
Value isZero = genIsNonzero(rewriter, loc, elemV);		Value isZero = genIsNonzero(rewriter, loc, elemV);
scf::IfOp ifOp = rewriter.create<scf::IfOp>(loc, isZero, /else/ false);		scf::IfOp ifOp = rewriter.create<scf::IfOp>(loc, isZero, /else/ false);
rewriter.setInsertionPointToStart(&ifOp.getThenRegion().front());		rewriter.setInsertionPointToStart(&ifOp.getThenRegion().front());
}		}
// Callback here to build loop body.		// Callback here to build loop body.
bodyBuilder(rewriter, loc, srcIdx, elemPtr);		bodyBuilder(rewriter, loc, srcIdx, elemPtr);
Show All 16 Lines
// for ik in dimk		// for ik in dimk
// val = a[i1,..,ik]		// val = a[i1,..,ik]
// if val != 0		// if val != 0
// bodyBuilder(v, [i1, ..., ik])		// bodyBuilder(v, [i1, ..., ik])
// TODO: It can be used by other operators (ReshapeOp, ConvertOP) conversion to		// TODO: It can be used by other operators (ReshapeOp, ConvertOP) conversion to
// reduce code repetition!		// reduce code repetition!
static void genDenseTensorIterationLoop(		static void genDenseTensorIterationLoop(
ConversionPatternRewriter &rewriter, Location loc, Value t,		ConversionPatternRewriter &rewriter, Location loc, Value t,
RankedTensorType tensorTp,		SparseTensorType stt,
function_ref<void(OpBuilder &, Location, ValueRange)> bodyBuilder) {		function_ref<void(OpBuilder &, Location, ValueRange)> bodyBuilder) {
assert(!getSparseTensorEncoding(tensorTp) &&		assert(!stt.hasEncoding() &&
"Generating Dense Tensor Loop on a Sparse Tensor!");		"Generating Dense Tensor Loop on a Sparse Tensor!");

unsigned rank = tensorTp.getRank();		const Dimension dimRank = stt.getDimRank();
Value zero = constantIndex(rewriter, loc, 0);		Value zero = constantIndex(rewriter, loc, 0);
Value one = constantIndex(rewriter, loc, 1);		Value one = constantIndex(rewriter, loc, 1);

SmallVector<Value> lo;		SmallVector<Value> lo;
SmallVector<Value> hi;		SmallVector<Value> hi;
SmallVector<Value> st;		SmallVector<Value> st;

// Fill out loop iteration information.		// Fill out loop iteration information.
for (unsigned i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
lo.push_back(zero);		lo.push_back(zero);
hi.push_back(linalg::createOrFoldDimOp(rewriter, loc, t, i));		hi.push_back(linalg::createOrFoldDimOp(rewriter, loc, t, d));
st.push_back(one);		st.push_back(one);
}		}

scf::buildLoopNest(rewriter, loc, lo, hi, st, {},		scf::buildLoopNest(rewriter, loc, lo, hi, st, {},
[&](OpBuilder &builder, Location loc, ValueRange ivs,		[&](OpBuilder &builder, Location loc, ValueRange ivs,
ValueRange args) -> scf::ValueVector {		ValueRange args) -> scf::ValueVector {
// Invoke callback to build the body of the loop.		// Invoke callback to build the body of the loop.
bodyBuilder(builder, loc, ivs);		bodyBuilder(builder, loc, ivs);
Show All 20 Lines
/// Sparse conversion rule for accessing dimension-sizes.		/// Sparse conversion rule for accessing dimension-sizes.
class SparseTensorToDimSizeConverter		class SparseTensorToDimSizeConverter
: public OpConversionPattern<tensor::DimOp> {		: public OpConversionPattern<tensor::DimOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(tensor::DimOp op, OpAdaptor adaptor,		matchAndRewrite(tensor::DimOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto stp = op.getSource().getType().cast<ShapedType>();		const auto stt = getSparseTensorType(op.getSource());
// Only rewrite sparse DimOp.		// Only rewrite sparse DimOp.
auto enc = getSparseTensorEncoding(stp);		if (!stt.hasEncoding())
if (!enc)
return failure();		return failure();
// Only rewrite DimOp with constant index.		// Only rewrite DimOp with constant index.
std::optional<int64_t> dim = op.getConstantIndex();		std::optional<int64_t> dim = op.getConstantIndex();
if (!dim)		if (!dim)
return failure();		return failure();
// Generate the call.		// Generate the call.
Value src = adaptor.getOperands()[0];		Value src = adaptor.getOperands()[0];
rewriter.replaceOp(		rewriter.replaceOp(
op, createOrFoldDimCall(rewriter, op->getLoc(), enc, stp, src, *dim));		op, createOrFoldDimCall(rewriter, op->getLoc(), stt, src, *dim));
return success();		return success();
}		}
};		};

/// Sparse conversion rule for trivial tensor casts.		/// Sparse conversion rule for trivial tensor casts.
class SparseCastConverter : public OpConversionPattern<tensor::CastOp> {		class SparseCastConverter : public OpConversionPattern<tensor::CastOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
Show All 26 Lines
/// Sparse conversion rule for the new operator.		/// Sparse conversion rule for the new operator.
class SparseTensorNewConverter : public OpConversionPattern<NewOp> {		class SparseTensorNewConverter : public OpConversionPattern<NewOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(NewOp op, OpAdaptor adaptor,		matchAndRewrite(NewOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
Location loc = op.getLoc();		Location loc = op.getLoc();
auto stp = op.getType().cast<ShapedType>();		const auto stt = getSparseTensorType(op);
auto enc = getSparseTensorEncoding(stp);		if (!stt.hasEncoding())
if (!enc)
return failure();		return failure();
const unsigned dimRank = stp.getRank();		const Dimension dimRank = stt.getDimRank();
const unsigned lvlRank = enc.getDimLevelType().size();		const Level lvlRank = stt.getLvlRank();
// Construct the dimShape.		// Construct the dimShape.
const auto dimShape = stp.getShape();		SmallVector<Value> dimShapeValues = getDimShape(rewriter, loc, stt);
SmallVector<Value> dimShapeValues = getDimShape(rewriter, loc, stp);
Value dimShapeBuffer = allocaBuffer(rewriter, loc, dimShapeValues);		Value dimShapeBuffer = allocaBuffer(rewriter, loc, dimShapeValues);
// Allocate `SparseTensorReader` and perform all initial setup that		// Allocate `SparseTensorReader` and perform all initial setup that
// does not depend on lvlSizes (nor dim2lvl, lvl2dim, etc).		// does not depend on lvlSizes (nor dim2lvl, lvl2dim, etc).
Type opaqueTp = getOpaquePointerType(rewriter);		Type opaqueTp = getOpaquePointerType(rewriter);
Value valTp =		Value valTp =
constantPrimaryTypeEncoding(rewriter, loc, stp.getElementType());		constantPrimaryTypeEncoding(rewriter, loc, stt.getElementType());
Value reader =		Value reader =
createFuncCall(rewriter, loc, "createCheckedSparseTensorReader",		createFuncCall(rewriter, loc, "createCheckedSparseTensorReader",
opaqueTp,		opaqueTp,
{adaptor.getOperands()[0], dimShapeBuffer, valTp},		{adaptor.getOperands()[0], dimShapeBuffer, valTp},
EmitCInterface::On)		EmitCInterface::On)
.getResult(0);		.getResult(0);
// Construct the lvlSizes. If the dimShape is static, then it's		// Construct the lvlSizes. If the dimShape is static, then it's
// identical to dimSizes: so we can compute lvlSizes entirely at		// identical to dimSizes: so we can compute lvlSizes entirely at
// compile-time. If dimShape is dynamic, then we'll need to generate		// compile-time. If dimShape is dynamic, then we'll need to generate
// code for computing lvlSizes from the `reader`'s actual dimSizes.		// code for computing lvlSizes from the `reader`'s actual dimSizes.
//		//
// TODO: For now we're still assuming `dim2lvl` is a permutation.		// TODO: For now we're still assuming `dim2lvl` is a permutation.
// But since we're computing lvlSizes here (rather than in the runtime),		// But since we're computing lvlSizes here (rather than in the runtime),
// we can easily generalize that simply by adjusting this code.		// we can easily generalize that simply by adjusting this code.
//		//
// FIXME: reduce redundancy vs `NewCallParams::genBuffers`.		// FIXME: reduce redundancy vs `NewCallParams::genBuffers`.
Value dimSizesBuffer;		Value dimSizesBuffer;
if (!stp.hasStaticShape()) {		if (stt.hasDynamicDimShape()) {
Type indexTp = rewriter.getIndexType();		Type indexTp = rewriter.getIndexType();
auto memTp = MemRefType::get({ShapedType::kDynamic}, indexTp);		auto memTp = MemRefType::get({ShapedType::kDynamic}, indexTp);
dimSizesBuffer =		dimSizesBuffer =
createFuncCall(rewriter, loc, "getSparseTensorReaderDimSizes", memTp,		createFuncCall(rewriter, loc, "getSparseTensorReaderDimSizes", memTp,
reader, EmitCInterface::On)		reader, EmitCInterface::On)
.getResult(0);		.getResult(0);
}		}
Value lvlSizesBuffer;		Value lvlSizesBuffer;
Value lvl2dimBuffer;		Value lvl2dimBuffer;
Value dim2lvlBuffer;		Value dim2lvlBuffer;
if (auto dimOrder = enc.getDimOrdering()) {		if (!stt.isIdentity()) {
		const auto dimOrder = stt.getDimToLvlMap();
assert(dimOrder.isPermutation() && "Got non-permutation");		assert(dimOrder.isPermutation() && "Got non-permutation");
// We preinitialize `dim2lvlValues` since we need random-access writing.		// We preinitialize `dim2lvlValues` since we need random-access writing.
// And we preinitialize the others for stylistic consistency.		// And we preinitialize the others for stylistic consistency.
SmallVector<Value> lvlSizeValues(lvlRank);		SmallVector<Value> lvlSizeValues(lvlRank);
SmallVector<Value> lvl2dimValues(lvlRank);		SmallVector<Value> lvl2dimValues(lvlRank);
SmallVector<Value> dim2lvlValues(dimRank);		SmallVector<Value> dim2lvlValues(dimRank);
for (unsigned l = 0; l < lvlRank; l++) {		for (Level l = 0; l < lvlRank; l++) {
// The `d`th source variable occurs in the `l`th result position.		// The `d`th source variable occurs in the `l`th result position.
uint64_t d = dimOrder.getDimPosition(l);		Dimension d = dimOrder.getDimPosition(l);
Value lvl = constantIndex(rewriter, loc, l);		Value lvl = constantIndex(rewriter, loc, l);
Value dim = constantIndex(rewriter, loc, d);		Value dim = constantIndex(rewriter, loc, d);
dim2lvlValues[d] = lvl;		dim2lvlValues[d] = lvl;
lvl2dimValues[l] = dim;		lvl2dimValues[l] = dim;
lvlSizeValues[l] =		lvlSizeValues[l] =
(dimShape[d] == ShapedType::kDynamic)		stt.isDynamicDim(d)
? rewriter.create<memref::LoadOp>(loc, dimSizesBuffer, dim)		? rewriter.create<memref::LoadOp>(loc, dimSizesBuffer, dim)
: dimShapeValues[d];		: dimShapeValues[d];
}		}
lvlSizesBuffer = allocaBuffer(rewriter, loc, lvlSizeValues);		lvlSizesBuffer = allocaBuffer(rewriter, loc, lvlSizeValues);
lvl2dimBuffer = allocaBuffer(rewriter, loc, lvl2dimValues);		lvl2dimBuffer = allocaBuffer(rewriter, loc, lvl2dimValues);
dim2lvlBuffer = allocaBuffer(rewriter, loc, dim2lvlValues);		dim2lvlBuffer = allocaBuffer(rewriter, loc, dim2lvlValues);
} else {		} else {
assert(dimRank == lvlRank && "Rank mismatch");		// The `SparseTensorType` ctor already ensures `dimRank == lvlRank`
		// when `isIdentity`; so no need to re-assert it here.
SmallVector<Value> iotaValues;		SmallVector<Value> iotaValues;
iotaValues.reserve(lvlRank);		iotaValues.reserve(lvlRank);
for (unsigned i = 0; i < lvlRank; i++)		for (Level l = 0; l < lvlRank; l++)
iotaValues.push_back(constantIndex(rewriter, loc, i));		iotaValues.push_back(constantIndex(rewriter, loc, l));
lvlSizesBuffer = dimSizesBuffer ? dimSizesBuffer : dimShapeBuffer;		lvlSizesBuffer = dimSizesBuffer ? dimSizesBuffer : dimShapeBuffer;
dim2lvlBuffer = lvl2dimBuffer = allocaBuffer(rewriter, loc, iotaValues);		dim2lvlBuffer = lvl2dimBuffer = allocaBuffer(rewriter, loc, iotaValues);
}		}
// Use the `reader` to parse the file.		// Use the `reader` to parse the file.
SmallVector<Value, 8> params{		SmallVector<Value, 8> params{
reader,		reader,
lvlSizesBuffer,		lvlSizesBuffer,
genLvlTypesBuffer(rewriter, loc, enc),		genLvlTypesBuffer(rewriter, loc, stt),
lvl2dimBuffer,		lvl2dimBuffer,
dim2lvlBuffer,		dim2lvlBuffer,
constantPointerTypeEncoding(rewriter, loc, enc),		constantPointerTypeEncoding(rewriter, loc, stt.getEncoding()),
constantIndexTypeEncoding(rewriter, loc, enc),		constantIndexTypeEncoding(rewriter, loc, stt.getEncoding()),
valTp};		valTp};
Value tensor = createFuncCall(rewriter, loc, "newSparseTensorFromReader",		Value tensor = createFuncCall(rewriter, loc, "newSparseTensorFromReader",
opaqueTp, params, EmitCInterface::On)		opaqueTp, params, EmitCInterface::On)
.getResult(0);		.getResult(0);
// Free the memory for `reader`.		// Free the memory for `reader`.
createFuncCall(rewriter, loc, "delSparseTensorReader", {}, {reader},		createFuncCall(rewriter, loc, "delSparseTensorReader", {}, {reader},
EmitCInterface::Off);		EmitCInterface::Off);
rewriter.replaceOp(op, tensor);		rewriter.replaceOp(op, tensor);
return success();		return success();
}		}
};		};

/// Sparse conversion rule for the alloc operator.		/// Sparse conversion rule for the alloc operator.
class SparseTensorAllocConverter		class SparseTensorAllocConverter
: public OpConversionPattern<bufferization::AllocTensorOp> {		: public OpConversionPattern<bufferization::AllocTensorOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(bufferization::AllocTensorOp op, OpAdaptor adaptor,		matchAndRewrite(bufferization::AllocTensorOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
if (op.getCopy())		if (op.getCopy())
return rewriter.notifyMatchFailure(op,		return rewriter.notifyMatchFailure(op,
"sparse tensor copy not implemented");		"sparse tensor copy not implemented");
Location loc = op.getLoc();		Location loc = op.getLoc();
RankedTensorType resType = op.getType();		const auto stt = getSparseTensorType(op);
auto enc = getSparseTensorEncoding(resType);		if (!stt.hasEncoding())
if (!enc)
return failure();		return failure();
// Gather all dimension sizes as SSA values.		// Gather all dimension sizes as SSA values.
SmallVector<Value> sizes;		const Dimension dimRank = stt.getDimRank();
unsigned int operandCtr = 0;		SmallVector<Value> dimSizes;
for (int64_t i = 0; i < resType.getRank(); ++i) {		dimSizes.reserve(dimRank);
if (resType.isDynamicDim(i)) {		unsigned operandCtr = 0;
sizes.push_back(adaptor.getOperands()[operandCtr++]);		for (Dimension d = 0; d < dimRank; ++d) {
} else {		dimSizes.push_back(
sizes.push_back(		stt.isDynamicDim(d)
rewriter.create<arith::ConstantIndexOp>(loc, op.getStaticSize(i)));		? adaptor.getOperands()[operandCtr++]
}		: constantIndex(rewriter, loc, op.getStaticSize(d)));
}		}
// Generate the call to construct empty tensor. The sizes are		// Generate the call to construct empty tensor. The sizes are
// explicitly defined by the arguments to the alloc operator.		// explicitly defined by the arguments to the alloc operator.
rewriter.replaceOp(op,		rewriter.replaceOp(op, NewCallParams(rewriter, loc)
NewCallParams(rewriter, loc)		.genBuffers(stt, dimSizes)
.genBuffers(enc, sizes, resType.cast<ShapedType>())
.genNewCall(Action::kEmpty));		.genNewCall(Action::kEmpty));
return success();		return success();
}		}
};		};

/// Sparse conversion rule for the convert operator.		/// Sparse conversion rule for the convert operator.
class SparseTensorConvertConverter : public OpConversionPattern<ConvertOp> {		class SparseTensorConvertConverter : public OpConversionPattern<ConvertOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
SparseTensorConvertConverter(MLIRContext *context,		SparseTensorConvertConverter(MLIRContext *context,
SparseTensorConversionOptions o)		SparseTensorConversionOptions o)
: OpConversionPattern<ConvertOp>(context), options(o) {}		: OpConversionPattern<ConvertOp>(context), options(o) {}
SparseTensorConvertConverter(TypeConverter &typeConv, MLIRContext *context,		SparseTensorConvertConverter(TypeConverter &typeConv, MLIRContext *context,
SparseTensorConversionOptions o)		SparseTensorConversionOptions o)
: OpConversionPattern<ConvertOp>(typeConv, context), options(o) {}		: OpConversionPattern<ConvertOp>(typeConv, context), options(o) {}

LogicalResult		LogicalResult
matchAndRewrite(ConvertOp op, OpAdaptor adaptor,		matchAndRewrite(ConvertOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
Location loc = op->getLoc();		const Location loc = op->getLoc();
auto resType = getRankedTensorType(op);		const auto srcTp = getSparseTensorType(op.getSource());
auto srcType = getRankedTensorType(op.getSource());		const auto dstTp = getSparseTensorType(op);
auto encDst = getSparseTensorEncoding(resType);		if (!srcTp.hasEncoding() && !dstTp.hasEncoding())
auto encSrc = getSparseTensorEncoding(srcType);		return failure();
Value src = adaptor.getOperands()[0];
if (encDst && encSrc) {		const Dimension dimRank = srcTp.getDimRank();
		const Type elemTp = srcTp.getElementType();
		const Value src = adaptor.getOperands()[0];
		if (srcTp.hasEncoding() && dstTp.hasEncoding()) {
		const auto srcEnc = srcTp.getEncoding();
		const auto dstEnc = dstTp.getEncoding();
// This is a sparse => sparse conversion, which is handled as follows:		// This is a sparse => sparse conversion, which is handled as follows:
// t = src->toCOO(); ; src to COO in dst order		// t = src->toCOO(); ; src to COO in dst order
// dst = newSparseTensor(t)		// dst = newSparseTensor(t)
// Using the coordinate scheme as an intermediate does not always		// Using the coordinate scheme as an intermediate does not always
// yield the fastest conversion but avoids the need for a full		// yield the fastest conversion but avoids the need for a full
// O(N^2) conversion matrix.		// O(N^2) conversion matrix.
if (encDst == encSrc) {		if (dstEnc == srcEnc) {
rewriter.replaceOp(op, adaptor.getOperands()); // hidden nop cast		rewriter.replaceOp(op, adaptor.getOperands()); // hidden nop cast
return success();		return success();
}		}
NewCallParams params(rewriter, loc);		NewCallParams params(rewriter, loc);
ShapedType stp = srcType.cast<ShapedType>();		SmallVector<Value> dimSizes = getDimSizes(rewriter, loc, srcTp, src);
SmallVector<Value> dimSizes =
getDimSizes(rewriter, loc, encSrc, stp, src);
bool useDirectConversion;		bool useDirectConversion;
switch (options.sparseToSparseStrategy) {		switch (options.sparseToSparseStrategy) {
case SparseToSparseConversionStrategy::kViaCOO:		case SparseToSparseConversionStrategy::kViaCOO:
useDirectConversion = false;		useDirectConversion = false;
break;		break;
case SparseToSparseConversionStrategy::kDirect:		case SparseToSparseConversionStrategy::kDirect:
useDirectConversion = true;		useDirectConversion = true;
assert(canUseDirectConversion(encDst.getDimLevelType()) &&		assert(canUseDirectConversion(dstEnc.getDimLevelType()) &&
"Unsupported target for direct sparse-to-sparse conversion");		"Unsupported target for direct sparse-to-sparse conversion");
break;		break;
case SparseToSparseConversionStrategy::kAuto:		case SparseToSparseConversionStrategy::kAuto:
useDirectConversion = canUseDirectConversion(encDst.getDimLevelType());		useDirectConversion = canUseDirectConversion(dstEnc.getDimLevelType());
break;		break;
}		}
if (useDirectConversion) {		if (useDirectConversion) {
rewriter.replaceOp(op, params.genBuffers(encDst, dimSizes, stp)		rewriter.replaceOp(
		op, params.genBuffers(srcTp.withEncoding(dstEnc), dimSizes)
.genNewCall(Action::kSparseToSparse, src));		.genNewCall(Action::kSparseToSparse, src));
} else { // use via-COO conversion.		} else { // use via-COO conversion.
// Set up encoding with right mix of src and dst so that the two		// Set up encoding with right mix of src and dst so that the two
// method calls can share most parameters, while still providing		// method calls can share most parameters, while still providing
// the correct sparsity information to either of them.		// the correct sparsity information to either of them.
auto enc = SparseTensorEncodingAttr::get(		const auto mixedEnc = SparseTensorEncodingAttr::get(
op->getContext(), encDst.getDimLevelType(), encDst.getDimOrdering(),		op->getContext(), dstEnc.getDimLevelType(), dstEnc.getDimOrdering(),
encDst.getHigherOrdering(), encSrc.getPointerBitWidth(),		dstEnc.getHigherOrdering(), srcEnc.getPointerBitWidth(),
encSrc.getIndexBitWidth());		srcEnc.getIndexBitWidth());
// TODO: This is the only place where `kToCOO` (or `kToIterator`)		// TODO: This is the only place where `kToCOO` (or `kToIterator`)
// is called with a non-identity permutation. Is there any clean		// is called with a non-identity permutation. Is there any clean
// way to push the permutation over to the `kFromCOO` side instead?		// way to push the permutation over to the `kFromCOO` side instead?
Value coo = params.genBuffers(enc, dimSizes, stp)		Value coo = params.genBuffers(srcTp.withEncoding(mixedEnc), dimSizes)
.genNewCall(Action::kToCOO, src);		.genNewCall(Action::kToCOO, src);
Value dst = params.setTemplateTypes(encDst, stp)		Value dst = params.setTemplateTypes(srcTp.withEncoding(dstEnc))
.genNewCall(Action::kFromCOO, coo);		.genNewCall(Action::kFromCOO, coo);
genDelCOOCall(rewriter, loc, stp.getElementType(), coo);		genDelCOOCall(rewriter, loc, elemTp, coo);
rewriter.replaceOp(op, dst);		rewriter.replaceOp(op, dst);
}		}
return success();		return success();
}		}
if (!encDst && encSrc) {		if (srcTp.hasEncoding() && !dstTp.hasEncoding()) {
		const auto srcEnc = srcTp.getEncoding();
// This is sparse => dense conversion, which is handled as follows:		// This is sparse => dense conversion, which is handled as follows:
// dst = new Tensor(0);		// dst = new Tensor(0);
// iter = new SparseTensorIterator(src);		// iter = new SparseTensorIterator(src);
// while (elem = iter->getNext()) {		// while (elem = iter->getNext()) {
// dst[elem.indices] = elem.value;		// dst[elem.indices] = elem.value;
// }		// }
// delete iter;		// delete iter;
const unsigned rank = resType.getRank();		//
const Type elemTp = resType.getElementType();
// Fabricate a no-permutation encoding for NewCallParams		// Fabricate a no-permutation encoding for NewCallParams
// The pointer/index types must be those of `src`.		// The pointer/index types must be those of `src`.
// The dimLevelTypes aren't actually used by Action::kToIterator.		// The dimLevelTypes aren't actually used by Action::kToIterator.
encDst = SparseTensorEncodingAttr::get(		const auto dstEnc = SparseTensorEncodingAttr::get(
op->getContext(),		op->getContext(),
SmallVector<DimLevelType>(rank, DimLevelType::Dense), AffineMap(),		SmallVector<DimLevelType>(dimRank, DimLevelType::Dense), AffineMap(),
AffineMap(), encSrc.getPointerBitWidth(), encSrc.getIndexBitWidth());		AffineMap(), srcEnc.getPointerBitWidth(), srcEnc.getIndexBitWidth());
SmallVector<Value> dimSizes =		SmallVector<Value> dimSizes = getDimSizes(rewriter, loc, srcTp, src);
getDimSizes(rewriter, loc, encSrc, srcType, src);
Value iter = NewCallParams(rewriter, loc)		Value iter = NewCallParams(rewriter, loc)
.genBuffers(encDst, dimSizes, resType)		.genBuffers(dstTp.withEncoding(dstEnc), dimSizes)
.genNewCall(Action::kToIterator, src);		.genNewCall(Action::kToIterator, src);
Value ind = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		Value ind = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);		Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);
Block *insertionBlock = rewriter.getInsertionBlock();		Block *insertionBlock = rewriter.getInsertionBlock();
// TODO: Dense buffers should be allocated/deallocated via the callback		// TODO: Dense buffers should be allocated/deallocated via the callback
// in BufferizationOptions.		// in BufferizationOptions.
Value dst = allocDenseTensor(rewriter, loc, resType, dimSizes);		Value dst = allocDenseTensor(rewriter, loc, dstTp, dimSizes);
SmallVector<Value> noArgs;		SmallVector<Value> noArgs;
SmallVector<Type> noTypes;		SmallVector<Type> noTypes;
auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);		auto whileOp = rewriter.create<scf::WhileOp>(loc, noTypes, noArgs);
Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);		Block *before = rewriter.createBlock(&whileOp.getBefore(), {}, noTypes);
rewriter.setInsertionPointToEnd(before);		rewriter.setInsertionPointToEnd(before);
Value cond = genGetNextCall(rewriter, loc, iter, ind, elemPtr);		Value cond = genGetNextCall(rewriter, loc, iter, ind, elemPtr);
rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());		rewriter.create<scf::ConditionOp>(loc, cond, before->getArguments());
Block *after = rewriter.createBlock(&whileOp.getAfter(), {}, noTypes);		Block *after = rewriter.createBlock(&whileOp.getAfter(), {}, noTypes);
rewriter.setInsertionPointToStart(after);		rewriter.setInsertionPointToStart(after);
SmallVector<Value> ivs = loadIndices(rewriter, loc, rank, ind);		SmallVector<Value> ivs = loadIndices(rewriter, loc, dimRank, ind);
insertScalarIntoDenseTensor(rewriter, loc, elemPtr, dst, ivs);		insertScalarIntoDenseTensor(rewriter, loc, elemPtr, dst, ivs);
rewriter.create<scf::YieldOp>(loc);		rewriter.create<scf::YieldOp>(loc);
rewriter.setInsertionPointAfter(whileOp);		rewriter.setInsertionPointAfter(whileOp);
genDelIteratorCall(rewriter, loc, elemTp, iter);		genDelIteratorCall(rewriter, loc, elemTp, iter);
rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(op, resType, dst);		rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(
		op, dstTp.getRankedTensorType(), dst);
// Deallocate the buffer.		// Deallocate the buffer.
if (bufferization::allocationDoesNotEscape(op->getOpResult(0))) {		if (bufferization::allocationDoesNotEscape(op->getOpResult(0))) {
rewriter.setInsertionPoint(insertionBlock->getTerminator());		rewriter.setInsertionPoint(insertionBlock->getTerminator());
deallocDenseTensor(rewriter, loc, dst);		deallocDenseTensor(rewriter, loc, dst);
}		}
return success();		return success();
}		}
if (!encDst && !encSrc) {		assert(!srcTp.hasEncoding() && dstTp.hasEncoding());
// dense => dense
return failure();
}
// This is a dense => sparse conversion or a sparse constant in COO =>		// This is a dense => sparse conversion or a sparse constant in COO =>
// sparse conversion, which is handled as follows:		// sparse conversion, which is handled as follows:
// t = newSparseCOO()		// t = newSparseCOO()
// ...code to fill the COO tensor t...		// ...code to fill the COO tensor t...
// s = newSparseTensor(t)		// s = newSparseTensor(t)
//		//
// To fill the COO tensor from a dense tensor:		// To fill the COO tensor from a dense tensor:
// for i1 in dim1		// for i1 in dim1
Show All 10 Lines	matchAndRewrite(ConvertOp op, OpAdaptor adaptor,
// t->add(val, [i1,..,ik], [p1,..,pk])		// t->add(val, [i1,..,ik], [p1,..,pk])
//		//
// Note that the dense tensor traversal code is actually implemented		// Note that the dense tensor traversal code is actually implemented
// using MLIR IR to avoid having to expose too much low-level		// using MLIR IR to avoid having to expose too much low-level
// memref traversal details to the runtime support library.		// memref traversal details to the runtime support library.
// Also note that the code below only generates the "new" ops and		// Also note that the code below only generates the "new" ops and
// the loop-nest per se; whereas the entire body of the innermost		// the loop-nest per se; whereas the entire body of the innermost
// loop is generated by genAddElt().		// loop is generated by genAddElt().
ShapedType stp = resType.cast<ShapedType>();		SmallVector<Value> dimSizes;
unsigned rank = stp.getRank();		sizesFromSrc(rewriter, dimSizes, loc, src);
SmallVector<Value> sizes;
sizesFromSrc(rewriter, sizes, loc, src);
NewCallParams params(rewriter, loc);		NewCallParams params(rewriter, loc);
Value coo =		Value coo =
params.genBuffers(encDst, sizes, stp).genNewCall(Action::kEmptyCOO);		params.genBuffers(dstTp, dimSizes).genNewCall(Action::kEmptyCOO);
Value ind = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		Value ind = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
Value perm = params.getDim2LvlMap();		Value perm = params.getDim2LvlMap();
Type eltType = stp.getElementType();		Value elemPtr = genAllocaScalar(rewriter, loc, elemTp);
Value elemPtr = genAllocaScalar(rewriter, loc, eltType);
genDenseTensorOrSparseConstantIterLoop(		genDenseTensorOrSparseConstantIterLoop(
rewriter, loc, src, rank,		rewriter, loc, src, dimRank,
[&](OpBuilder &builder, Location loc, Value val, ValueRange indices) {		[&](OpBuilder &builder, Location loc, Value val, ValueRange ivs) {
for (unsigned i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
Value idx = constantIndex(builder, loc, i);		Value dim = constantIndex(builder, loc, d);
builder.create<memref::StoreOp>(loc, indices[i], ind, idx);		builder.create<memref::StoreOp>(loc, ivs[d], ind, dim);
}		}
builder.create<memref::StoreOp>(loc, val, elemPtr);		builder.create<memref::StoreOp>(loc, val, elemPtr);
genAddEltCall(builder, loc, eltType, coo, elemPtr, ind, perm);		genAddEltCall(builder, loc, elemTp, coo, elemPtr, ind, perm);
});		});
// Final call to construct sparse tensor storage.		// Final call to construct sparse tensor storage.
Value dst = params.genNewCall(Action::kFromCOO, coo);		Value dst = params.genNewCall(Action::kFromCOO, coo);
genDelCOOCall(rewriter, loc, eltType, coo);		genDelCOOCall(rewriter, loc, elemTp, coo);
rewriter.replaceOp(op, dst);		rewriter.replaceOp(op, dst);
return success();		return success();
}		}

private:		private:
/// Options to control sparse code generation.		/// Options to control sparse code generation.
SparseTensorConversionOptions options;		SparseTensorConversionOptions options;
};		};

/// Sparse conversion rule for the dealloc operator.		/// Sparse conversion rule for the dealloc operator.
class SparseTensorDeallocConverter		class SparseTensorDeallocConverter
: public OpConversionPattern<bufferization::DeallocTensorOp> {		: public OpConversionPattern<bufferization::DeallocTensorOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(bufferization::DeallocTensorOp op, OpAdaptor adaptor,		matchAndRewrite(bufferization::DeallocTensorOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
auto enc = getSparseTensorEncoding(op.getTensor().getType());		if (!getSparseTensorType(op.getTensor()).hasEncoding())
if (!enc)
return failure();		return failure();
StringRef name = "delSparseTensor";		StringRef name = "delSparseTensor";
createFuncCall(rewriter, op->getLoc(), name, {}, adaptor.getOperands(),		createFuncCall(rewriter, op->getLoc(), name, {}, adaptor.getOperands(),
EmitCInterface::Off);		EmitCInterface::Off);
rewriter.eraseOp(op);		rewriter.eraseOp(op);
return success();		return success();
}		}
};		};
▲ Show 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(InsertOp op, OpAdaptor adaptor,		matchAndRewrite(InsertOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
// Note that the current regime only allows for strict lexicographic		// Note that the current regime only allows for strict lexicographic
// index order. All values are passed by reference through stack		// index order. All values are passed by reference through stack
// allocated memrefs.		// allocated memrefs.
Location loc = op->getLoc();		Location loc = op->getLoc();
auto tp = getRankedTensorType(op.getTensor());		const auto stt = getSparseTensorType(op.getTensor());
auto elemTp = tp.getElementType();		const auto elemTp = stt.getElementType();
unsigned rank = tp.getRank();		const Dimension dimRank = stt.getDimRank();
auto mref = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		auto mref = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
auto vref = genAllocaScalar(rewriter, loc, elemTp);		auto vref = genAllocaScalar(rewriter, loc, elemTp);
for (unsigned i = 0; i < rank; i++)		for (Dimension d = 0; d < dimRank; d++)
rewriter.create<memref::StoreOp>(loc, adaptor.getIndices()[i], mref,		rewriter.create<memref::StoreOp>(loc, adaptor.getIndices()[d], mref,
constantIndex(rewriter, loc, i));		constantIndex(rewriter, loc, d));
rewriter.create<memref::StoreOp>(loc, adaptor.getValue(), vref);		rewriter.create<memref::StoreOp>(loc, adaptor.getValue(), vref);
SmallString<12> name{"lexInsert", primaryTypeFunctionSuffix(elemTp)};		SmallString<12> name{"lexInsert", primaryTypeFunctionSuffix(elemTp)};
createFuncCall(rewriter, loc, name, {}, {adaptor.getTensor(), mref, vref},		createFuncCall(rewriter, loc, name, {}, {adaptor.getTensor(), mref, vref},
EmitCInterface::On);		EmitCInterface::On);
rewriter.replaceOp(op, adaptor.getTensor());		rewriter.replaceOp(op, adaptor.getTensor());
return success();		return success();
}		}
};		};

/// Sparse conversion rule for the expand operator.		/// Sparse conversion rule for the expand operator.
class SparseTensorExpandConverter : public OpConversionPattern<ExpandOp> {		class SparseTensorExpandConverter : public OpConversionPattern<ExpandOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(ExpandOp op, OpAdaptor adaptor,		matchAndRewrite(ExpandOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
Location loc = op->getLoc();		Location loc = op->getLoc();
auto srcType = getRankedTensorType(op.getTensor());		const auto srcTp = getSparseTensorType(op.getTensor());
Type eltType = srcType.getElementType();		Type eltType = srcTp.getElementType();
Type boolType = rewriter.getIntegerType(1);		Type boolType = rewriter.getIntegerType(1);
Type idxType = rewriter.getIndexType();		Type idxType = rewriter.getIndexType();
// All initialization should be done on entry of the loop nest.		// All initialization should be done on entry of the loop nest.
rewriter.setInsertionPointAfter(op.getTensor().getDefiningOp());		rewriter.setInsertionPointAfter(op.getTensor().getDefiningOp());
// Get the cardinality of valid coordinates for the innermost level.		// Get the cardinality of valid coordinates for the innermost level.
auto srcEnc = getSparseTensorEncoding(srcType);		Value sz = createOrFoldLvlCall(rewriter, loc, srcTp, adaptor.getTensor(),
unsigned lvlRank =		srcTp.getLvlRank() - 1);
srcEnc ? srcEnc.getDimLevelType().size() : srcType.getRank();
Value sz = createOrFoldLvlCall(rewriter, loc, srcEnc, srcType,
adaptor.getTensor(), lvlRank - 1);
// Allocate temporary buffers for values, filled-switch, and indices.		// Allocate temporary buffers for values, filled-switch, and indices.
// We do not use stack buffers for this, since the expanded size may		// We do not use stack buffers for this, since the expanded size may
// be rather large (as it envelops a single expanded dense dimension).		// be rather large (as it envelops a single expanded dense dimension).
Value values = genAlloc(rewriter, loc, sz, eltType);		Value values = genAlloc(rewriter, loc, sz, eltType);
Value filled = genAlloc(rewriter, loc, sz, boolType);		Value filled = genAlloc(rewriter, loc, sz, boolType);
Value indices = genAlloc(rewriter, loc, sz, idxType);		Value indices = genAlloc(rewriter, loc, sz, idxType);
Value zero = constantZero(rewriter, loc, idxType);		Value zero = constantZero(rewriter, loc, idxType);
// Reset the values/filled-switch to all-zero/false. Note that this		// Reset the values/filled-switch to all-zero/false. Note that this
Show All 26 Lines	matchAndRewrite(CompressOp op, OpAdaptor adaptor,
// all-zero/false by only iterating over the set elements, so the		// all-zero/false by only iterating over the set elements, so the
// complexity remains proportional to the sparsity of the expanded		// complexity remains proportional to the sparsity of the expanded
// access pattern.		// access pattern.
Value values = adaptor.getValues();		Value values = adaptor.getValues();
Value filled = adaptor.getFilled();		Value filled = adaptor.getFilled();
Value added = adaptor.getAdded();		Value added = adaptor.getAdded();
Value count = adaptor.getCount();		Value count = adaptor.getCount();
Value tensor = adaptor.getTensor();		Value tensor = adaptor.getTensor();
auto tp = getRankedTensorType(op.getTensor());		const auto stt = getSparseTensorType(op.getTensor());
Type elemTp = tp.getElementType();		const Type elemTp = stt.getElementType();
unsigned rank = tp.getRank();		const Dimension dimRank = stt.getDimRank();
auto mref = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		auto mref = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
for (unsigned i = 0; i < rank - 1; i++)		for (Dimension d = 0; d < dimRank - 1; d++)
rewriter.create<memref::StoreOp>(loc, adaptor.getIndices()[i], mref,		rewriter.create<memref::StoreOp>(loc, adaptor.getIndices()[d], mref,
constantIndex(rewriter, loc, i));		constantIndex(rewriter, loc, d));
SmallString<12> name{"expInsert", primaryTypeFunctionSuffix(elemTp)};		SmallString<12> name{"expInsert", primaryTypeFunctionSuffix(elemTp)};
createFuncCall(rewriter, loc, name, {},		createFuncCall(rewriter, loc, name, {},
{tensor, mref, values, filled, added, count},		{tensor, mref, values, filled, added, count},
EmitCInterface::On);		EmitCInterface::On);
rewriter.replaceOp(op, adaptor.getTensor());		rewriter.replaceOp(op, adaptor.getTensor());
// Deallocate the buffers on exit of the loop nest.		// Deallocate the buffers on exit of the loop nest.
Operation *parent = getTop(op);		Operation *parent = getTop(op);
rewriter.setInsertionPointAfter(parent);		rewriter.setInsertionPointAfter(parent);
Show All 31 Lines	matchAndRewrite(ConcatenateOp op, OpAdaptor adaptor,
// a = malloc(shapeOf(a)) or newSparseAllDense(shapeOf(a))		// a = malloc(shapeOf(a)) or newSparseAllDense(shapeOf(a))
// for i, j, k // dense input		// for i, j, k // dense input
// a[ adjustForOffset(i,j,k) ] = b[i,j,k]		// a[ adjustForOffset(i,j,k) ] = b[i,j,k]
//		//
// for elem in sparse_input		// for elem in sparse_input
// a[ adjustForOffset(elem.indices) ] = elem.value		// a[ adjustForOffset(elem.indices) ] = elem.value
// return a		// return a
Location loc = op.getLoc();		Location loc = op.getLoc();
auto dstTp = getRankedTensorType(op);		const auto dstTp = getSparseTensorType(op);
auto encDst = getSparseTensorEncoding(dstTp);		const auto dstEnc = dstTp.getEncoding();
Type elemTp = dstTp.getElementType();		const Type elemTp = dstTp.getElementType();
uint64_t concatDim = op.getDimension().getZExtValue();		const Dimension concatDim = op.getDimension().getZExtValue();
unsigned rank = dstTp.getRank();		const Dimension dimRank = dstTp.getDimRank();

Value dst; // destination tensor		Value dst; // destination tensor
Value dstPerm; // destination tensor permutation (if sparse out)		Value dstPerm; // destination tensor permutation (if sparse out)
// A pointer to the value being inserted (if dense => sparse)		// A pointer to the value being inserted (if dense => sparse)
Value elemPtr;		Value elemPtr;
// Memory that holds the COO for destination tensor (if sparse out)		// Memory that holds the dim-indices for destination tensor (if sparse out)
Value dstIdx;		Value dstInd;
// The offset applied to the dimenstion to be concated (starting from 0)		// The offset applied to the dimenstion to be concated (starting from 0)
Value offset = constantIndex(rewriter, loc, 0);		Value offset = constantIndex(rewriter, loc, 0);

SmallVector<Value> sizes;		SmallVector<Value> dimSizes;
NewCallParams params(rewriter, loc);		concatDimSizesFromInputs(rewriter, loc, dstTp, op.getInputs(), concatDim,
concatSizesFromInputs(rewriter, sizes, loc, dstTp, op.getInputs(),		dimSizes);
concatDim);

bool allDense = false;		NewCallParams params(rewriter, loc);
		const bool allDense = dstTp.hasEncoding() && dstTp.isAllDense();
Value dstTensor;		Value dstTensor;
if (encDst) {		if (dstTp.hasEncoding()) {
allDense = encDst.isAllDense();
// Start a new COO or an initialized annotated all dense sparse tensor.		// Start a new COO or an initialized annotated all dense sparse tensor.
dst = params.genBuffers(encDst, sizes, dstTp)		dst = params.genBuffers(dstTp, dimSizes)
.genNewCall(allDense ? Action::kEmpty : Action::kEmptyCOO);		.genNewCall(allDense ? Action::kEmpty : Action::kEmptyCOO);
dstIdx = genAlloca(rewriter, loc, rank, rewriter.getIndexType());		dstInd = genAlloca(rewriter, loc, dimRank, rewriter.getIndexType());
if (allDense) {		if (allDense) {
dstTensor = dst;		dstTensor = dst;
// Get the values buffer for the sparse tensor and reshape it to the		// Get the values buffer for the sparse tensor and reshape it to the
// corresponding dense tensor shape.		// corresponding dense tensor shape.
dst = genValuesCall(rewriter, loc,		dst = genValuesCall(rewriter, loc,
MemRefType::get({ShapedType::kDynamic}, elemTp),		MemRefType::get({ShapedType::kDynamic}, elemTp),
{dst});		{dst});
// Use the dstIdx to store the level sizes.		// Use the dstInd to store the level sizes.
dst = reshapeValuesToLevels(rewriter, loc, encDst, sizes, dst, dstIdx);		dst =
		reshapeValuesToLevels(rewriter, loc, dstEnc, dimSizes, dst, dstInd);
} else {		} else {
dstPerm = params.getDim2LvlMap();		dstPerm = params.getDim2LvlMap();
elemPtr = genAllocaScalar(rewriter, loc, elemTp);		elemPtr = genAllocaScalar(rewriter, loc, elemTp);
}		}
} else {		} else {
// TODO: Dense buffers should be allocated/deallocated via the callback		// TODO: Dense buffers should be allocated/deallocated via the callback
// in BufferizationOptions.		// in BufferizationOptions.
dst = allocDenseTensor(rewriter, loc, dstTp, sizes);		dst = allocDenseTensor(rewriter, loc, dstTp, dimSizes);
}		}
auto dimIdx2LvlIdx = [&](ValueRange dIdx) -> SmallVector<Value> {		const Level lvlRank = dstTp.getLvlRank();
SmallVector<Value> lIdx;		const auto dimIvs2LvlIvs = [&](ValueRange dimIvs) -> SmallVector<Value> {
for (unsigned i = 0; i < dIdx.size(); i++)		SmallVector<Value> lvlIvs;
lIdx.push_back(dIdx[toOrigDim(encDst, i)]);		lvlIvs.reserve(lvlRank);
return lIdx;		for (Level l = 0; l < lvlRank; l++)
		// FIXME: `toOrigDim` is deprecated
		lvlIvs.push_back(dimIvs[toOrigDim(dstEnc, l)]);
		return lvlIvs;
};		};
for (auto it : llvm::zip(op.getInputs(), adaptor.getInputs())) {		for (const auto &it : llvm::zip(op.getInputs(), adaptor.getInputs())) {
Value orignalOp = std::get<0>(it); // Input (with encoding) from Op		Value orignalOp = std::get<0>(it); // Input (with encoding) from Op
Value adaptedOp = std::get<1>(it); // Input (type converted) from adaptor		Value adaptedOp = std::get<1>(it); // Input (type converted) from adaptor
auto srcTp = getRankedTensorType(orignalOp);		const auto srcTp = getSparseTensorType(orignalOp);
auto encSrc = getSparseTensorEncoding(srcTp);		if (srcTp.hasEncoding()) {
if (encSrc) {
genSparseCOOIterationLoop(		genSparseCOOIterationLoop(
rewriter, loc, adaptedOp, srcTp,		rewriter, loc, adaptedOp, srcTp,
[&](OpBuilder &builder, Location loc, Value idx,		[&](OpBuilder &builder, Location loc, Value idx,
Value elemPtr) -> void {		Value elemPtr) -> void {
SmallVector<Value> dimInd =		SmallVector<Value> dimIvs =
loadIndices(builder, loc, rank, idx, concatDim, offset);		loadIndices(builder, loc, dimRank, idx, concatDim, offset);
if (encDst && !allDense) {		if (dstTp.hasEncoding() && !allDense) {
// Case: sparse => sparse, except for annotated all dense.		// Case: sparse => sparse, except for annotated all dense.
storeIndices(builder, loc, rank, dstIdx, dimInd);		storeIndices(builder, loc, dimRank, dstInd, dimIvs);
genAddEltCall(builder, loc, elemTp, dst, elemPtr, dstIdx,		genAddEltCall(builder, loc, elemTp, dst, elemPtr, dstInd,
dstPerm);		dstPerm);
} else {		} else {
// Case: sparse => dense, or annotated all dense.		// Case: sparse => dense, or annotated all dense.
SmallVector<Value> lvlInd;		const auto lvlIvs = allDense ? dimIvs2LvlIvs(dimIvs) : dimIvs;
if (allDense)		insertScalarIntoDenseTensor(builder, loc, elemPtr, dst, lvlIvs);
lvlInd = dimIdx2LvlIdx(dimInd);
else
lvlInd = dimInd;
insertScalarIntoDenseTensor(builder, loc, elemPtr, dst, lvlInd);
}		}
});		});
} else {		} else {
genDenseTensorIterationLoop(		genDenseTensorIterationLoop(
rewriter, loc, adaptedOp, srcTp,		rewriter, loc, adaptedOp, srcTp,
[&](OpBuilder &builder, Location loc, ValueRange idx) -> void {		[&](OpBuilder &builder, Location loc, ValueRange dimIvs) -> void {
if (encDst && !allDense) {		if (dstTp.hasEncoding() && !allDense) {
// Case: dense => sparse, except for annotated all dense.		// Case: dense => sparse, except for annotated all dense.
storeIndices(builder, loc, rank, dstIdx, idx, concatDim,		storeIndices(builder, loc, dimRank, dstInd, dimIvs, concatDim,
offset);		offset);
Value val = genValueForDense(builder, loc, adaptedOp, idx);		Value val = genValueForDense(builder, loc, adaptedOp, dimIvs);
builder.create<memref::StoreOp>(loc, val, elemPtr);		builder.create<memref::StoreOp>(loc, val, elemPtr);
genAddEltCall(builder, loc, elemTp, dst, elemPtr, dstIdx,		genAddEltCall(builder, loc, elemTp, dst, elemPtr, dstInd,
dstPerm);		dstPerm);
} else {		} else {
// Case: dense => dense, or annotated all dense.		// Case: dense => dense, or annotated all dense.
Value val = genValueForDense(builder, loc, adaptedOp, idx);		Value val = genValueForDense(builder, loc, adaptedOp, dimIvs);
SmallVector<Value> lvlInd(idx);		// Despite the name, this isn't actually level-ivs until
		// after the `dimIvs2LvlIvs` call.
		SmallVector<Value> lvlIvs(dimIvs);
// Apply offset.		// Apply offset.
lvlInd[concatDim] = builder.create<arith::AddIOp>(		lvlIvs[concatDim] = builder.create<arith::AddIOp>(
loc, lvlInd[concatDim], offset);		loc, lvlIvs[concatDim], offset);
if (allDense)		if (allDense)
lvlInd = dimIdx2LvlIdx(lvlInd);		lvlIvs = dimIvs2LvlIvs(lvlIvs);
builder.create<memref::StoreOp>(loc, val, dst, lvlInd);		builder.create<memref::StoreOp>(loc, val, dst, lvlIvs);
}		}
});		});
}		}
// Accumulate offset.		// Accumulate offset.
// TODO: avoid calling sparseDimSize multiple times by caching the result!		// TODO: avoid calling sparseDimSize multiple times by caching the result!
Value curDim = createOrFoldDimCall(rewriter, loc, encSrc, srcTp,		Value curDim =
adaptedOp, concatDim);		createOrFoldDimCall(rewriter, loc, srcTp, adaptedOp, concatDim);

offset = rewriter.create<arith::AddIOp>(loc, offset, curDim);		offset = rewriter.create<arith::AddIOp>(loc, offset, curDim);
}		}
if (encDst) {		if (dstTp.hasEncoding()) {
if (!allDense) {		if (!allDense) {
// In sparse output case, the destination holds the COO.		// In sparse output case, the destination holds the COO.
Value coo = dst;		Value coo = dst;
dst = params.genNewCall(Action::kFromCOO, coo);		dst = params.genNewCall(Action::kFromCOO, coo);
// Release resources.		// Release resources.
genDelCOOCall(rewriter, loc, elemTp, coo);		genDelCOOCall(rewriter, loc, elemTp, coo);
} else {		} else {
dst = dstTensor;		dst = dstTensor;
}		}
rewriter.replaceOp(op, dst);		rewriter.replaceOp(op, dst);
} else {		} else {
rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(op, dstTp, dst);		rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(
		op, dstTp.getRankedTensorType(), dst);
}		}
return success();		return success();
}		}
};		};

/// Sparse conversion rule for the output operator.		/// Sparse conversion rule for the output operator.
class SparseTensorOutConverter : public OpConversionPattern<OutOp> {		class SparseTensorOutConverter : public OpConversionPattern<OutOp> {
public:		public:
using OpConversionPattern::OpConversionPattern;		using OpConversionPattern::OpConversionPattern;
LogicalResult		LogicalResult
matchAndRewrite(OutOp op, OpAdaptor adaptor,		matchAndRewrite(OutOp op, OpAdaptor adaptor,
ConversionPatternRewriter &rewriter) const override {		ConversionPatternRewriter &rewriter) const override {
Location loc = op->getLoc();		const Location loc = op->getLoc();
ShapedType srcType = op.getTensor().getType().cast<ShapedType>();		const auto srcTp = getSparseTensorType(op.getTensor());
// Convert to default permuted COO.		// Convert to default permuted COO.
Value src = adaptor.getOperands()[0];		Value src = adaptor.getOperands()[0];
auto encSrc = getSparseTensorEncoding(srcType);		SmallVector<Value> dimSizes = getDimSizes(rewriter, loc, srcTp, src);
SmallVector<Value> dimSizes =
getDimSizes(rewriter, loc, encSrc, srcType, src);
const auto enc = encSrc.withoutOrdering();
Value coo = NewCallParams(rewriter, loc)		Value coo = NewCallParams(rewriter, loc)
.genBuffers(enc, dimSizes, srcType)		.genBuffers(srcTp.withoutOrdering(), dimSizes)
.genNewCall(Action::kToCOO, src);		.genNewCall(Action::kToCOO, src);
// Then output the tensor to external file with indices in the externally		// Then output the tensor to external file with indices in the externally
// visible lexicographic index order. A sort is required if the source was		// visible lexicographic index order. A sort is required if the source was
// not in that order yet (note that the sort can be dropped altogether if		// not in that order yet (note that the sort can be dropped altogether if
// external format does not care about the order at all, but here we assume		// external format does not care about the order at all, but here we assume
// it does).		// it does).
Value sort = constantI1(rewriter, loc,		const Value sort = constantI1(rewriter, loc, !srcTp.isIdentity());
encSrc.getDimOrdering() &&
!encSrc.getDimOrdering().isIdentity());
SmallVector<Value, 3> outParams{coo, adaptor.getOperands()[1], sort};		SmallVector<Value, 3> outParams{coo, adaptor.getOperands()[1], sort};
Type eltType = srcType.getElementType();		const Type elemTp = srcTp.getElementType();
SmallString<18> name{"outSparseTensor", primaryTypeFunctionSuffix(eltType)};		SmallString<18> name{"outSparseTensor", primaryTypeFunctionSuffix(elemTp)};
createFuncCall(rewriter, loc, name, {}, outParams, EmitCInterface::Off);		createFuncCall(rewriter, loc, name, {}, outParams, EmitCInterface::Off);
genDelCOOCall(rewriter, loc, eltType, coo);		genDelCOOCall(rewriter, loc, elemTp, coo);
rewriter.eraseOp(op);		rewriter.eraseOp(op);
return success();		return success();
}		}
};		};

} // namespace		} // namespace

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
Show All 32 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp

Show All 14 Lines

#include "mlir/Dialect/Arith/IR/Arith.h"		#include "mlir/Dialect/Arith/IR/Arith.h"
#include "mlir/Dialect/Bufferization/IR/Bufferization.h"		#include "mlir/Dialect/Bufferization/IR/Bufferization.h"
#include "mlir/Dialect/Linalg/IR/Linalg.h"		#include "mlir/Dialect/Linalg/IR/Linalg.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
#include "mlir/Dialect/MemRef/IR/MemRef.h"		#include "mlir/Dialect/MemRef/IR/MemRef.h"
#include "mlir/Dialect/SCF/IR/SCF.h"		#include "mlir/Dialect/SCF/IR/SCF.h"
#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"		#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/IR/AffineMap.h"		#include "mlir/IR/AffineMap.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"

using namespace mlir;		using namespace mlir;
using namespace mlir::bufferization;		using namespace mlir::bufferization;
using namespace mlir::linalg;		using namespace mlir::linalg;
using namespace mlir::sparse_tensor;		using namespace mlir::sparse_tensor;

//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//
// Helper methods for the actual rewriting rules.		// Helper methods for the actual rewriting rules.
//===---------------------------------------------------------------------===//		//===---------------------------------------------------------------------===//

// Helper method to match any typed zero.		// Helper method to match any typed zero.
static bool isZeroValue(Value val) {		static bool isZeroValue(Value val) {
return matchPattern(val, m_Zero()) \|\| matchPattern(val, m_AnyZeroFloat());		return matchPattern(val, m_Zero()) \|\| matchPattern(val, m_AnyZeroFloat());
}		}

// Helper to detect a sparse tensor type operand.		// Helper to detect a sparse tensor type operand.
static bool isSparseTensor(OpOperand *op) {		static bool isSparseTensor(OpOperand *op) {
if (auto enc = getSparseTensorEncoding(op->get().getType())) {		auto enc = getSparseTensorEncoding(op->get().getType());
if (llvm::is_contained(enc.getDimLevelType(), DimLevelType::Compressed))		return enc &&
return true;		llvm::is_contained(enc.getDimLevelType(), DimLevelType::Compressed);
}
return false;
}

static bool isAllDimOrdered(RankedTensorType rtp) {
if (auto enc = getSparseTensorEncoding(rtp))
return llvm::all_of(enc.getDimLevelType(), isOrderedDLT);

return true;
}		}

static bool hasSameDimOrdering(RankedTensorType rtp1, RankedTensorType rtp2) {		static bool hasSameDimOrdering(RankedTensorType rtp1, RankedTensorType rtp2) {
assert(rtp1.getRank() == rtp2.getRank());		assert(rtp1.getRank() == rtp2.getRank());
AffineMap idMap =		return SparseTensorType(rtp1).getDimToLvlMap() ==
AffineMap::getMultiDimIdentityMap(rtp1.getRank(), rtp1.getContext());		SparseTensorType(rtp2).getDimToLvlMap();

auto enc1 = getSparseTensorEncoding(rtp1);
auto enc2 = getSparseTensorEncoding(rtp2);

auto order1 = (enc1 && enc1.getDimOrdering()) ? enc1.getDimOrdering() : idMap;
auto order2 = (enc2 && enc2.getDimOrdering()) ? enc2.getDimOrdering() : idMap;

return order1 == order2;
}		}

// Helper method to find zero/uninitialized allocation.		// Helper method to find zero/uninitialized allocation.
static bool isAlloc(OpOperand *op, bool isZero) {		static bool isAlloc(OpOperand *op, bool isZero) {
Value val = op->get();		Value val = op->get();
// Check allocation, with zero alloc when required.		// Check allocation, with zero alloc when required.
if (auto alloc = val.getDefiningOp<AllocTensorOp>()) {		if (auto alloc = val.getDefiningOp<AllocTensorOp>()) {
Value copy = alloc.getCopy();		Value copy = alloc.getCopy();
▲ Show 20 Lines • Show All 340 Lines • ▼ Show 20 Lines	Value cooBuffer =
.getResult();		.getResult();

ForeachOp foreachOp = rewriter.create<ForeachOp>(		ForeachOp foreachOp = rewriter.create<ForeachOp>(
loc, srcTensor, cooBuffer,		loc, srcTensor, cooBuffer,
[&](OpBuilder &builder, Location loc, ValueRange args, Value v,		[&](OpBuilder &builder, Location loc, ValueRange args, Value v,
ValueRange reduc) {		ValueRange reduc) {
SmallVector<Value> srcIndices;		SmallVector<Value> srcIndices;
SmallVector<Value> dstIndices;		SmallVector<Value> dstIndices;
for (int64_t i = 0, e = srcTp.getRank(); i < e; i++) {		for (Dimension d = 0, dimRank = srcTp.getRank(); d < dimRank; d++) {
uint64_t dim = toStoredDim(encSrc, i);		// FIXME: `toStoredDim` is deprecated
srcIndices.push_back(args[dim]);		Level lvl = toStoredDim(encSrc, d);
		srcIndices.push_back(args[lvl]);
}		}
translateIndicesArray(builder, loc, op.getReassociationIndices(),		translateIndicesArray(builder, loc, op.getReassociationIndices(),
srcIndices, srcSizes, dstSizes, dstIndices);		srcIndices, srcSizes, dstSizes, dstIndices);
auto t = builder.create<InsertOp>(loc, v, reduc.front(), dstIndices);		auto t = builder.create<InsertOp>(loc, v, reduc.front(), dstIndices);
builder.create<sparse_tensor::YieldOp>(loc, t);		builder.create<sparse_tensor::YieldOp>(loc, t);
});		});
auto t = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);		auto t = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);
auto converted = rewriter.create<ConvertOp>(loc, dstTp, t).getResult();		auto converted = rewriter.create<ConvertOp>(loc, dstTp, t).getResult();
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(ReshapeOp op,
return failure();		return failure();
}		}
};		};

struct ConcatenateRewriter : public OpRewritePattern<ConcatenateOp> {		struct ConcatenateRewriter : public OpRewritePattern<ConcatenateOp> {
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;
LogicalResult matchAndRewrite(ConcatenateOp op,		LogicalResult matchAndRewrite(ConcatenateOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
Location loc = op.getLoc();		const Location loc = op.getLoc();
auto dstTp = getRankedTensorType(op);		const auto dstTp = getSparseTensorType(op);
uint64_t conDim = op.getDimension().getZExtValue();		const Dimension dimRank = dstTp.getDimRank();
		const Dimension conDim = op.getDimension().getZExtValue();
SmallVector<Value> sizes;		SmallVector<Value> sizes;
concatSizesFromInputs(rewriter, sizes, loc, dstTp, op.getInputs(), conDim);		concatSizesFromInputs(rewriter, sizes, loc, dstTp, op.getInputs(), conDim);

// %t = concatenate %s1, %s2, %s3 {dim = 1}		// %t = concatenate %s1, %s2, %s3 {dim = 1}
// ==>		// ==>
// if (isSparseDst)		// if (isSparseDst)
// if (allDense)		// if (allDense)
// %tmp = bufferization.alloc_tensor dstTp		// %tmp = bufferization.alloc_tensor dstTp
// else		// else
// %tmp = bufferization.alloc_tensor : unordered COO		// %tmp = bufferization.alloc_tensor : unordered COO
// else		// else
// %tmp = memref.alloc : dense tensor		// %tmp = memref.alloc : dense tensor
// foreach in %s1 : insert d0, d1, %tmp		// foreach in %s1 : insert d0, d1, %tmp
// foreach in %s2 : insert d0, d1 + size(s1), %tmp		// foreach in %s2 : insert d0, d1 + size(s1), %tmp
// foreach in %s3 : insert d0, d1 + size(s1) + size(s2), %tmp		// foreach in %s3 : insert d0, d1 + size(s1) + size(s2), %tmp
// %t = convert_to_dest_tensor(%tmp)		// %t = convert_to_dest_tensor(%tmp)
SparseTensorEncodingAttr encDst = getSparseTensorEncoding(dstTp);		//
		// NOTE: this cannot be `const` because it will be changed when
		// `needTmpCOO`, but that's buried in the conditional below and
		// thus not easily extracted.
		auto encDst = dstTp.getEncoding();
Value dst; // Destination tensor for inserting source tensor values.		Value dst; // Destination tensor for inserting source tensor values.
bool needTmpCOO = true;		bool needTmpCOO = true;
bool allDense = false;		const bool allDense = dstTp.hasEncoding() && dstTp.isAllDense();
Value annotatedDenseDst;		Value annotatedDenseDst;
int64_t rank = dstTp.getRank();		if (dstTp.hasEncoding()) {
if (encDst) {
allDense = encDst.isAllDense();
bool allOrdered = false;		bool allOrdered = false;
// When concatenating on dimension 0, and all inputs are sorted and have		// When concatenating on dimension 0, and all inputs are sorted and have
// an identity dimOrdering, the concatenate will generate coords in		// an identity dimOrdering, the concatenate will generate coords in
// lexOrder thus no need for the tmp COO buffer.		// lexOrder thus no need for the tmp COO buffer.
// TODO: When conDim != 0, as long as conDim is the first dimension		// TODO: When conDim != 0, as long as conDim is the first dimension
// in all input/output buffers, and all input/output buffers have the same		// in all input/output buffers, and all input/output buffers have the same
// dimOrdering, the tmp COO buffer is still unnecessary (e.g, concatenate		// dimOrdering, the tmp COO buffer is still unnecessary (e.g, concatenate
// CSC matrices along column).		// CSC matrices along column).
if (!allDense && conDim == 0 && encDst.hasIdDimOrdering()) {		if (!allDense && conDim == 0 && dstTp.isIdentity()) {
for (auto i : op.getInputs()) {		for (auto i : op.getInputs()) {
auto rtp = getRankedTensorType(i);		const auto stt = getSparseTensorType(i);
auto srcEnc = getSparseTensorEncoding(rtp);		allOrdered = stt.isAllOrdered() && stt.isIdentity();
if (isAllDimOrdered(rtp) && (!srcEnc \|\| srcEnc.hasIdDimOrdering())) {		if (!allOrdered)
allOrdered = true;
continue;
}
allOrdered = false;
break;		break;
}		}
}		}

needTmpCOO = !allDense && !allOrdered;		needTmpCOO = !allDense && !allOrdered;
SmallVector<Value> dynSizes;		SmallVector<Value> dynSizes;
getDynamicSizes(dstTp, sizes, dynSizes);		getDynamicSizes(dstTp, sizes, dynSizes);
RankedTensorType tp = dstTp;		RankedTensorType tp = dstTp;
if (needTmpCOO) {		if (needTmpCOO) {
tp = getUnorderedCOOFromType(dstTp);		tp = getUnorderedCOOFromType(dstTp);
encDst = getSparseTensorEncoding(tp);		encDst = getSparseTensorEncoding(tp);
}		}
dst = rewriter.create<AllocTensorOp>(loc, tp, dynSizes).getResult();		dst = rewriter.create<AllocTensorOp>(loc, tp, dynSizes).getResult();
if (allDense) {		if (allDense) {
// Create a view of the values buffer to match the unannotated dense		// Create a view of the values buffer to match the unannotated dense
// tensor.		// tensor.
Value valuesBuffer = genToValues(rewriter, loc, dst);		Value valuesBuffer = genToValues(rewriter, loc, dst);
Value idxBuffer = genAlloca(		Value idxBuffer =
rewriter, loc, rank, rewriter.getIndexType(), /staticShape=/true);		genAlloca(rewriter, loc, dimRank, rewriter.getIndexType(),
		/staticShape=/true);
annotatedDenseDst = dst;		annotatedDenseDst = dst;
dst = reshapeValuesToLevels(rewriter, loc, encDst, sizes, valuesBuffer,		dst = reshapeValuesToLevels(rewriter, loc, encDst, sizes, valuesBuffer,
idxBuffer);		idxBuffer);
}		}
} else {		} else {
// TODO: Dense buffers should be allocated/deallocated via the callback		// TODO: Dense buffers should be allocated/deallocated via the callback
// in BufferizationOptions.		// in BufferizationOptions.
dst = allocDenseTensor(rewriter, loc, dstTp, sizes);		dst = allocDenseTensor(rewriter, loc, dstTp, sizes);
}		}

Value offset = constantIndex(rewriter, loc, 0);		Value offset = constantIndex(rewriter, loc, 0);
SmallVector<Value> initArgs;		SmallVector<Value> initArgs;
if (encDst && !allDense)		if (encDst && !allDense)
initArgs.push_back(dst);		initArgs.push_back(dst);
ForeachOp foreachOp;		ForeachOp foreachOp;
for (Value input : op.getInputs()) {		for (Value input : op.getInputs()) {
// Build a for op for each input tensor to append new values into the		// Build a for op for each input tensor to append new values into the
// output tensor.		// output tensor.
foreachOp = rewriter.create<ForeachOp>(		foreachOp = rewriter.create<ForeachOp>(
loc, input, initArgs,		loc, input, initArgs,
[&](OpBuilder &builder, Location loc, ValueRange args, Value v,		[&](OpBuilder &builder, Location loc, ValueRange args, Value v,
ValueRange reduc) {		ValueRange reduc) {
SmallVector<Value> indices(rank, Value());		SmallVector<Value> indices(dstTp.getLvlRank());
for (int64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
Value idx = args[i];		Value idx = args[d];
if (i == static_cast<int64_t>(conDim))		if (d == conDim)
// Transform coordinates for the concatenating dim.		// Transform coordinates for the concatenating dim.
idx = builder.create<arith::AddIOp>(loc, idx, offset);		idx = builder.create<arith::AddIOp>(loc, idx, offset);
indices[toStoredDim(encDst, i)] = idx;		// FIXME: `toStoredDim` is deprecated
		indices[toStoredDim(encDst, d)] = idx;
}		}
if (encDst && !allDense) {		if (encDst && !allDense) {
Value cond = genIsNonzero(rewriter, loc, v);		Value cond = genIsNonzero(rewriter, loc, v);
scf::IfOp ifOp = builder.create<scf::IfOp>(		scf::IfOp ifOp = builder.create<scf::IfOp>(
loc, TypeRange(reduc.front().getType()), cond, /else/ true);		loc, TypeRange(reduc.front().getType()), cond, /else/ true);
builder.setInsertionPointToStart(&ifOp.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp.getThenRegion().front());
Value t =		Value t =
builder.create<InsertOp>(loc, v, reduc.front(), indices);		builder.create<InsertOp>(loc, v, reduc.front(), indices);
rewriter.create<scf::YieldOp>(loc, t);		rewriter.create<scf::YieldOp>(loc, t);
rewriter.setInsertionPointToStart(&ifOp.getElseRegion().front());		rewriter.setInsertionPointToStart(&ifOp.getElseRegion().front());
rewriter.create<scf::YieldOp>(loc, reduc.front());		rewriter.create<scf::YieldOp>(loc, reduc.front());
rewriter.setInsertionPointAfter(ifOp);		rewriter.setInsertionPointAfter(ifOp);
rewriter.create<sparse_tensor::YieldOp>(loc, ifOp.getResult(0));		rewriter.create<sparse_tensor::YieldOp>(loc, ifOp.getResult(0));
} else {		} else {
builder.create<memref::StoreOp>(loc, v, dst, indices);		builder.create<memref::StoreOp>(loc, v, dst, indices);
builder.create<sparse_tensor::YieldOp>(loc);		builder.create<sparse_tensor::YieldOp>(loc);
}		}
});		});
// Accumulates the offset. Note that only static-shaped inputs are allowed		// Accumulates the offset. Note that only static-shaped inputs are allowed
// by concatenate op verifier, which saves us from computing the offset		// by concatenate op verifier, which saves us from computing the offset
// dynamically.		// dynamically.
int64_t d = getRankedTensorType(input).getShape()[conDim];		const auto sh = getSparseTensorType(input).getStaticDimSize(conDim);
assert(!ShapedType::isDynamic(d));		assert(sh.has_value());
offset = rewriter.create<arith::AddIOp>(loc, offset,		offset = rewriter.create<arith::AddIOp>(
constantIndex(rewriter, loc, d));		loc, offset, constantIndex(rewriter, loc, *sh));
if (encDst && !allDense) {		if (encDst && !allDense) {
dst = foreachOp.getResult(0);		dst = foreachOp.getResult(0);
initArgs[0] = dst;		initArgs[0] = dst;
}		}
}		}

		// Temp variable to avoid needing to call `getRankedTensorType`
		// in the three use-sites below.
		const RankedTensorType dstRTT = dstTp;
if (encDst) {		if (encDst) {
if (!allDense) {		if (!allDense) {
dst = rewriter.create<LoadOp>(loc, dst, true);		dst = rewriter.create<LoadOp>(loc, dst, true);
if (needTmpCOO) {		if (needTmpCOO) {
Value tmpCoo = dst;		Value tmpCoo = dst;
dst = rewriter.create<ConvertOp>(loc, dstTp, tmpCoo).getResult();		dst = rewriter.create<ConvertOp>(loc, dstRTT, tmpCoo).getResult();
rewriter.create<DeallocTensorOp>(loc, tmpCoo);		rewriter.create<DeallocTensorOp>(loc, tmpCoo);
}		}
} else {		} else {
dst = rewriter.create<ConvertOp>(loc, dstTp, annotatedDenseDst)		dst = rewriter.create<ConvertOp>(loc, dstRTT, annotatedDenseDst)
.getResult();		.getResult();
}		}
rewriter.replaceOp(op, dst);		rewriter.replaceOp(op, dst);
} else {		} else {
rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(op, dstTp, dst);		rewriter.replaceOpWithNewOp<bufferization::ToTensorOp>(op, dstRTT, dst);
}		}
return success();		return success();
}		}
};		};

/// Sparse rewriting rule for the convert operator.		/// Sparse rewriting rule for the convert operator.
struct ConvertRewriter : public OpRewritePattern<ConvertOp> {		struct ConvertRewriter : public OpRewritePattern<ConvertOp> {
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;
Show All 35 Lines	private:
// for i in range(NNZ)		// for i in range(NNZ)
// val = values[i]		// val = values[i]
// [i1,..,ik] = indices[i]		// [i1,..,ik] = indices[i]
// t->add(val, [i1,..,ik], [p1,..,pk])		// t->add(val, [i1,..,ik], [p1,..,pk])
LogicalResult dense2SparseRewrite(ConvertOp op,		LogicalResult dense2SparseRewrite(ConvertOp op,
PatternRewriter &rewriter) const {		PatternRewriter &rewriter) const {
Location loc = op.getLoc();		Location loc = op.getLoc();
Value src = op.getSource();		Value src = op.getSource();
auto dstTp = getRankedTensorType(op);		const auto dstTp = getSparseTensorType(op);
SmallVector<Value> sizes;		SmallVector<Value> sizes;
sizesFromSrc(rewriter, sizes, loc, src);		sizesFromSrc(rewriter, sizes, loc, src);
SmallVector<Value> dynSizes;		SmallVector<Value> dynSizes;
getDynamicSizes(dstTp, sizes, dynSizes);		getDynamicSizes(dstTp, sizes, dynSizes);

bool fromSparseConst = false;		bool fromSparseConst = false;
if (auto constOp = op.getSource().getDefiningOp<arith::ConstantOp>()) {		if (auto constOp = op.getSource().getDefiningOp<arith::ConstantOp>()) {
if (constOp.getValue().dyn_cast<SparseElementsAttr>()) {		if (constOp.getValue().dyn_cast<SparseElementsAttr>()) {
fromSparseConst = true;		fromSparseConst = true;
}		}
}		}

SparseTensorEncodingAttr encDst = getSparseTensorEncoding(dstTp);		const auto encDst = dstTp.getEncoding();
// We don't need a temporary COO tensor if the destination has an identity		// We don't need a temporary COO tensor if the destination has an identity
// ordering. Otherwise, we use the destination ordering for the temporary		// ordering. Otherwise, we use the destination ordering for the temporary
// COO tensor.		// COO tensor.
// TODO: enhance foreachOp to take ordering to remove the need of a		// TODO: enhance foreachOp to take ordering to remove the need of a
// temporary COO tensor here.		// temporary COO tensor here.
RankedTensorType bufferTp = encDst.hasIdDimOrdering()		const RankedTensorType bufferTp = dstTp.isIdentity()
? dstTp		? dstTp.getRankedTensorType()
: getUnorderedCOOFromTypeWithOrdering(		: getUnorderedCOOFromTypeWithOrdering(
dstTp, encDst.getDimOrdering());		dstTp, dstTp.getDimToLvlMap());
auto buffer =		auto buffer =
rewriter.create<AllocTensorOp>(loc, bufferTp, dynSizes).getResult();		rewriter.create<AllocTensorOp>(loc, bufferTp, dynSizes).getResult();
auto foreachOp = rewriter.create<ForeachOp>(		auto foreachOp = rewriter.create<ForeachOp>(
loc, src, buffer,		loc, src, buffer,
[&](OpBuilder &builder, Location loc, ValueRange indices, Value v,		[&](OpBuilder &builder, Location loc, ValueRange indices, Value v,
ValueRange reduc) {		ValueRange reduc) {
Value input = reduc.front();		Value input = reduc.front();
uint64_t rank = dstTp.getRank();		const Dimension dimRank = dstTp.getDimRank();
SmallVector<Value> indicesArray(rank, Value());		SmallVector<Value> indicesArray(dimRank);
for (uint64_t i = 0; i < rank; i++)		for (Dimension d = 0; d < dimRank; d++)
indicesArray[toStoredDim(encDst, i)] = indices[i];		// FIXME: `toStoredDim` is deprecated
		indicesArray[toStoredDim(encDst, d)] = indices[d];
if (fromSparseConst) {		if (fromSparseConst) {
input = builder.create<InsertOp>(loc, v, input, indicesArray);		input = builder.create<InsertOp>(loc, v, input, indicesArray);
} else {		} else {
Value cond = genIsNonzero(builder, loc, v);		Value cond = genIsNonzero(builder, loc, v);
auto ifOp = builder.create<scf::IfOp>(		auto ifOp = builder.create<scf::IfOp>(
loc, TypeRange(input.getType()), cond, /else/ true);		loc, TypeRange(input.getType()), cond, /else/ true);
builder.setInsertionPointToStart(&ifOp.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp.getThenRegion().front());
Value insert =		Value insert =
builder.create<InsertOp>(loc, v, input, indicesArray);		builder.create<InsertOp>(loc, v, input, indicesArray);
builder.create<scf::YieldOp>(loc, insert);		builder.create<scf::YieldOp>(loc, insert);
builder.setInsertionPointToStart(&ifOp.getElseRegion().front());		builder.setInsertionPointToStart(&ifOp.getElseRegion().front());
builder.create<scf::YieldOp>(loc, input);		builder.create<scf::YieldOp>(loc, input);
builder.setInsertionPointAfter(ifOp);		builder.setInsertionPointAfter(ifOp);
input = ifOp.getResult(0);		input = ifOp.getResult(0);
}		}
builder.create<sparse_tensor::YieldOp>(loc, input);		builder.create<sparse_tensor::YieldOp>(loc, input);
});		});
rewriter.setInsertionPointAfter(op);		rewriter.setInsertionPointAfter(op);
src = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);		src = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);
if (bufferTp != dstTp) {		if (bufferTp != dstTp) {
rewriter.replaceOpWithNewOp<ConvertOp>(op, dstTp, src);		rewriter.replaceOpWithNewOp<ConvertOp>(op, dstTp.getRankedTensorType(),
		src);
rewriter.create<DeallocTensorOp>(loc, src);		rewriter.create<DeallocTensorOp>(loc, src);
} else {		} else {
rewriter.replaceOp(op, src);		rewriter.replaceOp(op, src);
}		}

return success();		return success();
}		}

Show All 36 Lines	private:
// Handles sparse tensor to sparse tensor conversion as follows:		// Handles sparse tensor to sparse tensor conversion as follows:
// if src is not COO		// if src is not COO
// construct a COO to represent the src		// construct a COO to represent the src
// sort the src COO		// sort the src COO
// foreach elemment in the sorted src COO		// foreach elemment in the sorted src COO
// insert element to dst		// insert element to dst
LogicalResult sparse2SparseRewrite(ConvertOp op,		LogicalResult sparse2SparseRewrite(ConvertOp op,
PatternRewriter &rewriter) const {		PatternRewriter &rewriter) const {
Location loc = op->getLoc();		const Location loc = op->getLoc();
		// These two variables cannot be `const` because they're conditionally
		// changed below. Ideally we'd use `SparseTensorType` for `srcRTT`;
		// however that class's copy-ctor is implicitly deleted.
Value src = op.getSource();		Value src = op.getSource();
RankedTensorType srcTp = getRankedTensorType(src);		auto srcRTT = getRankedTensorType(src);
RankedTensorType dstTp = getRankedTensorType(op);		const auto dstTp = getSparseTensorType(op);
SparseTensorEncodingAttr encDst = getSparseTensorEncoding(dstTp);		const auto encDst = dstTp.getEncoding();
int64_t rank = dstTp.getRank();		const Level dstLvlRank = dstTp.getLvlRank();
		const Dimension dimRank = dstTp.getDimRank();
		// This assertion should be guaranteed by validity of the op,
		// but just for paranoia's sake.
		assert(srcRTT.getRank() == dimRank);

SmallVector<Value> srcSizes;		SmallVector<Value> srcSizes;
sizesForTensor(rewriter, srcSizes, loc, srcTp, src);		sizesForTensor(rewriter, srcSizes, loc, srcRTT, src);
Value tmpCoo = Value();		Value tmpCoo = Value();
Value nnz = rewriter.create<NumberOfEntriesOp>(loc, src);		Value nnz = rewriter.create<NumberOfEntriesOp>(loc, src);
// We need a tmp COO buffer if and only if		// We need a tmp COO buffer if and only if
// 1. the src tensor is not a COO and		// 1. the src tensor is not a COO and
// 2. the src tensor is not ordered in the same way as the target		// 2. the src tensor is not ordered in the same way as the target
// tensor (e.g., src tensor is not ordered or src tensor haves a different		// tensor (e.g., src tensor is not ordered or src tensor haves a different
// dimOrdering).		// dimOrdering).
if (!isUniqueCOOType(srcTp) &&		if (!isUniqueCOOType(srcRTT) && !(SparseTensorType(srcRTT).isAllOrdered() &&
!(isAllDimOrdered(srcTp) && hasSameDimOrdering(srcTp, dstTp))) {		hasSameDimOrdering(srcRTT, dstTp))) {
// Construct a COO tensor from the src tensor.		// Construct a COO tensor from the src tensor.
// TODO: there may be cases for which more efficiently without		// TODO: there may be cases for which more efficiently without
// going through an intermediate COO, such as cases that only change		// going through an intermediate COO, such as cases that only change
// the overhead types.		// the overhead types.
SmallVector<Value> dynSrcSizes;		SmallVector<Value> dynSrcSizes;
getDynamicSizes(srcTp, srcSizes, dynSrcSizes);		getDynamicSizes(srcRTT, srcSizes, dynSrcSizes);
srcTp =		srcRTT =
getUnorderedCOOFromTypeWithOrdering(srcTp, encDst.getDimOrdering());		getUnorderedCOOFromTypeWithOrdering(srcRTT, dstTp.getDimToLvlMap());
		// Ensure that mutating `srcRTT` didn't invalidate `dimRank`.
		assert(srcRTT.getRank() == dimRank);
tmpCoo = rewriter		tmpCoo = rewriter
.create<AllocTensorOp>(loc, srcTp, dynSrcSizes, Value(),		.create<AllocTensorOp>(loc, srcRTT, dynSrcSizes, Value(),
/sizeHint=/nnz, Attribute())		/sizeHint=/nnz, Attribute())
.getResult();		.getResult();
auto foreachOp = rewriter.create<ForeachOp>(		auto foreachOp = rewriter.create<ForeachOp>(
loc, src, tmpCoo,		loc, src, tmpCoo,
[&](OpBuilder &builder, Location loc, ValueRange args, Value v,		[&](OpBuilder &builder, Location loc, ValueRange args, Value v,
ValueRange reduc) {		ValueRange reduc) {
SmallVector<Value> dstIndices(srcTp.getRank(), Value());		SmallVector<Value> dstIndices(dstLvlRank);
for (int64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
uint64_t dim = toStoredDim(encDst, i);		// FIXME: `toStoredDim` is deprecated
dstIndices[dim] = args[i];		Level l = toStoredDim(encDst, d);
		dstIndices[l] = args[d];
}		}
auto t =		auto t =
builder.create<InsertOp>(loc, v, reduc.front(), dstIndices);		builder.create<InsertOp>(loc, v, reduc.front(), dstIndices);
builder.create<sparse_tensor::YieldOp>(loc, t);		builder.create<sparse_tensor::YieldOp>(loc, t);
});		});
src = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);		src = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);
}		}

		// Now that the conditional is done, we can use `SparseTensorType`.
		const SparseTensorType srcTp(srcRTT);

// Only need to sort if the srcTp is not already sorted (we faithfully take		// Only need to sort if the srcTp is not already sorted (we faithfully take
// the guarantee from the sparse tensor encoding).		// the guarantee from the sparse tensor encoding).
if (!isAllDimOrdered(srcTp)) {		if (!srcTp.isAllOrdered()) {
// Retrieve the values-array.		// Retrieve the values-array.
Value y = genToValues(rewriter, loc, src);		Value y = genToValues(rewriter, loc, src);
SparseTensorEncodingAttr encSrc = getSparseTensorEncoding(srcTp);		const auto encSrc = srcTp.getEncoding();
// Sort the COO tensor so that its elements are ordered via increasing		// Sort the COO tensor so that its elements are ordered via increasing
// indices for the storage ordering of the dst tensor. Use SortCoo if the		// indices for the storage ordering of the dst tensor. Use SortCoo if the
// COO tensor has the same dim ordering as the dst tensor.		// COO tensor has the same dim ordering as the dst tensor.
if (rank > 1 && hasSameDimOrdering(srcTp, dstTp)) {		if (dimRank > 1 && hasSameDimOrdering(srcTp, dstTp)) {
MemRefType indTp =		MemRefType indTp =
get1DMemRefType(getIndexOverheadType(rewriter, encSrc),		get1DMemRefType(getIndexOverheadType(rewriter, encSrc),
/withLayout=/false);		/withLayout=/false);
Value xs = rewriter.create<ToIndicesBufferOp>(loc, indTp, src);		Value xs = rewriter.create<ToIndicesBufferOp>(loc, indTp, src);
rewriter.create<SortCooOp>(		rewriter.create<SortCooOp>(
loc, nnz, xs, ValueRange{y}, rewriter.getIndexAttr(rank),		loc, nnz, xs, ValueRange{y}, rewriter.getIndexAttr(dimRank),
rewriter.getIndexAttr(0), SparseTensorSortKind::HybridQuickSort);		rewriter.getIndexAttr(0), SparseTensorSortKind::HybridQuickSort);
} else {		} else {
// Gather the indices-arrays in the dst tensor storage order.		// Gather the indices-arrays in the dst tensor storage order.
SmallVector<Value> xs(rank, Value());		SmallVector<Value> xs(dstLvlRank);
for (int64_t i = 0; i < rank; i++) {		const Level srcLvlRank = srcTp.getLvlRank();
uint64_t orgDim = toOrigDim(encSrc, i);		for (Level srcLvl = 0; srcLvl < srcLvlRank; srcLvl++) {
xs[toStoredDim(encDst, orgDim)] =		// FIXME: `toOrigDim` is deprecated
genToIndices(rewriter, loc, src, i, /cooStart=/0);		Dimension dim = toOrigDim(encSrc, srcLvl);
		// FIXME: `toStoredDim` is deprecated
		Level dstLvl = toStoredDim(encDst, dim);
		xs[dstLvl] = genToIndices(rewriter, loc, src, srcLvl, /cooStart=/0);
}		}
rewriter.create<SortOp>(loc, nnz, xs, ValueRange{y},		rewriter.create<SortOp>(loc, nnz, xs, ValueRange{y},
SparseTensorSortKind::HybridQuickSort);		SparseTensorSortKind::HybridQuickSort);
}		}
}		}

// For each element in the COO tensor, insert the element to the dst tensor.		// For each element in the COO tensor, insert the element to the dst tensor.
SmallVector<Value> dynDstSizes;		SmallVector<Value> dynDstSizes;
getDynamicSizes(dstTp, srcSizes, dynDstSizes);		getDynamicSizes(dstTp, srcSizes, dynDstSizes);
Value dst = rewriter		Value dst = rewriter
.create<AllocTensorOp>(loc, dstTp, dynDstSizes, Value(),		.create<AllocTensorOp>(loc, dstTp.getRankedTensorType(),
		dynDstSizes, Value(),
/sizeHint=/nnz, Attribute())		/sizeHint=/nnz, Attribute())
.getResult();		.getResult();
SmallVector<Value> indices(srcTp.getRank(), Value());		SmallVector<Value> indices(dstLvlRank);
auto foreachOp = rewriter.create<ForeachOp>(		auto foreachOp = rewriter.create<ForeachOp>(
loc, src, dst,		loc, src, dst,
[&](OpBuilder &builder, Location loc, ValueRange args, Value v,		[&](OpBuilder &builder, Location loc, ValueRange args, Value v,
ValueRange reduc) {		ValueRange reduc) {
for (int64_t i = 0, e = srcTp.getRank(); i < e; i++) {		for (Dimension d = 0; d < dimRank; d++) {
uint64_t dim = toStoredDim(encDst, i);		// FIXME: `toStoredDim` is deprecated
indices[dim] = args[i];		Level l = toStoredDim(encDst, d);
		indices[l] = args[d];
}		}
auto t = builder.create<InsertOp>(loc, v, reduc.front(), indices);		auto t = builder.create<InsertOp>(loc, v, reduc.front(), indices);
builder.create<sparse_tensor::YieldOp>(loc, t);		builder.create<sparse_tensor::YieldOp>(loc, t);
});		});

// Release the temporary COO if it is created. Note that tmpCoo is		// Release the temporary COO if it is created. Note that tmpCoo is
// invalidated due to foreach and updated to src.		// invalidated due to foreach and updated to src.
if (tmpCoo)		if (tmpCoo)
rewriter.create<DeallocTensorOp>(loc, src);		rewriter.create<DeallocTensorOp>(loc, src);

// Directly replace op with dst results in bufferization error message		// Directly replace op with dst results in bufferization error message
// "sparse tensor allocation should not escape function".		// "sparse tensor allocation should not escape function".
// As such, we insert a trivial tensor convert which will be removed by		// As such, we insert a trivial tensor convert which will be removed by
// codegen.		// codegen.
rewriter.setInsertionPointAfter(op);		rewriter.setInsertionPointAfter(op);
auto t = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);		auto t = rewriter.create<LoadOp>(loc, foreachOp.getResult(0), true);
rewriter.replaceOpWithNewOp<ConvertOp>(op, dstTp, t);		rewriter.replaceOpWithNewOp<ConvertOp>(op, dstTp.getRankedTensorType(), t);
return success();		return success();
}		}
};		};

/// Sparse rewriting rule for the foreach operator.		/// Sparse rewriting rule for the foreach operator.
struct ForeachRewriter : public OpRewritePattern<ForeachOp> {		struct ForeachRewriter : public OpRewritePattern<ForeachOp> {
public:		public:
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;

LogicalResult matchAndRewrite(ForeachOp op,		LogicalResult matchAndRewrite(ForeachOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {

auto loc = op.getLoc();		auto loc = op.getLoc();
Value input = op.getTensor();		Value input = op.getTensor();
SmallVector<Value> reduc = op.getInitArgs();		SmallVector<Value> reduc = op.getInitArgs();
auto rtp = getRankedTensorType(input);		const auto stt = getSparseTensorType(input);
int64_t rank = rtp.getRank();		const Dimension dimRank = stt.getDimRank();

// Special-case: for each over a sparse constant uses its own rewriting		// Special-case: for each over a sparse constant uses its own rewriting
// rule.		// rule.
if (auto constOp = input.getDefiningOp<arith::ConstantOp>()) {		if (auto constOp = input.getDefiningOp<arith::ConstantOp>()) {
if (auto attr = constOp.getValue().dyn_cast<SparseElementsAttr>()) {		if (auto attr = constOp.getValue().dyn_cast<SparseElementsAttr>()) {
return genForeachOnSparseConstant(op, rewriter, attr);		return genForeachOnSparseConstant(op, rewriter, attr);
}		}
}		}

// Otherwise, use loop emitter to generate loops.		// Otherwise, use loop emitter to generate loops.
auto enc = getSparseTensorEncoding(rtp);		const auto enc = stt.getEncoding();

// 1. Generates loop for the sparse input.		// 1. Generates loop for the sparse input.
LoopEmitter loopEmitter(		LoopEmitter loopEmitter(
ValueRange{input},		ValueRange{input},
StringAttr::get(getContext(), ForeachOp::getOperationName()));		StringAttr::get(getContext(), ForeachOp::getOperationName()));
loopEmitter.initializeLoopEmit(rewriter, loc);		loopEmitter.initializeLoopEmit(rewriter, loc);
for (int64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
// TODO: provide utility function for loop sequences that only contains		// TODO: provide utility function for loop sequences that only contains
// one for loop?		// one for loop?
loopEmitter.enterNewLoopSeq(rewriter, loc, 0, static_cast<size_t>(i));		loopEmitter.enterNewLoopSeq(rewriter, loc, 0, static_cast<size_t>(d));
// Note that reduc will be taken care of by loop emitter and get updated		// Note that reduc will be taken care of by loop emitter and get updated
// in place.		// in place.
loopEmitter.enterLoopOverTensorAtDim(rewriter, loc, 0, i, reduc);		loopEmitter.enterLoopOverTensorAtDim(rewriter, loc, 0, d, reduc);
}		}

SmallVector<Value> coords;		SmallVector<Value> coords;
coords.reserve(rank);		coords.reserve(dimRank);
loopEmitter.getCoordinateArray(coords);		loopEmitter.getCoordinateArray(coords);

Value vals = loopEmitter.getValBuffer()[0];		Value vals = loopEmitter.getValBuffer()[0];
Value pidx = loopEmitter.getPidxs()[0].back();		Value pidx = loopEmitter.getPidxs()[0].back();
// Loads the value from sparse tensor using pointer index;		// Loads the value from sparse tensor using pointer index;
// loads the value from dense tensor using coordinate array.		// loads the value from dense tensor using coordinate array.
Value val = enc ? rewriter.create<memref::LoadOp>(loc, vals, pidx)		Value val = enc ? rewriter.create<memref::LoadOp>(loc, vals, pidx)
: rewriter.create<memref::LoadOp>(loc, vals, coords);		: rewriter.create<memref::LoadOp>(loc, vals, coords);

// 2. Inline the block in the foreach operator.		// 2. Inline the block in the foreach operator.
Block *srcBlock = op.getBody();		Block *srcBlock = op.getBody();

// Remap coordinates.		// Remap coordinates.
SmallVector<Value> args;		SmallVector<Value> args;
for (int64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
Value actual = coords[toStoredDim(enc, i)];		// FIXME: `toStoredDim` is deprecated
		Value actual = coords[toStoredDim(enc, d)];
args.push_back(actual);		args.push_back(actual);
}		}
// Remap value.		// Remap value.
args.push_back(val);		args.push_back(val);
// Remap reduction variables.		// Remap reduction variables.
args.append(reduc);		args.append(reduc);

// Remove sparse_tensor.yield.		// Remove sparse_tensor.yield.
SmallVector<Value> reducValue = srcBlock->getTerminator()->getOperands();		SmallVector<Value> reducValue = srcBlock->getTerminator()->getOperands();
rewriter.eraseOp(srcBlock->getTerminator());		rewriter.eraseOp(srcBlock->getTerminator());

// Inline body.		// Inline body.
if (!reducValue.empty()) {		if (!reducValue.empty()) {
rewriter.mergeBlocks(srcBlock, rewriter.getBlock(), args);		rewriter.mergeBlocks(srcBlock, rewriter.getBlock(), args);
} else {		} else {
// This is annoying, since scf.for inserts a implicit yield op when		// This is annoying, since scf.for inserts a implicit yield op when
// there is no reduction variable upon creation, in this case we need to		// there is no reduction variable upon creation, in this case we need to
// merge the block before the yield op.		// merge the block before the yield op.
rewriter.mergeBlockBefore(srcBlock, &*rewriter.getInsertionPoint(), args);		rewriter.mergeBlockBefore(srcBlock, &*rewriter.getInsertionPoint(), args);
}		}

for (int64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
// Link the reduction chain. Note that loop emitter update the reducValue		// Link the reduction chain. Note that loop emitter update the reducValue
// in place.		// in place.
loopEmitter.exitCurrentLoop(rewriter, loc, reducValue);		loopEmitter.exitCurrentLoop(rewriter, loc, reducValue);
loopEmitter.exitCurrentLoopSeq();		loopEmitter.exitCurrentLoopSeq();
}		}

// Replace the foreach operator with the value returned by the outtermost		// Replace the foreach operator with the value returned by the outtermost
// for loop.		// for loop.
rewriter.replaceOp(op, reducValue);		rewriter.replaceOp(op, reducValue);
return success();		return success();
}		}
};		};

/// Sparse rewriting rule for the new operator.		/// Sparse rewriting rule for the new operator.
struct NewRewriter : public OpRewritePattern<NewOp> {		struct NewRewriter : public OpRewritePattern<NewOp> {
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;
LogicalResult matchAndRewrite(NewOp op,		LogicalResult matchAndRewrite(NewOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
Location loc = op.getLoc();		Location loc = op.getLoc();
auto dstTp = getRankedTensorType(op.getResult());		const auto dstTp = getSparseTensorType(op.getResult());
SparseTensorEncodingAttr encDst = getSparseTensorEncoding(dstTp);		const auto encDst = dstTp.getEncoding();
if (!encDst)		if (!dstTp.hasEncoding())
return failure();		return failure();

// Create a sparse tensor reader.		// Create a sparse tensor reader.
Value fileName = op.getSource();		Value fileName = op.getSource();
Type opaqueTp = getOpaquePointerType(rewriter);		Type opaqueTp = getOpaquePointerType(rewriter);
Value reader = createFuncCall(rewriter, loc, "createSparseTensorReader",		Value reader = createFuncCall(rewriter, loc, "createSparseTensorReader",
{opaqueTp}, {fileName}, EmitCInterface::Off)		{opaqueTp}, {fileName}, EmitCInterface::Off)
.getResult(0);		.getResult(0);

// Allocate a temporary buffer for storing dimension sizes and indices.		// Allocate a temporary buffer for storing dimension sizes and indices.
Type indexTp = rewriter.getIndexType();		Type indexTp = rewriter.getIndexType();
uint64_t rank = dstTp.getRank();		const Dimension dimRank = dstTp.getDimRank();
Value dimSizes = genAlloca(rewriter, loc, rank, indexTp);		Value dimSizes = genAlloca(rewriter, loc, dimRank, indexTp);

// If the result tensor has dynamic dimensions, get the dynamic sizes from		// If the result tensor has dynamic dimensions, get the dynamic sizes from
// the sparse tensor reader.		// the sparse tensor reader.
SmallVector<Value> dynSizesArray;		SmallVector<Value> dynSizesArray;
if (!dstTp.hasStaticShape()) {		if (dstTp.hasDynamicDimShape()) {
createFuncCall(rewriter, loc, "copySparseTensorReaderDimSizes", {},		createFuncCall(rewriter, loc, "copySparseTensorReaderDimSizes", {},
{reader, dimSizes}, EmitCInterface::On)		{reader, dimSizes}, EmitCInterface::On)
.getResult(0);		.getResult(0);
ArrayRef<int64_t> dstShape = dstTp.getShape();		ArrayRef<int64_t> dstShape = dstTp.getRankedTensorType().getShape();
for (auto &d : llvm::enumerate(dstShape)) {		for (auto &d : llvm::enumerate(dstShape)) {
if (d.value() == ShapedType::kDynamic) {		if (d.value() == ShapedType::kDynamic) {
dynSizesArray.push_back(rewriter.create<memref::LoadOp>(		dynSizesArray.push_back(rewriter.create<memref::LoadOp>(
loc, dimSizes, constantIndex(rewriter, loc, d.index())));		loc, dimSizes, constantIndex(rewriter, loc, d.index())));
}		}
}		}
}		}

// Implement the NewOp as follows:		// Implement the NewOp as follows:
// %tmp = bufferization.alloc_tensor : an unordered COO with identity		// %tmp = bufferization.alloc_tensor : an unordered COO with identity
// storage ordering		// storage ordering
// for i = 0 to nnz		// for i = 0 to nnz
// get the next element from the input file		// get the next element from the input file
// insert the element to %tmp		// insert the element to %tmp
// %t = sparse_tensor.ConvertOp %tmp		// %t = sparse_tensor.ConvertOp %tmp
Value c0 = constantIndex(rewriter, loc, 0);		Value c0 = constantIndex(rewriter, loc, 0);
Value c1 = constantIndex(rewriter, loc, 1);		Value c1 = constantIndex(rewriter, loc, 1);
Value nnz = createFuncCall(rewriter, loc, "getSparseTensorReaderNNZ",		Value nnz = createFuncCall(rewriter, loc, "getSparseTensorReaderNNZ",
{indexTp}, {reader}, EmitCInterface::Off)		{indexTp}, {reader}, EmitCInterface::Off)
.getResult(0);		.getResult(0);
RankedTensorType cooTp =		RankedTensorType cooTp =
getUnorderedCOOFromTypeWithOrdering(dstTp, encDst.getDimOrdering());		getUnorderedCOOFromTypeWithOrdering(dstTp, dstTp.getDimToLvlMap());
Value cooBuffer =		Value cooBuffer =
rewriter		rewriter
.create<AllocTensorOp>(loc, cooTp, dynSizesArray, Value(),		.create<AllocTensorOp>(loc, cooTp, dynSizesArray, Value(),
/sizeHint=/nnz, Attribute())		/sizeHint=/nnz, Attribute())
.getResult();		.getResult();

// The verifier ensures only 2D tensors can have the expandSymmetry flag.		// The verifier ensures only 2D tensors can have the expandSymmetry flag.
Value symmetric;		Value symmetric;
if (rank == 2 && op.getExpandSymmetry()) {		if (dimRank == 2 && op.getExpandSymmetry()) {
symmetric =		symmetric =
createFuncCall(rewriter, loc, "getSparseTensorReaderIsSymmetric",		createFuncCall(rewriter, loc, "getSparseTensorReaderIsSymmetric",
{rewriter.getI1Type()}, {reader}, EmitCInterface::Off)		{rewriter.getI1Type()}, {reader}, EmitCInterface::Off)
.getResult(0);		.getResult(0);
} else {		} else {
symmetric = Value();		symmetric = Value();
}		}
Type eltTp = dstTp.getElementType();		Type eltTp = dstTp.getElementType();
Value value = genAllocaScalar(rewriter, loc, eltTp);		Value value = genAllocaScalar(rewriter, loc, eltTp);
scf::ForOp forOp = rewriter.create<scf::ForOp>(loc, c0, nnz, c1,		scf::ForOp forOp = rewriter.create<scf::ForOp>(loc, c0, nnz, c1,
ArrayRef<Value>(cooBuffer));		ArrayRef<Value>(cooBuffer));
rewriter.setInsertionPointToStart(forOp.getBody());		rewriter.setInsertionPointToStart(forOp.getBody());

SmallString<29> getNextFuncName{"getSparseTensorReaderNext",		SmallString<29> getNextFuncName{"getSparseTensorReaderNext",
primaryTypeFunctionSuffix(eltTp)};		primaryTypeFunctionSuffix(eltTp)};
Value indices = dimSizes; // Reuse the indices memref to store indices.		Value indices = dimSizes; // Reuse the indices memref to store indices.
createFuncCall(rewriter, loc, getNextFuncName, {}, {reader, indices, value},		createFuncCall(rewriter, loc, getNextFuncName, {}, {reader, indices, value},
EmitCInterface::On);		EmitCInterface::On);
SmallVector<Value> indicesArray(rank, Value());		SmallVector<Value> indicesArray(dimRank);
for (uint64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
indicesArray[toStoredDim(encDst, i)] = rewriter.create<memref::LoadOp>(		// FIXME: `toStoredDim` is deprecated
loc, indices, constantIndex(rewriter, loc, i));		indicesArray[toStoredDim(encDst, d)] = rewriter.create<memref::LoadOp>(
		loc, indices, constantIndex(rewriter, loc, d));
}		}
Value v = rewriter.create<memref::LoadOp>(loc, value);		Value v = rewriter.create<memref::LoadOp>(loc, value);
Value t = rewriter.create<InsertOp>(loc, v, forOp.getRegionIterArg(0),		Value t = rewriter.create<InsertOp>(loc, v, forOp.getRegionIterArg(0),
indicesArray);		indicesArray);
if (symmetric) {		if (symmetric) {
Value eq = rewriter.create<arith::CmpIOp>(		Value eq = rewriter.create<arith::CmpIOp>(
loc, arith::CmpIPredicate::ne, indicesArray[0], indicesArray[1]);		loc, arith::CmpIPredicate::ne, indicesArray[0], indicesArray[1]);
Value cond = rewriter.create<arith::AndIOp>(loc, symmetric, eq);		Value cond = rewriter.create<arith::AndIOp>(loc, symmetric, eq);
Show All 12 Lines	LogicalResult matchAndRewrite(NewOp op,
rewriter.setInsertionPointAfter(forOp);		rewriter.setInsertionPointAfter(forOp);
// Link SSA chain.		// Link SSA chain.
cooBuffer = forOp.getResult(0);		cooBuffer = forOp.getResult(0);

// Release the sparse tensor reader.		// Release the sparse tensor reader.
createFuncCall(rewriter, loc, "delSparseTensorReader", {}, {reader},		createFuncCall(rewriter, loc, "delSparseTensorReader", {}, {reader},
EmitCInterface::Off);		EmitCInterface::Off);
cooBuffer = rewriter.create<LoadOp>(loc, cooBuffer, true);		cooBuffer = rewriter.create<LoadOp>(loc, cooBuffer, true);
Value newOp = rewriter.replaceOpWithNewOp<ConvertOp>(op, dstTp, cooBuffer);		Value newOp = rewriter.replaceOpWithNewOp<ConvertOp>(
		op, dstTp.getRankedTensorType(), cooBuffer);

// Release the unordered COO tensor buffer.		// Release the unordered COO tensor buffer.
rewriter.setInsertionPointAfterValue(newOp);		rewriter.setInsertionPointAfterValue(newOp);
rewriter.create<DeallocTensorOp>(loc, cooBuffer);		rewriter.create<DeallocTensorOp>(loc, cooBuffer);

return success();		return success();
}		}
};		};

struct OutRewriter : public OpRewritePattern<OutOp> {		struct OutRewriter : public OpRewritePattern<OutOp> {
using OpRewritePattern::OpRewritePattern;		using OpRewritePattern::OpRewritePattern;
LogicalResult matchAndRewrite(OutOp op,		LogicalResult matchAndRewrite(OutOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
Location loc = op.getLoc();		Location loc = op.getLoc();
// Calculate NNZ.		// Calculate NNZ.
Value src = op.getTensor();		Value src = op.getTensor();
Value nnz = rewriter.create<NumberOfEntriesOp>(loc, src);		Value nnz = rewriter.create<NumberOfEntriesOp>(loc, src);

// Allocate a temporary buffer for storing dimension sizes and indices.		// Allocate a temporary buffer for storing dimension sizes and indices.
auto srcTp = getRankedTensorType(src);		const auto srcTp = getSparseTensorType(src);
uint64_t rank = srcTp.getRank();		const Dimension dimRank = srcTp.getDimRank();
Type indexTp = rewriter.getIndexType();		Type indexTp = rewriter.getIndexType();
Value dimSizes = genAlloca(rewriter, loc, rank, indexTp);		Value dimSizes = genAlloca(rewriter, loc, dimRank, indexTp);

// Generate code to calculate dimension size values and store the values to		// Generate code to calculate dimension size values and store the values to
// the buffer.		// the buffer.
SmallVector<Value> dims;		SmallVector<Value> dims;
sizesForTensor(rewriter, dims, loc, srcTp, src);		sizesForTensor(rewriter, dims, loc, srcTp, src);
for (uint64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
rewriter.create<memref::StoreOp>(loc, dims[i], dimSizes,		rewriter.create<memref::StoreOp>(loc, dims[d], dimSizes,
constantIndex(rewriter, loc, i));		constantIndex(rewriter, loc, d));
}		}

// Create a sparse tensor writer and output meta data.		// Create a sparse tensor writer and output meta data.
Type opaqueTp = getOpaquePointerType(rewriter);		Type opaqueTp = getOpaquePointerType(rewriter);
Value writer =		Value writer =
createFuncCall(rewriter, loc, "createSparseTensorWriter", {opaqueTp},		createFuncCall(rewriter, loc, "createSparseTensorWriter", {opaqueTp},
{op.getDest()}, EmitCInterface::Off)		{op.getDest()}, EmitCInterface::Off)
.getResult(0);		.getResult(0);
Value rankValue = constantIndex(rewriter, loc, rank);		Value rankValue = constantIndex(rewriter, loc, dimRank);
createFuncCall(rewriter, loc, "outSparseTensorWriterMetaData", {},		createFuncCall(rewriter, loc, "outSparseTensorWriterMetaData", {},
{writer, rankValue, nnz, dimSizes}, EmitCInterface::On);		{writer, rankValue, nnz, dimSizes}, EmitCInterface::On);

Value indices = dimSizes; // Reuse the dimSizes buffer for indices.		Value indices = dimSizes; // Reuse the dimSizes buffer for indices.
Type eltTp = srcTp.getElementType();		Type eltTp = srcTp.getElementType();
SmallString<29> outNextFuncName{"outSparseTensorWriterNext",		SmallString<29> outNextFuncName{"outSparseTensorWriterNext",
primaryTypeFunctionSuffix(eltTp)};		primaryTypeFunctionSuffix(eltTp)};
Value value = genAllocaScalar(rewriter, loc, eltTp);		Value value = genAllocaScalar(rewriter, loc, eltTp);
ModuleOp module = op->getParentOfType<ModuleOp>();		ModuleOp module = op->getParentOfType<ModuleOp>();
// For each element in the source tensor, output the element.		// For each element in the source tensor, output the element.
rewriter.create<ForeachOp>(		rewriter.create<ForeachOp>(
loc, src, std::nullopt,		loc, src, std::nullopt,
[&](OpBuilder &builder, Location loc, ValueRange args, Value v,		[&](OpBuilder &builder, Location loc, ValueRange args, Value v,
ValueRange reduc) {		ValueRange reduc) {
for (uint64_t i = 0; i < rank; i++) {		for (Dimension d = 0; d < dimRank; d++) {
rewriter.create<memref::StoreOp>(loc, args[i], indices,		rewriter.create<memref::StoreOp>(loc, args[d], indices,
constantIndex(builder, loc, i));		constantIndex(builder, loc, d));
}		}
rewriter.create<memref::StoreOp>(loc, v, value);		rewriter.create<memref::StoreOp>(loc, v, value);
SmallVector<Value> operands{writer, rankValue, indices, value};		SmallVector<Value> operands{writer, rankValue, indices, value};
FlatSymbolRefAttr fn = getFunc(module, outNextFuncName, {}, operands,		FlatSymbolRefAttr fn = getFunc(module, outNextFuncName, {}, operands,
EmitCInterface::On);		EmitCInterface::On);
builder.create<func::CallOp>(loc, TypeRange(), fn, operands);		builder.create<func::CallOp>(loc, TypeRange(), fn, operands);
builder.create<sparse_tensor::YieldOp>(loc);		builder.create<sparse_tensor::YieldOp>(loc);
});		});
Show All 39 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.h

	Show All 9 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_			#ifndef MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_
	#define MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_			#define MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_

	#include "mlir/Conversion/LLVMCommon/StructBuilder.h"			#include "mlir/Conversion/LLVMCommon/StructBuilder.h"
	#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"			#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
				#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
	#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"			#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
	#include "mlir/Transforms/DialectConversion.h"			#include "mlir/Transforms/DialectConversion.h"

	namespace mlir {			namespace mlir {
	namespace sparse_tensor {			namespace sparse_tensor {

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// SparseTensorDescriptor and helpers that manage the sparse tensor memory			// SparseTensorDescriptor and helpers that manage the sparse tensor memory
	// layout scheme during "direct code generation" (i.e. when sparsification			// layout scheme during "direct code generation" (i.e. when sparsification
	// generates the buffers as part of actual IR, in constrast with the library			// generates the buffers as part of actual IR, in constrast with the library
	// approach where data structures are hidden behind opaque pointers).			// approach where data structures are hidden behind opaque pointers).
	//			//
	// The sparse tensor storage scheme for a rank-dimensional tensor is organized			// The sparse tensor storage scheme for a rank-dimensional tensor is organized
	// as a single compound type with the following fields. Note that every memref			// as a single compound type with the following fields. Note that every memref
	// with ? size actually behaves as a "vector", i.e. the stored size is the			// with ? size actually behaves as a "vector", i.e. the stored size is the
	// capacity and the used size resides in the storage_specifier struct.			// capacity and the used size resides in the storage_specifier struct.
	//			//
	// struct {			// struct {
	// ; per-dimension d:			// ; per-level l:
	// ; if dense:			// ; if dense:
	// <nothing>			// <nothing>
	// ; if compresed:			// ; if compresed:
	// memref<? x ptr> pointers-d ; pointers for sparse dim d			// memref<? x ptr> pointers-l ; pointers for sparse level l
	// memref<? x idx> indices-d ; indices for sparse dim d			// memref<? x idx> indices-l ; indices for sparse level l
	// ; if singleton:			// ; if singleton:
	// memref<? x idx> indices-d ; indices for singleton dim d			// memref<? x idx> indices-l ; indices for singleton level l
	//			//
	// memref<? x eltType> values ; values			// memref<? x eltType> values ; values
	//			//
	// struct sparse_tensor.storage_specifier {			// struct sparse_tensor.storage_specifier {
	// array<rank x int> dimSizes ; sizes for each dimension			// array<rank x int> dimSizes ; sizes for each dimension
	// array<n x int> memSizes; ; sizes for each data memref			// array<n x int> memSizes; ; sizes for each data memref
	// }			// }
	// };			// };
	//			//
	// In addition, for a "trailing COO region", defined as a compressed			// In addition, for a "trailing COO region", defined as a compressed level
	// dimension followed by one ore more singleton dimensions, the default			// followed by one ore more singleton levels, the default SOA storage that
	// SOA storage that is inherent to the TACO format is optimized into an			// is inherent to the TACO format is optimized into an AOS storage where
	// AOS storage where all indices of a stored element appear consecutively.			// all indices of a stored element appear consecutively. In such cases,
	// In such cases, a special operation (sparse_tensor.indices_buffer) must			// a special operation (sparse_tensor.indices_buffer) must be used to
	// be used to access the AOS index array. In the code below, the method			// access the AOS index array. In the code below, the method `getCOOStart`
	// `getCOOStart` is used to find the start of the "trailing COO region".			// is used to find the start of the "trailing COO region".
	//			//
	// Examples.			// Examples.
	//			//
	// #CSR storage of 2-dim matrix yields			// #CSR storage of 2-dim matrix yields
	// memref<?xindex> ; pointers-1			// memref<?xindex> ; pointers-1
	// memref<?xindex> ; indices-1			// memref<?xindex> ; indices-1
	// memref<?xf64> ; values			// memref<?xf64> ; values
	// struct<(array<2 x i64>, array<3 x i64>)>) ; dim0, dim1, 3xsizes			// struct<(array<2 x i64>, array<3 x i64>)>) ; lvl0, lvl1, 3xsizes
	//			//
	// #COO storage of 2-dim matrix yields			// #COO storage of 2-dim matrix yields
	// memref<?xindex>, ; pointers-0, essentially [0,sz]			// memref<?xindex>, ; pointers-0, essentially [0,sz]
	// memref<?xindex> ; AOS index storage			// memref<?xindex> ; AOS index storage
	// memref<?xf64> ; values			// memref<?xf64> ; values
	// struct<(array<2 x i64>, array<3 x i64>)>) ; dim0, dim1, 3xsizes			// struct<(array<2 x i64>, array<3 x i64>)>) ; lvl0, lvl1, 3xsizes
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	enum class SparseTensorFieldKind : uint32_t {			enum class SparseTensorFieldKind : uint32_t {
	StorageSpec = 0,			StorageSpec = 0,
	PtrMemRef = 1,			PtrMemRef = 1,
	IdxMemRef = 2,			IdxMemRef = 2,
	ValMemRef = 3			ValMemRef = 3
	};			};

	static_assert(static_cast<uint32_t>(SparseTensorFieldKind::PtrMemRef) ==			static_assert(static_cast<uint32_t>(SparseTensorFieldKind::PtrMemRef) ==
	static_cast<uint32_t>(StorageSpecifierKind::PtrMemSize));			static_cast<uint32_t>(StorageSpecifierKind::PtrMemSize));
	static_assert(static_cast<uint32_t>(SparseTensorFieldKind::IdxMemRef) ==			static_assert(static_cast<uint32_t>(SparseTensorFieldKind::IdxMemRef) ==
	static_cast<uint32_t>(StorageSpecifierKind::IdxMemSize));			static_cast<uint32_t>(StorageSpecifierKind::IdxMemSize));
	static_assert(static_cast<uint32_t>(SparseTensorFieldKind::ValMemRef) ==			static_assert(static_cast<uint32_t>(SparseTensorFieldKind::ValMemRef) ==
	static_cast<uint32_t>(StorageSpecifierKind::ValMemSize));			static_cast<uint32_t>(StorageSpecifierKind::ValMemSize));

				/// The type of field indices. This alias is to help code be more
				/// self-documenting; unfortunately it is not type-checked, so it only
				/// provides documentation rather than doing anything to prevent mixups.
				using FieldIndex = unsigned;

				// TODO: Functions/methods marked with [NUMFIELDS] might should use
				// `FieldIndex` for their return type, via the same reasoning for why
				// `Dimension`/`Level` are used both for identifiers and ranks.

	/// For each field that will be allocated for the given sparse tensor encoding,			/// For each field that will be allocated for the given sparse tensor encoding,
	/// calls the callback with the corresponding field index, field kind, dimension			/// calls the callback with the corresponding field index, field kind, dimension
	/// (for sparse tensor level memrefs) and dimlevelType.			/// (for sparse tensor level memrefs) and dimlevelType.
	/// The field index always starts with zero and increments by one between two			/// The field index always starts with zero and increments by one between two
	/// callback invocations.			/// callback invocations.
	/// Ideally, all other methods should rely on this function to query a sparse			/// Ideally, all other methods should rely on this function to query a sparse
	/// tensor fields instead of relying on ad-hoc index computation.			/// tensor fields instead of relying on ad-hoc index computation.
	void foreachFieldInSparseTensor(			void foreachFieldInSparseTensor(
	SparseTensorEncodingAttr,			SparseTensorEncodingAttr,
	llvm::function_ref<bool(unsigned /fieldIdx/,			llvm::function_ref<bool(
	SparseTensorFieldKind /fieldKind/,			FieldIndex /fieldIdx/, SparseTensorFieldKind /fieldKind/,
	unsigned /dim (if applicable)/,			Level /lvl (if applicable)/, DimLevelType /DLT (if applicable)/)>);
	DimLevelType /DLT (if applicable)/)>);

	/// Same as above, except that it also builds the Type for the corresponding			/// Same as above, except that it also builds the Type for the corresponding
	/// field.			/// field.
	void foreachFieldAndTypeInSparseTensor(			void foreachFieldAndTypeInSparseTensor(
	RankedTensorType,			SparseTensorType,
	llvm::function_ref<bool(Type /fieldType/, unsigned /fieldIdx/,			llvm::function_ref<bool(Type /fieldType/, FieldIndex /fieldIdx/,
	SparseTensorFieldKind /fieldKind/,			SparseTensorFieldKind /fieldKind/,
	unsigned /dim (if applicable)/,			Level /lvl (if applicable)/,
	DimLevelType /DLT (if applicable)/)>);			DimLevelType /DLT (if applicable)/)>);

	/// Gets the total number of fields for the given sparse tensor encoding.			/// Gets the total number of fields for the given sparse tensor encoding.
				// TODO: See note [NUMFIELDS].
	unsigned getNumFieldsFromEncoding(SparseTensorEncodingAttr enc);			unsigned getNumFieldsFromEncoding(SparseTensorEncodingAttr enc);

	/// Gets the total number of data fields (index arrays, pointer arrays, and a			/// Gets the total number of data fields (index arrays, pointer arrays, and a
	/// value array) for the given sparse tensor encoding.			/// value array) for the given sparse tensor encoding.
				// TODO: See note [NUMFIELDS].
	unsigned getNumDataFieldsFromEncoding(SparseTensorEncodingAttr enc);			unsigned getNumDataFieldsFromEncoding(SparseTensorEncodingAttr enc);

	inline StorageSpecifierKind toSpecifierKind(SparseTensorFieldKind kind) {			inline StorageSpecifierKind toSpecifierKind(SparseTensorFieldKind kind) {
	assert(kind != SparseTensorFieldKind::StorageSpec);			assert(kind != SparseTensorFieldKind::StorageSpec);
	return static_cast<StorageSpecifierKind>(kind);			return static_cast<StorageSpecifierKind>(kind);
	}			}

	inline SparseTensorFieldKind toFieldKind(StorageSpecifierKind kind) {			inline SparseTensorFieldKind toFieldKind(StorageSpecifierKind kind) {
	assert(kind != StorageSpecifierKind::DimSize);			assert(kind != StorageSpecifierKind::DimSize);
	return static_cast<SparseTensorFieldKind>(kind);			return static_cast<SparseTensorFieldKind>(kind);
	}			}

	/// Provides methods to access fields of a sparse tensor with the given			/// Provides methods to access fields of a sparse tensor with the given
	/// encoding.			/// encoding.
	class StorageLayout {			class StorageLayout {
	public:			public:
	explicit StorageLayout(SparseTensorEncodingAttr enc) : enc(enc) {}			explicit StorageLayout(SparseTensorEncodingAttr enc) : enc(enc) {}

	///			///
	/// Getters: get the field index for required field.			/// Getters: get the field index for required field.
	///			///

	unsigned getMemRefFieldIndex(SparseTensorFieldKind kind,			FieldIndex getMemRefFieldIndex(SparseTensorFieldKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	return getFieldIndexAndStride(kind, dim).first;			return getFieldIndexAndStride(kind, lvl).first;
	}			}

	unsigned getMemRefFieldIndex(StorageSpecifierKind kind,			FieldIndex getMemRefFieldIndex(StorageSpecifierKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	return getMemRefFieldIndex(toFieldKind(kind), dim);			return getMemRefFieldIndex(toFieldKind(kind), lvl);
	}			}

				// TODO: See note [NUMFIELDS].
	static unsigned getNumFieldsFromEncoding(SparseTensorEncodingAttr enc) {			static unsigned getNumFieldsFromEncoding(SparseTensorEncodingAttr enc) {
	return sparse_tensor::getNumFieldsFromEncoding(enc);			return sparse_tensor::getNumFieldsFromEncoding(enc);
	}			}

	static void foreachFieldInSparseTensor(			static void foreachFieldInSparseTensor(
	const SparseTensorEncodingAttr enc,			const SparseTensorEncodingAttr enc,
	llvm::function_ref<bool(unsigned, SparseTensorFieldKind, unsigned,			llvm::function_ref<bool(FieldIndex, SparseTensorFieldKind, Level,
	DimLevelType)>			DimLevelType)>
	callback) {			callback) {
	return sparse_tensor::foreachFieldInSparseTensor(enc, callback);			return sparse_tensor::foreachFieldInSparseTensor(enc, callback);
	}			}

	std::pair<unsigned, unsigned>			std::pair<FieldIndex, unsigned>
	getFieldIndexAndStride(SparseTensorFieldKind kind,			getFieldIndexAndStride(SparseTensorFieldKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	unsigned fieldIdx = -1u;			FieldIndex fieldIdx = -1u;
	unsigned stride = 1;			unsigned stride = 1;
	if (kind == SparseTensorFieldKind::IdxMemRef) {			if (kind == SparseTensorFieldKind::IdxMemRef) {
	assert(dim.has_value());			assert(lvl.has_value());
	unsigned cooStart = getCOOStart(enc);			const Level cooStart = getCOOStart(enc);
	unsigned rank = enc.getDimLevelType().size();			const Level lvlRank = enc.getLvlRank();
	if (dim.value() >= cooStart && dim.value() < rank) {			if (lvl.value() >= cooStart && lvl.value() < lvlRank) {
	dim = cooStart;			lvl = cooStart;
	stride = rank - cooStart;			stride = lvlRank - cooStart;
	}			}
	}			}
	foreachFieldInSparseTensor(			foreachFieldInSparseTensor(
	enc,			enc,
	[dim, kind, &fieldIdx](unsigned fIdx, SparseTensorFieldKind fKind,			[lvl, kind, &fieldIdx](FieldIndex fIdx, SparseTensorFieldKind fKind,
	unsigned fDim, DimLevelType dlt) -> bool {			Level fLvl, DimLevelType dlt) -> bool {
	if ((dim && fDim == dim.value() && kind == fKind) \|\|			if ((lvl && fLvl == lvl.value() && kind == fKind) \|\|
	(kind == fKind && fKind == SparseTensorFieldKind::ValMemRef)) {			(kind == fKind && fKind == SparseTensorFieldKind::ValMemRef)) {
	fieldIdx = fIdx;			fieldIdx = fIdx;
	// Returns false to break the iteration.			// Returns false to break the iteration.
	return false;			return false;
	}			}
	return true;			return true;
	});			});
	assert(fieldIdx != -1u);			assert(fieldIdx != -1u);
	return std::pair<unsigned, unsigned>(fieldIdx, stride);			return std::pair<FieldIndex, unsigned>(fieldIdx, stride);
	}			}

	private:			private:
	SparseTensorEncodingAttr enc;			SparseTensorEncodingAttr enc;
	};			};

				// FIXME: Functions/methods marked with [CLARIFY_DIM_LVL] require
				// clarification on whether their "dim" argument should actually
				// be `Level` or `Dimension`. In particular, it's unclear whether
				// `StorageSpecifierKind::DimSize` actually means to refer to dimension-sizes
				// vs level-sizes. If it's the latter (which seems unlikely), then all the
				// noted functions should use the `Level` type alias. If it's the former,
				// then the functions which specifically use `DimSize` should be changed
				// to use the `Dimension` type alias; however, the functions which take
				// an unknown `StorageSpecifierKind` must be adjusted to ensure that they
				// correctly interpret the "dim" argument since the interpretation depends
				// on the `StorageSpecifierKind` value. Since wrengr couldn't figure this
				// out from context, Peiming or Bixia should review these functions and
				// update them as appropriate.

	class SparseTensorSpecifier {			class SparseTensorSpecifier {
	public:			public:
	explicit SparseTensorSpecifier(Value specifier)			explicit SparseTensorSpecifier(Value specifier)
	: specifier(cast<TypedValue<StorageSpecifierType>>(specifier)) {}			: specifier(cast<TypedValue<StorageSpecifierType>>(specifier)) {}

	// Undef value for dimension sizes, all zero value for memory sizes.			// Undef value for dimension sizes, all zero value for memory sizes.
	static Value getInitValue(OpBuilder &builder, Location loc,			static Value getInitValue(OpBuilder &builder, Location loc,
	RankedTensorType rtp);			SparseTensorType stt);

	/implicit/ operator Value() { return specifier; }			/implicit/ operator Value() { return specifier; }

				// FIXME: see note [CLARIFY_DIM_LVL].
	Value getSpecifierField(OpBuilder &builder, Location loc,			Value getSpecifierField(OpBuilder &builder, Location loc,
	StorageSpecifierKind kind,			StorageSpecifierKind kind,
	std::optional<unsigned> dim);			std::optional<unsigned> dim);

				// FIXME: see note [CLARIFY_DIM_LVL].
	void setSpecifierField(OpBuilder &builder, Location loc, Value v,			void setSpecifierField(OpBuilder &builder, Location loc, Value v,
	StorageSpecifierKind kind,			StorageSpecifierKind kind,
	std::optional<unsigned> dim);			std::optional<unsigned> dim);

				// FIXME: see note [CLARIFY_DIM_LVL].
	Type getFieldType(StorageSpecifierKind kind, std::optional<unsigned> dim) {			Type getFieldType(StorageSpecifierKind kind, std::optional<unsigned> dim) {
	return specifier.getType().getFieldType(kind, dim);			return specifier.getType().getFieldType(kind, dim);
	}			}

	private:			private:
	TypedValue<StorageSpecifierType> specifier;			TypedValue<StorageSpecifierType> specifier;
	};			};

	/// A helper class around an array of values that corresponds to a sparse			/// A helper class around an array of values that corresponds to a sparse
	/// tensor. This class provides a set of meaningful APIs to query and update			/// tensor. This class provides a set of meaningful APIs to query and update
	/// a particular field in a consistent way. Users should not make assumptions			/// a particular field in a consistent way. Users should not make assumptions
	/// on how a sparse tensor is laid out but instead rely on this class to access			/// on how a sparse tensor is laid out but instead rely on this class to access
	/// the right value for the right field.			/// the right value for the right field.
	template <typename ValueArrayRef>			template <typename ValueArrayRef>
	class SparseTensorDescriptorImpl {			class SparseTensorDescriptorImpl {
	protected:			protected:
	SparseTensorDescriptorImpl(Type tp, ValueArrayRef fields)			SparseTensorDescriptorImpl(SparseTensorType stt, ValueArrayRef fields)
	: rType(tp.cast<RankedTensorType>()), fields(fields) {			: rType(stt), fields(fields) {
	assert(getSparseTensorEncoding(tp) &&			assert(stt.hasEncoding() &&
	getNumFieldsFromEncoding(getSparseTensorEncoding(tp)) ==			getNumFieldsFromEncoding(stt.getEncoding()) == getNumFields());
	fields.size());
	// We should make sure the class is trivially copyable (and should be small			// We should make sure the class is trivially copyable (and should be small
	// enough) such that we can pass it by value.			// enough) such that we can pass it by value.
	static_assert(std::is_trivially_copyable_v<			static_assert(std::is_trivially_copyable_v<
	SparseTensorDescriptorImpl<ValueArrayRef>>);			SparseTensorDescriptorImpl<ValueArrayRef>>);
	}			}

	public:			public:
	unsigned getMemRefFieldIndex(SparseTensorFieldKind kind,			FieldIndex getMemRefFieldIndex(SparseTensorFieldKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	// Delegates to storage layout.			// Delegates to storage layout.
	StorageLayout layout(getSparseTensorEncoding(rType));			StorageLayout layout(rType.getEncoding());
	return layout.getMemRefFieldIndex(kind, dim);			return layout.getMemRefFieldIndex(kind, lvl);
	}			}

				// TODO: See note [NUMFIELDS].
	unsigned getNumFields() const { return fields.size(); }			unsigned getNumFields() const { return fields.size(); }

	///			///
	/// Getters: get the value for required field.			/// Getters: get the value for required field.
	///			///

				// FIXME: see note [CLARIFY_DIM_LVL].
	Value getSpecifierField(OpBuilder &builder, Location loc,			Value getSpecifierField(OpBuilder &builder, Location loc,
	StorageSpecifierKind kind,			StorageSpecifierKind kind,
	std::optional<unsigned> dim) const {			std::optional<unsigned> dim) const {
	SparseTensorSpecifier md(fields.back());			SparseTensorSpecifier md(fields.back());
	return md.getSpecifierField(builder, loc, kind, dim);			return md.getSpecifierField(builder, loc, kind, dim);
	}			}

				// FIXME: see note [CLARIFY_DIM_LVL].
	Value getDimSize(OpBuilder &builder, Location loc, unsigned dim) const {			Value getDimSize(OpBuilder &builder, Location loc, unsigned dim) const {
	return getSpecifierField(builder, loc, StorageSpecifierKind::DimSize, dim);			return getSpecifierField(builder, loc, StorageSpecifierKind::DimSize, dim);
	}			}

	Value getPtrMemRef(unsigned ptrDim) const {			Value getPtrMemRef(Level lvl) const {
	return getMemRefField(SparseTensorFieldKind::PtrMemRef, ptrDim);			return getMemRefField(SparseTensorFieldKind::PtrMemRef, lvl);
	}			}

	Value getValMemRef() const {			Value getValMemRef() const {
	return getMemRefField(SparseTensorFieldKind::ValMemRef, std::nullopt);			return getMemRefField(SparseTensorFieldKind::ValMemRef, std::nullopt);
	}			}

	Value getMemRefField(SparseTensorFieldKind kind,			Value getMemRefField(SparseTensorFieldKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	return getField(getMemRefFieldIndex(kind, dim));			return getField(getMemRefFieldIndex(kind, lvl));
	}			}

	Value getMemRefField(unsigned fidx) const {			Value getMemRefField(FieldIndex fidx) const {
	assert(fidx < fields.size() - 1);			assert(fidx < fields.size() - 1);
	return getField(fidx);			return getField(fidx);
	}			}

	Value getPtrMemSize(OpBuilder &builder, Location loc, unsigned dim) const {			Value getPtrMemSize(OpBuilder &builder, Location loc, Level lvl) const {
	return getSpecifierField(builder, loc, StorageSpecifierKind::PtrMemSize,			return getSpecifierField(builder, loc, StorageSpecifierKind::PtrMemSize,
	dim);			lvl);
	}			}

	Value getIdxMemSize(OpBuilder &builder, Location loc, unsigned dim) const {			Value getIdxMemSize(OpBuilder &builder, Location loc, Level lvl) const {
	return getSpecifierField(builder, loc, StorageSpecifierKind::IdxMemSize,			return getSpecifierField(builder, loc, StorageSpecifierKind::IdxMemSize,
	dim);			lvl);
	}			}

	Value getValMemSize(OpBuilder &builder, Location loc) const {			Value getValMemSize(OpBuilder &builder, Location loc) const {
	return getSpecifierField(builder, loc, StorageSpecifierKind::ValMemSize,			return getSpecifierField(builder, loc, StorageSpecifierKind::ValMemSize,
	std::nullopt);			std::nullopt);
	}			}

	Type getMemRefElementType(SparseTensorFieldKind kind,			Type getMemRefElementType(SparseTensorFieldKind kind,
	std::optional<unsigned> dim) const {			std::optional<Level> lvl) const {
	return getMemRefField(kind, dim)			return getMemRefType(getMemRefField(kind, lvl)).getElementType();
	.getType()
	.template cast<MemRefType>()
	.getElementType();
	}			}

	Value getField(unsigned fidx) const {			Value getField(FieldIndex fidx) const {
	assert(fidx < fields.size());			assert(fidx < fields.size());
	return fields[fidx];			return fields[fidx];
	}			}

	ValueRange getMemRefFields() const {			ValueRange getMemRefFields() const {
	ValueRange ret = fields;
	// Drop the last metadata fields.			// Drop the last metadata fields.
	return ret.slice(0, fields.size() - 1);			return fields.drop_back();
	}			}

	std::pair<unsigned, unsigned>			std::pair<FieldIndex, unsigned> getIdxMemRefIndexAndStride(Level lvl) const {
	getIdxMemRefIndexAndStride(unsigned idxDim) const {			StorageLayout layout(rType.getEncoding());
	StorageLayout layout(getSparseTensorEncoding(rType));			return layout.getFieldIndexAndStride(SparseTensorFieldKind::IdxMemRef, lvl);
	return layout.getFieldIndexAndStride(SparseTensorFieldKind::IdxMemRef,
	idxDim);
	}			}

	Value getAOSMemRef() const {			Value getAOSMemRef() const {
	auto enc = getSparseTensorEncoding(rType);			const Level cooStart = getCOOStart(rType.getEncoding());
	unsigned cooStart = getCOOStart(enc);			assert(cooStart < rType.getLvlRank());
	assert(cooStart < enc.getDimLevelType().size());
	return getMemRefField(SparseTensorFieldKind::IdxMemRef, cooStart);			return getMemRefField(SparseTensorFieldKind::IdxMemRef, cooStart);
	}			}

	RankedTensorType getTensorType() const { return rType; }			RankedTensorType getRankedTensorType() const { return rType; }
	ValueArrayRef getFields() const { return fields; }			ValueArrayRef getFields() const { return fields; }

	protected:			protected:
	RankedTensorType rType;			SparseTensorType rType;
	ValueArrayRef fields;			ValueArrayRef fields;
	};			};

	/// Uses ValueRange for immutable descriptors.			/// Uses ValueRange for immutable descriptors.
	class SparseTensorDescriptor : public SparseTensorDescriptorImpl<ValueRange> {			class SparseTensorDescriptor : public SparseTensorDescriptorImpl<ValueRange> {
	public:			public:
	SparseTensorDescriptor(Type tp, ValueRange buffers)			SparseTensorDescriptor(SparseTensorType stt, ValueRange buffers)
	: SparseTensorDescriptorImpl<ValueRange>(tp, buffers) {}			: SparseTensorDescriptorImpl<ValueRange>(stt, buffers) {}

	Value getIdxMemRefOrView(OpBuilder &builder, Location loc,			Value getIdxMemRefOrView(OpBuilder &builder, Location loc, Level lvl) const;
	unsigned idxDim) const;
	};			};

	/// Uses SmallVectorImpl<Value> & for mutable descriptors.			/// Uses SmallVectorImpl<Value> & for mutable descriptors.
	/// Using SmallVector for mutable descriptor allows users to reuse it as a			/// Using SmallVector for mutable descriptor allows users to reuse it as a
	/// tmp buffers to append value for some special cases, though users should			/// tmp buffers to append value for some special cases, though users should
	/// be responsible to restore the buffer to legal states after their use. It			/// be responsible to restore the buffer to legal states after their use. It
	/// is probably not a clean way, but it is the most efficient way to avoid			/// is probably not a clean way, but it is the most efficient way to avoid
	/// copying the fields into another SmallVector. If a more clear way is			/// copying the fields into another SmallVector. If a more clear way is
	/// wanted, we should change it to MutableArrayRef instead.			/// wanted, we should change it to MutableArrayRef instead.
	class MutSparseTensorDescriptor			class MutSparseTensorDescriptor
	: public SparseTensorDescriptorImpl<SmallVectorImpl<Value> &> {			: public SparseTensorDescriptorImpl<SmallVectorImpl<Value> &> {
	public:			public:
	MutSparseTensorDescriptor(Type tp, SmallVectorImpl<Value> &buffers)			MutSparseTensorDescriptor(SparseTensorType stt,
	: SparseTensorDescriptorImpl<SmallVectorImpl<Value> &>(tp, buffers) {}			SmallVectorImpl<Value> &buffers)
				: SparseTensorDescriptorImpl<SmallVectorImpl<Value> &>(stt, buffers) {}

	// Allow implicit type conversion from mutable descriptors to immutable ones			// Allow implicit type conversion from mutable descriptors to immutable ones
	// (but not vice versa).			// (but not vice versa).
	/implicit/ operator SparseTensorDescriptor() const {			/implicit/ operator SparseTensorDescriptor() const {
	return SparseTensorDescriptor(rType, fields);			return SparseTensorDescriptor(rType, fields);
	}			}

	///			///
	/// Adds additional setters for mutable descriptor, update the value for			/// Adds additional setters for mutable descriptor, update the value for
	/// required field.			/// required field.
	///			///

	void setMemRefField(SparseTensorFieldKind kind, std::optional<unsigned> dim,			void setMemRefField(SparseTensorFieldKind kind, std::optional<Level> lvl,
	Value v) {			Value v) {
	fields[getMemRefFieldIndex(kind, dim)] = v;			fields[getMemRefFieldIndex(kind, lvl)] = v;
	}			}

	void setMemRefField(unsigned fidx, Value v) {			void setMemRefField(FieldIndex fidx, Value v) {
	assert(fidx < fields.size() - 1);			assert(fidx < fields.size() - 1);
	fields[fidx] = v;			fields[fidx] = v;
	}			}

	void setField(unsigned fidx, Value v) {			void setField(FieldIndex fidx, Value v) {
	assert(fidx < fields.size());			assert(fidx < fields.size());
	fields[fidx] = v;			fields[fidx] = v;
	}			}

				// FIXME: see note [CLARIFY_DIM_LVL].
	void setSpecifierField(OpBuilder &builder, Location loc,			void setSpecifierField(OpBuilder &builder, Location loc,
	StorageSpecifierKind kind, std::optional<unsigned> dim,			StorageSpecifierKind kind, std::optional<unsigned> dim,
	Value v) {			Value v) {
	SparseTensorSpecifier md(fields.back());			SparseTensorSpecifier md(fields.back());
	md.setSpecifierField(builder, loc, v, kind, dim);			md.setSpecifierField(builder, loc, v, kind, dim);
	fields.back() = md;			fields.back() = md;
	}			}

	void setValMemSize(OpBuilder &builder, Location loc, Value v) {			void setValMemSize(OpBuilder &builder, Location loc, Value v) {
	setSpecifierField(builder, loc, StorageSpecifierKind::ValMemSize,			setSpecifierField(builder, loc, StorageSpecifierKind::ValMemSize,
	std::nullopt, v);			std::nullopt, v);
	}			}

	void setIdxMemSize(OpBuilder &builder, Location loc, unsigned dim, Value v) {			void setIdxMemSize(OpBuilder &builder, Location loc, Level lvl, Value v) {
	setSpecifierField(builder, loc, StorageSpecifierKind::IdxMemSize, dim, v);			setSpecifierField(builder, loc, StorageSpecifierKind::IdxMemSize, lvl, v);
	}			}

	void setPtrMemSize(OpBuilder &builder, Location loc, unsigned dim, Value v) {			void setPtrMemSize(OpBuilder &builder, Location loc, Level lvl, Value v) {
	setSpecifierField(builder, loc, StorageSpecifierKind::PtrMemSize, dim, v);			setSpecifierField(builder, loc, StorageSpecifierKind::PtrMemSize, lvl, v);
	}			}

				// FIXME: see note [CLARIFY_DIM_LVL].
	void setDimSize(OpBuilder &builder, Location loc, unsigned dim, Value v) {			void setDimSize(OpBuilder &builder, Location loc, unsigned dim, Value v) {
	setSpecifierField(builder, loc, StorageSpecifierKind::DimSize, dim, v);			setSpecifierField(builder, loc, StorageSpecifierKind::DimSize, dim, v);
	}			}
	};			};

	/// Returns the "tuple" value of the adapted tensor.			/// Returns the "tuple" value of the adapted tensor.
	inline UnrealizedConversionCastOp getTuple(Value tensor) {			inline UnrealizedConversionCastOp getTuple(Value tensor) {
	return llvm::cast<UnrealizedConversionCastOp>(tensor.getDefiningOp());			return llvm::cast<UnrealizedConversionCastOp>(tensor.getDefiningOp());
	}			}

	/// Packs the given values as a "tuple" value.			/// Packs the given values as a "tuple" value.
	inline Value genTuple(OpBuilder &builder, Location loc, Type tp,			inline Value genTuple(OpBuilder &builder, Location loc, Type tp,
	ValueRange values) {			ValueRange values) {
	return builder.create<UnrealizedConversionCastOp>(loc, TypeRange(tp), values)			return builder.create<UnrealizedConversionCastOp>(loc, TypeRange(tp), values)
	.getResult(0);			.getResult(0);
	}			}

	inline Value genTuple(OpBuilder &builder, Location loc,			inline Value genTuple(OpBuilder &builder, Location loc,
	SparseTensorDescriptor desc) {			SparseTensorDescriptor desc) {
	return genTuple(builder, loc, desc.getTensorType(), desc.getFields());			return genTuple(builder, loc, desc.getRankedTensorType(), desc.getFields());
	}			}

	inline SparseTensorDescriptor getDescriptorFromTensorTuple(Value tensor) {			inline SparseTensorDescriptor getDescriptorFromTensorTuple(Value tensor) {
	auto tuple = getTuple(tensor);			auto tuple = getTuple(tensor);
	return SparseTensorDescriptor(tuple.getResultTypes()[0], tuple.getInputs());			SparseTensorType stt(tuple.getResultTypes()[0].cast<RankedTensorType>());
				return SparseTensorDescriptor(stt, tuple.getInputs());
	}			}

	inline MutSparseTensorDescriptor			inline MutSparseTensorDescriptor
	getMutDescriptorFromTensorTuple(Value tensor, SmallVectorImpl<Value> &fields) {			getMutDescriptorFromTensorTuple(Value tensor, SmallVectorImpl<Value> &fields) {
	auto tuple = getTuple(tensor);			auto tuple = getTuple(tensor);
	fields.assign(tuple.getInputs().begin(), tuple.getInputs().end());			fields.assign(tuple.getInputs().begin(), tuple.getInputs().end());
	return MutSparseTensorDescriptor(tuple.getResultTypes()[0], fields);			SparseTensorType stt(tuple.getResultTypes()[0].cast<RankedTensorType>());
				return MutSparseTensorDescriptor(stt, fields);
	}			}

	} // namespace sparse_tensor			} // namespace sparse_tensor
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_			#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_SPARSETENSORBUILDER_H_

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.cpp

Show All 30 Lines

static IntegerAttr fromOptionalInt(MLIRContext *ctx,		static IntegerAttr fromOptionalInt(MLIRContext *ctx,
std::optional<unsigned> dim) {		std::optional<unsigned> dim) {
if (!dim)		if (!dim)
return nullptr;		return nullptr;
return IntegerAttr::get(IndexType::get(ctx), dim.value());		return IntegerAttr::get(IndexType::get(ctx), dim.value());
}		}

		// This is only ever called from `SparseTensorTypeToBufferConverter`,
		// which is why the first argument is `RankedTensorType` rather than
		// `SparseTensorType`.
static std::optional<LogicalResult>		static std::optional<LogicalResult>
convertSparseTensorType(RankedTensorType rtp, SmallVectorImpl<Type> &fields) {		convertSparseTensorType(RankedTensorType rtp, SmallVectorImpl<Type> &fields) {
auto enc = getSparseTensorEncoding(rtp);		const SparseTensorType stt(rtp);
if (!enc)		if (!stt.hasEncoding())
return std::nullopt;		return std::nullopt;

foreachFieldAndTypeInSparseTensor(		foreachFieldAndTypeInSparseTensor(
rtp,		stt,
[&fields](Type fieldType, unsigned fieldIdx,		[&fields](Type fieldType, FieldIndex fieldIdx,
SparseTensorFieldKind /fieldKind/, unsigned /dim/,		SparseTensorFieldKind /fieldKind/, Level /lvl/,
DimLevelType /dlt/) -> bool {		DimLevelType /dlt/) -> bool {
assert(fieldIdx == fields.size());		assert(fieldIdx == fields.size());
fields.push_back(fieldType);		fields.push_back(fieldType);
return true;		return true;
});		});
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// The sparse tensor type converter (defined in Passes.h).		// The sparse tensor type converter (defined in Passes.h).
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

SparseTensorTypeToBufferConverter::SparseTensorTypeToBufferConverter() {		SparseTensorTypeToBufferConverter::SparseTensorTypeToBufferConverter() {
addConversion([](Type type) { return type; });		addConversion([](Type type) { return type; });
addConversion([&](RankedTensorType rtp, SmallVectorImpl<Type> &fields) {		addConversion(convertSparseTensorType);
return convertSparseTensorType(rtp, fields);
});

// Required by scf.for 1:N type conversion.		// Required by scf.for 1:N type conversion.
addSourceMaterialization([](OpBuilder &builder, RankedTensorType tp,		addSourceMaterialization([](OpBuilder &builder, RankedTensorType tp,
ValueRange inputs,		ValueRange inputs,
Location loc) -> std::optional<Value> {		Location loc) -> std::optional<Value> {
if (!getSparseTensorEncoding(tp))		if (!getSparseTensorEncoding(tp))
// Not a sparse tensor.		// Not a sparse tensor.
return std::nullopt;		return std::nullopt;
// Sparse compiler knows how to cancel out these casts.		// Sparse compiler knows how to cancel out these casts.
return genTuple(builder, loc, tp, inputs);		return genTuple(builder, loc, tp, inputs);
});		});
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// StorageTensorSpecifier methods.		// StorageTensorSpecifier methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

Value SparseTensorSpecifier::getInitValue(OpBuilder &builder, Location loc,		Value SparseTensorSpecifier::getInitValue(OpBuilder &builder, Location loc,
RankedTensorType rtp) {		SparseTensorType stt) {
return builder.create<StorageSpecifierInitOp>(		return builder.create<StorageSpecifierInitOp>(
loc, StorageSpecifierType::get(getSparseTensorEncoding(rtp)));		loc, StorageSpecifierType::get(stt.getEncoding()));
}		}

Value SparseTensorSpecifier::getSpecifierField(OpBuilder &builder, Location loc,		Value SparseTensorSpecifier::getSpecifierField(OpBuilder &builder, Location loc,
StorageSpecifierKind kind,		StorageSpecifierKind kind,
std::optional<unsigned> dim) {		std::optional<unsigned> dim) {
return createIndexCast(builder, loc,		return createIndexCast(builder, loc,
builder.create<GetStorageSpecifierOp>(		builder.create<GetStorageSpecifierOp>(
loc, getFieldType(kind, dim), specifier, kind,		loc, getFieldType(kind, dim), specifier, kind,
Show All 10 Lines	specifier = builder.create<SetStorageSpecifierOp>(
createIndexCast(builder, loc, v, getFieldType(kind, dim)));		createIndexCast(builder, loc, v, getFieldType(kind, dim)));
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// SparseTensorDescriptor methods.		// SparseTensorDescriptor methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

Value sparse_tensor::SparseTensorDescriptor::getIdxMemRefOrView(		Value sparse_tensor::SparseTensorDescriptor::getIdxMemRefOrView(
OpBuilder &builder, Location loc, unsigned idxDim) const {		OpBuilder &builder, Location loc, Level idxLvl) const {
auto enc = getSparseTensorEncoding(rType);		const Level cooStart = getCOOStart(rType.getEncoding());
unsigned cooStart = getCOOStart(enc);		if (idxLvl < cooStart)
unsigned idx = idxDim >= cooStart ? cooStart : idxDim;		return getMemRefField(SparseTensorFieldKind::IdxMemRef, idxLvl);
Value buffer = getMemRefField(SparseTensorFieldKind::IdxMemRef, idx);
if (idxDim >= cooStart) {		Value stride = constantIndex(builder, loc, rType.getLvlRank() - cooStart);
unsigned rank = enc.getDimLevelType().size();
Value stride = constantIndex(builder, loc, rank - cooStart);
Value size = getIdxMemSize(builder, loc, cooStart);		Value size = getIdxMemSize(builder, loc, cooStart);
size = builder.create<arith::DivUIOp>(loc, size, stride);		size = builder.create<arith::DivUIOp>(loc, size, stride);
buffer = builder.create<memref::SubViewOp>(		return builder.create<memref::SubViewOp>(
loc, buffer,		loc, getMemRefField(SparseTensorFieldKind::IdxMemRef, cooStart),
/offset=/ValueRange{constantIndex(builder, loc, idxDim - cooStart)},		/offset=/ValueRange{constantIndex(builder, loc, idxLvl - cooStart)},
/size=/ValueRange{size},		/size=/ValueRange{size},
/step=/ValueRange{stride});		/step=/ValueRange{stride});
}		}
return buffer;
}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Public methods.		// Public methods.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

constexpr uint64_t kDataFieldStartingIdx = 0;		constexpr FieldIndex kDataFieldStartingIdx = 0;

void sparse_tensor::foreachFieldInSparseTensor(		void sparse_tensor::foreachFieldInSparseTensor(
const SparseTensorEncodingAttr enc,		const SparseTensorEncodingAttr enc,
llvm::function_ref<bool(unsigned, SparseTensorFieldKind, unsigned,		llvm::function_ref<bool(FieldIndex, SparseTensorFieldKind, Level,
DimLevelType)>		DimLevelType)>
callback) {		callback) {
assert(enc);		assert(enc);

#define RETURN_ON_FALSE(idx, kind, dim, dlt) \		#define RETURN_ON_FALSE(idx, kind, dim, dlt) \
if (!(callback(idx, kind, dim, dlt))) \		if (!(callback(idx, kind, dim, dlt))) \
return;		return;

unsigned rank = enc.getDimLevelType().size();		const auto lvlTypes = enc.getDimLevelType();
unsigned end = getCOOStart(enc);		const Level lvlRank = enc.getLvlRank();
if (end != rank)		const Level cooStart = getCOOStart(enc);
end += 1;		const Level end = cooStart == lvlRank ? cooStart : cooStart + 1;
static_assert(kDataFieldStartingIdx == 0);		FieldIndex fieldIdx = kDataFieldStartingIdx;
unsigned fieldIdx = kDataFieldStartingIdx;
// Per-dimension storage.		// Per-dimension storage.
for (unsigned r = 0; r < end; r++) {		for (Level l = 0; l < end; l++) {
// Dimension level types apply in order to the reordered dimension.		// Dimension level types apply in order to the reordered dimension.
// As a result, the compound type can be constructed directly in the given		// As a result, the compound type can be constructed directly in the given
// order.		// order.
auto dlt = getDimLevelType(enc, r);		const auto dlt = lvlTypes[l];
if (isCompressedDLT(dlt)) {		if (isCompressedDLT(dlt)) {
RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::PtrMemRef, r, dlt);		RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::PtrMemRef, l, dlt);
RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::IdxMemRef, r, dlt);		RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::IdxMemRef, l, dlt);
} else if (isSingletonDLT(dlt)) {		} else if (isSingletonDLT(dlt)) {
RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::IdxMemRef, r, dlt);		RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::IdxMemRef, l, dlt);
} else {		} else {
assert(isDenseDLT(dlt)); // no fields		assert(isDenseDLT(dlt)); // no fields
}		}
}		}
// The values array.		// The values array.
RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::ValMemRef, -1u,		RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::ValMemRef, -1u,
DimLevelType::Undef);		DimLevelType::Undef);

// Put metadata at the end.		// Put metadata at the end.
RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::StorageSpec, -1u,		RETURN_ON_FALSE(fieldIdx++, SparseTensorFieldKind::StorageSpec, -1u,
DimLevelType::Undef);		DimLevelType::Undef);

#undef RETURN_ON_FALSE		#undef RETURN_ON_FALSE
}		}

void sparse_tensor::foreachFieldAndTypeInSparseTensor(		void sparse_tensor::foreachFieldAndTypeInSparseTensor(
RankedTensorType rType,		SparseTensorType stt,
llvm::function_ref<bool(Type, unsigned, SparseTensorFieldKind, unsigned,		llvm::function_ref<bool(Type, FieldIndex, SparseTensorFieldKind, Level,
DimLevelType)>		DimLevelType)>
callback) {		callback) {
auto enc = getSparseTensorEncoding(rType);		const auto enc = stt.getEncoding();
assert(enc);		assert(enc);
// Construct the basic types.		// Construct the basic types.
Type idxType = enc.getIndexType();		Type idxType = enc.getIndexType();
Type ptrType = enc.getPointerType();		Type ptrType = enc.getPointerType();
Type eltType = rType.getElementType();		Type eltType = stt.getElementType();

Type metaDataType = StorageSpecifierType::get(enc);		Type metaDataType = StorageSpecifierType::get(enc);
// memref<? x ptr> pointers		// memref<? x ptr> pointers
Type ptrMemType = MemRefType::get({ShapedType::kDynamic}, ptrType);		Type ptrMemType = MemRefType::get({ShapedType::kDynamic}, ptrType);
// memref<? x idx> indices		// memref<? x idx> indices
Type idxMemType = MemRefType::get({ShapedType::kDynamic}, idxType);		Type idxMemType = MemRefType::get({ShapedType::kDynamic}, idxType);
// memref<? x eltType> values		// memref<? x eltType> values
Type valMemType = MemRefType::get({ShapedType::kDynamic}, eltType);		Type valMemType = MemRefType::get({ShapedType::kDynamic}, eltType);

foreachFieldInSparseTensor(		foreachFieldInSparseTensor(
enc,		enc,
[metaDataType, ptrMemType, idxMemType, valMemType,		[metaDataType, ptrMemType, idxMemType, valMemType,
callback](unsigned fieldIdx, SparseTensorFieldKind fieldKind,		callback](FieldIndex fieldIdx, SparseTensorFieldKind fieldKind,
unsigned dim, DimLevelType dlt) -> bool {		Level lvl, DimLevelType dlt) -> bool {
switch (fieldKind) {		switch (fieldKind) {
case SparseTensorFieldKind::StorageSpec:		case SparseTensorFieldKind::StorageSpec:
return callback(metaDataType, fieldIdx, fieldKind, dim, dlt);		return callback(metaDataType, fieldIdx, fieldKind, lvl, dlt);
case SparseTensorFieldKind::PtrMemRef:		case SparseTensorFieldKind::PtrMemRef:
return callback(ptrMemType, fieldIdx, fieldKind, dim, dlt);		return callback(ptrMemType, fieldIdx, fieldKind, lvl, dlt);
case SparseTensorFieldKind::IdxMemRef:		case SparseTensorFieldKind::IdxMemRef:
return callback(idxMemType, fieldIdx, fieldKind, dim, dlt);		return callback(idxMemType, fieldIdx, fieldKind, lvl, dlt);
case SparseTensorFieldKind::ValMemRef:		case SparseTensorFieldKind::ValMemRef:
return callback(valMemType, fieldIdx, fieldKind, dim, dlt);		return callback(valMemType, fieldIdx, fieldKind, lvl, dlt);
};		};
llvm_unreachable("unrecognized field kind");		llvm_unreachable("unrecognized field kind");
});		});
}		}

unsigned sparse_tensor::getNumFieldsFromEncoding(SparseTensorEncodingAttr enc) {		unsigned sparse_tensor::getNumFieldsFromEncoding(SparseTensorEncodingAttr enc) {
unsigned numFields = 0;		unsigned numFields = 0;
foreachFieldInSparseTensor(enc,		foreachFieldInSparseTensor(enc,
[&numFields](unsigned, SparseTensorFieldKind,		[&numFields](FieldIndex, SparseTensorFieldKind,
unsigned, DimLevelType) -> bool {		Level, DimLevelType) -> bool {
numFields++;		numFields++;
return true;		return true;
});		});
return numFields;		return numFields;
}		}

unsigned		unsigned
sparse_tensor::getNumDataFieldsFromEncoding(SparseTensorEncodingAttr enc) {		sparse_tensor::getNumDataFieldsFromEncoding(SparseTensorEncodingAttr enc) {
unsigned numFields = 0; // one value memref		unsigned numFields = 0; // one value memref
foreachFieldInSparseTensor(enc,		foreachFieldInSparseTensor(enc,
[&numFields](unsigned fidx, SparseTensorFieldKind,		[&numFields](FieldIndex fidx,
unsigned, DimLevelType) -> bool {		SparseTensorFieldKind, Level,
		DimLevelType) -> bool {
if (fidx >= kDataFieldStartingIdx)		if (fidx >= kDataFieldStartingIdx)
numFields++;		numFields++;
return true;		return true;
});		});
numFields -= 1; // the last field is MetaData field		numFields -= 1; // the last field is MetaData field
assert(numFields ==		assert(numFields ==
getNumFieldsFromEncoding(enc) - kDataFieldStartingIdx - 1);		getNumFieldsFromEncoding(enc) - kDataFieldStartingIdx - 1);
return numFields;		return numFields;
}		}

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

Show All 20 Lines
#include "mlir/Dialect/Func/IR/FuncOps.h"		#include "mlir/Dialect/Func/IR/FuncOps.h"
#include "mlir/Dialect/LLVMIR/LLVMDialect.h"		#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
#include "mlir/Dialect/Linalg/IR/Linalg.h"		#include "mlir/Dialect/Linalg/IR/Linalg.h"
#include "mlir/Dialect/Linalg/Utils/Utils.h"		#include "mlir/Dialect/Linalg/Utils/Utils.h"
#include "mlir/Dialect/MemRef/IR/MemRef.h"		#include "mlir/Dialect/MemRef/IR/MemRef.h"
#include "mlir/Dialect/SCF/IR/SCF.h"		#include "mlir/Dialect/SCF/IR/SCF.h"
#include "mlir/Dialect/SCF/Transforms/Transforms.h"		#include "mlir/Dialect/SCF/Transforms/Transforms.h"
#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"		#include "mlir/Dialect/SparseTensor/IR/SparseTensor.h"
		#include "mlir/Dialect/SparseTensor/IR/SparseTensorType.h"
#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"		#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
#include "mlir/Dialect/SparseTensor/Utils/Merger.h"		#include "mlir/Dialect/SparseTensor/Utils/Merger.h"
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/IR/AffineExprVisitor.h"		#include "mlir/IR/AffineExprVisitor.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
#include "mlir/IR/TensorEncoding.h"		#include "mlir/IR/TensorEncoding.h"
#include "llvm/ADT/SmallBitVector.h"		#include "llvm/ADT/SmallBitVector.h"
#include <optional>		#include <optional>
▲ Show 20 Lines • Show All 183 Lines • ▼ Show 20 Lines
/// affineMap = (d0, d1, d2) => (d0 + d1, d2)		/// affineMap = (d0, d1, d2) => (d0 + d1, d2)
/// tensor = ["compressed", "compressed"]		/// tensor = ["compressed", "compressed"]
///		///
/// Returns 1 (because the first level is compressed and its corresponding		/// Returns 1 (because the first level is compressed and its corresponding
/// affineMap is d0 + d1)		/// affineMap is d0 + d1)
static unsigned getNumCompoundAffineOnSparseDims(AffineMap affineMap,		static unsigned getNumCompoundAffineOnSparseDims(AffineMap affineMap,
Value tensor) {		Value tensor) {
unsigned num = 0;		unsigned num = 0;
auto enc = getSparseTensorEncoding(tensor.getType());		const auto enc = getSparseTensorEncoding(tensor.getType());
if (enc) {		if (enc) {
ArrayRef<AffineExpr> exps = affineMap.getResults();		const ArrayRef<AffineExpr> exps = affineMap.getResults();
for (unsigned rank = 0; rank < exps.size(); rank++) {		const Level lvlRank = enc.getLvlRank();
auto aidx = toOrigDim(enc, rank);		assert(static_cast<Level>(exps.size()) == lvlRank);
auto affine = exps[aidx];		for (Level l = 0; l < lvlRank; l++) {
if (!affine.isa<AffineDimExpr>())		// FIXME: `toOrigDim` is deprecated.
if (!isDenseDLT(getDimLevelType(enc, rank)))		const Dimension d = toOrigDim(enc, l);
		// FIXME: there's some dim/lvl confusion here; since `d` isn't
		// guaranteed to be in bounds (for non-permutations).
		if (!exps[d].isa<AffineDimExpr>() && !enc.isDenseLvl(l))
num++;		num++;
}		}
}		}

return num;		return num;
}		}

/// Get the total number of compound affine expressions attached on a sparse		/// Get the total number of compound affine expressions attached on a sparse
/// level in the given GenericOp.		/// level in the given GenericOp.
static unsigned getNumCompoundAffineOnSparseDims(linalg::GenericOp op) {		static unsigned getNumCompoundAffineOnSparseDims(linalg::GenericOp op) {
unsigned num = 0;		unsigned num = 0;
for (OpOperand &t : op->getOpOperands())		for (OpOperand &t : op->getOpOperands())
num += getNumCompoundAffineOnSparseDims(op.getMatchingIndexingMap(&t),		num += getNumCompoundAffineOnSparseDims(op.getMatchingIndexingMap(&t),
t.get());		t.get());
return num;		return num;
}		}

static bool hasCompoundAffineOnSparseOut(linalg::GenericOp op) {		static bool hasCompoundAffineOnSparseOut(linalg::GenericOp op) {
OpOperand *out = op.getDpsInitOperand(0);		OpOperand *out = op.getDpsInitOperand(0);
auto enc = getSparseTensorEncoding(out->get().getType());		if (getSparseTensorType(out->get()).isAllDense())
if (!enc \|\| enc.isAllDense())
return false;		return false;

return getNumCompoundAffineOnSparseDims(op.getMatchingIndexingMap(out),		return getNumCompoundAffineOnSparseDims(op.getMatchingIndexingMap(out),
out->get());		out->get());
}		}

/// Helper method to inspect sparse encodings in the tensor types.		/// Helper method to inspect sparse encodings in the tensor types.
/// Fills the per-dimension sparsity information for all tensors.		/// Fills the per-dimension sparsity information for all tensors.
/// Returns true if the sparse annotations and affine subscript		/// Returns true if the sparse annotations and affine subscript
/// expressions of all tensors are admissible. Returns false if		/// expressions of all tensors are admissible. Returns false if
/// no annotations are found or inadmissible constructs occur.		/// no annotations are found or inadmissible constructs occur.
static bool findSparseAnnotations(CodegenEnv &env) {		static bool findSparseAnnotations(CodegenEnv &env) {
bool annotated = false;		bool annotated = false;
unsigned filterLdx = env.merger().getFilterLoopStartingIdx();		unsigned filterLdx = env.merger().getFilterLoopStartingIdx();
for (OpOperand &t : env.op()->getOpOperands()) {		for (OpOperand &t : env.op()->getOpOperands()) {
auto map = env.op().getMatchingIndexingMap(&t);		const auto map = env.op().getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t.get().getType());		const auto enc = getSparseTensorEncoding(t.get().getType());
if (enc)		if (enc)
annotated = true;		annotated = true;
assert(map.getNumResults() == env.op().getRank(&t));		const Level lvlRank = map.getNumResults();
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		assert(!enc \|\| lvlRank == enc.getLvlRank());
unsigned tensor = t.getOperandNumber();		assert(env.op().getRank(&t) == lvlRank);
AffineExpr a = map.getResult(toOrigDim(enc, d));		for (Level l = 0; l < lvlRank; l++) {
if (!findAffine(env.merger(), tensor, d, a, getDimLevelType(enc, d),		const unsigned tensor = t.getOperandNumber();
filterLdx))		// FIXME: `toOrigDim` is deprecated.
		const AffineExpr a = map.getResult(toOrigDim(enc, l));
		if (!findAffine(env.merger(), tensor, l, a, enc.getLvlType(l), filterLdx))
return false; // inadmissible affine expression		return false; // inadmissible affine expression
}		}
}		}
assert(filterLdx == env.merger().getNumLoops());		assert(filterLdx == env.merger().getNumLoops());
return annotated;		return annotated;
}		}

/// A helper to compute a topological sort. O(n^2) time complexity		/// A helper to compute a topological sort. O(n^2) time complexity
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines
/// is essential for sparse storage formats since these only support access		/// is essential for sparse storage formats since these only support access
/// along fixed dimensions. Even for dense storage formats, however, the		/// along fixed dimensions. Even for dense storage formats, however, the
/// natural index order yields innermost unit-stride access with better		/// natural index order yields innermost unit-stride access with better
/// spatial locality.		/// spatial locality.
static bool computeIterationGraph(CodegenEnv &env, unsigned mask,		static bool computeIterationGraph(CodegenEnv &env, unsigned mask,
OpOperand *skip = nullptr) {		OpOperand *skip = nullptr) {
// Set up an n x n from/to adjacency matrix of the iteration graph		// Set up an n x n from/to adjacency matrix of the iteration graph
// for the implicit loop indices i_0 .. i_n-1.		// for the implicit loop indices i_0 .. i_n-1.
unsigned n = env.merger().getNumLoops();		const unsigned n = env.merger().getNumLoops();
std::vector<std::vector<bool>> adjM(n, std::vector<bool>(n, false));		std::vector<std::vector<bool>> adjM(n, std::vector<bool>(n, false));
std::vector<unsigned> inDegree(n, 0); // in-degree of each node.		std::vector<unsigned> inDegree(n, 0); // in-degree of each node.
auto iteratorTypes = env.op().getIteratorTypesArray();		const auto iteratorTypes = env.op().getIteratorTypesArray();
// Iterate over the indexing maps of every tensor in the tensor expression.		// Iterate over the indexing maps of every tensor in the tensor expression.
for (OpOperand &t : env.op()->getOpOperands()) {		for (OpOperand &t : env.op()->getOpOperands()) {
// Get map and encoding.		// Get map and encoding.
auto map = env.op().getMatchingIndexingMap(&t);		const auto map = env.op().getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t.get().getType());		const auto enc = getSparseTensorEncoding(t.get().getType());
assert(map.getNumDims() + getNumCompoundAffineOnSparseDims(env.op()) == n);		assert(map.getNumDims() + getNumCompoundAffineOnSparseDims(env.op()) == n);
// Skip dense tensor constraints when not requested.		// Skip dense tensor constraints when not requested.
if (!(mask & SortMask::kIncludeDense) && !enc)		if (!(mask & SortMask::kIncludeDense) && !enc)
continue;		continue;
// Each tensor expression and optional dimension ordering (row-major		// Each tensor expression and optional dimension ordering (row-major
// by default) puts an ordering constraint on the loop indices. For		// by default) puts an ordering constraint on the loop indices. For
// example, the tensor expresion A_ijk forces the ordering i < j < k		// example, the tensor expresion A_ijk forces the ordering i < j < k
// on the loop indices if no explicit dimension ordering is given.		// on the loop indices if no explicit dimension ordering is given.
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		const Level lvlRank = map.getNumResults();
AffineExpr ta = map.getResult(toOrigDim(enc, d));		assert(!enc \|\| lvlRank == enc.getLvlRank());
		for (Level l = 0; l < lvlRank; l++) {
		// FIXME: `toOrigDim` is deprecated.
		AffineExpr ta = map.getResult(toOrigDim(enc, l));
std::optional<unsigned> tldx =		std::optional<unsigned> tldx =
env.merger().getLoopIdx(t.getOperandNumber(), d);		env.merger().getLoopIdx(t.getOperandNumber(), l);

// Filter loops should be constructed after all the dependent loops,		// Filter loops should be constructed after all the dependent loops,
// i.e., d0 + d1 < filter_loop(d0 + d1)		// i.e., d0 + d1 < filter_loop(d0 + d1)
if (tldx && env.merger().isFilterLoop(*tldx)) {		if (tldx && env.merger().isFilterLoop(*tldx)) {
assert(!ta.isa<AffineDimExpr>() &&		assert(!ta.isa<AffineDimExpr>() && !isDenseDLT(enc.getLvlType(l)));
!isDenseDLT(getDimLevelType(enc, d)));
addAffineOrderings(adjM, inDegree, ta, AffineExpr(), std::nullopt,		addAffineOrderings(adjM, inDegree, ta, AffineExpr(), std::nullopt,
tldx);		tldx);
// Now that the ordering of affine expression is captured by filter		// Now that the ordering of affine expression is captured by filter
// loop idx, we only need to ensure the affine ordering against filter		// loop idx, we only need to ensure the affine ordering against filter
// loop. Thus, we reset the affine express to nil here to mark it as		// loop. Thus, we reset the affine express to nil here to mark it as
// resolved.		// resolved.
ta = AffineExpr();		ta = AffineExpr();
}		}

// Skip tensor during cycle resolution, though order between filter loop		// Skip tensor during cycle resolution, though order between filter loop
// and dependent loops need to be guaranteed unconditionally.		// and dependent loops need to be guaranteed unconditionally.
if (&t == skip)		if (&t == skip)
continue;		continue;

if (d > 0) {		if (l > 0) {
AffineExpr fa = map.getResult(toOrigDim(enc, d - 1));		// FIXME: `toOrigDim` is deprecated.
		AffineExpr fa = map.getResult(toOrigDim(enc, l - 1));
std::optional<unsigned> fldx =		std::optional<unsigned> fldx =
env.merger().getLoopIdx(t.getOperandNumber(), d - 1);		env.merger().getLoopIdx(t.getOperandNumber(), l - 1);

// Applying order constraints on every pair of dimExpr between two		// Applying order constraints on every pair of dimExpr between two
// compound affine expressions can sometime too strict:		// compound affine expressions can sometime too strict:
// E.g, for [dense, dense] -> (d0 + d1, d2 + d3).		// E.g, for [dense, dense] -> (d0 + d1, d2 + d3).
// It is totally fine to have loop sequence d0->d2->d1->d3 instead of		// It is totally fine to have loop sequence d0->d2->d1->d3 instead of
// requiring d0 < d2, d1 < d2, d0 < d3, d1 < d3.		// requiring d0 < d2, d1 < d2, d0 < d3, d1 < d3.
if (!(mask & SortMask::kIncludeDense))		if (!(mask & SortMask::kIncludeDense))
tryLoosenAffineDenseConstraints(env.op(), fldx, fa, tldx, ta);		tryLoosenAffineDenseConstraints(env.op(), fldx, fa, tldx, ta);
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	env.emitter().initializeLoopEmit(
}		}
return init;		return init;
});		});
}		}

/// Generates index for load/store on sparse tensor.		/// Generates index for load/store on sparse tensor.
static Value genIndex(CodegenEnv &env, OpOperand *t) {		static Value genIndex(CodegenEnv &env, OpOperand *t) {
auto map = env.op().getMatchingIndexingMap(t);		auto map = env.op().getMatchingIndexingMap(t);
auto enc = getSparseTensorEncoding(t->get().getType());		const auto stt = getSparseTensorType(t->get());
AffineExpr a = map.getResult(toOrigDim(enc, map.getNumResults() - 1));		const Level lvlRank = stt.getLvlRank();
		assert(static_cast<Level>(map.getNumResults()) == lvlRank);
		// FIXME: `toOrigDim` is deprecated.
		AffineExpr a = map.getResult(toOrigDim(stt.getEncoding(), lvlRank - 1));
assert(a.getKind() == AffineExprKind::DimId);		assert(a.getKind() == AffineExprKind::DimId);
unsigned idx = a.cast<AffineDimExpr>().getPosition();		unsigned idx = a.cast<AffineDimExpr>().getPosition();
return env.getLoopIdxValue(idx);		return env.getLoopIdxValue(idx);
}		}

/// Generates subscript for load/store on a dense or sparse tensor.		/// Generates subscript for load/store on a dense or sparse tensor.
static Value genSubscript(CodegenEnv &env, OpBuilder &builder, OpOperand *t,		static Value genSubscript(CodegenEnv &env, OpBuilder &builder, OpOperand *t,
SmallVectorImpl<Value> &args) {		SmallVectorImpl<Value> &args) {
linalg::GenericOp op = env.op();		linalg::GenericOp op = env.op();
unsigned tensor = t->getOperandNumber();		unsigned tensor = t->getOperandNumber();
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(t);
auto enc = getSparseTensorEncoding(t->get().getType());		const auto stt = getSparseTensorType(t->get());
unsigned rank = map.getNumResults();		if (stt.hasEncoding()) {
if (enc) {
Value pidx = env.emitter().getPidxs()[tensor].back();		Value pidx = env.emitter().getPidxs()[tensor].back();
assert(pidx);		assert(pidx);
args.push_back(pidx); // position index		args.push_back(pidx); // position index
} else {		} else {
for (unsigned d = 0; d < rank; d++) {		const Level lvlRank = stt.getLvlRank();
AffineExpr a = map.getResult(d);		assert(static_cast<Level>(map.getNumResults()) == lvlRank);
		for (Level l = 0; l < lvlRank; l++) {
		AffineExpr a = map.getResult(l);
args.push_back(env.emitter().genAffine(builder, a, op.getLoc()));		args.push_back(env.emitter().genAffine(builder, a, op.getLoc()));
}		}
}		}
return env.emitter().getValBuffer()[tensor];		return env.emitter().getValBuffer()[tensor];
}		}

/// Generates insertion code to implement dynamic tensor load.		/// Generates insertion code to implement dynamic tensor load.
static Value genInsertionLoad(CodegenEnv &env, OpBuilder &builder,		static Value genInsertionLoad(CodegenEnv &env, OpBuilder &builder,
▲ Show 20 Lines • Show All 247 Lines • ▼ Show 20 Lines	static void genInvariants(CodegenEnv &env, OpBuilder &builder, unsigned exp,
if (exp == -1u)		if (exp == -1u)
return;		return;
if (env.exp(exp).kind == Kind::kTensor) {		if (env.exp(exp).kind == Kind::kTensor) {
// Inspect tensor indices.		// Inspect tensor indices.
bool atLevel = ldx == -1u;		bool atLevel = ldx == -1u;
linalg::GenericOp op = env.op();		linalg::GenericOp op = env.op();
OpOperand &t = op->getOpOperand(env.exp(exp).tensor);		OpOperand &t = op->getOpOperand(env.exp(exp).tensor);
auto map = op.getMatchingIndexingMap(&t);		auto map = op.getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t.get().getType());		const auto stt = getSparseTensorType(t.get());
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		const Level lvlRank = stt.getLvlRank();
AffineExpr a = map.getResult(toOrigDim(enc, d));		assert(static_cast<Level>(map.getNumResults()) == lvlRank);
		for (Level l = 0; l < lvlRank; l++) {
		// FIXME: `toOrigDim` is deprecated.
		AffineExpr a = map.getResult(toOrigDim(stt.getEncoding(), l));
std::optional<unsigned> sldx =		std::optional<unsigned> sldx =
env.merger().getLoopIdx(t.getOperandNumber(), d);		env.merger().getLoopIdx(t.getOperandNumber(), l);
if (sldx && env.merger().isFilterLoop(*sldx)) {		if (sldx && env.merger().isFilterLoop(*sldx)) {
if (!env.getLoopIdxValue(*sldx))		if (!env.getLoopIdxValue(*sldx))
// The filter loops has not been constructed.		// The filter loops has not been constructed.
return;		return;
if (*sldx == ldx)		if (*sldx == ldx)
atLevel = true;		atLevel = true;
} else if (!isInvariantAffine(env, a, ldx, atLevel))		} else if (!isInvariantAffine(env, a, ldx, atLevel))
return; // still in play		return; // still in play
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	Operation loop = env.genLoopBoundary([&](MutableArrayRef<Value> reduc) {
if (env.merger().isFilterLoop(idx)) {		if (env.merger().isFilterLoop(idx)) {
size_t tid = tids.front(), dim = dims.front();		size_t tid = tids.front(), dim = dims.front();
// tids/dims must only have one value because filter loops only		// tids/dims must only have one value because filter loops only
// corresponding to the one and only sparse tensor level.		// corresponding to the one and only sparse tensor level.
assert(isSparse && tids.size() == 1 && dims.size() == 1);		assert(isSparse && tids.size() == 1 && dims.size() == 1);
OpOperand *t = &op->getOpOperand(tid);		OpOperand *t = &op->getOpOperand(tid);
auto enc = getSparseTensorEncoding(t->get().getType());		auto enc = getSparseTensorEncoding(t->get().getType());
// Retrieves the affine expression for the filter loop.		// Retrieves the affine expression for the filter loop.
		// FIXME: `toOrigDim` is deprecated.
AffineExpr a =		AffineExpr a =
op.getMatchingIndexingMap(t).getResult(toOrigDim(enc, dim));		op.getMatchingIndexingMap(t).getResult(toOrigDim(enc, dim));
return env.emitter().enterFilterLoopOverTensorAtDim(builder, loc, tid,		return env.emitter().enterFilterLoopOverTensorAtDim(builder, loc, tid,
dim, a, reduc);		dim, a, reduc);
}		}
return env.emitter().enterLoopOverTensorAtDim(builder, loc, tids, dims,		return env.emitter().enterLoopOverTensorAtDim(builder, loc, tids, dims,
reduc, isParallel);		reduc, isParallel);
});		});
▲ Show 20 Lines • Show All 174 Lines • ▼ Show 20 Lines	for (unsigned i = 1; i < lsize; i++) {
return true;		return true;
}		}
}		}
return false;		return false;
}		}

static void genConstantDenseAddressFromLevel(CodegenEnv &env,		static void genConstantDenseAddressFromLevel(CodegenEnv &env,
OpBuilder &builder, unsigned tid,		OpBuilder &builder, unsigned tid,
unsigned lvl) {		Level lvl) {
// TODO: Handle affine expression on output tensor.		// TODO: Handle affine expression on output tensor.
linalg::GenericOp op = env.op();		linalg::GenericOp op = env.op();
assert(tid < op.getNumDpsInputs());		assert(tid < op.getNumDpsInputs());
OpOperand *input = op.getDpsInputOperands()[tid];		OpOperand *input = op.getDpsInputOperands()[tid];
ArrayRef<AffineExpr> affines = op.getMatchingIndexingMap(input).getResults();		ArrayRef<AffineExpr> affines = op.getMatchingIndexingMap(input).getResults();
auto enc = getSparseTensorEncoding(input->get().getType());		const auto enc = getSparseTensorEncoding(input->get().getType());
if (enc) {		if (enc) {
for (unsigned i = lvl, e = affines.size(); i < e; i++) {		const Level lvlRank = enc.getLvlRank();
AffineExpr affine = affines[toOrigDim(enc, i)];		assert(affines.size() == static_cast<size_t>(lvlRank));
if (isDenseDLT(getDimLevelType(enc, i)) &&		for (Level l = lvl; l < lvlRank; l++) {
affine.isa<AffineConstantExpr>())		// FIXME: `toOrigDim` is deprecated.
		AffineExpr affine = affines[toOrigDim(enc, l)];
		if (enc.isDenseLvl(l) && affine.isa<AffineConstantExpr>())
env.emitter().genDenseAffineAddressAtCurLevel(		env.emitter().genDenseAffineAddressAtCurLevel(
builder, op.getLoc(), input->getOperandNumber(), i, affine);		builder, op.getLoc(), input->getOperandNumber(), l, affine);
else		else
return; // break on first non-dense non-constant level		return; // break on first non-dense non-constant level
}		}
}		}
}		}

static void genInitConstantDenseAddress(CodegenEnv &env,		static void genInitConstantDenseAddress(CodegenEnv &env,
RewriterBase &rewriter) {		RewriterBase &rewriter) {
Show All 40 Lines	env.merger().foreachTidDimPairInBits(
dims.push_back(*dim);		dims.push_back(*dim);
} else {		} else {
assert(isUndefDLT(dlt));		assert(isUndefDLT(dlt));
linalg::GenericOp op = env.op();		linalg::GenericOp op = env.op();
if (tid >= op.getNumDpsInputs())		if (tid >= op.getNumDpsInputs())
// We only handle affine expression on input tensors (for now).		// We only handle affine expression on input tensors (for now).
return;		return;
OpOperand *operand = &op->getOpOperand(tid);		OpOperand *operand = &op->getOpOperand(tid);
auto enc = getSparseTensorEncoding(operand->get().getType());		const auto stt = getSparseTensorType(operand->get());
// Non-annotated dense tensors requires no special handling.		// Non-annotated dense tensors requires no special handling.
if (!enc)		if (!stt.hasEncoding())
return;		return;

ArrayRef<AffineExpr> affines =		ArrayRef<AffineExpr> affines =
op.getMatchingIndexingMap(operand).getResults();		op.getMatchingIndexingMap(operand).getResults();
assert(affines.size() == enc.getDimLevelType().size());		const Level lvlRank = stt.getLvlRank();
for (unsigned i = 0, e = affines.size(); i < e; i++) {		assert(affines.size() == static_cast<size_t>(lvlRank));
AffineExpr exp = affines[toOrigDim(enc, i)];		for (Level l = 0; l < lvlRank; l++) {
		// FIXME: `toOrigDim` is deprecated.
		AffineExpr exp = affines[toOrigDim(stt.getEncoding(), l)];
// Skip simple affine expression and non dense dimensions (which has		// Skip simple affine expression and non dense dimensions (which has
// it own filter loop).		// it own filter loop).
if (exp.isa<AffineDimExpr>() \|\|		if (exp.isa<AffineDimExpr>() \|\| !stt.isDenseLvl(l))
!isDenseDLT(getDimLevelType(enc, i)))
continue;		continue;

// Constant affine expression are handled in genLoop		// Constant affine expression are handled in genLoop
if (!exp.isa<AffineConstantExpr>()) {		if (!exp.isa<AffineConstantExpr>()) {
bool atLevel = false;		bool atLevel = false;
if (isInvariantAffine(env, exp, idx, atLevel) && atLevel) {		if (isInvariantAffine(env, exp, idx, atLevel) && atLevel) {
// If the compound affine is invariant and we are right at the		// If the compound affine is invariant and we are right at the
// level. We need to generate the address according to the		// level. We need to generate the address according to the
// affine expression. This is also the best place we can do it		// affine expression. This is also the best place we can do it
// to avoid putting it inside inner loops.		// to avoid putting it inside inner loops.
// NOTE: It assumes that the levels of the input tensor are		// NOTE: It assumes that the levels of the input tensor are
// initialized in order (and it is also currently guaranteed by		// initialized in order (and it is also currently guaranteed by
// computeIterationGraph), another more admissible approach		// computeIterationGraph), another more admissible approach
// might be accepting out-of-order access between consecutive		// might be accepting out-of-order access between consecutive
// dense levels.		// dense levels.
affineTids.push_back(tid);		affineTids.push_back(tid);
affineDims.push_back(i);		affineDims.push_back(l);
exps.push_back(exp);		exps.push_back(exp);
}		}
}		}
}		}
}		}
});		});

if (isDenseDLT(env.dlt(env.merger().getOutTensorID(), idx))) {		if (isDenseDLT(env.dlt(env.merger().getOutTensorID(), idx))) {
▲ Show 20 Lines • Show All 285 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/invalid.mlir

Show First 20 Lines • Show All 221 Lines • ▼ Show 20 Lines	func.func @mismatch_values_types(%arg0: tensor<?xf64, #SparseVector>) -> memref<?xf32> {
return %0 : memref<?xf32>		return %0 : memref<?xf32>
}		}

// -----		// -----

#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>		#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>

func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {		func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {
// expected-error@+1 {{redundant dimension argument for querying value memory size}}		// expected-error@+1 {{redundant level argument for querying value memory size}}
%0 = sparse_tensor.storage_specifier.get %arg0 val_mem_sz at 0		%0 = sparse_tensor.storage_specifier.get %arg0 val_mem_sz at 0
: !sparse_tensor.storage_specifier<#SparseVector> to i64		: !sparse_tensor.storage_specifier<#SparseVector> to i64
return %0 : i64		return %0 : i64
}		}

// -----		// -----

#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>		#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>

func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {		func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {
// expected-error@+1 {{missing dimension argument}}		// expected-error@+1 {{missing level argument}}
%0 = sparse_tensor.storage_specifier.get %arg0 idx_mem_sz		%0 = sparse_tensor.storage_specifier.get %arg0 idx_mem_sz
: !sparse_tensor.storage_specifier<#SparseVector> to i64		: !sparse_tensor.storage_specifier<#SparseVector> to i64
return %0 : i64		return %0 : i64
}		}

// -----		// -----

#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>		#SparseVector = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>

func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {		func.func @sparse_get_md(%arg0: !sparse_tensor.storage_specifier<#SparseVector>) -> i64 {
// expected-error@+1 {{requested dimension out of bound}}		// expected-error@+1 {{requested level out of bound}}
%0 = sparse_tensor.storage_specifier.get %arg0 dim_sz at 1		%0 = sparse_tensor.storage_specifier.get %arg0 dim_sz at 1
: !sparse_tensor.storage_specifier<#SparseVector> to i64		: !sparse_tensor.storage_specifier<#SparseVector> to i64
return %0 : i64		return %0 : i64
}		}

// -----		// -----

#COO = #sparse_tensor.encoding<{dimLevelType = ["compressed-nu", "singleton"]}>		#COO = #sparse_tensor.encoding<{dimLevelType = ["compressed-nu", "singleton"]}>
▲ Show 20 Lines • Show All 388 Lines • ▼ Show 20 Lines
}		}

// -----		// -----

#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
func.func @invalid_concat_dim(%arg0: tensor<2x4xf64, #DC>,		func.func @invalid_concat_dim(%arg0: tensor<2x4xf64, #DC>,
%arg1: tensor<3x4xf64, #DC>,		%arg1: tensor<3x4xf64, #DC>,
%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {		%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {
// expected-error@+1 {{Failed to concatentate tensors with rank=2 on dimension=4}}		// expected-error@+1 {{Concat-dimension is out of bounds for dimension-rank (4 >= 2)}}
%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 4 : index}		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 4 : index}
: tensor<2x4xf64, #DC>,		: tensor<2x4xf64, #DC>,
tensor<3x4xf64, #DC>,		tensor<3x4xf64, #DC>,
tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>		tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>
return %0 : tensor<9x4xf64, #DC>		return %0 : tensor<9x4xf64, #DC>
}		}

// -----		// -----

#C = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>		#C = #sparse_tensor.encoding<{dimLevelType = ["compressed"]}>
#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
#DCC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed", "compressed"]}>		#DCC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed", "compressed"]}>
func.func @invalid_concat_rank_mismatch(%arg0: tensor<2xf64, #C>,		func.func @invalid_concat_rank_mismatch(%arg0: tensor<2xf64, #C>,
%arg1: tensor<3x4xf64, #DC>,		%arg1: tensor<3x4xf64, #DC>,
%arg2: tensor<4x4x4xf64, #DCC>) -> tensor<9x4xf64, #DC> {		%arg2: tensor<4x4x4xf64, #DCC>) -> tensor<9x4xf64, #DC> {
// expected-error@+1 {{The input tensor $0 has a different rank (rank=1) from the output tensor (rank=2)}}		// expected-error@+1 {{Input tensor $0 has a different rank (rank=1) from the output tensor (rank=2)}}
%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
: tensor<2xf64, #C>,		: tensor<2xf64, #C>,
tensor<3x4xf64, #DC>,		tensor<3x4xf64, #DC>,
tensor<4x4x4xf64, #DCC> to tensor<9x4xf64, #DC>		tensor<4x4x4xf64, #DCC> to tensor<9x4xf64, #DC>
return %0 : tensor<9x4xf64, #DC>		return %0 : tensor<9x4xf64, #DC>
}		}

// -----		// -----

#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>		#DC = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
func.func @invalid_concat_size_mismatch_dyn(%arg0: tensor<?x4xf64, #DC>,		func.func @invalid_concat_size_mismatch_dyn(%arg0: tensor<?x4xf64, #DC>,
%arg1: tensor<5x4xf64, #DC>,		%arg1: tensor<5x4xf64, #DC>,
%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {		%arg2: tensor<4x4xf64, #DC>) -> tensor<9x4xf64, #DC> {
// expected-error@+1 {{Only statically-sized input tensors are supported.}}		// expected-error@+1 {{Input tensor $0 has dynamic shape}}
%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}		%0 = sparse_tensor.concatenate %arg0, %arg1, %arg2 {dimension = 0 : index}
: tensor<?x4xf64, #DC>,		: tensor<?x4xf64, #DC>,
tensor<5x4xf64, #DC>,		tensor<5x4xf64, #DC>,
tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>		tensor<4x4xf64, #DC> to tensor<9x4xf64, #DC>
return %0 : tensor<9x4xf64, #DC>		return %0 : tensor<9x4xf64, #DC>
}		}

// -----		// -----
▲ Show 20 Lines • Show All 166 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/invalid_encoding.mlir

	// RUN: mlir-opt %s -split-input-file -verify-diagnostics			// RUN: mlir-opt %s -split-input-file -verify-diagnostics

				// expected-error@+1 {{expected a non-empty array for level types}}
	#a = #sparse_tensor.encoding<{dimLevelType = []}>			#a = #sparse_tensor.encoding<{dimLevelType = []}>
	func.func private @scalar(%arg0: tensor<f64, #a>) -> () // expected-error {{expected non-scalar sparse tensor}}			func.func private @scalar(%arg0: tensor<f64, #a>) -> ()

	// -----			// -----

	#a = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>			#a = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"]}>
	func.func private @tensor_dimlevel_size_mismatch(%arg0: tensor<8xi32, #a>) -> () // expected-error {{expected an array of size 1 for dimension level types}}			func.func private @tensor_dimlevel_size_mismatch(%arg0: tensor<8xi32, #a>) -> () // expected-error {{expected an array of size 1 for dimension level types}}

	// -----			// -----

	Show All 17 Lines

	// -----			// -----

	#a = #sparse_tensor.encoding<{higherOrdering = "wrong"}> // expected-error {{expected an affine map for higher ordering}}			#a = #sparse_tensor.encoding<{higherOrdering = "wrong"}> // expected-error {{expected an affine map for higher ordering}}
	func.func private @tensor_highorder_mismatch(%arg0: tensor<8xi32, #a>) -> ()			func.func private @tensor_highorder_mismatch(%arg0: tensor<8xi32, #a>) -> ()

	// -----			// -----

	#a = #sparse_tensor.encoding<{dimOrdering = affine_map<(i,j) -> (i,i)>}> // expected-error {{expected a permutation affine map for dimension ordering}}			// expected-error@+1 {{expected a permutation affine map for dimension ordering}}
				#a = #sparse_tensor.encoding<{dimLevelType = ["dense", "compressed"], dimOrdering = affine_map<(i,j) -> (i,i)>}>
	func.func private @tensor_no_permutation(%arg0: tensor<16x32xf32, #a>) -> ()			func.func private @tensor_no_permutation(%arg0: tensor<16x32xf32, #a>) -> ()

	// -----			// -----

	#a = #sparse_tensor.encoding<{pointerBitWidth = "x"}> // expected-error {{expected an integral pointer bitwidth}}			#a = #sparse_tensor.encoding<{pointerBitWidth = "x"}> // expected-error {{expected an integral pointer bitwidth}}
	func.func private @tensor_no_int_ptr(%arg0: tensor<16x32xf32, #a>) -> ()			func.func private @tensor_no_int_ptr(%arg0: tensor<16x32xf32, #a>) -> ()

	// -----			// -----
	Show All 31 Lines

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,169 Lines • ▼ Show 20 Lines	cc_library(
name = "SparseTensorEnums",		name = "SparseTensorEnums",
hdrs = ["include/mlir/Dialect/SparseTensor/IR/Enums.h"],		hdrs = ["include/mlir/Dialect/SparseTensor/IR/Enums.h"],
includes = ["include"],		includes = ["include"],
)		)

cc_library(		cc_library(
name = "SparseTensorDialect",		name = "SparseTensorDialect",
srcs = ["lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp"],		srcs = ["lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp"],
hdrs = ["include/mlir/Dialect/SparseTensor/IR/SparseTensor.h"],		hdrs = [
		"include/mlir/Dialect/SparseTensor/IR/SparseTensor.h",
		"include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h",
		],
includes = ["include"],		includes = ["include"],
deps = [		deps = [
":ArithDialect",		":ArithDialect",
":IR",		":IR",
":InferTypeOpInterface",		":InferTypeOpInterface",
":SparseTensorAttrDefsIncGen",		":SparseTensorAttrDefsIncGen",
":SparseTensorEnums",		":SparseTensorEnums",
":SparseTensorOpsIncGen",		":SparseTensorOpsIncGen",
▲ Show 20 Lines • Show All 8,409 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][sparse] Factoring out SparseTensorType classClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 497532

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensor.h

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorAttrDefs.td

mlir/include/mlir/Dialect/SparseTensor/IR/SparseTensorType.h

mlir/lib/CAPI/Dialect/SparseTensor.cpp

mlir/lib/Dialect/SparseTensor/IR/SparseTensorDialect.cpp

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp

mlir/lib/Dialect/SparseTensor/Transforms/CodegenUtils.h

mlir/lib/Dialect/SparseTensor/Transforms/CodegenUtils.cpp

mlir/lib/Dialect/SparseTensor/Transforms/LoopEmitter.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseStorageSpecifierToLLVM.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorCodegen.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorConversion.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.h

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorStorageLayout.cpp

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

mlir/test/Dialect/SparseTensor/invalid.mlir

mlir/test/Dialect/SparseTensor/invalid_encoding.mlir

utils/bazel/llvm-project-overlay/mlir/BUILD.bazel

[mlir][sparse] Factoring out SparseTensorType class
ClosedPublic