This is an archive of the discontinued LLVM Phabricator instance.

[mlir][sparse] add asCOO() functionality to sparse tensor object
ClosedPublic

Authored by aartbik on Aug 24 2021, 6:59 PM.

Download Raw Diff

Details

Reviewers

penpornk
bixia
gussmith23
wrengr

Commits

rG6b26857dbfc1: [mlir][sparse] add asCOO() functionality to sparse tensor object

Summary

This prepares general sparse to sparse conversions. The code that
needs to be generated using this new feature is now simply:

(1) coo = sparse_tensor_1->asCOO(); source format1
(2) sparse_tensor_2 = newSparseTensor(coo); destination format2

By using COO as an intermediate, we can do *all* conversions without
having to implement the full O(N^2) conversion matrix. Note that we
can always improve particular conversions individually if a faster
solution is required.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aartbik created this revision.Aug 24 2021, 6:59 PM

Herald added subscribers: wrengr, Chia-hungDuan, dcaballe and 17 others. · View Herald TranscriptAug 24 2021, 7:00 PM

aartbik requested review of this revision.Aug 24 2021, 7:00 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 24 2021, 7:00 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

aartbik added reviewers: penpornk, bixia, gussmith23, wrengr.Aug 24 2021, 7:01 PM

Harbormaster completed remote builds in B121093: Diff 368533.Aug 24 2021, 7:37 PM

aartbik added inline comments.Aug 24 2021, 8:21 PM

mlir/lib/ExecutionEngine/SparseUtils.cpp
297	Please note that this implementation does not take the permutation of either source or destination into account yet (i.e. this version is for the identity permutations). A follow-up CL will dot all the i's and cross the t's, followed by actual lowering code support in SparseTensorConversion.

cross the ts and dot the is

mlir/lib/ExecutionEngine/SparseUtils.cpp
297	I added the required permutation requirements in this CL also. The next revision that will build on this will finalize the sparse conversions (with just a few lines of code).

aartbik added a child revision: D108721: [mlir][sparse] fully implement sparse tensor to sparse tensor conversions.Aug 25 2021, 11:55 AM

Harbormaster completed remote builds in B121201: Diff 368689.Aug 25 2021, 12:20 PM

bixia accepted this revision.Aug 25 2021, 2:56 PM

bixia added inline comments.

mlir/lib/ExecutionEngine/SparseUtils.cpp
224–230	It could be nice if we can have a more specially names or/and documentation to help understanding of the two usages of "tmp". the first tmp: Apply the reversed permutation to the storage sizes to get the sizes of the tensor. the second tmp: Apply the reversed permutation to the permutation of the coo tensor to get the accumulated permutation from the tensor to the coo tensor.

This revision is now accepted and ready to land.Aug 25 2021, 2:56 PM

aartbik marked an inline comment as done.Aug 25 2021, 8:43 PM

aartbik added inline comments.

mlir/lib/ExecutionEngine/SparseUtils.cpp
224–230	Yeah, probably premature optimization keeping the same vector for both. I changed this back into two vectors with better names, leaving the optimization for another time (not that this one will help much anyway).

renamed tmp vector, used two different ones, added comments

Harbormaster completed remote builds in B121292: Diff 368800.Aug 25 2021, 9:38 PM

Closed by commit rG6b26857dbfc1: [mlir][sparse] add asCOO() functionality to sparse tensor object (authored by aartbik). · Explain WhyAug 25 2021, 9:51 PM

This revision was automatically updated to reflect the committed changes.

aartbik added a commit: rG6b26857dbfc1: [mlir][sparse] add asCOO() functionality to sparse tensor object.

Revision Contents

Path

Size

mlir/

lib/

ExecutionEngine/

SparseUtils.cpp

107 lines

Diff 368804

mlir/lib/ExecutionEngine/SparseUtils.cpp

Show First 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	public:
const Element<V> &next() { return elements[pos++]; }		const Element<V> &next() { return elements[pos++]; }
/// Returns rank.		/// Returns rank.
uint64_t getRank() const { return sizes.size(); }		uint64_t getRank() const { return sizes.size(); }
/// Getter for sizes array.		/// Getter for sizes array.
const std::vector<uint64_t> &getSizes() const { return sizes; }		const std::vector<uint64_t> &getSizes() const { return sizes; }
/// Getter for elements array.		/// Getter for elements array.
const std::vector<Element<V>> &getElements() const { return elements; }		const std::vector<Element<V>> &getElements() const { return elements; }

/// Factory method.		/// Factory method. Permutes the original dimensions according to
		/// the given ordering and expects subsequent add() calls to honor
		/// that same ordering for the given indices. The result is a
		/// fully permuted coordinate scheme.
static SparseTensor<V> newSparseTensor(uint64_t size, uint64_t sizes,		static SparseTensor<V> newSparseTensor(uint64_t size, uint64_t sizes,
uint64_t *perm,		uint64_t *perm,
uint64_t capacity = 0) {		uint64_t capacity = 0) {
std::vector<uint64_t> indices(size);		std::vector<uint64_t> permsz(size);
for (uint64_t r = 0; r < size; r++)		for (uint64_t r = 0; r < size; r++)
indices[perm[r]] = sizes[r];		permsz[perm[r]] = sizes[r];
return new SparseTensor<V>(indices, capacity);		return new SparseTensor<V>(permsz, capacity);
}		}

private:		private:
/// Returns true if indices of e1 < indices of e2.		/// Returns true if indices of e1 < indices of e2.
static bool lexOrder(const Element<V> &e1, const Element<V> &e2) {		static bool lexOrder(const Element<V> &e1, const Element<V> &e2) {
assert(e1.indices.size() == e2.indices.size());		assert(e1.indices.size() == e2.indices.size());
for (uint64_t r = 0, rank = e1.indices.size(); r < rank; r++) {		for (uint64_t r = 0, rank = e1.indices.size(); r < rank; r++) {
if (e1.indices[r] == e2.indices[r])		if (e1.indices[r] == e2.indices[r])
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
/// "one-size-fits-all" solution that simply takes an input tensor and		/// "one-size-fits-all" solution that simply takes an input tensor and
/// annotations to implement all required setup in a general manner.		/// annotations to implement all required setup in a general manner.
template <typename P, typename I, typename V>		template <typename P, typename I, typename V>
class SparseTensorStorage : public SparseTensorStorageBase {		class SparseTensorStorage : public SparseTensorStorageBase {
public:		public:
/// Constructs a sparse tensor storage scheme from the given sparse		/// Constructs a sparse tensor storage scheme from the given sparse
/// tensor in coordinate scheme following the given per-rank dimension		/// tensor in coordinate scheme following the given per-rank dimension
/// dense/sparse annotations.		/// dense/sparse annotations.
SparseTensorStorage(SparseTensor<V> tensor, uint8_t sparsity)		SparseTensorStorage(SparseTensor<V> tensor, uint8_t sparsity,
: sizes(tensor->getSizes()), pointers(getRank()), indices(getRank()) {		uint64_t *perm)
		: sizes(tensor->getSizes()), rev(getRank()), pointers(getRank()),
		indices(getRank()) {
		// Store "reverse" permutation.
		for (uint64_t d = 0, rank = getRank(); d < rank; d++)
		rev[perm[d]] = d;
// Provide hints on capacity.		// Provide hints on capacity.
// TODO: needs fine-tuning based on sparsity		// TODO: needs fine-tuning based on sparsity
uint64_t nnz = tensor->getElements().size();		uint64_t nnz = tensor->getElements().size();
values.reserve(nnz);		values.reserve(nnz);
for (uint64_t d = 0, s = 1, rank = getRank(); d < rank; d++) {		for (uint64_t d = 0, s = 1, rank = getRank(); d < rank; d++) {
s *= sizes[d];		s *= sizes[d];
if (sparsity[d] == kCompressed) {		if (sparsity[d] == kCompressed) {
pointers[d].reserve(s + 1);		pointers[d].reserve(s + 1);
indices[d].reserve(s);		indices[d].reserve(s);
s = 1;		s = 1;
} else {		} else {
assert(sparsity[d] == kDense && "singleton not yet supported");		assert(sparsity[d] == kDense && "singleton not yet supported");
}		}
}		}
		// Prepare sparse pointer structures for all dimensions.
		for (uint64_t d = 0, rank = getRank(); d < rank; d++)
		if (sparsity[d] == kCompressed)
		pointers[d].push_back(0);
// Then setup the tensor.		// Then setup the tensor.
traverse(tensor, sparsity, 0, nnz, 0);		fromCOO(tensor, sparsity, 0, nnz, 0);
}		}

virtual ~SparseTensorStorage() {}		virtual ~SparseTensorStorage() {}

uint64_t getRank() const { return sizes.size(); }		uint64_t getRank() const { return sizes.size(); }

uint64_t getDimSize(uint64_t d) override { return sizes[d]; }		uint64_t getDimSize(uint64_t d) override { return sizes[d]; }

// Partially specialize these three methods based on template types.		// Partially specialize these three methods based on template types.
void getPointers(std::vector<P> **out, uint64_t d) override {		void getPointers(std::vector<P> **out, uint64_t d) override {
*out = &pointers[d];		*out = &pointers[d];
}		}
void getIndices(std::vector<I> **out, uint64_t d) override {		void getIndices(std::vector<I> **out, uint64_t d) override {
*out = &indices[d];		*out = &indices[d];
}		}
void getValues(std::vector<V> *out) override { out = &values; }		void getValues(std::vector<V> *out) override { out = &values; }

/// Factory method.		/// Returns this sparse tensor storage scheme as a new memory-resident
static SparseTensorStorage<P, I, V> newSparseTensor(SparseTensor<V> t,		/// sparse tensor in coordinate scheme with the given dimension order.
uint8_t *s) {		SparseTensor<V> asCOO(uint64_t perm) {
		// Restore original order of the dimension sizes and allocate coordinate
		// scheme with desired new ordering specified in perm.
		uint64_t size = getRank();
		std::vector<uint64_t> orgsz(size);
		for (uint64_t r = 0; r < size; r++)
		orgsz[rev[r]] = sizes[r];
		SparseTensor<V> *tensor = SparseTensor<V>::newSparseTensor(
		size, orgsz.data(), perm, values.size());
		// Populate coordinate scheme restored from old ordering and changed with
		// new ordering. Rather than applying both reorderings during the recursion,
		bixiaUnsubmitted Done Reply Inline Actions It could be nice if we can have a more specially names or/and documentation to help understanding of the two usages of "tmp". the first tmp: Apply the reversed permutation to the storage sizes to get the sizes of the tensor. the second tmp: Apply the reversed permutation to the permutation of the coo tensor to get the accumulated permutation from the tensor to the coo tensor. bixia: It could be nice if we can have a more specially names or/and documentation to help…
		aartbikAuthorUnsubmitted Done Reply Inline Actions Yeah, probably premature optimization keeping the same vector for both. I changed this back into two vectors with better names, leaving the optimization for another time (not that this one will help much anyway). aartbik: Yeah, probably premature optimization keeping the same vector for both. I changed this back…
		// we compute the combine permutation in advance.
		std::vector<uint64_t> reord(size);
		for (uint64_t r = 0; r < size; r++)
		reord[r] = perm[rev[r]];
		std::vector<uint64_t> idx(size);
		toCOO(tensor, reord, idx, 0, 0);
		return tensor;
		}

		/// Factory method. Expects a coordinate scheme that respects the same
		/// permutation as is desired for the new sparse storage scheme.
		static SparseTensorStorage<P, I, V> *
		newSparseTensor(SparseTensor<V> t, uint8_t sparsity, uint64_t *perm) {
t->sort(); // sort lexicographically		t->sort(); // sort lexicographically
SparseTensorStorage<P, I, V> *n = new SparseTensorStorage<P, I, V>(t, s);		SparseTensorStorage<P, I, V> *n =
		new SparseTensorStorage<P, I, V>(t, sparsity, perm);
delete t;		delete t;
return n;		return n;
}		}

private:		private:
/// Initializes sparse tensor storage scheme from a memory-resident sparse		/// Initializes sparse tensor storage scheme from a memory-resident sparse
/// tensor in coordinate scheme. This method prepares the pointers and indices		/// tensor in coordinate scheme. This method prepares the pointers and indices
/// arrays under the given per-rank dimension dense/sparse annotations.		/// arrays under the given per-rank dimension dense/sparse annotations.
void traverse(SparseTensor<V> tensor, uint8_t sparsity, uint64_t lo,		void fromCOO(SparseTensor<V> tensor, uint8_t sparsity, uint64_t lo,
uint64_t hi, uint64_t d) {		uint64_t hi, uint64_t d) {
const std::vector<Element<V>> &elements = tensor->getElements();		const std::vector<Element<V>> &elements = tensor->getElements();
// Once dimensions are exhausted, insert the numerical values.		// Once dimensions are exhausted, insert the numerical values.
if (d == getRank()) {		if (d == getRank()) {
values.push_back(lo < hi ? elements[lo].value : 0);		values.push_back(lo < hi ? elements[lo].value : 0);
return;		return;
}		}
// Prepare a sparse pointer structure at this dimension.
if (sparsity[d] == kCompressed && pointers[d].empty())
pointers[d].push_back(0);
// Visit all elements in this interval.		// Visit all elements in this interval.
uint64_t full = 0;		uint64_t full = 0;
while (lo < hi) {		while (lo < hi) {
// Find segment in interval with same index elements in this dimension.		// Find segment in interval with same index elements in this dimension.
unsigned idx = elements[lo].indices[d];		unsigned idx = elements[lo].indices[d];
unsigned seg = lo + 1;		unsigned seg = lo + 1;
while (seg < hi && elements[seg].indices[d] == idx)		while (seg < hi && elements[seg].indices[d] == idx)
seg++;		seg++;
// Handle segment in interval for sparse or dense dimension.		// Handle segment in interval for sparse or dense dimension.
if (sparsity[d] == kCompressed) {		if (sparsity[d] == kCompressed) {
indices[d].push_back(idx);		indices[d].push_back(idx);
} else {		} else {
for (; full < idx; full++)		for (; full < idx; full++)
traverse(tensor, sparsity, 0, 0, d + 1); // pass empty		fromCOO(tensor, sparsity, 0, 0, d + 1); // pass empty
full++;		full++;
}		}
traverse(tensor, sparsity, lo, seg, d + 1);		fromCOO(tensor, sparsity, lo, seg, d + 1);
// And move on to next segment in interval.		// And move on to next segment in interval.
lo = seg;		lo = seg;
}		}
// Finalize the sparse pointer structure at this dimension.		// Finalize the sparse pointer structure at this dimension.
if (sparsity[d] == kCompressed) {		if (sparsity[d] == kCompressed) {
pointers[d].push_back(indices[d].size());		pointers[d].push_back(indices[d].size());
} else {		} else {
for (uint64_t sz = tensor->getSizes()[d]; full < sz; full++)		for (uint64_t sz = tensor->getSizes()[d]; full < sz; full++)
traverse(tensor, sparsity, 0, 0, d + 1); // pass empty		fromCOO(tensor, sparsity, 0, 0, d + 1); // pass empty
		}
		}

		/// Stores the sparse tensor storage scheme into a memory-resident sparse
		/// tensor in coordinate scheme.
		void toCOO(SparseTensor<V> *tensor, std::vector<uint64_t> &reord,
		std::vector<uint64_t> &idx, uint64_t pos, uint64_t d) {
		if (d == getRank()) {
		tensor->add(idx, values[pos]);
		aartbikAuthorUnsubmitted Done Reply Inline Actions Please note that this implementation does not take the permutation of either source or destination into account yet (i.e. this version is for the identity permutations). A follow-up CL will dot all the i's and cross the t's, followed by actual lowering code support in SparseTensorConversion. aartbik: Please note that this implementation does not take the permutation of either source or…
		aartbikAuthorUnsubmitted Done Reply Inline Actions I added the required permutation requirements in this CL also. The next revision that will build on this will finalize the sparse conversions (with just a few lines of code). aartbik: I added the required permutation requirements in this CL also. The next revision that will…
		} else if (pointers[d].empty()) {
		// Dense dimension.
		for (uint64_t i = 0; i < sizes[d]; i++) {
		idx[reord[d]] = i;
		toCOO(tensor, reord, idx, pos * sizes[d] + i, d + 1);
		}
		} else {
		// Sparse dimension.
		for (uint64_t ii = pointers[d][pos]; ii < pointers[d][pos + 1]; ii++) {
		idx[reord[d]] = indices[d][ii];
		toCOO(tensor, reord, idx, ii, d + 1);
		}
}		}
}		}

private:		private:
std::vector<uint64_t> sizes; // per-rank dimension sizes		std::vector<uint64_t> sizes; // per-rank dimension sizes
		std::vector<uint64_t> rev; // "reverse" permutation
std::vector<std::vector<P>> pointers;		std::vector<std::vector<P>> pointers;
std::vector<std::vector<I>> indices;		std::vector<std::vector<I>> indices;
std::vector<V> values;		std::vector<V> values;
};		};

/// Helper to convert string to lower case.		/// Helper to convert string to lower case.
static char toLower(char token) {		static char toLower(char token) {
for (char c = token; c; c++)		for (char c = token; c; c++)
▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines

#define CASE(p, i, v, P, I, V) \		#define CASE(p, i, v, P, I, V) \
if (ptrTp == (p) && indTp == (i) && valTp == (v)) { \		if (ptrTp == (p) && indTp == (i) && valTp == (v)) { \
SparseTensor<V> *tensor; \		SparseTensor<V> *tensor; \
if (action == 0) \		if (action == 0) \
tensor = openTensor<V>(static_cast<char *>(ptr), asize, sizes, perm); \		tensor = openTensor<V>(static_cast<char *>(ptr), asize, sizes, perm); \
else if (action == 1) \		else if (action == 1) \
tensor = static_cast<SparseTensor<V> *>(ptr); \		tensor = static_cast<SparseTensor<V> *>(ptr); \
else \		else if (action == 2) \
return SparseTensor<V>::newSparseTensor(asize, sizes, perm); \		return SparseTensor<V>::newSparseTensor(asize, sizes, perm); \
return SparseTensorStorage<P, I, V>::newSparseTensor(tensor, sparsity); \		else \
		return static_cast<SparseTensorStorage<P, I, V> *>(ptr)->asCOO(perm); \
		return SparseTensorStorage<P, I, V>::newSparseTensor(tensor, sparsity, \
		perm); \
}		}

#define IMPL1(RET, NAME, TYPE, LIB) \		#define IMPL1(RET, NAME, TYPE, LIB) \
RET NAME(void *tensor) { \		RET NAME(void *tensor) { \
std::vector<TYPE> *v; \		std::vector<TYPE> *v; \
static_cast<SparseTensorStorageBase *>(tensor)->LIB(&v); \		static_cast<SparseTensorStorageBase *>(tensor)->LIB(&v); \
return {v->data(), v->data(), 0, {v->size()}, {1}}; \		return {v->data(), v->data(), 0, {v->size()}, {1}}; \
}		}
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	enum PrimaryTypeEnum : uint64_t {
kI32 = 4,		kI32 = 4,
kI16 = 5,		kI16 = 5,
kI8 = 6		kI8 = 6
};		};

/// Constructs a new sparse tensor. This is the "swiss army knife"		/// Constructs a new sparse tensor. This is the "swiss army knife"
/// method for materializing sparse tensors into the computation.		/// method for materializing sparse tensors into the computation.
/// action		/// action
/// 0 : ptr contains filename to read into storage		/// 0 : ptr contains filename to read into storage
/// 1 : ptr contains coordinate scheme to assign to storage		/// 1 : ptr contains coordinate scheme to assign to new storage
/// 2 : returns coordinate scheme to fill (call back later with 1)		/// 2 : returns empty coordinate scheme to fill (call back 1 to setup)
		/// 3 : returns coordinate scheme from storage in ptr (call back 1 to convert)
void newSparseTensor(uint8_t abase, uint8_t *adata, uint64_t aoff,		void newSparseTensor(uint8_t abase, uint8_t *adata, uint64_t aoff,
uint64_t asize, uint64_t astride, uint64_t *sbase,		uint64_t asize, uint64_t astride, uint64_t *sbase,
uint64_t *sdata, uint64_t soff, uint64_t ssize,		uint64_t *sdata, uint64_t soff, uint64_t ssize,
uint64_t sstride, uint64_t pbase, uint64_t pdata,		uint64_t sstride, uint64_t pbase, uint64_t pdata,
uint64_t poff, uint64_t psize, uint64_t pstride,		uint64_t poff, uint64_t psize, uint64_t pstride,
uint64_t ptrTp, uint64_t indTp, uint64_t valTp,		uint64_t ptrTp, uint64_t indTp, uint64_t valTp,
uint32_t action, void *ptr) {		uint32_t action, void *ptr) {
assert(astride == 1 && sstride == 1 && pstride == 1);		assert(astride == 1 && sstride == 1 && pstride == 1);
▲ Show 20 Lines • Show All 106 Lines • Show Last 20 Lines