This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
lib/Dialect/SparseTensor/Transforms/
-
Dialect/
-
SparseTensor/
-
Transforms/
1/1
CodegenEnv.h
2/3
CodegenEnv.cpp
2/2
Sparsification.cpp
-
test/
-
Dialect/SparseTensor/
-
SparseTensor/
-
sparse_affine.mlir
-
sparse_out.mlir
-
Integration/Dialect/SparseTensor/CPU/
-
Dialect/
-
SparseTensor/
-
CPU/
-
sparse_reduce_custom.mlir

Differential D142050

[mlir][sparse] Sparse reduction in lex order no longer produces dense output
ClosedPublic

Authored by jim22k on Jan 18 2023, 12:02 PM.

Download Raw Diff

Details

Reviewers

aartbik
nicolasvasilache

Commits

rG81d0d2b2a068: [mlir][sparse] Sparse reduction in lex order no longer produces dense output

Summary

Previously, when performing a reduction on a sparse tensor, the result
would be different depending on iteration order. For expanded access pattern,
an empty row would contribute no entry in the output. For lex ordering, the
identity would end up in the output.

This code changes that behavior and keeps track of whether any entries were
actually reduced during lex ordering, making the output consistent between the
two iteration styles.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jim22k created this revision.Jan 18 2023, 12:02 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 18 2023, 12:02 PM

Herald added subscribers: hanchung, jsetoain, Moerafaat and 21 others. · View Herald Transcript

jim22k requested review of this revision.Jan 18 2023, 12:02 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJan 18 2023, 12:02 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

To show this in action, consider this function from test/Integration/Dialect/SparseTensor/CPU/sparse_reduce_custom.mlir:

func.func @redProdLex(%arga: tensor<?x?xf64, #CSR>) -> tensor<?xf64, #SparseVector> {
  %c0 = arith.constant 0 : index
  %cf1 = arith.constant 1.0 : f64
  %d0 = tensor.dim %arga, %c0 : tensor<?x?xf64, #CSR>
  %xv = bufferization.alloc_tensor(%d0): tensor<?xf64, #SparseVector>
  %0 = linalg.generic #trait_mat_reduce_rowwise
    ins(%arga: tensor<?x?xf64, #CSR>)
    outs(%xv: tensor<?xf64, #SparseVector>) {
      ^bb(%a: f64, %b: f64):
        %1 = sparse_tensor.reduce %a, %b, %cf1 : f64 {
            ^bb0(%x: f64, %y: f64):
              %2 = arith.mulf %x, %y : f64
              sparse_tensor.yield %2 : f64
          }
        linalg.yield %1 : f64
  } -> tensor<?xf64, #SparseVector>
  return %0 : tensor<?xf64, #SparseVector>
}

This will be converted to:

func.func @redProdLex(%arg0: tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) -> tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>> {
  %c0 = arith.constant 0 : index
  %c1 = arith.constant 1 : index
  %false = arith.constant false
  %true = arith.constant true
  %cst = arith.constant 1.000000e+00 : f64
  %dim = tensor.dim %arg0, %c0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  %0 = bufferization.alloc_tensor(%dim) : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
  %dim_0 = tensor.dim %arg0, %c0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  %1 = sparse_tensor.pointers %arg0 {dimension = 1 : index} : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> to memref<?xindex>
  %2 = sparse_tensor.values %arg0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> to memref<?xf64>
  %3 = scf.for %arg1 = %c0 to %dim_0 step %c1 iter_args(%arg2 = %0) -> (tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>) {
    %5 = memref.load %1[%arg1] : memref<?xindex>
    %6 = arith.addi %arg1, %c1 : index
    %7 = memref.load %1[%6] : memref<?xindex>
    %8:2 = scf.for %arg3 = %5 to %7 step %c1 iter_args(%arg4 = %cst, %arg5 = %false) -> (f64, i1) {
      %10 = memref.load %2[%arg3] : memref<?xf64>
      %11 = arith.mulf %10, %arg4 : f64
      scf.yield %11, %true : f64, i1
    } {"Emitted from" = "linalg.generic"}
    %9 = scf.if %8#1 -> (tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>) {
      %10 = sparse_tensor.insert %8#0 into %arg2[%arg1] : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
      scf.yield %10 : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
    } else {
      scf.yield %arg2 : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
    }
    scf.yield %9 : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
  } {"Emitted from" = "linalg.generic"}
  %4 = sparse_tensor.load %3 hasInserts : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
  return %4 : tensor<?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "compressed" ] }>>
}

%8:2 is where the tracking begins. Notice that the reduction identity %cst is passed, along with %false. As soon as an iteration of the for-loop happens, the 2nd parameter returned is %true. This boolean check of having iterated at least once allows us to either insert the reduction value (true branch of %9) or ignore the value (false branch of %9).

The other use case is with CSR @ CSC matrix multiplication, which involves while-loops.

func.func @min_plus_csrcsc(%arga: tensor<?x?xf64, #CSR>,
                           %argb: tensor<?x?xf64, #CSC>) -> tensor<?x?xf64, #CSR> {
  %c0 = arith.constant 0 : index
  %c1 = arith.constant 1 : index
  %maxf = arith.constant 1.0e999 : f64
  %d0 = tensor.dim %arga, %c0 : tensor<?x?xf64, #CSR>
  %d1 = tensor.dim %argb, %c1 : tensor<?x?xf64, #CSC>
  %xm = bufferization.alloc_tensor(%d0, %d1) : tensor<?x?xf64, #CSR>
  %0 = linalg.generic #trait_matmul
     ins(%arga, %argb: tensor<?x?xf64, #CSR>, tensor<?x?xf64, #CSC>)
      outs(%xm: tensor<?x?xf64, #CSR>) {
      ^bb(%a: f64, %b: f64, %output: f64):
        %1 = sparse_tensor.binary %a, %b : f64, f64 to f64
          overlap = {
            ^bb0(%x: f64, %y: f64):
              %3 = arith.addf %x, %y : f64
              sparse_tensor.yield %3 : f64
          }
          left={}
          right={}
        %2 = sparse_tensor.reduce %1, %output, %maxf : f64 {
            ^bb0(%x: f64, %y: f64):
              %cmp = arith.cmpf "olt", %x, %y : f64
              %3 = arith.select %cmp, %x, %y : f64
              sparse_tensor.yield %3 : f64
          }
        linalg.yield %2 : f64
  } -> tensor<?x?xf64, #CSR>
  return %0 : tensor<?x?xf64, #CSR>
}

This is converted to:

func.func @min_plus_csrcsc(%arg0: tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>, %arg1: tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>>) -> tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> {
  %c0 = arith.constant 0 : index
  %c1 = arith.constant 1 : index
  %false = arith.constant false
  %true = arith.constant true
  %cst = arith.constant 0x7FF0000000000000 : f64
  %dim = tensor.dim %arg0, %c0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  %dim_0 = tensor.dim %arg1, %c1 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>>
  %0 = bufferization.alloc_tensor(%dim, %dim_0) : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  %dim_1 = tensor.dim %arg0, %c0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  %1 = sparse_tensor.pointers %arg0 {dimension = 1 : index} : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> to memref<?xindex>
  %2 = sparse_tensor.indices %arg0 {dimension = 1 : index} : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> to memref<?xindex>
  %3 = sparse_tensor.values %arg0 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>> to memref<?xf64>
  %dim_2 = tensor.dim %arg1, %c1 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>>
  %4 = sparse_tensor.pointers %arg1 {dimension = 1 : index} : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>> to memref<?xindex>
  %5 = sparse_tensor.indices %arg1 {dimension = 1 : index} : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>> to memref<?xindex>
  %6 = sparse_tensor.values %arg1 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ], dimOrdering = affine_map<(d0, d1) -> (d1, d0)> }>> to memref<?xf64>
  %7 = scf.for %arg2 = %c0 to %dim_1 step %c1 iter_args(%arg3 = %0) -> (tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) {
    %9 = scf.for %arg4 = %c0 to %dim_2 step %c1 iter_args(%arg5 = %arg3) -> (tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) {
      %10 = memref.load %1[%arg2] : memref<?xindex>
      %11 = arith.addi %arg2, %c1 : index
      %12 = memref.load %1[%11] : memref<?xindex>
      %13 = memref.load %4[%arg4] : memref<?xindex>
      %14 = arith.addi %arg4, %c1 : index
      %15 = memref.load %4[%14] : memref<?xindex>
      %16:5 = scf.while (%arg6 = %10, %arg7 = %13, %arg8 = %cst, %arg9 = %false, %arg10 = %arg5) : (index, index, f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) -> (index, index, f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) {
        %18 = arith.cmpi ult, %arg6, %12 : index
        %19 = arith.cmpi ult, %arg7, %15 : index
        %20 = arith.andi %18, %19 : i1
        scf.condition(%20) %arg6, %arg7, %arg8, %arg9, %arg10 : index, index, f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
      } do {
      ^bb0(%arg6: index, %arg7: index, %arg8: f64, %arg9: i1, %arg10: tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>):
        %18 = memref.load %2[%arg6] : memref<?xindex>
        %19 = memref.load %5[%arg7] : memref<?xindex>
        %20 = arith.cmpi ult, %19, %18 : index
        %21 = arith.select %20, %19, %18 : index
        %22 = arith.cmpi eq, %18, %21 : index
        %23 = arith.cmpi eq, %19, %21 : index
        %24 = arith.andi %22, %23 : i1
        %25:3 = scf.if %24 -> (f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) {
          %32 = memref.load %3[%arg6] : memref<?xf64>
          %33 = memref.load %6[%arg7] : memref<?xf64>
          %34 = arith.addf %32, %33 : f64
          %35 = arith.cmpf olt, %34, %arg8 : f64
          %36 = arith.select %35, %34, %arg8 : f64
          scf.yield %36, %true, %arg10 : f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
        } else {
          scf.yield %arg8, %arg9, %arg10 : f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
        }
        %26 = arith.cmpi eq, %18, %21 : index
        %27 = arith.addi %arg6, %c1 : index
        %28 = arith.select %26, %27, %arg6 : index
        %29 = arith.cmpi eq, %19, %21 : index
        %30 = arith.addi %arg7, %c1 : index
        %31 = arith.select %29, %30, %arg7 : index
        scf.yield %28, %31, %25#0, %25#1, %25#2 : index, index, f64, i1, tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
      } attributes {"Emitted from" = "linalg.generic"}
      %17 = scf.if %16#3 -> (tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>) {
        %18 = sparse_tensor.insert %16#2 into %16#4[%arg2, %arg4] : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
        scf.yield %18 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
      } else {
        scf.yield %16#4 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
      }
      scf.yield %17 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
    } {"Emitted from" = "linalg.generic"}
    scf.yield %9 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  } {"Emitted from" = "linalg.generic"}
  %8 = sparse_tensor.load %7 hasInserts : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
  return %8 : tensor<?x?xf64, #sparse_tensor.encoding<{ dimLevelType = [ "dense", "compressed" ] }>>
}

The important change is in %16:5 with the addition of %false to the argument list. Later in %25:3, the true branch represents an overlap of the dot-product, and therefore a non-empty result of the dot-product. As such, we return %true. The false branch simply returns the previous "valid lex insert" flag argument. As long as a single overlap has occurred, we perform the insert in %17. Otherwise we ignore it.

Harbormaster completed remote builds in B208578: Diff 490265.Jan 18 2023, 1:31 PM

This is good stuff! Thanks for adding, Jim, and sorry for taking a bit longer on my review.

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp
56	part of the codegenenv is also to verify consistency so perhaps an else assert(!redValidInsert) here?
151	this is good, and in the line of the bookkeeping validation. But can it be even stricter? i.e. if isReduc() we need val? EDIT: later, WDYT of a clearValidLexInsert()?
mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.h
164	"to determine if" at first reading implies a boolean value, so please be a bit more specific (when not set ... when set ....)
mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp
733	Can you put more documentation here, i.e. pseudo code. Just saying, true/false branch is not very informative (and I agree that genInsertionStore could have used a lot more to start with, but we are now growing beyond what is easy to read, so bonus points for documenting some of the other branches too ;-)
968	perhaps we even want a clearValidLexInsert() for this? Also better error detection when the assert in codegenenv fails

Herald added a subscriber: thopre. · View Herald TranscriptFeb 7 2023, 1:34 PM

Add clearValidLexInsert() and improve comments

Harbormaster completed remote builds in B212629: Diff 495872.Feb 8 2023, 10:08 AM

Solid work. Thanks!

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp
156	yeah, very nice!

This revision is now accepted and ready to land.Feb 9 2023, 2:04 PM

This revision was landed with ongoing or failed builds.Feb 10 2023, 11:09 AM

Closed by commit rG81d0d2b2a068: [mlir][sparse] Sparse reduction in lex order no longer produces dense output (authored by jim22k). · Explain Why

This revision was automatically updated to reflect the committed changes.

jim22k added a commit: rG81d0d2b2a068: [mlir][sparse] Sparse reduction in lex order no longer produces dense output.

Revision Contents

Path

Size

mlir/

lib/

Dialect/

SparseTensor/

Transforms/

CodegenEnv.h

8 lines

CodegenEnv.cpp

24 lines

Sparsification.cpp

48 lines

test/

Dialect/

SparseTensor/

sparse_affine.mlir

33 lines

sparse_out.mlir

25 lines

Integration/

Dialect/

SparseTensor/

CPU/

sparse_reduce_custom.mlir

5 lines

Diff 495872

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.h

Show First 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	public:
// Reduction methods.		// Reduction methods.
//		//

void startReduc(unsigned exp, Value val);		void startReduc(unsigned exp, Value val);
bool isReduc() const { return redExp != -1u; }		bool isReduc() const { return redExp != -1u; }
void updateReduc(Value val);		void updateReduc(Value val);
Value getReduc() const { return redVal; }		Value getReduc() const { return redVal; }
Value endReduc();		Value endReduc();
		void setValidLexInsert(Value val);
		void clearValidLexInsert();
		Value getValidLexInsert() const { return redValidLexInsert; }

void startCustomReduc(unsigned exp);		void startCustomReduc(unsigned exp);
bool isCustomReduc() const { return redCustom != -1u; }		bool isCustomReduc() const { return redCustom != -1u; }
Value getCustomRedId();		Value getCustomRedId();
void endCustomReduc();		void endCustomReduc();

private:		private:
// Linalg operation.		// Linalg operation.
Show All 23 Lines	private:
Value expCount;		Value expCount;

// Bookkeeping for reductions (up-to-date value of the reduction, and indices		// Bookkeeping for reductions (up-to-date value of the reduction, and indices
// into the merger's expression tree. When the indices of a tensor reduction		// into the merger's expression tree. When the indices of a tensor reduction
// expression are exhausted, all inner loops can use a scalarized reduction.		// expression are exhausted, all inner loops can use a scalarized reduction.
Value redVal;		Value redVal;
unsigned redExp;		unsigned redExp;
unsigned redCustom;		unsigned redCustom;

		// Bookkeeping for lex insertion during reductions. Holds the runtime boolean
		// value of whether any reduction occurred. This is only set during a
		aartbikUnsubmitted Done Reply Inline Actions "to determine if" at first reading implies a boolean value, so please be a bit more specific (when not set ... when set ....) aartbik: "to determine if" at first reading implies a boolean value, so please be a bit more specific…
		// reduction and cleared once the reduction is finished.
		Value redValidLexInsert;
};		};

} // namespace sparse_tensor		} // namespace sparse_tensor
} // namespace mlir		} // namespace mlir

#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_CODEGENENV_H_		#endif // MLIR_DIALECT_SPARSETENSOR_TRANSFORMS_CODEGENENV_H_

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp

Show All 17 Lines

CodegenEnv::CodegenEnv(linalg::GenericOp linop, SparsificationOptions opts,		CodegenEnv::CodegenEnv(linalg::GenericOp linop, SparsificationOptions opts,
unsigned numTensors, unsigned numLoops,		unsigned numTensors, unsigned numLoops,
unsigned numFilterLoops)		unsigned numFilterLoops)
: linalgOp(linop), sparseOptions(opts),		: linalgOp(linop), sparseOptions(opts),
latticeMerger(numTensors, numLoops, numFilterLoops), loopEmitter(),		latticeMerger(numTensors, numLoops, numFilterLoops), loopEmitter(),
topSort(), sparseOut(nullptr), outerParNest(-1u), insChain(), expValues(),		topSort(), sparseOut(nullptr), outerParNest(-1u), insChain(), expValues(),
expFilled(), expAdded(), expCount(), redVal(), redExp(-1u),		expFilled(), expAdded(), expCount(), redVal(), redExp(-1u),
redCustom(-1u) {}		redCustom(-1u), redValidLexInsert() {}

void CodegenEnv::startEmit(OpOperand *so, unsigned lv) {		void CodegenEnv::startEmit(OpOperand *so, unsigned lv) {
assert(sparseOut == nullptr && insChain == nullptr &&		assert(sparseOut == nullptr && insChain == nullptr &&
"must only start emitting once");		"must only start emitting once");
sparseOut = so;		sparseOut = so;
outerParNest = lv;		outerParNest = lv;
if (sparseOut) {		if (sparseOut) {
insChain = sparseOut->get();		insChain = sparseOut->get();
Show All 9 Lines	loopEmitter.initialize(tensors,
/hasOutput=/true,		/hasOutput=/true,
/isSparseOut=/sparseOut != nullptr, topSort);		/isSparseOut=/sparseOut != nullptr, topSort);
}		}

std::optional<Operation *> CodegenEnv::genLoopBoundary(		std::optional<Operation *> CodegenEnv::genLoopBoundary(
function_ref<std::optional<Operation *>(MutableArrayRef<Value> parameters)>		function_ref<std::optional<Operation *>(MutableArrayRef<Value> parameters)>
callback) {		callback) {
SmallVector<Value> params;		SmallVector<Value> params;
if (isReduc())		if (isReduc()) {
params.push_back(redVal);		params.push_back(redVal);
		if (redValidLexInsert)
		params.push_back(redValidLexInsert);
		} else {
		aartbikUnsubmitted Done Reply Inline Actions part of the codegenenv is also to verify consistency so perhaps an else assert(!redValidInsert) here? aartbik: part of the codegenenv is also to verify consistency so perhaps an else assert(!
		assert(!redValidLexInsert);
		}
if (isExpand())		if (isExpand())
params.push_back(expCount);		params.push_back(expCount);
if (insChain != nullptr)		if (insChain != nullptr)
params.push_back(insChain);		params.push_back(insChain);
auto r = callback(params); // may update parameters		auto r = callback(params); // may update parameters
unsigned i = 0;		unsigned i = 0;
if (isReduc())		if (isReduc()) {
updateReduc(params[i++]);		updateReduc(params[i++]);
		if (redValidLexInsert)
		setValidLexInsert(params[i++]);
		}
if (isExpand())		if (isExpand())
updateExpandCount(params[i++]);		updateExpandCount(params[i++]);
if (insChain != nullptr)		if (insChain != nullptr)
updateInsertionChain(params[i]);		updateInsertionChain(params[i]);
return r;		return r;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 64 Lines • ▼ Show 20 Lines

Value CodegenEnv::endReduc() {		Value CodegenEnv::endReduc() {
Value val = redVal;		Value val = redVal;
updateReduc(Value());		updateReduc(Value());
redExp = -1u;		redExp = -1u;
return val;		return val;
}		}

		void CodegenEnv::setValidLexInsert(Value val) {
		assert(isReduc() && val);
		aartbikUnsubmitted Done Reply Inline Actions this is good, and in the line of the bookkeeping validation. But can it be even stricter? i.e. if isReduc() we need val? EDIT: later, WDYT of a clearValidLexInsert()? aartbik: this is good, and in the line of the bookkeeping validation. But can it be even stricter? i.e.
		redValidLexInsert = val;
		}

		void CodegenEnv::clearValidLexInsert() {
		assert(!isReduc());
		aartbikUnsubmitted Not Done Reply Inline Actions yeah, very nice! aartbik: yeah, very nice!
		redValidLexInsert = Value();
		}

void CodegenEnv::startCustomReduc(unsigned exp) {		void CodegenEnv::startCustomReduc(unsigned exp) {
assert(redCustom == -1u && exp != -1u);		assert(redCustom == -1u && exp != -1u);
redCustom = exp;		redCustom = exp;
}		}

Value CodegenEnv::getCustomRedId() {		Value CodegenEnv::getCustomRedId() {
assert(redCustom != -1u);		assert(redCustom != -1u);
return dyn_cast<sparse_tensor::ReduceOp>(exp(redCustom).op).getIdentity();		return dyn_cast<sparse_tensor::ReduceOp>(exp(redCustom).op).getIdentity();
}		}

void CodegenEnv::endCustomReduc() {		void CodegenEnv::endCustomReduc() {
assert(redCustom != -1u);		assert(redCustom != -1u);
redCustom = -1u;		redCustom = -1u;
}		}

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

Show First 20 Lines • Show All 720 Lines • ▼ Show 20 Lines	static void genInsertionStore(CodegenEnv &env, OpBuilder &builder, OpOperand *t,
if (!env.isExpand()) {		if (!env.isExpand()) {
unsigned rank = op.getRank(t);		unsigned rank = op.getRank(t);
SmallVector<Value> indices;		SmallVector<Value> indices;
for (unsigned i = 0; i < rank; i++) {		for (unsigned i = 0; i < rank; i++) {
assert(env.emitter().getLoopIV(i));		assert(env.emitter().getLoopIV(i));
indices.push_back(env.emitter().getLoopIV(i));		indices.push_back(env.emitter().getLoopIV(i));
}		}
Value chain = env.getInsertionChain();		Value chain = env.getInsertionChain();
		if (!env.getValidLexInsert()) {
env.updateInsertionChain(		env.updateInsertionChain(
builder.create<InsertOp>(loc, rhs, chain, indices));		builder.create<InsertOp>(loc, rhs, chain, indices));
		} else {
		// Generates runtime check for a valid lex during reduction,
		aartbikUnsubmitted Done Reply Inline Actions Can you put more documentation here, i.e. pseudo code. Just saying, true/false branch is not very informative (and I agree that genInsertionStore could have used a lot more to start with, but we are now growing beyond what is easy to read, so bonus points for documenting some of the other branches too ;-) aartbik: Can you put more documentation here, i.e. pseudo code. Just saying, true/false branch is not…
		// to avoid inserting the identity value for empty reductions.
		// if (validLexInsert) then
		// insert(rhs) into chain
		// return updated chain
		// else
		// return unmodified chain
		scf::IfOp ifValidLexInsert = builder.create<scf::IfOp>(
		loc, chain.getType(), env.getValidLexInsert(),
		/else=/true);
		// True branch.
		builder.setInsertionPointToStart(ifValidLexInsert.thenBlock());
		Value res = builder.create<InsertOp>(loc, rhs, chain, indices);
		builder.create<scf::YieldOp>(loc, res);
		// False branch.
		builder.setInsertionPointToStart(ifValidLexInsert.elseBlock());
		builder.create<scf::YieldOp>(loc, chain);
		// Value assignment.
		builder.setInsertionPointAfter(ifValidLexInsert);
		env.updateInsertionChain(ifValidLexInsert.getResult(0));
		}
return;		return;
}		}
// Generates insertion code along expanded access pattern.		// Generates insertion code along expanded access pattern.
// if (!expFilled[i]) then		// if (!expFilled[i]) then
// expFilled[i] = true		// expFilled[i] = true
// expAdded[inserts++] = i		// expAdded[inserts++] = i
// endif		// endif
// values[i] = rhs		// values[i] = rhs
▲ Show 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {
} else if (!isInvariantAffine(env, a, ldx, atLevel))		} else if (!isInvariantAffine(env, a, ldx, atLevel))
return; // still in play		return; // still in play
}		}
// All exhausted at this level (atLevel denotes exactly at this level).		// All exhausted at this level (atLevel denotes exactly at this level).
if (!atLevel)		if (!atLevel)
return;		return;
OpOperand *lhs = op.getDpsInitOperand(0);		OpOperand *lhs = op.getDpsInitOperand(0);
if (lhs == &t) {		if (lhs == &t) {
// Start or end a scalarized reduction		// Start or end a scalarized reduction.
if (atStart) {		if (atStart) {
Value load = env.isCustomReduc() ? env.getCustomRedId()		Value load = env.isCustomReduc() ? env.getCustomRedId()
: genTensorLoad(env, builder, exp);		: genTensorLoad(env, builder, exp);
env.startReduc(exp, load);		env.startReduc(exp, load);
		if (env.hasSparseOutput())
		env.setValidLexInsert(constantI1(builder, env.op().getLoc(), false));
} else {		} else {
genTensorStore(env, builder, exp, env.endReduc());		genTensorStore(env, builder, exp, env.endReduc());
		env.clearValidLexInsert();
}		}
		aartbikUnsubmitted Done Reply Inline Actions perhaps we even want a clearValidLexInsert() for this? Also better error detection when the assert in codegenenv fails aartbik: perhaps we even want a clearValidLexInsert() for this? Also better error detection when the…
} else {		} else {
// Start or end loop invariant hoisting of a tensor load.		// Start or end loop invariant hoisting of a tensor load.
env.exp(exp).val = atStart ? genTensorLoad(env, builder, exp) : Value();		env.exp(exp).val = atStart ? genTensorLoad(env, builder, exp) : Value();
}		}
} else if (env.exp(exp).kind != Kind::kInvariant &&		} else if (env.exp(exp).kind != Kind::kInvariant &&
env.exp(exp).kind != Kind::kIndex) {		env.exp(exp).kind != Kind::kIndex) {
// Traverse into the binary operations. Note that we only hoist		// Traverse into the binary operations. Note that we only hoist
// tensor loads, since subsequent MLIR/LLVM passes know how to		// tensor loads, since subsequent MLIR/LLVM passes know how to
▲ Show 20 Lines • Show All 150 Lines • ▼ Show 20 Lines	static void finalizeWhileOp(CodegenEnv &env, OpBuilder &builder, unsigned idx,
if (env.isReduc() \|\| env.isExpand() \|\| env.getInsertionChain()) {		if (env.isReduc() \|\| env.isExpand() \|\| env.getInsertionChain()) {
while (auto ifOp = dyn_cast_or_null<scf::IfOp>(		while (auto ifOp = dyn_cast_or_null<scf::IfOp>(
builder.getInsertionBlock()->getParentOp())) {		builder.getInsertionBlock()->getParentOp())) {
unsigned y = 0;		unsigned y = 0;
SmallVector<Value> yields;		SmallVector<Value> yields;
if (env.isReduc()) {		if (env.isReduc()) {
yields.push_back(env.getReduc());		yields.push_back(env.getReduc());
env.updateReduc(ifOp.getResult(y++));		env.updateReduc(ifOp.getResult(y++));
		if (env.getValidLexInsert()) {
		yields.push_back(env.getValidLexInsert());
		env.setValidLexInsert(ifOp.getResult(y++));
		}
}		}
if (env.isExpand()) {		if (env.isExpand()) {
yields.push_back(env.getExpandCount());		yields.push_back(env.getExpandCount());
env.updateExpandCount(ifOp->getResult(y++));		env.updateExpandCount(ifOp->getResult(y++));
}		}
if (env.getInsertionChain()) {		if (env.getInsertionChain()) {
yields.push_back(env.getInsertionChain());		yields.push_back(env.getInsertionChain());
env.updateInsertionChain(ifOp->getResult(y++));		env.updateInsertionChain(ifOp->getResult(y++));
Show All 26 Lines	if (isCompressedDLT(env.dlt(b)) \|\| isSingletonDLT(env.dlt(b))) {
op2);		op2);
} else {		} else {
assert(isDenseDLT(env.merger().getDimLevelType(b)) \|\|		assert(isDenseDLT(env.merger().getDimLevelType(b)) \|\|
isUndefDLT(env.merger().getDimLevelType(b)));		isUndefDLT(env.merger().getDimLevelType(b)));
clause = constantI1(builder, loc, true);		clause = constantI1(builder, loc, true);
}		}
cond = cond ? builder.create<arith::AndIOp>(loc, cond, clause) : clause;		cond = cond ? builder.create<arith::AndIOp>(loc, cond, clause) : clause;
}		}
if (env.isReduc())		if (env.isReduc()) {
types.push_back(env.getReduc().getType());		types.push_back(env.getReduc().getType());
		if (env.getValidLexInsert())
		types.push_back(env.getValidLexInsert().getType());
		}
if (env.isExpand())		if (env.isExpand())
types.push_back(builder.getIndexType());		types.push_back(builder.getIndexType());
if (env.getInsertionChain())		if (env.getInsertionChain())
types.push_back(env.getInsertionChain().getType());		types.push_back(env.getInsertionChain().getType());
scf::IfOp ifOp = builder.create<scf::IfOp>(loc, types, cond, /else=/true);		scf::IfOp ifOp = builder.create<scf::IfOp>(loc, types, cond, /else=/true);
builder.setInsertionPointToStart(&ifOp.getThenRegion().front());		builder.setInsertionPointToStart(&ifOp.getThenRegion().front());
return ifOp;		return ifOp;
}		}

/// Generates end of true branch of if-statement within a while-loop.		/// Generates end of true branch of if-statement within a while-loop.
static void endIf(CodegenEnv &env, OpBuilder &builder, scf::IfOp ifOp,		static void endIf(CodegenEnv &env, OpBuilder &builder, scf::IfOp ifOp,
Operation *loop, Value redInput, Value cntInput,		Operation *loop, Value redInput, Value cntInput,
Value insInput) {		Value insInput) {
SmallVector<Value> operands;		SmallVector<Value> operands;
if (env.isReduc()) {		if (env.isReduc()) {
operands.push_back(env.getReduc());		operands.push_back(env.getReduc());
env.updateReduc(redInput);		env.updateReduc(redInput);
		if (env.getValidLexInsert())
		// Any overlapping indices during a reduction creates a valid lex insert.
		operands.push_back(constantI1(builder, env.op().getLoc(), true));
}		}
if (env.isExpand()) {		if (env.isExpand()) {
operands.push_back(env.getExpandCount());		operands.push_back(env.getExpandCount());
env.updateExpandCount(cntInput);		env.updateExpandCount(cntInput);
}		}
if (env.getInsertionChain()) {		if (env.getInsertionChain()) {
operands.push_back(env.getInsertionChain());		operands.push_back(env.getInsertionChain());
env.updateInsertionChain(insInput);		env.updateInsertionChain(insInput);
▲ Show 20 Lines • Show All 209 Lines • ▼ Show 20 Lines
}		}

/// Ends a single loop in current sequence. Returns new values for needsUniv.		/// Ends a single loop in current sequence. Returns new values for needsUniv.
static bool endLoop(CodegenEnv &env, RewriterBase &rewriter, Operation *loop,		static bool endLoop(CodegenEnv &env, RewriterBase &rewriter, Operation *loop,
unsigned idx, unsigned li, bool needsUniv) {		unsigned idx, unsigned li, bool needsUniv) {
// End a while-loop.		// End a while-loop.
if (auto whileOp = dyn_cast<scf::WhileOp>(loop)) {		if (auto whileOp = dyn_cast<scf::WhileOp>(loop)) {
finalizeWhileOp(env, rewriter, idx, needsUniv, env.lat(li).bits, whileOp);		finalizeWhileOp(env, rewriter, idx, needsUniv, env.lat(li).bits, whileOp);
		} else if (auto forOp = dyn_cast<scf::ForOp>(loop)) {
		// Any iteration of a reduction for-loop creates a valid lex insert.
		if (env.isReduc() && env.getValidLexInsert())
		env.setValidLexInsert(constantI1(rewriter, env.op().getLoc(), true));
} else {		} else {
needsUniv = false;		needsUniv = false;
}		}

env.genLoopBoundary([&](MutableArrayRef<Value> reduc) {		env.genLoopBoundary([&](MutableArrayRef<Value> reduc) {
env.emitter().exitCurrentLoop(rewriter, env.op().getLoc(), reduc);		env.emitter().exitCurrentLoop(rewriter, env.op().getLoc(), reduc);
return std::nullopt;		return std::nullopt;
});		});
▲ Show 20 Lines • Show All 220 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/sparse_affine.mlir

Show First 20 Lines • Show All 300 Lines • ▼ Show 20 Lines	func.func @mul_affine_dense2d(%arga: tensor<32x16xf64, #CSR>,
} -> tensor<32x16xf64>		} -> tensor<32x16xf64>
return %0 : tensor<32x16xf64>		return %0 : tensor<32x16xf64>
}		}


// CHECK-LABEL: func.func @mul_affine_sparse2d(		// CHECK-LABEL: func.func @mul_affine_sparse2d(
// CHECK-SAME: %[[VAL_0:.]]: tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>,		// CHECK-SAME: %[[VAL_0:.]]: tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>,
// CHECK-SAME: %[[VAL_1:.]]: tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>>) -> tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>> {		// CHECK-SAME: %[[VAL_1:.]]: tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>>) -> tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>> {
// CHECK: %[[VAL_2:.*]] = arith.constant 32 : index		// CHECK-DAG: %[[VAL_2:.*]] = arith.constant 32 : index
// CHECK: %[[VAL_3:.*]] = arith.constant 0 : index		// CHECK-DAG: %[[VAL_3:.*]] = arith.constant 0 : index
// CHECK: %[[VAL_4:.*]] = arith.constant 1 : index		// CHECK-DAG: %[[VAL_4:.*]] = arith.constant 1 : index
// CHECK: %[[VAL_5:.*]] = arith.constant 2 : index		// CHECK-DAG: %[[VAL_5:.*]] = arith.constant 2 : index
// CHECK: %[[VAL_6:.*]] = arith.constant 0.000000e+00 : f64		// CHECK-DAG: %[[VAL_6:.*]] = arith.constant 0.000000e+00 : f64
// CHECK: %[[VAL_7:.*]] = arith.constant 3 : index		// CHECK-DAG: %[[VAL_7:.*]] = arith.constant 3 : index
		// CHECK-DAG: %[[VAL_TRUE:.*]] = arith.constant true
		// CHECK-DAG: %[[VAL_FALSE:.*]] = arith.constant false
// CHECK: %[[VAL_8:.]] = bufferization.alloc_tensor() : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>		// CHECK: %[[VAL_8:.]] = bufferization.alloc_tensor() : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>
// CHECK: %[[VAL_9:.]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 1 : index} : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>		// CHECK: %[[VAL_9:.]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 1 : index} : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>
// CHECK: %[[VAL_10:.]] = sparse_tensor.indices %[[VAL_0]] {dimension = 1 : index} : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>		// CHECK: %[[VAL_10:.]] = sparse_tensor.indices %[[VAL_0]] {dimension = 1 : index} : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>
// CHECK: %[[VAL_11:.]] = sparse_tensor.values %[[VAL_0]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xf64>		// CHECK: %[[VAL_11:.]] = sparse_tensor.values %[[VAL_0]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xf64>
// CHECK: %[[VAL_12:.]] = sparse_tensor.pointers %[[VAL_1]] {dimension = 1 : index} : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>		// CHECK: %[[VAL_12:.]] = sparse_tensor.pointers %[[VAL_1]] {dimension = 1 : index} : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>
// CHECK: %[[VAL_13:.]] = sparse_tensor.indices %[[VAL_1]] {dimension = 1 : index} : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>		// CHECK: %[[VAL_13:.]] = sparse_tensor.indices %[[VAL_1]] {dimension = 1 : index} : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xindex>
// CHECK: %[[VAL_14:.]] = sparse_tensor.values %[[VAL_1]] : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xf64>		// CHECK: %[[VAL_14:.]] = sparse_tensor.values %[[VAL_1]] : tensor<34x19xf64, #sparse_tensor.encoding<{{{.}}}>> to memref<?xf64>
// CHECK: %[[VAL_15:.]] = scf.for %[[VAL_16:.]] = %[[VAL_3]] to %[[VAL_2]] step %[[VAL_4]] iter_args(%[[VAL_17:.]] = %[[VAL_8]]) -> (tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {		// CHECK: %[[VAL_15:.]] = scf.for %[[VAL_16:.]] = %[[VAL_3]] to %[[VAL_2]] step %[[VAL_4]] iter_args(%[[VAL_17:.]] = %[[VAL_8]]) -> (tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {
// CHECK: %[[VAL_18:.*]] = arith.addi %[[VAL_16]], %[[VAL_5]] : index		// CHECK: %[[VAL_18:.*]] = arith.addi %[[VAL_16]], %[[VAL_5]] : index
// CHECK: %[[VAL_19:.*]] = memref.load %[[VAL_9]]{{\[}}%[[VAL_16]]] : memref<?xindex>		// CHECK: %[[VAL_19:.*]] = memref.load %[[VAL_9]]{{\[}}%[[VAL_16]]] : memref<?xindex>
// CHECK: %[[VAL_20:.*]] = arith.addi %[[VAL_16]], %[[VAL_4]] : index		// CHECK: %[[VAL_20:.*]] = arith.addi %[[VAL_16]], %[[VAL_4]] : index
// CHECK: %[[VAL_21:.*]] = memref.load %[[VAL_9]]{{\[}}%[[VAL_20]]] : memref<?xindex>		// CHECK: %[[VAL_21:.*]] = memref.load %[[VAL_9]]{{\[}}%[[VAL_20]]] : memref<?xindex>
// CHECK: %[[VAL_22:.]] = scf.for %[[VAL_23:.]] = %[[VAL_19]] to %[[VAL_21]] step %[[VAL_4]] iter_args(%[[VAL_24:.]] = %[[VAL_17]]) -> (tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {		// CHECK: %[[VAL_22:.]] = scf.for %[[VAL_23:.]] = %[[VAL_19]] to %[[VAL_21]] step %[[VAL_4]] iter_args(%[[VAL_24:.]] = %[[VAL_17]]) -> (tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {
// CHECK: %[[VAL_25:.*]] = memref.load %[[VAL_10]]{{\[}}%[[VAL_23]]] : memref<?xindex>		// CHECK: %[[VAL_25:.*]] = memref.load %[[VAL_10]]{{\[}}%[[VAL_23]]] : memref<?xindex>
// CHECK: %[[VAL_26:.*]] = memref.load %[[VAL_11]]{{\[}}%[[VAL_23]]] : memref<?xf64>		// CHECK: %[[VAL_26:.*]] = memref.load %[[VAL_11]]{{\[}}%[[VAL_23]]] : memref<?xf64>
// CHECK: %[[VAL_27:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_18]]] : memref<?xindex>		// CHECK: %[[VAL_27:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_18]]] : memref<?xindex>
// CHECK: %[[VAL_28:.*]] = arith.addi %[[VAL_18]], %[[VAL_4]] : index		// CHECK: %[[VAL_28:.*]] = arith.addi %[[VAL_18]], %[[VAL_4]] : index
// CHECK: %[[VAL_29:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_28]]] : memref<?xindex>		// CHECK: %[[VAL_29:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_28]]] : memref<?xindex>
// CHECK: %[[VAL_30:.]]:2 = scf.for %[[VAL_31:.]] = %[[VAL_27]] to %[[VAL_29]] step %[[VAL_4]] iter_args(%[[VAL_32:.]] = %[[VAL_6]], %[[VAL_33:.]] = %[[VAL_24]]) -> (f64, tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>) {		// CHECK: %[[VAL_30:.]]:3 = scf.for %[[VAL_31:.]] = %[[VAL_27]] to %[[VAL_29]] step %[[VAL_4]] iter_args(%[[VAL_32:.]] = %[[VAL_6]], %[[VAL_200:.]] = %[[VAL_FALSE]], %[[VAL_33:.]] = %[[VAL_24]]) -> (f64, i1, tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {
// CHECK: %[[VAL_34:.*]] = memref.load %[[VAL_13]]{{\[}}%[[VAL_31]]] : memref<?xindex>		// CHECK: %[[VAL_34:.*]] = memref.load %[[VAL_13]]{{\[}}%[[VAL_31]]] : memref<?xindex>
// CHECK: %[[VAL_35:.*]] = arith.addi %[[VAL_25]], %[[VAL_7]] : index		// CHECK: %[[VAL_35:.*]] = arith.addi %[[VAL_25]], %[[VAL_7]] : index
// CHECK: %[[VAL_36:.*]] = arith.cmpi eq, %[[VAL_34]], %[[VAL_35]] : index		// CHECK: %[[VAL_36:.*]] = arith.cmpi eq, %[[VAL_34]], %[[VAL_35]] : index
// CHECK: %[[VAL_37:.]]:2 = scf.if %[[VAL_36]] -> (f64, tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {		// CHECK: %[[VAL_37:.]]:3 = scf.if %[[VAL_36]] -> (f64, i1, tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {
// CHECK: %[[VAL_38:.*]] = memref.load %[[VAL_14]]{{\[}}%[[VAL_31]]] : memref<?xf64>		// CHECK: %[[VAL_38:.*]] = memref.load %[[VAL_14]]{{\[}}%[[VAL_31]]] : memref<?xf64>
// CHECK: %[[VAL_39:.*]] = arith.mulf %[[VAL_26]], %[[VAL_38]] : f64		// CHECK: %[[VAL_39:.*]] = arith.mulf %[[VAL_26]], %[[VAL_38]] : f64
// CHECK: %[[VAL_40:.*]] = arith.addf %[[VAL_32]], %[[VAL_39]] : f64		// CHECK: %[[VAL_40:.*]] = arith.addf %[[VAL_32]], %[[VAL_39]] : f64
// CHECK: scf.yield %[[VAL_40]], %[[VAL_33]] : f64, tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: scf.yield %[[VAL_40]], %[[VAL_TRUE]], %[[VAL_33]] : f64, i1, tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
// CHECK: } else {		// CHECK: } else {
// CHECK: scf.yield %[[VAL_32]], %[[VAL_33]] : f64, tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: scf.yield %[[VAL_32]], %[[VAL_200]], %[[VAL_33]] : f64, i1, tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
// CHECK: }		// CHECK: }
// CHECK: scf.yield %[[VAL_41:.]]#0, %[[VAL_41]]#1 : f64, tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>		// CHECK: scf.yield %[[VAL_41:.]]#0, %[[VAL_41]]#1, %[[VAL_41]]#2 : f64, i1, tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>
// CHECK: }		// CHECK: }
// CHECK: %[[VAL_42:.]] = sparse_tensor.insert %[[VAL_43:.]]#0 into %[[VAL_43]]#1{{\[}}%[[VAL_16]], %[[VAL_25]]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: %[[VAL_201:.]] = scf.if %[[VAL_30]]#1 -> (tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>) {
		// CHECK: %[[VAL_42:.]] = sparse_tensor.insert %[[VAL_30]]#0 into %[[VAL_30]]#2{{\[}}%[[VAL_16]], %[[VAL_25]]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>
// CHECK: scf.yield %[[VAL_42]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: scf.yield %[[VAL_42]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
		// CHECK: } else {
		// CHECK: scf.yield %[[VAL_30]]#2 : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
		// CHECK: }
		// CHECK: scf.yield %[[VAL_201]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
// CHECK: }		// CHECK: }
// CHECK: scf.yield %[[VAL_44:.]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>		// CHECK: scf.yield %[[VAL_44:.]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.}}}>>
// CHECK: }		// CHECK: }
// CHECK: %[[VAL_45:.]] = sparse_tensor.load %[[VAL_46:.]] hasInserts : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: %[[VAL_45:.]] = sparse_tensor.load %[[VAL_46:.]] hasInserts : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
// CHECK: return %[[VAL_45]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>		// CHECK: return %[[VAL_45]] : tensor<32x16xf64, #sparse_tensor.encoding<{{{.*}}}>>
func.func @mul_affine_sparse2d(%arga: tensor<32x16xf64, #CSR>,		func.func @mul_affine_sparse2d(%arga: tensor<32x16xf64, #CSR>,
%argb: tensor<34x19xf64, #CSR>) -> tensor<32x16xf64, #CSR> {		%argb: tensor<34x19xf64, #CSR>) -> tensor<32x16xf64, #CSR> {
%argx = bufferization.alloc_tensor() : tensor<32x16xf64, #CSR>		%argx = bufferization.alloc_tensor() : tensor<32x16xf64, #CSR>
▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

mlir/test/Dialect/SparseTensor/sparse_out.mlir

	Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines
	}			}

	// CHECK-LABEL: func.func @sumred(			// CHECK-LABEL: func.func @sumred(
	// CHECK-SAME: %[[VAL_0:.*]]: tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>,			// CHECK-SAME: %[[VAL_0:.*]]: tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>,
	// CHECK-SAME: %[[VAL_1:.*]]: tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>) -> tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>> {			// CHECK-SAME: %[[VAL_1:.*]]: tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>) -> tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>> {
	// CHECK-DAG: %[[VAL_2:.*]] = arith.constant 0 : index			// CHECK-DAG: %[[VAL_2:.*]] = arith.constant 0 : index
	// CHECK-DAG: %[[VAL_3:.*]] = arith.constant 1 : index			// CHECK-DAG: %[[VAL_3:.*]] = arith.constant 1 : index
	// CHECK-DAG: %[[VAL_4:.*]] = arith.constant 0 : i32			// CHECK-DAG: %[[VAL_4:.*]] = arith.constant 0 : i32
				// CHECK-DAG: %[[VAL_FALSE:.*]] = arith.constant false
				// CHECK-DAG: %[[VAL_TRUE:.*]] = arith.constant true
	// CHECK: %[[VAL_5:.*]] = tensor.dim %[[VAL_0]], %[[VAL_2]] : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>			// CHECK: %[[VAL_5:.*]] = tensor.dim %[[VAL_0]], %[[VAL_2]] : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>
	// CHECK: %[[VAL_6:.*]] = tensor.dim %[[VAL_0]], %[[VAL_3]] : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>			// CHECK: %[[VAL_6:.*]] = tensor.dim %[[VAL_0]], %[[VAL_3]] : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>>
	// CHECK: %[[VAL_7:.*]] = bufferization.alloc_tensor(%[[VAL_5]], %[[VAL_6]]) : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: %[[VAL_7:.*]] = bufferization.alloc_tensor(%[[VAL_5]], %[[VAL_6]]) : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: %[[VAL_8:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 0 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>			// CHECK: %[[VAL_8:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 0 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>
	// CHECK: %[[VAL_9:.*]] = sparse_tensor.indices %[[VAL_0]] {dimension = 0 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>			// CHECK: %[[VAL_9:.*]] = sparse_tensor.indices %[[VAL_0]] {dimension = 0 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>
	// CHECK: %[[VAL_10:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 1 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>			// CHECK: %[[VAL_10:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 1 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>
	// CHECK: %[[VAL_11:.*]] = sparse_tensor.indices %[[VAL_0]] {dimension = 1 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>			// CHECK: %[[VAL_11:.*]] = sparse_tensor.indices %[[VAL_0]] {dimension = 1 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>
	// CHECK: %[[VAL_12:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 2 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>			// CHECK: %[[VAL_12:.*]] = sparse_tensor.pointers %[[VAL_0]] {dimension = 2 : index} : tensor<?x?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed", "compressed" ] }>> to memref<?xindex>
	▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	// CHECK: %[[VAL_66:.*]] = arith.andi %[[VAL_64]], %[[VAL_65]] : i1			// CHECK: %[[VAL_66:.*]] = arith.andi %[[VAL_64]], %[[VAL_65]] : i1
	// CHECK: %[[VAL_67:.*]] = scf.if %[[VAL_66]] -> (tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {			// CHECK: %[[VAL_67:.*]] = scf.if %[[VAL_66]] -> (tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {
	// CHECK: %[[VAL_68:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_57]]] : memref<?xindex>			// CHECK: %[[VAL_68:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_57]]] : memref<?xindex>
	// CHECK: %[[VAL_69:.*]] = arith.addi %[[VAL_57]], %[[VAL_3]] : index			// CHECK: %[[VAL_69:.*]] = arith.addi %[[VAL_57]], %[[VAL_3]] : index
	// CHECK: %[[VAL_70:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_69]]] : memref<?xindex>			// CHECK: %[[VAL_70:.*]] = memref.load %[[VAL_12]]{{\[}}%[[VAL_69]]] : memref<?xindex>
	// CHECK: %[[VAL_71:.*]] = memref.load %[[VAL_19]]{{\[}}%[[VAL_58]]] : memref<?xindex>			// CHECK: %[[VAL_71:.*]] = memref.load %[[VAL_19]]{{\[}}%[[VAL_58]]] : memref<?xindex>
	// CHECK: %[[VAL_72:.*]] = arith.addi %[[VAL_58]], %[[VAL_3]] : index			// CHECK: %[[VAL_72:.*]] = arith.addi %[[VAL_58]], %[[VAL_3]] : index
	// CHECK: %[[VAL_73:.*]] = memref.load %[[VAL_19]]{{\[}}%[[VAL_72]]] : memref<?xindex>			// CHECK: %[[VAL_73:.*]] = memref.load %[[VAL_19]]{{\[}}%[[VAL_72]]] : memref<?xindex>
	// CHECK: %[[VAL_74:.]]:4 = scf.while (%[[VAL_75:.]] = %[[VAL_68]], %[[VAL_76:.]] = %[[VAL_71]], %[[VAL_77:.]] = %[[VAL_4]], %[[VAL_78:.*]] = %[[VAL_59]]) : (index, index, i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) -> (index, index, i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {			// CHECK: %[[VAL_74:.]]:5 = scf.while (%[[VAL_75:.]] = %[[VAL_68]], %[[VAL_76:.]] = %[[VAL_71]], %[[VAL_77:.]] = %[[VAL_4]], %[[VAL_200:.]] = %[[VAL_FALSE]], %[[VAL_78:.]] = %[[VAL_59]]) : (index, index, i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) -> (index, index, i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {
	// CHECK: %[[VAL_79:.*]] = arith.cmpi ult, %[[VAL_75]], %[[VAL_70]] : index			// CHECK: %[[VAL_79:.*]] = arith.cmpi ult, %[[VAL_75]], %[[VAL_70]] : index
	// CHECK: %[[VAL_80:.*]] = arith.cmpi ult, %[[VAL_76]], %[[VAL_73]] : index			// CHECK: %[[VAL_80:.*]] = arith.cmpi ult, %[[VAL_76]], %[[VAL_73]] : index
	// CHECK: %[[VAL_81:.*]] = arith.andi %[[VAL_79]], %[[VAL_80]] : i1			// CHECK: %[[VAL_81:.*]] = arith.andi %[[VAL_79]], %[[VAL_80]] : i1
	// CHECK: scf.condition(%[[VAL_81]]) %[[VAL_75]], %[[VAL_76]], %[[VAL_77]], %[[VAL_78]] : index, index, i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.condition(%[[VAL_81]]) %[[VAL_75]], %[[VAL_76]], %[[VAL_77]], %[[VAL_200]], %[[VAL_78]] : index, index, i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: } do {			// CHECK: } do {
	// CHECK: ^bb0(%[[VAL_82:.]]: index, %[[VAL_83:.]]: index, %[[VAL_84:.]]: i32, %[[VAL_85:.]]: tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>):			// CHECK: ^bb0(%[[VAL_82:.]]: index, %[[VAL_83:.]]: index, %[[VAL_84:.]]: i32, %[[VAL_201:.]]: i1, %[[VAL_85:.*]]: tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>):
	// CHECK: %[[VAL_86:.*]] = memref.load %[[VAL_13]]{{\[}}%[[VAL_82]]] : memref<?xindex>			// CHECK: %[[VAL_86:.*]] = memref.load %[[VAL_13]]{{\[}}%[[VAL_82]]] : memref<?xindex>
	// CHECK: %[[VAL_87:.*]] = memref.load %[[VAL_20]]{{\[}}%[[VAL_83]]] : memref<?xindex>			// CHECK: %[[VAL_87:.*]] = memref.load %[[VAL_20]]{{\[}}%[[VAL_83]]] : memref<?xindex>
	// CHECK: %[[VAL_88:.*]] = arith.cmpi ult, %[[VAL_87]], %[[VAL_86]] : index			// CHECK: %[[VAL_88:.*]] = arith.cmpi ult, %[[VAL_87]], %[[VAL_86]] : index
	// CHECK: %[[VAL_89:.*]] = arith.select %[[VAL_88]], %[[VAL_87]], %[[VAL_86]] : index			// CHECK: %[[VAL_89:.*]] = arith.select %[[VAL_88]], %[[VAL_87]], %[[VAL_86]] : index
	// CHECK: %[[VAL_90:.*]] = arith.cmpi eq, %[[VAL_86]], %[[VAL_89]] : index			// CHECK: %[[VAL_90:.*]] = arith.cmpi eq, %[[VAL_86]], %[[VAL_89]] : index
	// CHECK: %[[VAL_91:.*]] = arith.cmpi eq, %[[VAL_87]], %[[VAL_89]] : index			// CHECK: %[[VAL_91:.*]] = arith.cmpi eq, %[[VAL_87]], %[[VAL_89]] : index
	// CHECK: %[[VAL_92:.*]] = arith.andi %[[VAL_90]], %[[VAL_91]] : i1			// CHECK: %[[VAL_92:.*]] = arith.andi %[[VAL_90]], %[[VAL_91]] : i1
	// CHECK: %[[VAL_93:.*]]:2 = scf.if %[[VAL_92]] -> (i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {			// CHECK: %[[VAL_93:.*]]:3 = scf.if %[[VAL_92]] -> (i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {
	// CHECK: %[[VAL_94:.*]] = memref.load %[[VAL_14]]{{\[}}%[[VAL_82]]] : memref<?xi32>			// CHECK: %[[VAL_94:.*]] = memref.load %[[VAL_14]]{{\[}}%[[VAL_82]]] : memref<?xi32>
	// CHECK: %[[VAL_95:.*]] = memref.load %[[VAL_21]]{{\[}}%[[VAL_83]]] : memref<?xi32>			// CHECK: %[[VAL_95:.*]] = memref.load %[[VAL_21]]{{\[}}%[[VAL_83]]] : memref<?xi32>
	// CHECK: %[[VAL_96:.*]] = arith.muli %[[VAL_94]], %[[VAL_95]] : i32			// CHECK: %[[VAL_96:.*]] = arith.muli %[[VAL_94]], %[[VAL_95]] : i32
	// CHECK: %[[VAL_97:.*]] = arith.addi %[[VAL_84]], %[[VAL_96]] : i32			// CHECK: %[[VAL_97:.*]] = arith.addi %[[VAL_84]], %[[VAL_96]] : i32
	// CHECK: scf.yield %[[VAL_97]], %[[VAL_85]] : i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.yield %[[VAL_97]], %[[VAL_TRUE]], %[[VAL_85]] : i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: } else {			// CHECK: } else {
	// CHECK: scf.yield %[[VAL_84]], %[[VAL_85]] : i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.yield %[[VAL_84]], %[[VAL_201]], %[[VAL_85]] : i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: }			// CHECK: }
	// CHECK: %[[VAL_98:.*]] = arith.cmpi eq, %[[VAL_86]], %[[VAL_89]] : index			// CHECK: %[[VAL_98:.*]] = arith.cmpi eq, %[[VAL_86]], %[[VAL_89]] : index
	// CHECK: %[[VAL_99:.*]] = arith.addi %[[VAL_82]], %[[VAL_3]] : index			// CHECK: %[[VAL_99:.*]] = arith.addi %[[VAL_82]], %[[VAL_3]] : index
	// CHECK: %[[VAL_100:.*]] = arith.select %[[VAL_98]], %[[VAL_99]], %[[VAL_82]] : index			// CHECK: %[[VAL_100:.*]] = arith.select %[[VAL_98]], %[[VAL_99]], %[[VAL_82]] : index
	// CHECK: %[[VAL_101:.*]] = arith.cmpi eq, %[[VAL_87]], %[[VAL_89]] : index			// CHECK: %[[VAL_101:.*]] = arith.cmpi eq, %[[VAL_87]], %[[VAL_89]] : index
	// CHECK: %[[VAL_102:.*]] = arith.addi %[[VAL_83]], %[[VAL_3]] : index			// CHECK: %[[VAL_102:.*]] = arith.addi %[[VAL_83]], %[[VAL_3]] : index
	// CHECK: %[[VAL_103:.*]] = arith.select %[[VAL_101]], %[[VAL_102]], %[[VAL_83]] : index			// CHECK: %[[VAL_103:.*]] = arith.select %[[VAL_101]], %[[VAL_102]], %[[VAL_83]] : index
	// CHECK: scf.yield %[[VAL_100]], %[[VAL_103]], %[[VAL_104:.*]]#0, %[[VAL_104]]#1 : index, index, i32, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.yield %[[VAL_100]], %[[VAL_103]], %[[VAL_104:.*]]#0, %[[VAL_104]]#1, %[[VAL_104]]#2 : index, index, i32, i1, tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: }			// CHECK: }
	// CHECK: %[[VAL_105:.]] = sparse_tensor.insert %[[VAL_106:.]]#2 into %[[VAL_106]]#3{{\[}}%[[VAL_39]], %[[VAL_63]]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: %[[VAL_202:.*]] = scf.if %[[VAL_74]]#3 -> (tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>) {
				// CHECK: %[[VAL_105:.*]] = sparse_tensor.insert %[[VAL_74]]#2 into %[[VAL_74]]#4{{\[}}%[[VAL_39]], %[[VAL_63]]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: scf.yield %[[VAL_105]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.yield %[[VAL_105]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: } else {			// CHECK: } else {
				// CHECK: scf.yield %[[VAL_74]]#4 : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
				// CHECK: }
				// CHECK: scf.yield %[[VAL_202]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
				// CHECK: } else {
	// CHECK: scf.yield %[[VAL_59]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>			// CHECK: scf.yield %[[VAL_59]] : tensor<?x?xi32, #sparse_tensor.encoding<{ dimLevelType = [ "compressed", "compressed" ] }>>
	// CHECK: }			// CHECK: }
	// CHECK: %[[VAL_107:.*]] = arith.cmpi eq, %[[VAL_60]], %[[VAL_63]] : index			// CHECK: %[[VAL_107:.*]] = arith.cmpi eq, %[[VAL_60]], %[[VAL_63]] : index
	// CHECK: %[[VAL_108:.*]] = arith.addi %[[VAL_57]], %[[VAL_3]] : index			// CHECK: %[[VAL_108:.*]] = arith.addi %[[VAL_57]], %[[VAL_3]] : index
	// CHECK: %[[VAL_109:.*]] = arith.select %[[VAL_107]], %[[VAL_108]], %[[VAL_57]] : index			// CHECK: %[[VAL_109:.*]] = arith.select %[[VAL_107]], %[[VAL_108]], %[[VAL_57]] : index
	// CHECK: %[[VAL_110:.*]] = arith.cmpi eq, %[[VAL_61]], %[[VAL_63]] : index			// CHECK: %[[VAL_110:.*]] = arith.cmpi eq, %[[VAL_61]], %[[VAL_63]] : index
	// CHECK: %[[VAL_111:.*]] = arith.addi %[[VAL_58]], %[[VAL_3]] : index			// CHECK: %[[VAL_111:.*]] = arith.addi %[[VAL_58]], %[[VAL_3]] : index
	// CHECK: %[[VAL_112:.*]] = arith.select %[[VAL_110]], %[[VAL_111]], %[[VAL_58]] : index			// CHECK: %[[VAL_112:.*]] = arith.select %[[VAL_110]], %[[VAL_111]], %[[VAL_58]] : index
	▲ Show 20 Lines • Show All 150 Lines • Show Last 20 Lines

mlir/test/Integration/Dialect/SparseTensor/CPU/sparse_reduce_custom.mlir

Show First 20 Lines • Show All 215 Lines • ▼ Show 20 Lines	func.func @entry() {
// CHECK-NEXT: ( 6, 5, 4, 3, 2, 11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )		// CHECK-NEXT: ( 6, 5, 4, 3, 2, 11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )
// CHECK-NEXT: ( ( 6, 0, 0, 0, 0 ), ( 0, 0, 0, 5, 0 ), ( 4, 0, 0, 3, 0 ), ( 0, 2, 0, 0, 0 ), ( 0, 11, 0, 0, 0 ) )		// CHECK-NEXT: ( ( 6, 0, 0, 0, 0 ), ( 0, 0, 0, 5, 0 ), ( 4, 0, 0, 3, 0 ), ( 0, 2, 0, 0, 0 ), ( 0, 11, 0, 0, 0 ) )
// CHECK-NEXT: ( 2, 3, 120, 504, 0, 0, 0, 0 )		// CHECK-NEXT: ( 2, 3, 120, 504, 0, 0, 0, 0 )
// CHECK-NEXT: ( 2, 3, 120, 504, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )		// CHECK-NEXT: ( 2, 3, 120, 504, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )
// CHECK-NEXT: ( 6, 5, 12, 2, 11, 0, 0, 0 )		// CHECK-NEXT: ( 6, 5, 12, 2, 11, 0, 0, 0 )
// CHECK-NEXT: ( 6, 5, 12, 2, 11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )		// CHECK-NEXT: ( 6, 5, 12, 2, 11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0 )
// CHECK-NEXT: ( 7, 7, 9, 8, 7, 7, 12, 11, 11, 0, 0, 0, 0, 0, 0, 0 )		// CHECK-NEXT: ( 7, 7, 9, 8, 7, 7, 12, 11, 11, 0, 0, 0, 0, 0, 0, 0 )
// CHECK-NEXT: ( ( 7, 0, 0, 7, 0 ), ( 9, 0, 0, 0, 0 ), ( 8, 7, 0, 7, 0 ), ( 12, 11, 0, 11, 0 ), ( 0, 0, 0, 0, 0 ) )		// CHECK-NEXT: ( ( 7, 0, 0, 7, 0 ), ( 9, 0, 0, 0, 0 ), ( 8, 7, 0, 7, 0 ), ( 12, 11, 0, 11, 0 ), ( 0, 0, 0, 0, 0 ) )
// TODO: Update once identity values are no longer inserted for non-overlapping dot product		// CHECK-NEXT: ( 7, 7, 9, 8, 7, 7, 12, 11, 11, 0, 0, 0, 0, 0, 0, 0 )
// CHECK-NEXT: ( 7, inf, inf, 7, 9, inf, inf, inf, 8, 7, inf, 7, 12, 11, inf, 11 )		// CHECK-NEXT: ( ( 7, 0, 0, 7, 0 ), ( 9, 0, 0, 0, 0 ), ( 8, 7, 0, 7, 0 ), ( 12, 11, 0, 11, 0 ), ( 0, 0, 0, 0, 0 ) )
// CHECK-NEXT: ( ( 7, inf, inf, 7, 0 ), ( 9, inf, inf, inf, 0 ), ( 8, 7, inf, 7, 0 ), ( 12, 11, inf, 11, 0 ), ( 0, 0, 0, 0, 0 ) )
//		//
call @dump_mat(%sm1) : (tensor<?x?xf64, #CSR>) -> ()		call @dump_mat(%sm1) : (tensor<?x?xf64, #CSR>) -> ()
call @dump_mat(%sm2r) : (tensor<?x?xf64, #CSR>) -> ()		call @dump_mat(%sm2r) : (tensor<?x?xf64, #CSR>) -> ()
call @dump_vec(%1) : (tensor<?xf64, #SparseVector>) -> ()		call @dump_vec(%1) : (tensor<?xf64, #SparseVector>) -> ()
call @dump_vec(%2) : (tensor<?xf64, #SparseVector>) -> ()		call @dump_vec(%2) : (tensor<?xf64, #SparseVector>) -> ()
call @dump_mat(%5) : (tensor<?x?xf64, #CSR>) -> ()		call @dump_mat(%5) : (tensor<?x?xf64, #CSR>) -> ()
call @dump_mat(%6) : (tensor<?x?xf64, #CSR>) -> ()		call @dump_mat(%6) : (tensor<?x?xf64, #CSR>) -> ()

Show All 11 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][sparse] Sparse reduction in lex order no longer produces dense outputClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 495872

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.h

mlir/lib/Dialect/SparseTensor/Transforms/CodegenEnv.cpp

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

mlir/test/Dialect/SparseTensor/sparse_affine.mlir

mlir/test/Dialect/SparseTensor/sparse_out.mlir

mlir/test/Integration/Dialect/SparseTensor/CPU/sparse_reduce_custom.mlir

[mlir][sparse] Sparse reduction in lex order no longer produces dense output
ClosedPublic