This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Affine/
-
mlir/
-
Dialect/
-
Affine/
-
Analysis/
2/2
AffineStructures.h
-
IR/
1/1
AffineOps.h
-
lib/Dialect/Affine/
-
Dialect/
-
Affine/
-
Analysis/
10/10
AffineAnalysis.cpp
8/8
AffineStructures.cpp
-
IR/
7/7
AffineOps.cpp
-
test/Transforms/
-
Transforms/
2/2
memref-dependence-check.mlir

Differential D136056

[mlir][affine] Support affine.parallel in the index set analysis
ClosedPublic

Authored by Lewuathe on Oct 16 2022, 11:29 PM.

Download Raw Diff

Details

Reviewers

ftynse
dcaballe
bondhugula
nicolasvasilache
arjunp
Groverkss

Commits

rG1d541bd92044: [mlir][affine] Support affine.parallel in the index set analysis

Summary

Support affine.parallel in the index set analysis. It allows us to do dependence analysis containing affine.parallel in addition to affine.for and affine.if. This change only supports the constant lower/upper bound in affine.parallel. Other complicated affine map bounds will be supported in further commits.

See https://github.com/llvm/llvm-project/issues/57327

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Lewuathe created this revision.Oct 16 2022, 11:29 PM

Herald added a reviewer: bondhugula. · View Herald TranscriptOct 16 2022, 11:29 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: bzcheeseman, arjunp, sdasgup3 and 21 others. · View Herald Transcript

Lewuathe requested review of this revision.Oct 16 2022, 11:29 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptOct 16 2022, 11:29 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Lewuathe edited the summary of this revision. (Show Details)Oct 16 2022, 11:30 PM

Lewuathe edited the summary of this revision. (Show Details)Oct 16 2022, 11:33 PM

Harbormaster completed remote builds in B192436: Diff 468123.Oct 16 2022, 11:49 PM

Groverkss added reviewers: arjunp, Groverkss.Oct 17 2022, 2:42 AM

Some general code-style comments to start the review.

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
246–248	Please update this message.
254–255	Any reason for not using `else if`?
271–273	nit: remove braces here.
mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp
654	Is there a reason for using `auto` here and not the actual type? I think generally the actual type is prefered.
655–660	nit: remove braces here.
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2330–2331	nit: there is no need for a newline before the if.
2332–2334	nit: remove braces.

Groverkss added inline comments.Oct 17 2022, 3:33 AM

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
647–649	I don't think this ordering is valid for affine.parallel since it does not define any lexicographic order. For example: affine.parallel (%i) = (0) to (10) { %1 = affine.load %m[9 - %i] affine.store %1, [%i] } There can be a dependence from %i = 9 to %i = 0, since no lexicographic order is defined.

Apply format

Lewuathe added inline comments.Oct 17 2022, 9:51 PM

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
647–649	I see. We may need to skip this constraint in the case of `affine.parallel` in the middle.

Harbormaster completed remote builds in B192651: Diff 468412.Oct 17 2022, 10:14 PM

Fix format issues

Harbormaster completed remote builds in B192908: Diff 468781.Oct 18 2022, 10:17 PM

Add test for checking lexicographical order

Lewuathe added inline comments.Oct 28 2022, 12:07 AM

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp

647–649

I think there should be a dependency from the store to write (e.g. store %m[0] (%i=0) to load %m[0] (%i = 9). But this case is not properly checked.

func.func @no_lexicographic_order() {
  %m = memref.alloc() : memref<10xf32>
  %c7 = arith.constant 7.0 : f32
  affine.parallel (%i) = (0) to (10) {
    %1 = affine.load %m[9 - %i] : memref<10xf32>
    // expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
    // expected-remark@above {{dependence from 0 to 1 at depth 1 = true}}
    affine.store %1, %m[%i] : memref<10xf32>
    // expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
    // expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
  }
  return
}

Harbormaster completed remote builds in B194840: Diff 471416.Oct 28 2022, 12:23 AM

Thanks for contributing this support. This was long overdue. While this is fine, I feel just directly supporting the general (arbitrary bound case) is better -- since the affine.for support already supports that, and we may have to again throw away most of this code when handling the general bound case. Just supporting the constant bound case to start with isn't an incremental step.

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
253	Use comments on top (LLVM/MLIR style), not trailing.
261	May be good to fix the comment while on this: associated -> associating
mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp
650	Improve the assertion message here: `variable expected for the IV value`?
655–658	Negate condition and unguard `addBound`.
661–664	Likewise.

Herald added a subscriber: Moerafaat. · View Herald TranscriptOct 31 2022, 11:28 PM

@bondhugula Thank you for the comments. Sorry for taking the time to work on this. I was struggling with how to treat the ordering constraint in the affine.parallel. (Just removing the ordering constraint does not work as expected).

So for now, should we discard this patch completely and restart working on supporting general case?

In D136056#3910972, @Lewuathe wrote:

@bondhugula Thank you for the comments. Sorry for taking the time to work on this. I was struggling with how to treat the ordering constraint in the affine.parallel. (Just removing the ordering constraint does not work as expected).

So for now, should we discard this patch completely and restart working on supporting general case?

It would be good to support the generic case. Parts of this patch are still useful towards that. I didn't understand the issue related to "ordering constraint". You would order the dimensions in the order in which they appear in the affine.parallel.

In D136056#3911760, @bondhugula wrote:

In D136056#3910972, @Lewuathe wrote:

@bondhugula Thank you for the comments. Sorry for taking the time to work on this. I was struggling with how to treat the ordering constraint in the affine.parallel. (Just removing the ordering constraint does not work as expected).

So for now, should we discard this patch completely and restart working on supporting general case?

It would be good to support the generic case. Parts of this patch are still useful towards that. I didn't understand the issue related to "ordering constraint". You would order the dimensions in the order in which they appear in the affine.parallel.

I think the "ordering constraint" refers to the dependence analysis, since affine.parallel does not define a schedule for iterations, we cannot impose ordering constraints based on lexicographic order as it currently does for every induction variable.

I think the patch is good other than the dependence analysis part since that needs more discussion. For example, checkMemRefDependence takes a loop depth. Currently, loop depth implies the induction variable at that position. What would that mean after introducing affine.paralllel to the dependence analysis? Would it mean after affine.parallel is allowed? Would it mean the loop or the induction variable? I think some things need to be discussed more before we support it.

I would be happy to review the patch if you send the generic case without dependence analysis.

@bondhugula @Groverkss Thanks again. I'd like to work on the general cases for supporting affine.parallel in the getIndexSet.

But I have one question about the test for that case. Where should we write the test if we do not support the dependence analysis for affine.parallel for the time being? Is there any existing test checking the getIndexSet functionality?

In D136056#3914145, @Lewuathe wrote:

@bondhugula @Groverkss Thanks again. I'd like to work on the general cases for supporting affine.parallel in the getIndexSet.

But I have one question about the test for that case. Where should we write the test if we do not support the dependence analysis for affine.parallel for the time being? Is there any existing test checking the getIndexSet functionality?

If we don't support it temporarily, you can always return failure() from getIndexSet which will cleanly fail the dependence analysis check. There is a status for "dependence analysis failed" that gets returned already.

@bondhugula Okay, I'll return failure() to unsupport the dependence analysis with affine.parallel for the time being. Thanks!

Support non-constant constraints.

Post review follow-up.

Harbormaster completed remote builds in B197675: Diff 475346.Nov 14 2022, 11:07 PM

Apply format.

Harbormaster completed remote builds in B197888: Diff 475650.Nov 15 2022, 6:18 PM

Groverkss added inline comments.Nov 16 2022, 2:56 AM

mlir/include/mlir/Dialect/Affine/Analysis/AffineStructures.h
148	Is this stil lvalid?
149	Is it required that the variable exists in the constraint system? The documentation does not make this clear.
mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
617	This statement is misleading. Dependence analysis isn't implemented for affine.parallel. This statement claims it's not possible. Please add a TODO instead explaining why we did not implement it right now, which is affine.parallel does not specify an ordering.
mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp
654	Is there any reason we specialize for the constant case? If there is, could you write a comment explaining why we specialize? It is not obvious to me.

Lewuathe added inline comments.Nov 16 2022, 9:09 PM

mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp
654	The `addBound` method depends on whether the bound is constant. In the case of non-constant, we need to give operands explicitly. We use this pattern for `affine.for` and it's also the case for `affine.parallel` too.

Refine TODO comments.

Harbormaster completed remote builds in B198120: Diff 476000.Nov 16 2022, 9:35 PM

The added code isn't exercised by any test case, right? Is this ready for review?

mlir/include/mlir/Dialect/Affine/IR/AffineOps.h
455	What can `operations` contain? `operations` -> `affineOps`?
mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp
273	Can use `auto` here since the type is on the RHS.
mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp
646	The debug message isn't helpful here. Suggest adding more text to it.
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
259–264	This is nothing but checking `getParentOpOfType<AffineParallelOp>()` is null. We don't need to add a public method!
2325	Pass output argument by reference (MLIR convention).
2329	Use `auto` here.
2331	Use braces here per new LLVM/MLIR style.

Herald added a subscriber: jsetoain. · View Herald TranscriptNov 23 2022, 5:31 AM

Thank you for taking a look again.

I'll try to add a test case covering getIndexSet somehow. But there seems to be no dedicated test case for getIndexSet. Test cases for dependent analysis only check the logic of getIndexSet. Do you know whether there is already any test code covering getIndexSet usage?

In D136056#3923319, @bondhugula wrote:

In D136056#3914145, @Lewuathe wrote:

@bondhugula @Groverkss Thanks again. I'd like to work on the general cases for supporting affine.parallel in the getIndexSet.

But I have one question about the test for that case. Where should we write the test if we do not support the dependence analysis for affine.parallel for the time being? Is there any existing test checking the getIndexSet functionality?

If we don't support it temporarily, you can always return failure() from getIndexSet which will cleanly fail the dependence analysis check. There is a status for "dependence analysis failed" that gets returned already.

In D136056#3948381, @Lewuathe wrote:

Thank you for taking a look again.

I'll try to add a test case covering getIndexSet somehow. But there seems to be no dedicated test case for getIndexSet. Test cases for dependent analysis only check the logic of getIndexSet. Do you know whether there is already any test code covering getIndexSet usage?

I didn't understand - you don't need to add test cases for the utility. Just test dependence analysis in the presence of affine.parallel. This can be done via the affine-scalrep pass or test-memref-dependence-check.

Add test cases and reflect the latest reviews.

@bondhugula

Sorry for the confusion. I just wanted to know how/where to add test cases for the utility functions like this. But I've found a way to test the case including affine.parallel somehow for dependence check and scalrep. I'm so sorry for bothering you many times but could you review this change again when you get a chance?

Harbormaster completed remote builds in B199472: Diff 477855.Nov 24 2022, 9:51 PM

Can you please resolve the other remaining comments and mark them done before a round of review? For eg. all comments in AffineStructures.cpp many of which are trivial are still pending.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2332–2334	Drop trivial braces.
mlir/test/Transforms/memref-dependence-check.mlir
1078–1080	There seems to be something incorrect here. Both these "expected" checks are pointing to the same line (@above from below is the same as +1 here).
mlir/test/lib/Analysis/TestMemRefDependenceCheck.cpp
90 ↗	(On Diff #477855)	Thank you for improving/fixing this.

Validate affine.parallel existence strictly in memref dependence check.

Lewuathe marked 27 inline comments as done.Nov 29 2022, 10:29 PM

Lewuathe added inline comments.

mlir/test/Transforms/memref-dependence-check.mlir
1078–1080	We should have checked the `affine.parallel` existence before returning no dependence when there is no `AffineWriteOpInterface`.

Lewuathe marked an inline comment as done.Nov 29 2022, 10:30 PM

Harbormaster completed remote builds in B200173: Diff 478805.Nov 29 2022, 10:47 PM

This looks great to me - thanks! A comment on some missing test cases below.

mlir/test/Dialect/Affine/scalrep.mlir
796 ↗	(On Diff #478805)	Missing test case for non-constant loop bound case and mixed affine.parallel + affine.for case. (For eg., affine.for surrounded by affine.parallel or vice versa) -- either here or in the dependence check test case file.

This revision is now accepted and ready to land.Dec 3 2022, 6:46 AM

Post review follow-up

Lewuathe marked an inline comment as done.Dec 4 2022, 3:36 AM

This revision was landed with ongoing or failed builds.Dec 4 2022, 3:51 AM

Closed by commit rG1d541bd92044: [mlir][affine] Support affine.parallel in the index set analysis (authored by Lewuathe). · Explain Why

This revision was automatically updated to reflect the committed changes.

Lewuathe added a commit: rG1d541bd92044: [mlir][affine] Support affine.parallel in the index set analysis.

Harbormaster completed remote builds in B200965: Diff 479910.Dec 4 2022, 4:11 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Affine/

Analysis/

AffineStructures.h

7 lines

IR/

AffineOps.h

5 lines

lib/

Dialect/

Affine/

Analysis/

AffineAnalysis.cpp

26 lines

AffineStructures.cpp

27 lines

IR/

AffineOps.cpp

14 lines

test/

Transforms/

memref-dependence-check.mlir

96 lines

Diff 468781

mlir/include/mlir/Dialect/Affine/Analysis/AffineStructures.h

Show All 18 Lines
#include "mlir/IR/OpDefinition.h"		#include "mlir/IR/OpDefinition.h"
#include "mlir/Support/LogicalResult.h"		#include "mlir/Support/LogicalResult.h"

namespace mlir {		namespace mlir {

class AffineCondition;		class AffineCondition;
class AffineForOp;		class AffineForOp;
class AffineIfOp;		class AffineIfOp;
		class AffineParallelOp;
class AffineMap;		class AffineMap;
class AffineValueMap;		class AffineValueMap;
class IntegerSet;		class IntegerSet;
class MLIRContext;		class MLIRContext;
class Value;		class Value;
class MemRefType;		class MemRefType;
struct MutableAffineMap;		struct MutableAffineMap;

▲ Show 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	public:
/// constraint system. Returns failure for the yet unimplemented/unsupported		/// constraint system. Returns failure for the yet unimplemented/unsupported
/// cases. Any new variables that are found in the bound operands of the		/// cases. Any new variables that are found in the bound operands of the
/// 'affine.for' operation are added as trailing variables (either		/// 'affine.for' operation are added as trailing variables (either
/// dimensional or symbolic depending on whether the operand is a valid		/// dimensional or symbolic depending on whether the operand is a valid
/// symbol).		/// symbol).
// TODO: add support for non-unit strides.		// TODO: add support for non-unit strides.
LogicalResult addAffineForOpDomain(AffineForOp forOp);		LogicalResult addAffineForOpDomain(AffineForOp forOp);

		/// Add constraints (lower and upper bounds) for the specified
		/// 'affine.parallel' operation's Value using IR information stored in its
		/// bound maps. Returns failure for the yet unimplemented/unsupported cases.
		/// TODO: Support non-constant lower/upper bounds.
		GroverkssUnsubmitted Done Reply Inline Actions Is this stil lvalid? Groverkss: Is this stil lvalid?
		LogicalResult addAffineParallelOpDomain(AffineParallelOp parallelOp);
		GroverkssUnsubmitted Done Reply Inline Actions Is it required that the variable exists in the constraint system? The documentation does not make this clear. Groverkss: Is it required that the variable exists in the constraint system? The documentation does not…

/// Adds constraints (lower and upper bounds) for each loop in the loop nest		/// Adds constraints (lower and upper bounds) for each loop in the loop nest
/// described by the bound maps `lbMaps` and `ubMaps` of a computation slice.		/// described by the bound maps `lbMaps` and `ubMaps` of a computation slice.
/// Every pair (`lbMaps[i]`, `ubMaps[i]`) describes the bounds of a loop in		/// Every pair (`lbMaps[i]`, `ubMaps[i]`) describes the bounds of a loop in
/// the nest, sorted outer-to-inner. `operands` contains the bound operands		/// the nest, sorted outer-to-inner. `operands` contains the bound operands
/// for a single bound map. All the bound maps will use the same bound		/// for a single bound map. All the bound maps will use the same bound
/// operands. Note that some loops described by a computation slice might not		/// operands. Note that some loops described by a computation slice might not
/// exist yet in the IR so the Value attached to those dimension variables		/// exist yet in the IR so the Value attached to those dimension variables
/// might be empty. For that reason, this method doesn't perform Value		/// might be empty. For that reason, this method doesn't perform Value
▲ Show 20 Lines • Show All 522 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Affine/IR/AffineOps.h

	Show First 20 Lines • Show All 444 Lines • ▼ Show 20 Lines
	/// not an induction variable, then return nullptr.			/// not an induction variable, then return nullptr.
	AffineForOp getForInductionVarOwner(Value val);			AffineForOp getForInductionVarOwner(Value val);

	/// Extracts the induction variables from a list of AffineForOps and places them			/// Extracts the induction variables from a list of AffineForOps and places them
	/// in the output argument `ivs`.			/// in the output argument `ivs`.
	void extractForInductionVars(ArrayRef<AffineForOp> forInsts,			void extractForInductionVars(ArrayRef<AffineForOp> forInsts,
	SmallVectorImpl<Value> *ivs);			SmallVectorImpl<Value> *ivs);

				/// Extracts the induction variables from a list of either AffineForOp or
				/// AffineParallelOp and places them in the output argument `ivs`.
				void extractInductionVars(ArrayRef<Operation *> operations,
				bondhugulaUnsubmitted Done Reply Inline Actions What can `operations` contain? `operations` -> `affineOps`? bondhugula: What can `operations` contain? `operations` -> `affineOps`?
				SmallVectorImpl<Value> *ivs);

	/// Builds a perfect nest of affine.for loops, i.e., each loop except the			/// Builds a perfect nest of affine.for loops, i.e., each loop except the
	/// innermost one contains only another loop and a terminator. The loops iterate			/// innermost one contains only another loop and a terminator. The loops iterate
	/// from "lbs" to "ubs" with "steps". The body of the innermost loop is			/// from "lbs" to "ubs" with "steps". The body of the innermost loop is
	/// populated by calling "bodyBuilderFn" and providing it with an OpBuilder, a			/// populated by calling "bodyBuilderFn" and providing it with an OpBuilder, a
	/// Location and a list of loop induction variables.			/// Location and a list of loop induction variables.
	void buildAffineLoopNest(OpBuilder &builder, Location loc,			void buildAffineLoopNest(OpBuilder &builder, Location loc,
	ArrayRef<int64_t> lbs, ArrayRef<int64_t> ubs,			ArrayRef<int64_t> lbs, ArrayRef<int64_t> ubs,
	ArrayRef<int64_t> steps,			ArrayRef<int64_t> steps,
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp

Show First 20 Lines • Show All 234 Lines • ▼ Show 20 Lines
// the yet unimplemented cases.		// the yet unimplemented cases.
// TODO: Handle non-unit steps through local variables or stride information in		// TODO: Handle non-unit steps through local variables or stride information in
// FlatAffineValueConstraints. (For eg., by using iv - lb % step = 0 and/or by		// FlatAffineValueConstraints. (For eg., by using iv - lb % step = 0 and/or by
// introducing a method in FlatAffineValueConstraints		// introducing a method in FlatAffineValueConstraints
// setExprStride(ArrayRef<int64_t> expr, int64_t stride)		// setExprStride(ArrayRef<int64_t> expr, int64_t stride)
LogicalResult mlir::getIndexSet(MutableArrayRef<Operation *> ops,		LogicalResult mlir::getIndexSet(MutableArrayRef<Operation *> ops,
FlatAffineValueConstraints *domain) {		FlatAffineValueConstraints *domain) {
SmallVector<Value, 4> indices;		SmallVector<Value, 4> indices;
SmallVector<AffineForOp, 8> forOps;		SmallVector<Operation *, 8> loopOps;
		size_t numDims = 0;
for (Operation *op : ops) {		for (Operation *op : ops) {
if (!isa<AffineForOp, AffineIfOp>(op)) {		if (!isa<AffineForOp, AffineIfOp, AffineParallelOp>(op)) {
// TODO: Support affine.parallel ops.		LLVM_DEBUG(llvm::dbgs() << "getIndexSet only handles affine.for/if/"
LLVM_DEBUG(llvm::dbgs() << "getIndexSet only handles affine.for/if ops");		"parallel ops");
		GroverkssUnsubmitted Done Reply Inline Actions Please update this message. Groverkss: Please update this message.
return failure();		return failure();
}		}
if (AffineForOp forOp = dyn_cast<AffineForOp>(op))		if (AffineForOp forOp = dyn_cast<AffineForOp>(op)) {
forOps.push_back(forOp);		loopOps.push_back(forOp);
		numDims += 1; // An AffineForOp retains only 1 induction variable.
		bondhugulaUnsubmitted Done Reply Inline Actions Use comments on top (LLVM/MLIR style), not trailing. bondhugula: Use comments on top (LLVM/MLIR style), not trailing.
		}
		else if (AffineParallelOp parallelOp = dyn_cast<AffineParallelOp>(op)) {
		GroverkssUnsubmitted Done Reply Inline Actions Any reason for not using `else if`? Groverkss: Any reason for not using `else if`?
		loopOps.push_back(parallelOp);
		numDims += parallelOp.getNumDims();
}		}
extractForInductionVars(forOps, &indices);		}
		extractInductionVars(loopOps, &indices);
// Reset while associated Values in 'indices' to the domain.		// Reset while associated Values in 'indices' to the domain.
		bondhugulaUnsubmitted Done Reply Inline Actions May be good to fix the comment while on this: associated -> associating bondhugula: May be good to fix the comment while on this: associated -> associating
domain->reset(forOps.size(), /numSymbols=/0, /numLocals=/0, indices);		domain->reset(numDims, /numSymbols=/0, /numLocals=/0, indices);
for (Operation *op : ops) {		for (Operation *op : ops) {
// Add constraints from forOp's bounds.		// Add constraints from forOp's bounds.
if (AffineForOp forOp = dyn_cast<AffineForOp>(op)) {		if (AffineForOp forOp = dyn_cast<AffineForOp>(op)) {
if (failed(domain->addAffineForOpDomain(forOp)))		if (failed(domain->addAffineForOpDomain(forOp)))
return failure();		return failure();
} else if (AffineIfOp ifOp = dyn_cast<AffineIfOp>(op)) {		} else if (AffineIfOp ifOp = dyn_cast<AffineIfOp>(op)) {
domain->addAffineIfOpDomain(ifOp);		domain->addAffineIfOpDomain(ifOp);
		} else if (AffineParallelOp parallelOp = dyn_cast<AffineParallelOp>(op)) {
		if (failed(domain->addAffineParallelOpDomain(parallelOp)))
		return failure();
}		}
		GroverkssUnsubmitted Done Reply Inline Actions nit: remove braces here. Groverkss: nit: remove braces here.
		bondhugulaUnsubmitted Done Reply Inline Actions Can use `auto` here since the type is on the RHS. bondhugula: Can use `auto` here since the type is on the RHS.
}		}
return success();		return success();
}		}

/// Computes the iteration domain for 'op' and populates 'indexSet', which		/// Computes the iteration domain for 'op' and populates 'indexSet', which
/// encapsulates the constraints involving loops surrounding 'op' and		/// encapsulates the constraints involving loops surrounding 'op' and
/// potentially involving any Function symbols. The dimensional variables in		/// potentially involving any Function symbols. The dimensional variables in
/// 'indexSet' correspond to the loops surrounding 'op' from outermost to		/// 'indexSet' correspond to the loops surrounding 'op' from outermost to
▲ Show 20 Lines • Show All 327 Lines • ▼ Show 20 Lines	DependenceResult mlir::checkMemrefAccessDependence(
if (!allowRAR && !isa<AffineWriteOpInterface>(srcAccess.opInst) &&		if (!allowRAR && !isa<AffineWriteOpInterface>(srcAccess.opInst) &&
!isa<AffineWriteOpInterface>(dstAccess.opInst))		!isa<AffineWriteOpInterface>(dstAccess.opInst))
return DependenceResult::NoDependence;		return DependenceResult::NoDependence;

// We can't analyze further if the ops lie in different affine scopes.		// We can't analyze further if the ops lie in different affine scopes.
if (getAffineScope(srcAccess.opInst) != getAffineScope(dstAccess.opInst))		if (getAffineScope(srcAccess.opInst) != getAffineScope(dstAccess.opInst))
return DependenceResult::Failure;		return DependenceResult::Failure;

// Create access relation from each MemRefAccess.		// Create access relation from each MemRefAccess.
		GroverkssUnsubmitted Done Reply Inline Actions This statement is misleading. Dependence analysis isn't implemented for affine.parallel. This statement claims it's not possible. Please add a TODO instead explaining why we did not implement it right now, which is affine.parallel does not specify an ordering. Groverkss: This statement is misleading. Dependence analysis isn't implemented for affine.parallel. This…
FlatAffineRelation srcRel, dstRel;		FlatAffineRelation srcRel, dstRel;
if (failed(srcAccess.getAccessRelation(srcRel)))		if (failed(srcAccess.getAccessRelation(srcRel)))
return DependenceResult::Failure;		return DependenceResult::Failure;
if (failed(dstAccess.getAccessRelation(dstRel)))		if (failed(dstAccess.getAccessRelation(dstRel)))
return DependenceResult::Failure;		return DependenceResult::Failure;

FlatAffineValueConstraints srcDomain = srcRel.getDomainSet();		FlatAffineValueConstraints srcDomain = srcRel.getDomainSet();
FlatAffineValueConstraints dstDomain = dstRel.getDomainSet();		FlatAffineValueConstraints dstDomain = dstRel.getDomainSet();
Show All 13 Lines	DependenceResult mlir::checkMemrefAccessDependence(
// Compute the dependence relation by composing `srcRel` with the inverse of		// Compute the dependence relation by composing `srcRel` with the inverse of
// `dstRel`. Doing this builds a relation between iteration domain of		// `dstRel`. Doing this builds a relation between iteration domain of
// `srcAccess` to the iteration domain of `dstAccess` which access the same		// `srcAccess` to the iteration domain of `dstAccess` which access the same
// memory locations.		// memory locations.
dstRel.inverse();		dstRel.inverse();
dstRel.compose(srcRel);		dstRel.compose(srcRel);
*dependenceConstraints = dstRel;		*dependenceConstraints = dstRel;

// Add 'src' happens before 'dst' ordering constraints.		// Add 'src' happens before 'dst' ordering constraints.
addOrderingConstraints(srcDomain, dstDomain, loopDepth,		addOrderingConstraints(srcDomain, dstDomain, loopDepth,
dependenceConstraints);		dependenceConstraints);
		GroverkssUnsubmitted Done Reply Inline Actions I don't think this ordering is valid for affine.parallel since it does not define any lexicographic order. For example: affine.parallel (%i) = (0) to (10) { %1 = affine.load %m[9 - %i] affine.store %1, [%i] } There can be a dependence from %i = 9 to %i = 0, since no lexicographic order is defined. Groverkss: I don't think this ordering is valid for affine.parallel since it does not define any…
		LewuatheAuthorUnsubmitted Done Reply Inline Actions I see. We may need to skip this constraint in the case of `affine.parallel` in the middle. Lewuathe: I see. We may need to skip this constraint in the case of `affine.parallel` in the middle.
		LewuatheAuthorUnsubmitted Done Reply Inline Actions I think there should be a dependency from the store to write (e.g. store %m[0] (%i=0) to load %m[0] (%i = 9). But this case is not properly checked. func.func @no_lexicographic_order() { %m = memref.alloc() : memref<10xf32> %c7 = arith.constant 7.0 : f32 affine.parallel (%i) = (0) to (10) { %1 = affine.load %m[9 - %i] : memref<10xf32> // expected-remark@above {{dependence from 0 to 0 at depth 1 = false}} // expected-remark@above {{dependence from 0 to 1 at depth 1 = true}} affine.store %1, %m[%i] : memref<10xf32> // expected-remark@above {{dependence from 1 to 0 at depth 1 = false}} // expected-remark@above {{dependence from 1 to 1 at depth 1 = false}} } return } Lewuathe: I think there should be a dependency from the store to write (e.g. store %m[0] (%i=0) to load…

// Return 'NoDependence' if the solution space is empty: no dependence.		// Return 'NoDependence' if the solution space is empty: no dependence.
if (dependenceConstraints->isEmpty())		if (dependenceConstraints->isEmpty())
return DependenceResult::NoDependence;		return DependenceResult::NoDependence;

// Compute dependence direction vector and return true.		// Compute dependence direction vector and return true.
if (dependenceComponents != nullptr)		if (dependenceComponents != nullptr)
computeDirectionVector(srcDomain, dstDomain, loopDepth,		computeDirectionVector(srcDomain, dstDomain, loopDepth,
Show All 40 Lines

mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp

Show First 20 Lines • Show All 633 Lines • ▼ Show 20 Lines	if (forOp.hasConstantUpperBound()) {
addBound(BoundType::UB, pos, forOp.getConstantUpperBound() - 1);		addBound(BoundType::UB, pos, forOp.getConstantUpperBound() - 1);
return success();		return success();
}		}
// Non-constant upper bound case.		// Non-constant upper bound case.
return addBound(BoundType::UB, pos, forOp.getUpperBoundMap(),		return addBound(BoundType::UB, pos, forOp.getUpperBoundMap(),
forOp.getUpperBoundOperands());		forOp.getUpperBoundOperands());
}		}

		// TODO: Support non-constant upper/lower bounds.
		LogicalResult FlatAffineValueConstraints::addAffineParallelOpDomain(
		AffineParallelOp parallelOp) {
		size_t ivPos = 0;
		for (auto iv : parallelOp.getIVs()) {
		bondhugulaUnsubmitted Done Reply Inline Actions The debug message isn't helpful here. Suggest adding more text to it. bondhugula: The debug message isn't helpful here. Suggest adding more text to it.
		LLVM_DEBUG(iv.dump(););
		unsigned pos;
		if (!findVar(iv, &pos)) {
		assert(false && "Value not found");
		bondhugulaUnsubmitted Done Reply Inline Actions Improve the assertion message here: `variable expected for the IV value`? bondhugula: Improve the assertion message here: `variable expected for the IV value`?
		return failure();
		}

		AffineMap lowerBound = parallelOp.getLowerBoundMap(ivPos);
		GroverkssUnsubmitted Done Reply Inline Actions Is there a reason for using `auto` here and not the actual type? I think generally the actual type is prefered. Groverkss: Is there a reason for using `auto` here and not the actual type? I think generally the actual…
		GroverkssUnsubmitted Done Reply Inline Actions Is there any reason we specialize for the constant case? If there is, could you write a comment explaining why we specialize? It is not obvious to me. Groverkss: Is there any reason we specialize for the constant case? If there is, could you write a comment…
		LewuatheAuthorUnsubmitted Done Reply Inline Actions The `addBound` method depends on whether the bound is constant. In the case of non-constant, we need to give operands explicitly. We use this pattern for `affine.for` and it's also the case for `affine.parallel` too. Lewuathe: The `addBound` method depends on whether the bound is constant. In the case of non-constant, we…
		if (lowerBound.isConstant())
		addBound(BoundType::LB, pos, lowerBound.getSingleConstantResult());
		else
		return failure();
		bondhugulaUnsubmitted Done Reply Inline Actions Negate condition and unguard `addBound`. bondhugula: Negate condition and unguard `addBound`.

		auto upperBound = parallelOp.getUpperBoundMap(ivPos);
		GroverkssUnsubmitted Done Reply Inline Actions nit: remove braces here. Groverkss: nit: remove braces here.
		if (upperBound.isConstant())
		addBound(BoundType::UB, pos, upperBound.getSingleConstantResult());
		else
		return failure();
		bondhugulaUnsubmitted Done Reply Inline Actions Likewise. bondhugula: Likewise.
		}
		return success();
		}

LogicalResult		LogicalResult
FlatAffineValueConstraints::addDomainFromSliceMaps(ArrayRef<AffineMap> lbMaps,		FlatAffineValueConstraints::addDomainFromSliceMaps(ArrayRef<AffineMap> lbMaps,
ArrayRef<AffineMap> ubMaps,		ArrayRef<AffineMap> ubMaps,
ArrayRef<Value> operands) {		ArrayRef<Value> operands) {
assert(lbMaps.size() == ubMaps.size());		assert(lbMaps.size() == ubMaps.size());
assert(lbMaps.size() <= getNumDimVars());		assert(lbMaps.size() <= getNumDimVars());

for (unsigned i = 0, e = lbMaps.size(); i < e; ++i) {		for (unsigned i = 0, e = lbMaps.size(); i < e; ++i) {
▲ Show 20 Lines • Show All 1,182 Lines • Show Last 20 Lines

mlir/lib/Dialect/Affine/IR/AffineOps.cpp

Show First 20 Lines • Show All 250 Lines • ▼ Show 20 Lines	if (parentOp->hasTrait<OpTrait::AffineScope>())
return curOp->getParentRegion();		return curOp->getParentRegion();
curOp = parentOp;		curOp = parentOp;
}		}
return nullptr;		return nullptr;
}		}

// A Value can be used as a dimension id iff it meets one of the following		// A Value can be used as a dimension id iff it meets one of the following
// conditions:		// conditions:
// *) It is valid as a symbol.		// *) It is valid as a symbol.
// *) It is an induction variable.		// *) It is an induction variable.
// *) It is the result of affine apply operation with dimension id arguments.		// *) It is the result of affine apply operation with dimension id arguments.
bool mlir::isValidDim(Value value) {		bool mlir::isValidDim(Value value) {
// The value must be an index type.		// The value must be an index type.
if (!value.getType().isIndex())		if (!value.getType().isIndex())
		bondhugulaUnsubmitted Done Reply Inline Actions This is nothing but checking `getParentOpOfType<AffineParallelOp>()` is null. We don't need to add a public method! bondhugula: This is nothing but checking `getParentOpOfType<AffineParallelOp>()` is null. We don't need to…
return false;		return false;

if (auto *defOp = value.getDefiningOp())		if (auto *defOp = value.getDefiningOp())
return isValidDim(value, getAffineScope(defOp));		return isValidDim(value, getAffineScope(defOp));

// This value has to be a block argument for an op that has the		// This value has to be a block argument for an op that has the
// `AffineScope` trait or for an affine.for or affine.parallel.		// `AffineScope` trait or for an affine.for or affine.parallel.
auto *parentOp = value.cast<BlockArgument>().getOwner()->getParentOp();		auto *parentOp = value.cast<BlockArgument>().getOwner()->getParentOp();
▲ Show 20 Lines • Show All 2,043 Lines • ▼ Show 20 Lines
/// them.		/// them.
void mlir::extractForInductionVars(ArrayRef<AffineForOp> forInsts,		void mlir::extractForInductionVars(ArrayRef<AffineForOp> forInsts,
SmallVectorImpl<Value> *ivs) {		SmallVectorImpl<Value> *ivs) {
ivs->reserve(forInsts.size());		ivs->reserve(forInsts.size());
for (auto forInst : forInsts)		for (auto forInst : forInsts)
ivs->push_back(forInst.getInductionVar());		ivs->push_back(forInst.getInductionVar());
}		}

		void mlir::extractInductionVars(ArrayRef<mlir::Operation *> operations,
		SmallVectorImpl<mlir::Value> *ivs) {
		bondhugulaUnsubmitted Done Reply Inline Actions Pass output argument by reference (MLIR convention). bondhugula: Pass output argument by reference (MLIR convention).
		ivs->reserve(operations.size());
		for (Operation *op : operations) {
		// Add constraints from forOp's bounds.
		if (AffineForOp forOp = dyn_cast<AffineForOp>(op))
		bondhugulaUnsubmitted Done Reply Inline Actions Use `auto` here. bondhugula: Use `auto` here.
		ivs->push_back(forOp.getInductionVar());
		else if (AffineParallelOp parallelOp = dyn_cast<AffineParallelOp>(op)) {
		GroverkssUnsubmitted Done Reply Inline Actions nit: there is no need for a newline before the if. Groverkss: nit: there is no need for a newline before the if.
		bondhugulaUnsubmitted Done Reply Inline Actions Use braces here per new LLVM/MLIR style. bondhugula: Use braces here per new LLVM/MLIR style.
		for (size_t i = 0; i < parallelOp.getBody()->getNumArguments(); i++)
		ivs->push_back(parallelOp.getBody()->getArgument(i));
		}
		GroverkssUnsubmitted Done Reply Inline Actions nit: remove braces. Groverkss: nit: remove braces.
		bondhugulaUnsubmitted Done Reply Inline Actions Drop trivial braces. bondhugula: Drop trivial braces.
		}
		}

/// Builds an affine loop nest, using "loopCreatorFn" to create individual loop		/// Builds an affine loop nest, using "loopCreatorFn" to create individual loop
/// operations.		/// operations.
template <typename BoundListTy, typename LoopCreatorTy>		template <typename BoundListTy, typename LoopCreatorTy>
static void buildAffineLoopNestImpl(		static void buildAffineLoopNestImpl(
OpBuilder &builder, Location loc, BoundListTy lbs, BoundListTy ubs,		OpBuilder &builder, Location loc, BoundListTy lbs, BoundListTy ubs,
ArrayRef<int64_t> steps,		ArrayRef<int64_t> steps,
function_ref<void(OpBuilder &, Location, ValueRange)> bodyBuilderFn,		function_ref<void(OpBuilder &, Location, ValueRange)> bodyBuilderFn,
LoopCreatorTy &&loopCreatorFn) {		LoopCreatorTy &&loopCreatorFn) {
▲ Show 20 Lines • Show All 1,887 Lines • Show Last 20 Lines

mlir/test/Transforms/memref-dependence-check.mlir

Show First 20 Lines • Show All 1,058 Lines • ▼ Show 20 Lines	affine.if #set2(%i0) {
// expected-remark@above {{dependence from 1 to 0 at depth 2 = false}}		// expected-remark@above {{dependence from 1 to 0 at depth 2 = false}}
// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
// expected-remark@above {{dependence from 1 to 1 at depth 2 = false}}		// expected-remark@above {{dependence from 1 to 1 at depth 2 = false}}
}		}
}		}

return		return
}		}

		// -----

		// CHECK-LABEL: func @dependent_store_load_in_parallel() {
		func.func @dependent_store_load_in_parallel() {
		%0 = memref.alloc() : memref<10xf32>
		%cst = arith.constant 7.000000e+00 : f32
		affine.parallel (%i0) = (0) to (10) {
		affine.store %cst, %0[%i0] : memref<10xf32>
		// expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 0 to 1 at depth 1 = true}}
		%1 = affine.load %0[%i0] : memref<10xf32>
		// expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
		bondhugulaUnsubmitted Done Reply Inline Actions There seems to be something incorrect here. Both these "expected" checks are pointing to the same line (@above from below is the same as +1 here). bondhugula: There seems to be something incorrect here. Both these "expected" checks are pointing to the…
		LewuatheAuthorUnsubmitted Done Reply Inline Actions We should have checked the `affine.parallel` existence before returning no dependence when there is no `AffineWriteOpInterface`. Lewuathe: We should have checked the `affine.parallel` existence before returning no dependence when…
		}
		return
		}

		// -----

		// CHECK-LABEL: func @different_memref_in_parallel() {
		func.func @different_memref_in_parallel() {
		%m0 = memref.alloc() : memref<10xf32>
		%m1 = memref.alloc() : memref<10xf32>
		%cst = arith.constant 7.000000e+00 : f32
		affine.parallel (%i0) = (0) to (10) {
		affine.store %cst, %m0[%i0] : memref<10xf32>
		// expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 0 to 1 at depth 1 = false}}
		%1 = affine.load %m1[%i0] : memref<10xf32>
		// expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
		}
		return
		}

		// -----

		// CHECK-LABEL: func @dependent_parallels() {
		func.func @dependent_parallels() {
		%0 = memref.alloc() : memref<10xf32>
		%cst = arith.constant 7.000000e+00 : f32
		// No dependence from 0 to 1 because the first parallel dominates the second one.
		affine.parallel (%i0) = (0) to (10) {
		affine.store %cst, %0[%i0] : memref<10xf32>
		// expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 0 to 1 at depth 1 = true}}
		}
		affine.parallel (%i1) = (0) to (10) {
		%1 = affine.load %0[%i1] : memref<10xf32>
		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
		// expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
		}
		return
		}

		// -----

		// CHECK-LABEL: func @parallel_store_load_func_symbol(%arg0: index) {
		func.func @parallel_store_load_func_symbol(%arg0: index) {
		%m = memref.alloc() : memref<100xf32>
		%c7 = arith.constant 7.0 : f32
		%c10 = arith.constant 10 : index
		affine.parallel (%i0) = (0) to (10) {
		%a0 = affine.apply affine_map<(d0) -> (d0)> (%arg0)
		affine.store %c7, %m[%a0] : memref<100xf32>
		// expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 0 to 1 at depth 1 = true}}
		%a1 = affine.apply affine_map<(d0) -> (d0)> (%arg0)
		%v0 = affine.load %m[%a1] : memref<100xf32>
		// expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
		}
		return
		}

		// -----

		// CHECK-LABEL: func @two_dim_parallel() {
		func.func @two_dim_parallel() {
		%m = memref.alloc() : memref<10x10xf32>
		%c7 = arith.constant 7.0 : f32
		affine.parallel (%i0, %i1) = (0, 0) to (10, 10) {
		%a00 = affine.apply affine_map<(d0, d1) -> (d0)> (%i0, %i1)
		%a01 = affine.apply affine_map<(d0, d1) -> (d1)> (%i0, %i1)
		affine.store %c7, %m[%a00, %a01] : memref<10x10xf32>
		// expected-remark@above {{dependence from 0 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 0 to 1 at depth 1 = true}}
		%a10 = affine.apply affine_map<(d0, d1) -> (d0 - 2)> (%i0, %i1)
		%a11 = affine.apply affine_map<(d0, d1) -> (d1)> (%i0, %i1)
		%v0 = affine.load %m[%a10, %a11] : memref<10x10xf32>
		// expected-remark@above {{dependence from 1 to 0 at depth 1 = false}}
		// expected-remark@above {{dependence from 1 to 1 at depth 1 = false}}
		}
		return
		}
		No newline at end of file

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][affine] Support affine.parallel in the index set analysisClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 468781

mlir/include/mlir/Dialect/Affine/Analysis/AffineStructures.h

mlir/include/mlir/Dialect/Affine/IR/AffineOps.h

mlir/lib/Dialect/Affine/Analysis/AffineAnalysis.cpp

mlir/lib/Dialect/Affine/Analysis/AffineStructures.cpp

mlir/lib/Dialect/Affine/IR/AffineOps.cpp

mlir/test/Transforms/memref-dependence-check.mlir

[mlir][affine] Support affine.parallel in the index set analysis
ClosedPublic