This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Affine/IR/
-
mlir/
-
Dialect/
-
Affine/
-
IR/
1/1
AffineMemoryOpInterfaces.h
2/2
AffineMemoryOpInterfaces.td
-
AffineOps.h
-
AffineOps.td
2/3
CMakeLists.txt
-
lib/
-
Analysis/
2/2
AffineAnalysis.cpp
-
Utils.cpp
-
Dialect/Affine/IR/
-
Affine/
-
IR/
1/1
AffineMemoryOpInterfaces.cpp
-
CMakeLists.txt
-
Transforms/
1/1
LoopFusion.cpp
-
test/lib/Transforms/
-
lib/
-
Transforms/
-
TestMemRefBoundCheck.cpp

Differential D79829

[mlir][Affine] Introduce affine memory interfaces
ClosedPublic

Authored by dcaballe on May 12 2020, 5:15 PM.

Download Raw Diff

Details

Reviewers

bondhugula
nicolasvasilache
mehdi_amini
ftynse
andydavis1

Commits

rGa45fb1942fc5: [mlir][Affine] Introduce affine memory interfaces

Summary

This patch introduces interfaces for read and write ops with affine
restrictions. I used read/write intead of load/store for the
interfaces so that they can also be implemented by dma ops.
For now, they are only implemented by affine.load, affine.store,
affine.vector_load and affine.vector_store.

For testing purposes, this patch also migrates affine loop fusion and
required analysis to use the new interfaces. No other changes are made
beyond that.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

dcaballe created this revision.May 12 2020, 5:15 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 12 2020, 5:15 PM

Herald added subscribers: llvm-commits, Kayjukh, frgossen and 13 others. · View Herald Transcript

Harbormaster failed remote builds in B56518: Diff 263574!May 12 2020, 5:48 PM

Thanks for doing this Diego! Will be very nice to have this. Especially for affine loop fusion and dma generation....

LGTM in general.

I am wondering whether AffineMemoryOpInterfaces belongs in lib/Interfaces since it is specific to the Affine dialect. I'd rather put it together with the dialect, even though the transformations live in lib/Transforms .

mlir/include/mlir/Dialect/Affine/IR/CMakeLists.txt
2	Please do :)
mlir/lib/Analysis/AffineAnalysis.cpp
665–666	Nit: please make braces symmetric
mlir/lib/Transforms/LoopFusion.cpp
327	Nit: extract `cast<AffineWriteLikeOpInterface>(storeOpInst).getMemRef()` into a named variable for better formatting here

This looks really great! Thanks. CMake dependences and linking can be tricky; please make sure to build/test without LLD as well in case you are using LLD since there are a bunch of dependences / CMakelists.txt that need an update.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2564–2581 ↗	(On Diff #263574)	I missed why we need to introduce these here? Shouldn't they be shared via the op interface method?

This commit can't be marked NFC! It is adding functionality - generalizing several utilities to work on both the scalar and vector versions of load/store ops. You can drop the NFC from the title but mention in the commit summary that it isn't changing other functionality beyond migrating passes/utilities to the interface.

This revision now requires changes to proceed.May 13 2020, 3:20 AM

For the names AffineReadLikeOpInterface, AffineWriteLikeOpInterface, we could even consider dropping "Like" from it. It has become standard to use "Like" in interfaces because the prefixes are often the names of the ops. But here, "read" and "write" are already capturing the "like" part unlike say LoopLikeOp where you have a LoopOp and so it has to be LoopLikeOp. So, if you/others are fine with it, you could go for AffineReadOpInterface, AffineWriteOpInterface. DMAs, load/stores and vector load/stores - all read/write data.

dcaballe retitled this revision from [mlir][Affine][NFCI]: Introduce affine memory interfaces to [mlir][Affine]: Introduce affine memory interfaces.May 13 2020, 6:23 PM

dcaballe edited the summary of this revision. (Show Details)

Herald added a subscriber: jurahul. · View Herald TranscriptMay 13 2020, 6:23 PM

dcaballe retitled this revision from [mlir][Affine]: Introduce affine memory interfaces to [mlir][Affine] Introduce affine memory interfaces.May 13 2020, 6:24 PM

Addressing feedback.

Thanks for the feedback!

I am wondering whether AffineMemoryOpInterfaces belongs in lib/Interfaces since it is specific to the Affine dialect. I'd rather put it together with the dialect, even though the transformations live in lib/Transforms

Good point. That makes sense. Done.

For the names AffineReadLikeOpInterface, AffineWriteLikeOpInterface, we could even consider dropping "Like" from it. It has become standard to use "Like" in interfaces because the prefixes are often the names of the ops. But here, "read" and "write" are already capturing the "like" part unlike say LoopLikeOp where you have a LoopOp and so it has to be LoopLikeOp. So, if you/others are fine with it, you could go for AffineReadOpInterface, AffineWriteOpInterface. DMAs, load/stores and vector load/stores - all read/write data.

Agreed! Done.

mlir/include/mlir/Dialect/Affine/IR/CMakeLists.txt
2	Sorry, I wanted to ask about it since I didn't see any other tablegen->tablegen dependencies like this one so I'm not sure how to do it. For example, we should also have a dependency with the loop-like interface here, right? I added a dependency between both targets but this is not enough. I think we need to add a dependency between the dialect target and the interface .td file. Not sure how to do that cleanly.
mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2564–2581 ↗	(On Diff #263574)	Are you suggesting using the interface also as a base class? I assumed that interfaces shouldn't have any implementation. That could work for some methods but not all of them have the same implementation (e.g., `getMapOperands`). I could move those with the same implementation.

Harbormaster failed remote builds in B56694: Diff 263908!May 13 2020, 7:08 PM

bondhugula accepted this revision.May 13 2020, 8:43 PM

bondhugula added inline comments.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2564–2581 ↗	(On Diff #263574)	You are right that some of the ops that use this interface will not have the same implementation. But for my information, @ftynse, @rriddle, can Op Interfaces have implementations for things that are guaranteed to be shared among all ops using them? I remember seeing a default implementation being provided somewhere but I may be wrong. Anyway, can these be moved into AffineLoadOpBase, AffineStoreOpBase? (extra class declarations)

This revision is now accepted and ready to land.May 13 2020, 8:43 PM

mehdi_amini added inline comments.May 13 2020, 9:00 PM

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2564–2581 ↗	(On Diff #263574)	I believe OpInterface method are type-erased virtual function, they can have a default implementation in which case an op may or may override the default implementation.

ftynse accepted this revision.May 14 2020, 1:14 AM

ftynse added inline comments.

mlir/lib/Dialect/Affine/IR/AffineOps.cpp
2564–2581 ↗	(On Diff #263574)	But for my information, @ftynse, @rriddle, can Op Interfaces have implementations for things that are guaranteed to be shared among all ops using them? If an interface function declared in ODS has a body, and the ops don't redeclare it with `DeclareOpInterfaceMethods`, the default implementation is used in all ops. https://mlir.llvm.org/docs/OpDefinitions/#operation-interfaces has some examples.

Provide a default implementation for interface methods (not working, see next message)

I tried to change all the interface methods to have a default implementation but I'm hitting some problems. After reading again the documentation about interfaces, it's not clear to me what is the difference between providing a methodBody or a defaultImplementation. I had to dig into the generated code to have a bit better understanding but still, I couldn't make it work. I uploaded the patch with both approaches: methods in AffineWriteOpInterface has a methodBody, methods in AffineReadOpInterface has a defaultImplementation.

The problem that I'm hitting with methodBody is that the interface methods are not visible from the concrete op. For example, I cannot call getMemRef() using an AffineStoreOp object. Looking at the generated code, I see that the method getMemRef() is autogenerated for the class AffineWriteOpInterface, which is implemented like this:

Value AffineReadOpInterface::getMemRef() {
      return getImpl()->getMemRef(getOperation());
  }

However, there is no getMemRef() declared/defined in the same way for AffineStoreOp or AffineVectorStoreOp. Shouldn't they have similar autogenerated code? Am I missing something?

For defaultImplementation, I followed the doc example so interface methods are defined like this:

InterfaceMethod<
  /*desc=*/[{ Returns the memref operand to read from. }],
  /*retTy=*/"Value",
  /*methodName=*/"getMemRef",
  /*args=*/(ins),
  /*methodBody*/[{}],
  /*defaultImplementation=*/ [{
    ConcreteOp op = cast<ConcreteOp>(getOperation());
    return op.getOperand(op.getMemRefOperandIndex());
 }]
>,

However, the compiler complains about invoking getOperation() without object:

error: cannot call member function ‘mlir::Operation* mlir::Op<ConcreteT
ype, Traits>::getOperation() [with ConcreteType = mlir::AffineReadOpInterface; Traits = {}]’ without object

I get a bit lost in the internal details but ConcreteType = mlir::AffineReadOpInterface looks suspicious. Shouldn't it be AffineLoadOp/AffineVectorLoadOp?

I would appreciate some help, @ftynse @mehdi_amini @rriddle.

Thanks,
Diego

Harbormaster failed remote builds in B56812: Diff 264136!May 14 2020, 7:35 PM

TL;DR: "read" part is mostly correct. Use explicit this->getOperation() because class templates+inheritance. Also, cast<MemRefType>(....getType()) is incorrect, use ....getType().cast<MemRefType>() instead.

In D79829#2037697, @dcaballe wrote:

I tried to change all the interface methods to have a default implementation but I'm hitting some problems. After reading again the documentation about interfaces, it's not clear to me what is the difference between providing a methodBody or a defaultImplementation. I had to dig into the generated code to have a bit better understanding but still, I couldn't make it work. I uploaded the patch with both approaches: methods in AffineWriteOpInterface has a methodBody, methods in AffineReadOpInterface has a defaultImplementation.

Let's make a detour to try and untangle this (maybe we can update the doc afterwards). This is probably the most C++-intense part of the code base. I believe OpInterfaces implementation was inspired by the concept-based polymorphism from the inheritance is the base class of evil talk, transposed to LLVM and MLIR infrastructure (in particular LLVM-style casts and MLIR traits). The code example from the talk implements the concept-based polymorphic object that stores a pointer to the underlying object and dispatches polymorphic function calls to free functions differentiated by the type (stored as template parameter) of their first argument.

// This is the "interface" with pure virtual methods.
struct Concept {
  virtual void interfaceMethod() = 0;
};
template <typename Derived>
struct Model {
  void interfaceMethod() override {
    interfaceMethodImpl(data);
  }
  Derived data;
};

// This is a concrete implementation that does not need inheritance.
struct Concrete;
// And this is the implementation of interfaceMethod for Concrete.
void interfaceMethodImpl(Concrete) {}

// This class has polymorphic behavior without having virtual functions itself.
// We can see it as actual user-visible interface for our purposes, users
// will get instances of this class and will be able to work with them opaquely.
struct Polymorphic {
  template <typename Actual>
  Polymorphic(Actual &&a) : container(new Model<Actual>{forward<Actual>(a)}) {}
  void interfaceMethod() { container->interfaceMethod(); }
  std::unique_ptr<Concept> container;
};

Now, in MLIR, we don't actually need to store the underlying object because, as far as interfaces are concerned, there underlying object is an Operation that belongs to a block->region->parent-op->...->top-level-module-op that the caller owns. It would suffice to store a bare non-owning Operation *. Furthermore, we would like interface to integrate with the LLVM-style casting mechanisms. We're in luck, template <ConcereteOp, ...> class Op<ConcreteOp, ...> does exactly that: store an Operation * and provide isa/dyn_cast support. Now we start shifting a bit from the original concept-based idea with more inheritance and into the following:

struct Concept {
  // We will always work on some operation
  virtual void interfaceMethod(Operation *) = 0;
};
template <typename Derived>
struct Model : public Concept {
  void interfaceMethod(Operation *op) override {
    // Call the implementation from the Derived class, which is expected
    // to have a method with the compatible signature. (We could also
    // stick with free functions).
    cast<Derived>(op).interfaceMethod();
  }
  // No need to store the data anymore.
};

struct Polymorphic : public Op<Polymorphic, ...> {
  Poymorphic(Operation *op) : Op<Polymorphic, ...>(op), impl(...) {}
  void interfaceMethod() {
    impl->interfaceMethod(getOperation());
  }
  Concept *impl;
};

The remaining questions are how to we construct an instance of Model<Derived> that we could store in impl and how could we partially share the implementation. That's where the Traits mechanism comes into play. MLIR implements the Traits pattern that allows one to add generalized functions into specific Op classes. The scheme is roughly the following

template <typename Concrete>
struct Trait {
  void traitMethod() { 
    // This has access to actual operation and its type through CRTP.
    // cast<Concrete>(getOperation());
  };

  Operation *getOperation() {
    // In practice, this is bit more complex because it's provided by the base class
    // TraitBase through another layer of CRTP instead of requiring every Trait to
    // reimplement this from scratch.
    return static_cast<Concrete *>(this)->getOperation();
  }
};

// Again, the actual implementation is a more involved because of several layers
// of CRTP and a class variadic template Traits<...> that we use.
struct ConcreteOp : public Trait<ConcreteOp> {
  // This will have "traitMethod" available by inheritance from Trait.
};

Trait mechanism is also used in different places like verifiers and various precondition checkers, but for our purposes here it's basically a collection of templated base class _members_. The key difference with interfaces is that traits are allowed to have (static) data. We can then use this capability to store our instance of Model<Derived> in the trait. As an additional bonus, we get to reuse the Traits<...> machinery already implemented in operations. Now, since we are already giving all Ops that implement the interface a trait to support the interface, we might also use the properties of the trait itself -- in particular provide generic implementations of the methods that will be automatically available in all ops that have the trait (or the interface from which the trait is derived). The final scheme resembles the following

struct Concept {
  virtual void interfaceMethod(Operation *) = 0;
};
template <typename Derived>
struct Model : public Concept {
  void interfaceMethod(Operation *op) override {
    cast<Derived>(op).interfaceMethod();
  };
};
template <typename Derived>
struct Trait {
  static Concept &instance() {
    // This is where the per-class instance of Model lives.
    static Model<Derived> singleton;
    return singleton;
  }

  // We can define (or not!) the interface method implementation that
  // will be available in concrete ops by inheritance.
  void interfaceMethod() {
  }
};
struct Interface : public Op<Interface, ...> {
  Interface(Operation *op) : Op<Interface, ...(op) {
    // We leverage Op registration to store enable looking up Trait::instance in
    // an opaque way.
    impl = op->getAbstractOperation()->getInterfaceFor<Interface>();
  }

  void interfaceMethod() {
    impl->interfaceMethod();
  }
  Concept *impl;
};
struct ConcereteOp : public Op<ConcreteOp, Trait, ...> {
};

As usual, the actual implementation is more involved for practical purposes (e.g., Concept and Model are both nested inside the detail::InterfaceTraits class [not to be confused with Trait], Trait containing the instance is nested inside OpInterface to share the implementation and each specific Interface has a nested Trait that inherits OpInterface::Trait [both are class templates btw] with additional implementations, all Traits inherit from CRTP-base TraitBase, etc). Some of this complexity is hidden by table-genning Concept, Model, Interface and nested Trait classes. Unfortunately, sometimes it bites you back.

Two sources of confusion here are (1) the presence of identically-named functions that may or may not be inheritance-related and (2) the relation between Interfaces and Traits. For (1), concrete Ops are ultimately expected to implement exactly the signature described in Tablegen. These functions will be called by casting Operation * to the specific subclass of Op. Functions with identical signature will be provided in the Interface class, the rest is the dispatching mechanism letting them call similarly-named functions in the specific subclass of Op without directly knowing about them. For (2), Interfaces are extension of Op and are therefore wrappers around Operation * which can be stored or passed around; Traits are a way to share type-parameterized implementations across different Op subclasses. Interfaces rely on Traits to operate. Traits can be used to share the implementation of functions Interfaces require from Ops.

Finally, I can answer the original question about the difference between "methodBody" and "defaultImplementation". "methodBody" is the body of Model::interfaceMethod, if not provided in tablegen, it will just dispatch to ConcreteOp::interfaceMethod. Since there is no inheritance between Model<ConcreteOp> and ConcreteOp, these methods are not automatically available in ConcreteOp; on the contrary, they must be implemented for the entire mechanism to work. "defaultImplementation" is the body of "Trait::interfaceMethod" and will be automatically available in ConcreteOp. Combined with empty "methodBody", it will be the implementation that the interface will call into by default, hence the name. It can, however, be overridden by ConcreteOp and the interface will then call into that because it casts to ConcreteOp.

After this long detour, answers to your questions are relatively straightforward.

The problem that I'm hitting with methodBody is that the interface methods are not visible from the concrete op.

Interface methods are not visible because there is no inheritance from interface. Trait methods (referred to as default implementations) _are_ visible.

For example, I cannot call getMemRef() using an AffineStoreOp object. Looking at the generated code, I see that the method getMemRef() is autogenerated for the class AffineWriteOpInterface, which is implemented like this:
Value AffineReadOpInterface::getMemRef() {
      return getImpl()->getMemRef(getOperation());
  }
However, there is no getMemRef() declared/defined in the same way for AffineStoreOp or AffineVectorStoreOp. Shouldn't they have similar autogenerated code? Am I missing something?

An interface is like OOP interface in e.g. Java, it expects all classes that implement it to define the interface methods themselves. We have the possibility of providing a default implementation for them.

For defaultImplementation, I followed the doc example so interface methods are defined like this:

InterfaceMethod<
  /*desc=*/[{ Returns the memref operand to read from. }],
  /*retTy=*/"Value",
  /*methodName=*/"getMemRef",
  /*args=*/(ins),
  /*methodBody*/[{}],
  /*defaultImplementation=*/ [{
    ConcreteOp op = cast<ConcreteOp>(getOperation());
    return op.getOperand(op.getMemRefOperandIndex());
 }]
>,

However, the compiler complains about invoking getOperation() without object:

error: cannot call member function ‘mlir::Operation* mlir::Op<ConcreteT
ype, Traits>::getOperation() [with ConcreteType = mlir::AffineReadOpInterface; Traits = {}]’ without object

Because default implementation is placed in the Trait class template, which inherits from another class template (template <typename ConcreteOp> struct AffineReadOpInterfaceTrait : public OpInterface<AffineReadOpInterface, detail::AffineReadOpInterfaceInterfaceTraits>::Trait<ConcreteOp>) we need to explicitly disambiguate the call to the parent method (getOperaiton is defined in OpState, inherited by Op, inherited by OpInterface). this->getOperation() will work smoothly.

I get a bit lost in the internal details but ConcreteType = mlir::AffineReadOpInterface looks suspicious. Shouldn't it be AffineLoadOp/AffineVectorLoadOp?

No, it's correct. ConcreteType is a template parameter that points to the derived interface class for downcasting. If it were AffineLoadOp, we could erroneously assume any Op that implements AffineReadOpInterface _isa_ AffineLoadOp.

I would appreciate some help, @ftynse @mehdi_amini @rriddle.

The following diff makes everything compile and pass tests.

diff --git a/mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td b/mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td
index d9e8789a07a..8738000d8d5 100644
--- a/mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td
+++ b/mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td
@@ -29,7 +29,7 @@ def AffineReadOpInterface : OpInterface<"AffineReadOpInterface"> {
       /*args=*/(ins),
       /*methodBody*/[{}],
       /*defaultImplementation=*/ [{
-        ConcreteOp op = cast<ConcreteOp>(getOperation());
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
         return op.getOperand(op.getMemRefOperandIndex());
       }]
     >,
@@ -40,8 +40,8 @@ def AffineReadOpInterface : OpInterface<"AffineReadOpInterface"> {
       /*args=*/(ins),
       /*methodBody=*/[{}],
       /*defaultImplementation=*/[{
-        ConcreteOp op = cast<ConcreteOp>(getOperation());
-        return llvm::cast<MemRefType>(op.getMemRef().getType());
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
+        return op.getMemRef().getType().template cast<MemRefType>();
       }]
     >,
     InterfaceMethod<
@@ -51,7 +51,7 @@ def AffineReadOpInterface : OpInterface<"AffineReadOpInterface"> {
       /*args=*/(ins),
       /*methodBody=*/[{}],
       /*defaultImplementation=*/[{
-        ConcreteOp op = cast<ConcreteOp>(getOperation());
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
         return llvm::drop_begin(op.getOperands(), 1);
       }]
     >,
@@ -63,7 +63,7 @@ def AffineReadOpInterface : OpInterface<"AffineReadOpInterface"> {
       /*args=*/(ins),
       /*methodBody=*/[{}],
       /*defaultImplementation=*/[{
-        ConcreteOp op = cast<ConcreteOp>(getOperation());
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
         return op.getAffineMapAttr().getValue();
       }]
     >,
@@ -82,21 +82,33 @@ def AffineWriteOpInterface : OpInterface<"AffineWriteOpInterface"> {
       /*retTy=*/"Value",
       /*methodName=*/"getMemRef",
       /*args=*/(ins),
-      /*methodBody=*/ [{ return op.getOperand(op.getMemRefOperandIndex()); }]
+      /*methodBody=*/[{}],
+      /*defaultImplementation=*/[{
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
+        return op.getOperand(op.getMemRefOperandIndex());
+      }]
     >,
     InterfaceMethod<
       /*desc=*/[{ Returns the type of the memref operand. }],
       /*retTy=*/"MemRefType",
       /*methodName=*/"getMemRefType",
       /*args=*/(ins),
-      /*methodBody=*/[{ return llvm::cast<MemRefType>(getMemRef().getType()); }]
+      /*methodBody=*/[{}],
+      /*defaultImplementation=*/[{
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
+        return op.getMemRef().getType().template cast<MemRefType>();
+      }]
     >,
     InterfaceMethod<
       /*desc=*/[{ Returns affine map operands. }],
       /*retTy=*/"Operation::operand_range",
       /*methodName=*/"getMapOperands",
       /*args=*/(ins),
-      /*methodBody=*/[{ return llvm::drop_begin(op.getOperands(), 2); }]
+      /*methodBody=*/[{}],
+      /*defaultImplementation=*/[{
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
+        return llvm::drop_begin(op.getOperands(), 2);
+      }]
     >,
     InterfaceMethod<
       /*desc=*/[{ Returns the affine map used to index the memref for this
@@ -104,7 +116,11 @@ def AffineWriteOpInterface : OpInterface<"AffineWriteOpInterface"> {
       /*retTy=*/"AffineMap",
       /*methodName=*/"getAffineMap",
       /*args=*/(ins),
-      /*methodName=*/[{ return op.getAffineMapAttr().getValue(); }]
+      /*methodName=*/[{}],
+      /*defaultImplementation=*/[{
+        ConcreteOp op = cast<ConcreteOp>(this->getOperation());
+        return op.getAffineMapAttr().getValue();
+      }]
     >,
   ];
 }

Thanks,
Diego

Thank you so much for such a detailed explanation! It's really good! I think we should move this to somewhere in the documentation because it's really elucidating!

Interfaces are extension of Op and are therefore wrappers around Operation * which can be stored or passed around;

Got it! I was missing this key detail.

Interface methods are not visible because there is no inheritance from interface. Trait methods (referred to as default implementations) _are_ visible.
An interface is like OOP interface in e.g. Java, it expects all classes that implement it to define the interface methods themselves. We have the possibility of providing a default implementation for them.

I think the only remaining question for my understanding is why we need both methodBody and defaultImplementation. Wouldn't the latter suffice? I'm trying to think of a use case for the former that can't be implemented with the latter (other than having a default implementation without visibility in the concrete op).

Adding Alex's changes. Thanks for the patch!
I'll proceed with the commit if no more comments.

Harbormaster failed remote builds in B57231: Diff 264963!May 19 2020, 11:28 AM

Closed by commit rGa45fb1942fc5: [mlir][Affine] Introduce affine memory interfaces (authored by dcaballe). · Explain WhyMay 19 2020, 6:11 PM

This revision was automatically updated to reflect the committed changes.

In D79829#2044547, @dcaballe wrote:

Interface methods are not visible because there is no inheritance from interface. Trait methods (referred to as default implementations) _are_ visible.
An interface is like OOP interface in e.g. Java, it expects all classes that implement it to define the interface methods themselves. We have the possibility of providing a default implementation for them.

I think the only remaining question for my understanding is why we need both methodBody and defaultImplementation. Wouldn't the latter suffice? I'm trying to think of a use case for the former that can't be implemented with the latter (other than having a default implementation without visibility in the concrete op).

I am afraid I don't have an answer for that. Generally, there is some redundancy between interfaces and traits that may be eventually reduced. There may be cases where you want to dispatch the interface function differently than the default approach (wrap/unwrap arguments, call different functions in different classes, etc.); this dispatch may be more expensive and you may not want to pay for it unless you use interfaces. That's the underlying idea of the concept approach: you don't get pay the virtual dispatch cost unless you need it.

Sorry, I've been OOO but it looks like Alex was able to give an amazing run down. Thanks Alex!

mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.h
13	nit: This is wrong.
mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td
48	nit: You can use a normal string literal for one line descriptions: "..."
59	nit: These are generally formatted as: ... [{ ... }]
mlir/include/mlir/Dialect/Affine/IR/CMakeLists.txt
4	Seems like this should be using add_mlir_interface?
mlir/lib/Analysis/AffineAnalysis.cpp
666	nit: I would have just inlined the use of the storeOp
mlir/lib/Dialect/Affine/IR/AffineMemoryOpInterfaces.cpp
1	Loop-Like?

Thanks @rriddle. I'll take care of this.

dcaballe mentioned this in D80814: [mlir][Affine] Minor clean-up of D79829.May 29 2020, 9:29 AM

dcaballe mentioned this in rGe75325cfc397: [mlir][Affine] Minor clean-up of D79829.May 29 2020, 2:49 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Affine/

IR/

AffineMemoryOpInterfaces.h

24 lines

AffineMemoryOpInterfaces.td

128 lines

AffineOps.h

1 line

AffineOps.td

35 lines

CMakeLists.txt

8 lines

lib/

Analysis/

AffineAnalysis.cpp

16 lines

Utils.cpp

43 lines

Dialect/

Affine/

IR/

AffineMemoryOpInterfaces.cpp

18 lines

CMakeLists.txt

2 lines

Transforms/

LoopFusion.cpp

74 lines

test/

lib/

Transforms/

TestMemRefBoundCheck.cpp

5 lines

Diff 265109

mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.h

This file was added.

				//===- AffineMemoryOpInterfaces.h -------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains a set of interfaces for affine memory ops.
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_INTERFACES_AFFINEMEMORYOPINTERFACES_H_
				rriddleUnsubmitted Not Done Reply Inline Actions nit: This is wrong. rriddle: nit: This is wrong.
				#define MLIR_INTERFACES_AFFINEMEMORYOPINTERFACES_H_

				#include "mlir/IR/AffineMap.h"
				#include "mlir/IR/OpDefinition.h"
				#include "mlir/IR/StandardTypes.h"

				namespace mlir {
				#include "mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.h.inc"
				} // namespace mlir

				#endif // MLIR_INTERFACES_AFFINEMEMORYOPINTERFACES_H_

mlir/include/mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td

This file was added.

				//===- AffineMemoryOpInterfaces.td -------------------------- tablegen --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains a set of interfaces for affine memory ops.
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_AFFINEMEMORYOPINTERFACES
				#define MLIR_AFFINEMEMORYOPINTERFACES

				include "mlir/IR/OpBase.td"

				def AffineReadOpInterface : OpInterface<"AffineReadOpInterface"> {
				let description = [{
				Interface to query characteristics of read-like ops with affine
				restrictions.
				}];

				let methods = [
				InterfaceMethod<
				/desc=/[{ Returns the memref operand to read from. }],
				/retTy=/"Value",
				/methodName=/"getMemRef",
				/args=/(ins),
				/methodBody/[{}],
				/defaultImplementation=/ [{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getOperand(op.getMemRefOperandIndex());
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns the type of the memref operand. }],
				/retTy=/"MemRefType",
				/methodName=/"getMemRefType",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getMemRef().getType().template cast<MemRefType>();
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns affine map operands. }],
				rriddleUnsubmitted Not Done Reply Inline Actions nit: You can use a normal string literal for one line descriptions: "..." rriddle: nit: You can use a normal string literal for one line descriptions: "..."
				/retTy=/"Operation::operand_range",
				/methodName=/"getMapOperands",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return llvm::drop_begin(op.getOperands(), 1);
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns the affine map used to index the memref for this
				rriddleUnsubmitted Not Done Reply Inline Actions nit: These are generally formatted as: ... [{ ... }] rriddle: nit: These are generally formatted as: ... [{ ... }]
				operation. }],
				/retTy=/"AffineMap",
				/methodName=/"getAffineMap",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getAffineMapAttr().getValue();
				}]
				>,
				];
				}

				def AffineWriteOpInterface : OpInterface<"AffineWriteOpInterface"> {
				let description = [{
				Interface to query characteristics of write-like ops with affine
				restrictions.
				}];

				let methods = [
				InterfaceMethod<
				/desc=/[{ Returns the memref operand to write to. }],
				/retTy=/"Value",
				/methodName=/"getMemRef",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getOperand(op.getMemRefOperandIndex());
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns the type of the memref operand. }],
				/retTy=/"MemRefType",
				/methodName=/"getMemRefType",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getMemRef().getType().template cast<MemRefType>();
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns affine map operands. }],
				/retTy=/"Operation::operand_range",
				/methodName=/"getMapOperands",
				/args=/(ins),
				/methodBody=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return llvm::drop_begin(op.getOperands(), 2);
				}]
				>,
				InterfaceMethod<
				/desc=/[{ Returns the affine map used to index the memref for this
				operation. }],
				/retTy=/"AffineMap",
				/methodName=/"getAffineMap",
				/args=/(ins),
				/methodName=/[{}],
				/defaultImplementation=/[{
				ConcreteOp op = cast<ConcreteOp>(this->getOperation());
				return op.getAffineMapAttr().getValue();
				}]
				>,
				];
				}

				#endif // MLIR_AFFINEMEMORYOPINTERFACES

mlir/include/mlir/Dialect/Affine/IR/AffineOps.h

	//===- AffineOps.h - MLIR Affine Operations -------------------------------===//			//===- AffineOps.h - MLIR Affine Operations -------------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines convenience types for working with Affine operations			// This file defines convenience types for working with Affine operations
	// in the MLIR operation set.			// in the MLIR operation set.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_DIALECT_AFFINE_IR_AFFINEOPS_H			#ifndef MLIR_DIALECT_AFFINE_IR_AFFINEOPS_H
	#define MLIR_DIALECT_AFFINE_IR_AFFINEOPS_H			#define MLIR_DIALECT_AFFINE_IR_AFFINEOPS_H

				#include "mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.h"
	#include "mlir/IR/AffineMap.h"			#include "mlir/IR/AffineMap.h"
	#include "mlir/IR/Builders.h"			#include "mlir/IR/Builders.h"
	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/IR/OpDefinition.h"			#include "mlir/IR/OpDefinition.h"
	#include "mlir/IR/StandardTypes.h"			#include "mlir/IR/StandardTypes.h"
	#include "mlir/Interfaces/LoopLikeInterface.h"			#include "mlir/Interfaces/LoopLikeInterface.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"

	▲ Show 20 Lines • Show All 470 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Affine/IR/AffineOps.td

//===- AffineOps.td - Affine operation definitions ---------- tablegen --===//		//===- AffineOps.td - Affine operation definitions ---------- tablegen --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Defines MLIR affine operations.		// Defines MLIR affine operations.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef AFFINE_OPS		#ifndef AFFINE_OPS
#define AFFINE_OPS		#define AFFINE_OPS

include "mlir/Dialect/Affine/IR/AffineOpsBase.td"		include "mlir/Dialect/Affine/IR/AffineOpsBase.td"
		include "mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.td"
include "mlir/Interfaces/LoopLikeInterface.td"		include "mlir/Interfaces/LoopLikeInterface.td"
include "mlir/Interfaces/SideEffectInterfaces.td"		include "mlir/Interfaces/SideEffectInterfaces.td"

def Affine_Dialect : Dialect {		def Affine_Dialect : Dialect {
let name = "affine";		let name = "affine";
let cppNamespace = "";		let cppNamespace = "";
let hasConstantMaterializer = 1;		let hasConstantMaterializer = 1;
}		}
▲ Show 20 Lines • Show All 341 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
}		}
}];		}];

let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
let hasFolder = 1;		let hasFolder = 1;
}		}

class AffineLoadOpBase<string mnemonic, list<OpTrait> traits = []> :		class AffineLoadOpBase<string mnemonic, list<OpTrait> traits = []> :
Affine_Op<mnemonic, traits> {		Affine_Op<mnemonic, !listconcat(traits,
		[DeclareOpInterfaceMethods<AffineReadOpInterface>])> {
let arguments = (ins Arg<AnyMemRef, "the reference to load from",		let arguments = (ins Arg<AnyMemRef, "the reference to load from",
[MemRead]>:$memref,		[MemRead]>:$memref,
Variadic<Index>:$indices);		Variadic<Index>:$indices);

code extraClassDeclarationBase = [{		code extraClassDeclarationBase = [{
/// Returns the operand index of the memref.		/// Returns the operand index of the memref.
unsigned getMemRefOperandIndex() { return 0; }		unsigned getMemRefOperandIndex() { return 0; }

/// Get memref operand.
Value getMemRef() { return getOperand(getMemRefOperandIndex()); }
void setMemRef(Value value) { setOperand(getMemRefOperandIndex(), value); }		void setMemRef(Value value) { setOperand(getMemRefOperandIndex(), value); }
MemRefType getMemRefType() {
return getMemRef().getType().cast<MemRefType>();
}

/// Get affine map operands.
operand_range getMapOperands() { return llvm::drop_begin(getOperands(), 1); }

/// Returns the affine map used to index the memref for this operation.		/// Returns the affine map used to index the memref for this operation.
AffineMap getAffineMap() { return getAffineMapAttr().getValue(); }
AffineMapAttr getAffineMapAttr() {		AffineMapAttr getAffineMapAttr() {
return getAttr(getMapAttrName()).cast<AffineMapAttr>();		return getAttr(getMapAttrName()).cast<AffineMapAttr>();
}		}

/// Returns the AffineMapAttr associated with 'memref'.		/// Returns the AffineMapAttr associated with 'memref'.
NamedAttribute getAffineMapAttrForMemRef(Value memref) {		NamedAttribute getAffineMapAttrForMemRef(Value memref) {
assert(memref == getMemRef());		assert(memref == getMemRef());
return {Identifier::get(getMapAttrName(), getContext()),		return {Identifier::get(getMapAttrName(), getContext()),
getAffineMapAttr()};		getAffineMapAttr()};
}		}

static StringRef getMapAttrName() { return "map"; }		static StringRef getMapAttrName() { return "map"; }
}];		}];
}		}

def AffineLoadOp : AffineLoadOpBase<"load", []> {		def AffineLoadOp : AffineLoadOpBase<"load"> {
let summary = "affine load operation";		let summary = "affine load operation";
let description = [{		let description = [{
The "affine.load" op reads an element from a memref, where the index		The "affine.load" op reads an element from a memref, where the index
for each memref dimension is an affine expression of loop induction		for each memref dimension is an affine expression of loop induction
variables and symbols. The output of 'affine.load' is a new value with the		variables and symbols. The output of 'affine.load' is a new value with the
same type as the elements of the memref. An affine expression of loop IVs		same type as the elements of the memref. An affine expression of loop IVs
and symbols must be specified for each dimension of the memref. The keyword		and symbols must be specified for each dimension of the memref. The keyword
'symbol' can be used to indicate SSA identifiers which are symbolic.		'symbol' can be used to indicate SSA identifiers which are symbolic.
▲ Show 20 Lines • Show All 242 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
static StringRef getIsDataCacheAttrName() { return "isDataCache"; }		static StringRef getIsDataCacheAttrName() { return "isDataCache"; }
}];		}];

let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
let hasFolder = 1;		let hasFolder = 1;
}		}

class AffineStoreOpBase<string mnemonic, list<OpTrait> traits = []> :		class AffineStoreOpBase<string mnemonic, list<OpTrait> traits = []> :
Affine_Op<mnemonic, traits> {		Affine_Op<mnemonic, !listconcat(traits,
		[DeclareOpInterfaceMethods<AffineWriteOpInterface>])> {
code extraClassDeclarationBase = [{		code extraClassDeclarationBase = [{
/// Get value to be stored by store operation.		/// Get value to be stored by store operation.
Value getValueToStore() { return getOperand(0); }		Value getValueToStore() { return getOperand(0); }

/// Returns the operand index of the memref.		/// Returns the operand index of the memref.
unsigned getMemRefOperandIndex() { return 1; }		unsigned getMemRefOperandIndex() { return 1; }

/// Get memref operand.
Value getMemRef() { return getOperand(getMemRefOperandIndex()); }
void setMemRef(Value value) { setOperand(getMemRefOperandIndex(), value); }		void setMemRef(Value value) { setOperand(getMemRefOperandIndex(), value); }

MemRefType getMemRefType() {
return getMemRef().getType().cast<MemRefType>();
}

/// Get affine map operands.
operand_range getMapOperands() { return llvm::drop_begin(getOperands(), 2); }

/// Returns the affine map used to index the memref for this operation.		/// Returns the affine map used to index the memref for this operation.
AffineMap getAffineMap() { return getAffineMapAttr().getValue(); }
AffineMapAttr getAffineMapAttr() {		AffineMapAttr getAffineMapAttr() {
return getAttr(getMapAttrName()).cast<AffineMapAttr>();		return getAttr(getMapAttrName()).cast<AffineMapAttr>();
}		}

/// Returns the AffineMapAttr associated with 'memref'.		/// Returns the AffineMapAttr associated with 'memref'.
NamedAttribute getAffineMapAttrForMemRef(Value memref) {		NamedAttribute getAffineMapAttrForMemRef(Value memref) {
assert(memref == getMemRef());		assert(memref == getMemRef());
return {Identifier::get(getMapAttrName(), getContext()),		return {Identifier::get(getMapAttrName(), getContext()),
getAffineMapAttr()};		getAffineMapAttr()};
}		}

static StringRef getMapAttrName() { return "map"; }		static StringRef getMapAttrName() { return "map"; }
}];		}];
}		}

def AffineStoreOp : AffineStoreOpBase<"store", []> {		def AffineStoreOp : AffineStoreOpBase<"store"> {
let summary = "affine store operation";		let summary = "affine store operation";
let description = [{		let description = [{
The "affine.store" op writes an element to a memref, where the index		The "affine.store" op writes an element to a memref, where the index
for each memref dimension is an affine expression of loop induction		for each memref dimension is an affine expression of loop induction
variables and symbols. The 'affine.store' op stores a new value which is the		variables and symbols. The 'affine.store' op stores a new value which is the
same type as the elements of the memref. An affine expression of loop IVs		same type as the elements of the memref. An affine expression of loop IVs
and symbols must be specified for each dimension of the memref. The keyword		and symbols must be specified for each dimension of the memref. The keyword
'symbol' can be used to indicate SSA identifiers which are symbolic.		'symbol' can be used to indicate SSA identifiers which are symbolic.
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines	def AffineTerminatorOp :
// No custom parsing/printing form.		// No custom parsing/printing form.
let parser = ?;		let parser = ?;
let printer = ?;		let printer = ?;

// Fully specified by traits.		// Fully specified by traits.
let verifier = ?;		let verifier = ?;
}		}

def AffineVectorLoadOp : AffineLoadOpBase<"vector_load", []> {		def AffineVectorLoadOp : AffineLoadOpBase<"vector_load"> {
let summary = "affine vector load operation";		let summary = "affine vector load operation";
let description = [{		let description = [{
The "affine.vector_load" is the vector counterpart of		The "affine.vector_load" is the vector counterpart of
[affine.load](#affineload-operation). It reads a slice from a		[affine.load](#affineload-operation). It reads a slice from a
[MemRef](../LangRef.md#memref-type), supplied as its first operand,		[MemRef](../LangRef.md#memref-type), supplied as its first operand,
into a [vector](../LangRef.md#vector-type) of the same base elemental type.		into a [vector](../LangRef.md#vector-type) of the same base elemental type.
The index for each memref dimension is an affine expression of loop induction		The index for each memref dimension is an affine expression of loop induction
variables and symbols. These indices determine the start position of the read		variables and symbols. These indices determine the start position of the read
Show All 32 Lines	def AffineVectorLoadOp : AffineLoadOpBase<"vector_load"> {

let extraClassDeclaration = extraClassDeclarationBase # [{		let extraClassDeclaration = extraClassDeclarationBase # [{
VectorType getVectorType() {		VectorType getVectorType() {
return result().getType().cast<VectorType>();		return result().getType().cast<VectorType>();
}		}
}];		}];
}		}

def AffineVectorStoreOp : AffineStoreOpBase<"vector_store", []> {		def AffineVectorStoreOp : AffineStoreOpBase<"vector_store"> {
let summary = "affine vector store operation";		let summary = "affine vector store operation";
let description = [{		let description = [{
The "affine.vector_store" is the vector counterpart of		The "affine.vector_store" is the vector counterpart of
[affine.store](#affinestore-affinestoreop). It writes a		[affine.store](#affinestore-affinestoreop). It writes a
[vector](../LangRef.md#vector-type), supplied as its first operand,		[vector](../LangRef.md#vector-type), supplied as its first operand,
into a slice within a [MemRef](../LangRef.md#memref-type) of the same base		into a slice within a [MemRef](../LangRef.md#memref-type) of the same base
elemental type, supplied as its second operand.		elemental type, supplied as its second operand.
The index for each memref dimension is an affine expression of loop		The index for each memref dimension is an affine expression of loop
▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Affine/IR/CMakeLists.txt

	add_mlir_dialect(AffineOps affine)			add_mlir_dialect(AffineOps affine)
	add_mlir_doc(AffineOps -gen-op-doc AffineOps Dialects/)			add_mlir_doc(AffineOps -gen-op-doc AffineOps Dialects/)
				ftynseUnsubmitted Not Done Reply Inline Actions Please do :) ftynse: Please do :)
				dcaballeAuthorUnsubmitted Done Reply Inline Actions Sorry, I wanted to ask about it since I didn't see any other tablegen->tablegen dependencies like this one so I'm not sure how to do it. For example, we should also have a dependency with the loop-like interface here, right? I added a dependency between both targets but this is not enough. I think we need to add a dependency between the dialect target and the interface .td file. Not sure how to do that cleanly. dcaballe: Sorry, I wanted to ask about it since I didn't see any other tablegen->tablegen dependencies…

				set(LLVM_TARGET_DEFINITIONS AffineMemoryOpInterfaces.td)
				rriddleUnsubmitted Not Done Reply Inline Actions Seems like this should be using add_mlir_interface? rriddle: Seems like this should be using add_mlir_interface?
				mlir_tablegen(AffineMemoryOpInterfaces.h.inc -gen-op-interface-decls)
				mlir_tablegen(AffineMemoryOpInterfaces.cpp.inc -gen-op-interface-defs)
				add_public_tablegen_target(MLIRAffineMemoryOpInterfacesIncGen)
				add_dependencies(mlir-generic-headers MLIRAffineMemoryOpInterfacesIncGen)

				add_dependencies(MLIRAffineOpsIncGen MLIRAffineMemoryOpInterfacesIncGen)

mlir/lib/Analysis/AffineAnalysis.cpp

Show First 20 Lines • Show All 654 Lines • ▼ Show 20 Lines	static void computeDirectionVector(
}		}
}		}

// Populates 'accessMap' with composition of AffineApplyOps reachable from		// Populates 'accessMap' with composition of AffineApplyOps reachable from
// indices of MemRefAccess.		// indices of MemRefAccess.
void MemRefAccess::getAccessMap(AffineValueMap *accessMap) const {		void MemRefAccess::getAccessMap(AffineValueMap *accessMap) const {
// Get affine map from AffineLoad/Store.		// Get affine map from AffineLoad/Store.
AffineMap map;		AffineMap map;
if (auto loadOp = dyn_cast<AffineLoadOp>(opInst))		if (auto loadOp = dyn_cast<AffineReadOpInterface>(opInst)) {
map = loadOp.getAffineMap();		map = loadOp.getAffineMap();
else if (auto storeOp = dyn_cast<AffineStoreOp>(opInst))		} else {
		auto storeOp = cast<AffineWriteOpInterface>(opInst);
		ftynseUnsubmitted Done Reply Inline Actions Nit: please make braces symmetric ftynse: Nit: please make braces symmetric
		rriddleUnsubmitted Not Done Reply Inline Actions nit: I would have just inlined the use of the storeOp rriddle: nit: I would have just inlined the use of the storeOp
map = storeOp.getAffineMap();		map = storeOp.getAffineMap();
		}
SmallVector<Value, 8> operands(indices.begin(), indices.end());		SmallVector<Value, 8> operands(indices.begin(), indices.end());
fullyComposeAffineMapAndOperands(&map, &operands);		fullyComposeAffineMapAndOperands(&map, &operands);
map = simplifyAffineMap(map);		map = simplifyAffineMap(map);
canonicalizeMapAndOperands(&map, &operands);		canonicalizeMapAndOperands(&map, &operands);
accessMap->reset(map, operands);		accessMap->reset(map, operands);
}		}

// Builds a flat affine constraint system to check if there exists a dependence		// Builds a flat affine constraint system to check if there exists a dependence
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	LLVM_DEBUG(llvm::dbgs() << "Checking for dependence at depth: "
<< Twine(loopDepth) << " between:\n";);		<< Twine(loopDepth) << " between:\n";);
LLVM_DEBUG(srcAccess.opInst->dump(););		LLVM_DEBUG(srcAccess.opInst->dump(););
LLVM_DEBUG(dstAccess.opInst->dump(););		LLVM_DEBUG(dstAccess.opInst->dump(););

// Return 'NoDependence' if these accesses do not access the same memref.		// Return 'NoDependence' if these accesses do not access the same memref.
if (srcAccess.memref != dstAccess.memref)		if (srcAccess.memref != dstAccess.memref)
return DependenceResult::NoDependence;		return DependenceResult::NoDependence;

// Return 'NoDependence' if one of these accesses is not an AffineStoreOp.		// Return 'NoDependence' if one of these accesses is not an
if (!allowRAR && !isa<AffineStoreOp>(srcAccess.opInst) &&		// AffineWriteOpInterface.
!isa<AffineStoreOp>(dstAccess.opInst))		if (!allowRAR && !isa<AffineWriteOpInterface>(srcAccess.opInst) &&
		!isa<AffineWriteOpInterface>(dstAccess.opInst))
return DependenceResult::NoDependence;		return DependenceResult::NoDependence;

// Get composed access function for 'srcAccess'.		// Get composed access function for 'srcAccess'.
AffineValueMap srcAccessMap;		AffineValueMap srcAccessMap;
srcAccess.getAccessMap(&srcAccessMap);		srcAccess.getAccessMap(&srcAccessMap);

// Get composed access function for 'dstAccess'.		// Get composed access function for 'dstAccess'.
AffineValueMap dstAccessMap;		AffineValueMap dstAccessMap;
▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
/// Gathers dependence components for dependences between all ops in loop nest		/// Gathers dependence components for dependences between all ops in loop nest
/// rooted at 'forOp' at loop depths in range [1, maxLoopDepth].		/// rooted at 'forOp' at loop depths in range [1, maxLoopDepth].
void mlir::getDependenceComponents(		void mlir::getDependenceComponents(
AffineForOp forOp, unsigned maxLoopDepth,		AffineForOp forOp, unsigned maxLoopDepth,
std::vector<SmallVector<DependenceComponent, 2>> *depCompsVec) {		std::vector<SmallVector<DependenceComponent, 2>> *depCompsVec) {
// Collect all load and store ops in loop nest rooted at 'forOp'.		// Collect all load and store ops in loop nest rooted at 'forOp'.
SmallVector<Operation *, 8> loadAndStoreOpInsts;		SmallVector<Operation *, 8> loadAndStoreOpInsts;
forOp.getOperation()->walk([&](Operation *opInst) {		forOp.getOperation()->walk([&](Operation *opInst) {
if (isa<AffineLoadOp>(opInst) \|\| isa<AffineStoreOp>(opInst))		if (isa<AffineReadOpInterface>(opInst) \|\|
		isa<AffineWriteOpInterface>(opInst))
loadAndStoreOpInsts.push_back(opInst);		loadAndStoreOpInsts.push_back(opInst);
});		});

unsigned numOps = loadAndStoreOpInsts.size();		unsigned numOps = loadAndStoreOpInsts.size();
for (unsigned d = 1; d <= maxLoopDepth; ++d) {		for (unsigned d = 1; d <= maxLoopDepth; ++d) {
for (unsigned i = 0; i < numOps; ++i) {		for (unsigned i = 0; i < numOps; ++i) {
auto *srcOpInst = loadAndStoreOpInsts[i];		auto *srcOpInst = loadAndStoreOpInsts[i];
MemRefAccess srcAccess(srcOpInst);		MemRefAccess srcAccess(srcOpInst);
Show All 16 Lines

mlir/lib/Analysis/Utils.cpp

Show First 20 Lines • Show All 190 Lines • ▼ Show 20 Lines
// region: {memref = %A, write = false, {%i <= m0 <= %i + 7} }		// region: {memref = %A, write = false, {%i <= m0 <= %i + 7} }
// The last field is a 2-d FlatAffineConstraints symbolic in %i.		// The last field is a 2-d FlatAffineConstraints symbolic in %i.
//		//
// TODO(bondhugula): extend this to any other memref dereferencing ops		// TODO(bondhugula): extend this to any other memref dereferencing ops
// (dma_start, dma_wait).		// (dma_start, dma_wait).
LogicalResult MemRefRegion::compute(Operation *op, unsigned loopDepth,		LogicalResult MemRefRegion::compute(Operation *op, unsigned loopDepth,
ComputationSliceState *sliceState,		ComputationSliceState *sliceState,
bool addMemRefDimBounds) {		bool addMemRefDimBounds) {
assert((isa<AffineLoadOp>(op) \|\| isa<AffineStoreOp>(op)) &&		assert((isa<AffineReadOpInterface>(op) \|\| isa<AffineWriteOpInterface>(op)) &&
"affine load/store op expected");		"affine read/write op expected");

MemRefAccess access(op);		MemRefAccess access(op);
memref = access.memref;		memref = access.memref;
write = access.isStore();		write = access.isStore();

unsigned rank = access.getRank();		unsigned rank = access.getRank();

LLVM_DEBUG(llvm::dbgs() << "MemRefRegion::compute: " << *op		LLVM_DEBUG(llvm::dbgs() << "MemRefRegion::compute: " << *op
▲ Show 20 Lines • Show All 190 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = memRefType.getRank(); i < e; i++) {
sizeInBytes = sizeInBytes * memRefType.getDimSize(i);		sizeInBytes = sizeInBytes * memRefType.getDimSize(i);
}		}
return sizeInBytes;		return sizeInBytes;
}		}

template <typename LoadOrStoreOp>		template <typename LoadOrStoreOp>
LogicalResult mlir::boundCheckLoadOrStoreOp(LoadOrStoreOp loadOrStoreOp,		LogicalResult mlir::boundCheckLoadOrStoreOp(LoadOrStoreOp loadOrStoreOp,
bool emitError) {		bool emitError) {
static_assert(		static_assert(llvm::is_one_of<LoadOrStoreOp, AffineReadOpInterface,
llvm::is_one_of<LoadOrStoreOp, AffineLoadOp, AffineStoreOp>::value,		AffineWriteOpInterface>::value,
"argument should be either a AffineLoadOp or a AffineStoreOp");		"argument should be either a AffineReadOpInterface or a "
		"AffineWriteOpInterface");

Operation *op = loadOrStoreOp.getOperation();		Operation *op = loadOrStoreOp.getOperation();
MemRefRegion region(op->getLoc());		MemRefRegion region(op->getLoc());
if (failed(region.compute(op, /loopDepth=/0, /sliceState=/nullptr,		if (failed(region.compute(op, /loopDepth=/0, /sliceState=/nullptr,
/addMemRefDimBounds=/false)))		/addMemRefDimBounds=/false)))
return success();		return success();

LLVM_DEBUG(llvm::dbgs() << "Memory region");		LLVM_DEBUG(llvm::dbgs() << "Memory region");
Show All 33 Lines	if (outOfBounds && emitError) {
loadOrStoreOp.emitOpError()		loadOrStoreOp.emitOpError()
<< "memref out of lower bound access along dimension #" << (r + 1);		<< "memref out of lower bound access along dimension #" << (r + 1);
}		}
}		}
return failure(outOfBounds);		return failure(outOfBounds);
}		}

// Explicitly instantiate the template so that the compiler knows we need them!		// Explicitly instantiate the template so that the compiler knows we need them!
template LogicalResult mlir::boundCheckLoadOrStoreOp(AffineLoadOp loadOp,		template LogicalResult
bool emitError);		mlir::boundCheckLoadOrStoreOp(AffineReadOpInterface loadOp, bool emitError);
template LogicalResult mlir::boundCheckLoadOrStoreOp(AffineStoreOp storeOp,		template LogicalResult
bool emitError);		mlir::boundCheckLoadOrStoreOp(AffineWriteOpInterface storeOp, bool emitError);

// Returns in 'positions' the Block positions of 'op' in each ancestor		// Returns in 'positions' the Block positions of 'op' in each ancestor
// Block from the Block containing operation, stopping at 'limitBlock'.		// Block from the Block containing operation, stopping at 'limitBlock'.
static void findInstPosition(Operation op, Block limitBlock,		static void findInstPosition(Operation op, Block limitBlock,
SmallVectorImpl<unsigned> *positions) {		SmallVectorImpl<unsigned> *positions) {
Block *block = op->getBlock();		Block *block = op->getBlock();
while (block != limitBlock) {		while (block != limitBlock) {
// FIXME: This algorithm is unnecessarily O(n) and should be improved to not		// FIXME: This algorithm is unnecessarily O(n) and should be improved to not
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	for (unsigned j = 0, numOpsB = opsB.size(); j < numOpsB; ++j) {
continue;		continue;
// Check if 'loopDepth' exceeds nesting depth of src/dst ops.		// Check if 'loopDepth' exceeds nesting depth of src/dst ops.
if ((!isBackwardSlice && loopDepth > getNestingDepth(opsA[i])) \|\|		if ((!isBackwardSlice && loopDepth > getNestingDepth(opsA[i])) \|\|
(isBackwardSlice && loopDepth > getNestingDepth(opsB[j]))) {		(isBackwardSlice && loopDepth > getNestingDepth(opsB[j]))) {
LLVM_DEBUG(llvm::dbgs() << "Invalid loop depth\n");		LLVM_DEBUG(llvm::dbgs() << "Invalid loop depth\n");
return failure();		return failure();
}		}

bool readReadAccesses = isa<AffineLoadOp>(srcAccess.opInst) &&		bool readReadAccesses = isa<AffineReadOpInterface>(srcAccess.opInst) &&
isa<AffineLoadOp>(dstAccess.opInst);		isa<AffineReadOpInterface>(dstAccess.opInst);
FlatAffineConstraints dependenceConstraints;		FlatAffineConstraints dependenceConstraints;
// Check dependence between 'srcAccess' and 'dstAccess'.		// Check dependence between 'srcAccess' and 'dstAccess'.
DependenceResult result = checkMemrefAccessDependence(		DependenceResult result = checkMemrefAccessDependence(
srcAccess, dstAccess, /loopDepth=/numCommonLoops + 1,		srcAccess, dstAccess, /loopDepth=/numCommonLoops + 1,
&dependenceConstraints, /dependenceComponents=/nullptr,		&dependenceConstraints, /dependenceComponents=/nullptr,
/allowRAR=/readReadAccesses);		/allowRAR=/readReadAccesses);
if (result.value == DependenceResult::Failure) {		if (result.value == DependenceResult::Failure) {
LLVM_DEBUG(llvm::dbgs() << "Dependence check failed\n");		LLVM_DEBUG(llvm::dbgs() << "Dependence check failed\n");
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	void mlir::getComputationSliceState(
sliceState->ubOperands.resize(numSliceLoopIVs, sliceBoundOperands);		sliceState->ubOperands.resize(numSliceLoopIVs, sliceBoundOperands);

// Set destination loop nest insertion point to block start at 'dstLoopDepth'.		// Set destination loop nest insertion point to block start at 'dstLoopDepth'.
sliceState->insertPoint =		sliceState->insertPoint =
isBackwardSlice ? dstLoopIVs[loopDepth - 1].getBody()->begin()		isBackwardSlice ? dstLoopIVs[loopDepth - 1].getBody()->begin()
: std::prev(srcLoopIVs[loopDepth - 1].getBody()->end());		: std::prev(srcLoopIVs[loopDepth - 1].getBody()->end());

llvm::SmallDenseSet<Value, 8> sequentialLoops;		llvm::SmallDenseSet<Value, 8> sequentialLoops;
if (isa<AffineLoadOp>(depSourceOp) && isa<AffineLoadOp>(depSinkOp)) {		if (isa<AffineReadOpInterface>(depSourceOp) &&
		isa<AffineReadOpInterface>(depSinkOp)) {
// For read-read access pairs, clear any slice bounds on sequential loops.		// For read-read access pairs, clear any slice bounds on sequential loops.
// Get sequential loops in loop nest rooted at 'srcLoopIVs[0]'.		// Get sequential loops in loop nest rooted at 'srcLoopIVs[0]'.
getSequentialLoops(isBackwardSlice ? srcLoopIVs[0] : dstLoopIVs[0],		getSequentialLoops(isBackwardSlice ? srcLoopIVs[0] : dstLoopIVs[0],
&sequentialLoops);		&sequentialLoops);
}		}
// Clear all sliced loop bounds beginning at the first sequential loop, or		// Clear all sliced loop bounds beginning at the first sequential loop, or
// first loop with a slice fusion barrier attribute..		// first loop with a slice fusion barrier attribute..
// TODO(andydavis, bondhugula) Use MemRef read/write regions instead of		// TODO(andydavis, bondhugula) Use MemRef read/write regions instead of
▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines	if (AffineMap ubMap = sliceState->ubs[i])
forOp.setUpperBound(sliceState->ubOperands[i], ubMap);		forOp.setUpperBound(sliceState->ubOperands[i], ubMap);
}		}
return sliceLoopNest;		return sliceLoopNest;
}		}

// Constructs MemRefAccess populating it with the memref, its indices and		// Constructs MemRefAccess populating it with the memref, its indices and
// opinst from 'loadOrStoreOpInst'.		// opinst from 'loadOrStoreOpInst'.
MemRefAccess::MemRefAccess(Operation *loadOrStoreOpInst) {		MemRefAccess::MemRefAccess(Operation *loadOrStoreOpInst) {
if (auto loadOp = dyn_cast<AffineLoadOp>(loadOrStoreOpInst)) {		if (auto loadOp = dyn_cast<AffineReadOpInterface>(loadOrStoreOpInst)) {
memref = loadOp.getMemRef();		memref = loadOp.getMemRef();
opInst = loadOrStoreOpInst;		opInst = loadOrStoreOpInst;
auto loadMemrefType = loadOp.getMemRefType();		auto loadMemrefType = loadOp.getMemRefType();
indices.reserve(loadMemrefType.getRank());		indices.reserve(loadMemrefType.getRank());
for (auto index : loadOp.getMapOperands()) {		for (auto index : loadOp.getMapOperands()) {
indices.push_back(index);		indices.push_back(index);
}		}
} else {		} else {
assert(isa<AffineStoreOp>(loadOrStoreOpInst) && "load/store op expected");		assert(isa<AffineWriteOpInterface>(loadOrStoreOpInst) &&
auto storeOp = dyn_cast<AffineStoreOp>(loadOrStoreOpInst);		"Affine read/write op expected");
		auto storeOp = cast<AffineWriteOpInterface>(loadOrStoreOpInst);
opInst = loadOrStoreOpInst;		opInst = loadOrStoreOpInst;
memref = storeOp.getMemRef();		memref = storeOp.getMemRef();
auto storeMemrefType = storeOp.getMemRefType();		auto storeMemrefType = storeOp.getMemRefType();
indices.reserve(storeMemrefType.getRank());		indices.reserve(storeMemrefType.getRank());
for (auto index : storeOp.getMapOperands()) {		for (auto index : storeOp.getMapOperands()) {
indices.push_back(index);		indices.push_back(index);
}		}
}		}
}		}

unsigned MemRefAccess::getRank() const {		unsigned MemRefAccess::getRank() const {
return memref.getType().cast<MemRefType>().getRank();		return memref.getType().cast<MemRefType>().getRank();
}		}

bool MemRefAccess::isStore() const { return isa<AffineStoreOp>(opInst); }		bool MemRefAccess::isStore() const {
		return isa<AffineWriteOpInterface>(opInst);
		}

/// Returns the nesting depth of this statement, i.e., the number of loops		/// Returns the nesting depth of this statement, i.e., the number of loops
/// surrounding this statement.		/// surrounding this statement.
unsigned mlir::getNestingDepth(Operation *op) {		unsigned mlir::getNestingDepth(Operation *op) {
Operation *currOp = op;		Operation *currOp = op;
unsigned depth = 0;		unsigned depth = 0;
while ((currOp = currOp->getParentOp())) {		while ((currOp = currOp->getParentOp())) {
if (isa<AffineForOp>(currOp))		if (isa<AffineForOp>(currOp))
Show All 40 Lines
static Optional<int64_t> getMemoryFootprintBytes(Block &block,		static Optional<int64_t> getMemoryFootprintBytes(Block &block,
Block::iterator start,		Block::iterator start,
Block::iterator end,		Block::iterator end,
int memorySpace) {		int memorySpace) {
SmallDenseMap<Value, std::unique_ptr<MemRefRegion>, 4> regions;		SmallDenseMap<Value, std::unique_ptr<MemRefRegion>, 4> regions;

// Walk this 'affine.for' operation to gather all memory regions.		// Walk this 'affine.for' operation to gather all memory regions.
auto result = block.walk(start, end, [&](Operation *opInst) -> WalkResult {		auto result = block.walk(start, end, [&](Operation *opInst) -> WalkResult {
if (!isa<AffineLoadOp>(opInst) && !isa<AffineStoreOp>(opInst)) {		if (!isa<AffineReadOpInterface>(opInst) &&
		!isa<AffineWriteOpInterface>(opInst)) {
// Neither load nor a store op.		// Neither load nor a store op.
return WalkResult::advance();		return WalkResult::advance();
}		}

// Compute the memref region symbolic in any IVs enclosing this block.		// Compute the memref region symbolic in any IVs enclosing this block.
auto region = std::make_unique<MemRefRegion>(opInst->getLoc());		auto region = std::make_unique<MemRefRegion>(opInst->getLoc());
if (failed(		if (failed(
region->compute(opInst,		region->compute(opInst,
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	void mlir::getSequentialLoops(AffineForOp forOp,
});		});
}		}

/// Returns true if 'forOp' is parallel.		/// Returns true if 'forOp' is parallel.
bool mlir::isLoopParallel(AffineForOp forOp) {		bool mlir::isLoopParallel(AffineForOp forOp) {
// Collect all load and store ops in loop nest rooted at 'forOp'.		// Collect all load and store ops in loop nest rooted at 'forOp'.
SmallVector<Operation *, 8> loadAndStoreOpInsts;		SmallVector<Operation *, 8> loadAndStoreOpInsts;
auto walkResult = forOp.walk([&](Operation *opInst) -> WalkResult {		auto walkResult = forOp.walk([&](Operation *opInst) -> WalkResult {
if (isa<AffineLoadOp>(opInst) \|\| isa<AffineStoreOp>(opInst))		if (isa<AffineReadOpInterface>(opInst) \|\|
		isa<AffineWriteOpInterface>(opInst))
loadAndStoreOpInsts.push_back(opInst);		loadAndStoreOpInsts.push_back(opInst);
else if (!isa<AffineForOp>(opInst) && !isa<AffineTerminatorOp>(opInst) &&		else if (!isa<AffineForOp>(opInst) && !isa<AffineTerminatorOp>(opInst) &&
!isa<AffineIfOp>(opInst) &&		!isa<AffineIfOp>(opInst) &&
!MemoryEffectOpInterface::hasNoEffect(opInst))		!MemoryEffectOpInterface::hasNoEffect(opInst))
return WalkResult::interrupt();		return WalkResult::interrupt();

return WalkResult::advance();		return WalkResult::advance();
});		});
Show All 35 Lines

mlir/lib/Dialect/Affine/IR/AffineMemoryOpInterfaces.cpp

This file was added.

				//===- AffineMemoryOpInterfaces.cpp - Loop-like operations in MLIR --------===//
				rriddleUnsubmitted Not Done Reply Inline Actions Loop-Like? rriddle: Loop-Like?
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.h"

				using namespace mlir;

				//===----------------------------------------------------------------------===//
				// Affine Memory Op Interfaces
				//===----------------------------------------------------------------------===//

				/// Include the definitions of the affine memory op interfaces.
				#include "mlir/Dialect/Affine/IR/AffineMemoryOpInterfaces.cpp.inc"

mlir/lib/Dialect/Affine/IR/CMakeLists.txt

	add_mlir_dialect_library(MLIRAffineOps			add_mlir_dialect_library(MLIRAffineOps
				AffineMemoryOpInterfaces.cpp
	AffineOps.cpp			AffineOps.cpp
	AffineValueMap.cpp			AffineValueMap.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Affine			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/Affine

	DEPENDS			DEPENDS
				MLIRAffineMemoryOpInterfacesIncGen
	MLIRAffineOpsIncGen			MLIRAffineOpsIncGen

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	MLIREDSC			MLIREDSC
	MLIRIR			MLIRIR
	MLIRLoopLikeInterface			MLIRLoopLikeInterface
	MLIRSideEffectInterfaces			MLIRSideEffectInterfaces
	MLIRStandardOps			MLIRStandardOps
	)			)

mlir/lib/Transforms/LoopFusion.cpp

Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines
mlir::createLoopFusionPass(unsigned fastMemorySpace,		mlir::createLoopFusionPass(unsigned fastMemorySpace,
uint64_t localBufSizeThreshold, bool maximalFusion) {		uint64_t localBufSizeThreshold, bool maximalFusion) {
return std::make_unique<LoopFusion>(fastMemorySpace, localBufSizeThreshold,		return std::make_unique<LoopFusion>(fastMemorySpace, localBufSizeThreshold,
maximalFusion);		maximalFusion);
}		}

// TODO(b/117228571) Replace when this is modeled through side-effects/op traits		// TODO(b/117228571) Replace when this is modeled through side-effects/op traits
static bool isMemRefDereferencingOp(Operation &op) {		static bool isMemRefDereferencingOp(Operation &op) {
if (isa<AffineLoadOp>(op) \|\| isa<AffineStoreOp>(op) \|\|		if (isa<AffineReadOpInterface>(op) \|\| isa<AffineWriteOpInterface>(op) \|\|
isa<AffineDmaStartOp>(op) \|\| isa<AffineDmaWaitOp>(op))		isa<AffineDmaStartOp>(op) \|\| isa<AffineDmaWaitOp>(op))
return true;		return true;
return false;		return false;
}		}

namespace {		namespace {

// LoopNestStateCollector walks loop nests and collects load and store		// LoopNestStateCollector walks loop nests and collects load and store
// operations, and whether or not an IfInst was encountered in the loop nest.		// operations, and whether or not an IfInst was encountered in the loop nest.
struct LoopNestStateCollector {		struct LoopNestStateCollector {
SmallVector<AffineForOp, 4> forOps;		SmallVector<AffineForOp, 4> forOps;
SmallVector<Operation *, 4> loadOpInsts;		SmallVector<Operation *, 4> loadOpInsts;
SmallVector<Operation *, 4> storeOpInsts;		SmallVector<Operation *, 4> storeOpInsts;
bool hasNonForRegion = false;		bool hasNonForRegion = false;

void collect(Operation *opToWalk) {		void collect(Operation *opToWalk) {
opToWalk->walk([&](Operation *op) {		opToWalk->walk([&](Operation *op) {
if (isa<AffineForOp>(op))		if (isa<AffineForOp>(op))
forOps.push_back(cast<AffineForOp>(op));		forOps.push_back(cast<AffineForOp>(op));
else if (op->getNumRegions() != 0)		else if (op->getNumRegions() != 0)
hasNonForRegion = true;		hasNonForRegion = true;
else if (isa<AffineLoadOp>(op))		else if (isa<AffineReadOpInterface>(op))
loadOpInsts.push_back(op);		loadOpInsts.push_back(op);
else if (isa<AffineStoreOp>(op))		else if (isa<AffineWriteOpInterface>(op))
storeOpInsts.push_back(op);		storeOpInsts.push_back(op);
});		});
}		}
};		};

// MemRefDependenceGraph is a graph data structure where graph nodes are		// MemRefDependenceGraph is a graph data structure where graph nodes are
// top-level operations in a FuncOp which contain load/store ops, and edges		// top-level operations in a FuncOp which contain load/store ops, and edges
// are memref dependences between the nodes.		// are memref dependences between the nodes.
Show All 14 Lines	struct Node {
// List of store op insts.		// List of store op insts.
SmallVector<Operation *, 4> stores;		SmallVector<Operation *, 4> stores;
Node(unsigned id, Operation *op) : id(id), op(op) {}		Node(unsigned id, Operation *op) : id(id), op(op) {}

// Returns the load op count for 'memref'.		// Returns the load op count for 'memref'.
unsigned getLoadOpCount(Value memref) {		unsigned getLoadOpCount(Value memref) {
unsigned loadOpCount = 0;		unsigned loadOpCount = 0;
for (auto *loadOpInst : loads) {		for (auto *loadOpInst : loads) {
if (memref == cast<AffineLoadOp>(loadOpInst).getMemRef())		if (memref == cast<AffineReadOpInterface>(loadOpInst).getMemRef())
++loadOpCount;		++loadOpCount;
}		}
return loadOpCount;		return loadOpCount;
}		}

// Returns the store op count for 'memref'.		// Returns the store op count for 'memref'.
unsigned getStoreOpCount(Value memref) {		unsigned getStoreOpCount(Value memref) {
unsigned storeOpCount = 0;		unsigned storeOpCount = 0;
for (auto *storeOpInst : stores) {		for (auto *storeOpInst : stores) {
if (memref == cast<AffineStoreOp>(storeOpInst).getMemRef())		if (memref == cast<AffineWriteOpInterface>(storeOpInst).getMemRef())
++storeOpCount;		++storeOpCount;
}		}
return storeOpCount;		return storeOpCount;
}		}

// Returns all store ops in 'storeOps' which access 'memref'.		// Returns all store ops in 'storeOps' which access 'memref'.
void getStoreOpsForMemref(Value memref,		void getStoreOpsForMemref(Value memref,
SmallVectorImpl<Operation > storeOps) {		SmallVectorImpl<Operation > storeOps) {
for (auto *storeOpInst : stores) {		for (auto *storeOpInst : stores) {
if (memref == cast<AffineStoreOp>(storeOpInst).getMemRef())		if (memref == cast<AffineWriteOpInterface>(storeOpInst).getMemRef())
storeOps->push_back(storeOpInst);		storeOps->push_back(storeOpInst);
}		}
}		}

// Returns all load ops in 'loadOps' which access 'memref'.		// Returns all load ops in 'loadOps' which access 'memref'.
void getLoadOpsForMemref(Value memref,		void getLoadOpsForMemref(Value memref,
SmallVectorImpl<Operation > loadOps) {		SmallVectorImpl<Operation > loadOps) {
for (auto *loadOpInst : loads) {		for (auto *loadOpInst : loads) {
if (memref == cast<AffineLoadOp>(loadOpInst).getMemRef())		if (memref == cast<AffineReadOpInterface>(loadOpInst).getMemRef())
loadOps->push_back(loadOpInst);		loadOps->push_back(loadOpInst);
}		}
}		}

// Returns all memrefs in 'loadAndStoreMemrefSet' for which this node		// Returns all memrefs in 'loadAndStoreMemrefSet' for which this node
// has at least one load and store operation.		// has at least one load and store operation.
void getLoadAndStoreMemrefSet(DenseSet<Value> *loadAndStoreMemrefSet) {		void getLoadAndStoreMemrefSet(DenseSet<Value> *loadAndStoreMemrefSet) {
llvm::SmallDenseSet<Value, 2> loadMemrefs;		llvm::SmallDenseSet<Value, 2> loadMemrefs;
for (auto *loadOpInst : loads) {		for (auto *loadOpInst : loads) {
loadMemrefs.insert(cast<AffineLoadOp>(loadOpInst).getMemRef());		loadMemrefs.insert(cast<AffineReadOpInterface>(loadOpInst).getMemRef());
}		}
for (auto *storeOpInst : stores) {		for (auto *storeOpInst : stores) {
auto memref = cast<AffineStoreOp>(storeOpInst).getMemRef();		auto memref = cast<AffineWriteOpInterface>(storeOpInst).getMemRef();
if (loadMemrefs.count(memref) > 0)		if (loadMemrefs.count(memref) > 0)
loadAndStoreMemrefSet->insert(memref);		loadAndStoreMemrefSet->insert(memref);
}		}
}		}
};		};

// Edge represents a data dependence between nodes in the graph.		// Edge represents a data dependence between nodes in the graph.
struct Edge {		struct Edge {
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	void removeNode(unsigned id) {
nodes.erase(id);		nodes.erase(id);
}		}

// Returns true if node 'id' writes to any memref which escapes (or is an		// Returns true if node 'id' writes to any memref which escapes (or is an
// argument to) the function/block. Returns false otherwise.		// argument to) the function/block. Returns false otherwise.
bool writesToLiveInOrEscapingMemrefs(unsigned id) {		bool writesToLiveInOrEscapingMemrefs(unsigned id) {
Node *node = getNode(id);		Node *node = getNode(id);
for (auto *storeOpInst : node->stores) {		for (auto *storeOpInst : node->stores) {
auto memref = cast<AffineStoreOp>(storeOpInst).getMemRef();		auto memref = cast<AffineWriteOpInterface>(storeOpInst).getMemRef();
auto *op = memref.getDefiningOp();		auto *op = memref.getDefiningOp();
// Return true if 'memref' is a block argument.		// Return true if 'memref' is a block argument.
if (!op)		if (!op)
return true;		return true;
// Return true if any use of 'memref' escapes the function.		// Return true if any use of 'memref' escapes the function.
for (auto *user : memref.getUsers())		for (auto *user : memref.getUsers())
if (!isMemRefDereferencingOp(*user))		if (!isMemRefDereferencingOp(*user))
return true;		return true;
}		}
return false;		return false;
}		}

// Returns the unique AffineStoreOp in `node` that meets all the following:		// Returns the unique AffineWriteOpInterface in `node` that meets all the
		// following:
// *) store is the only one that writes to a function-local memref live out		// *) store is the only one that writes to a function-local memref live out
// of `node`,		// of `node`,
// *) store is not the source of a self-dependence on `node`.		// *) store is not the source of a self-dependence on `node`.
// Otherwise, returns a null AffineStoreOp.		// Otherwise, returns a null AffineWriteOpInterface.
AffineStoreOp getUniqueOutgoingStore(Node *node) {		AffineWriteOpInterface getUniqueOutgoingStore(Node *node) {
AffineStoreOp uniqueStore;		AffineWriteOpInterface uniqueStore;

// Return null if `node` doesn't have any outgoing edges.		// Return null if `node` doesn't have any outgoing edges.
auto outEdgeIt = outEdges.find(node->id);		auto outEdgeIt = outEdges.find(node->id);
if (outEdgeIt == outEdges.end())		if (outEdgeIt == outEdges.end())
return nullptr;		return nullptr;

const auto &nodeOutEdges = outEdgeIt->second;		const auto &nodeOutEdges = outEdgeIt->second;
for (auto *op : node->stores) {		for (auto *op : node->stores) {
auto storeOp = cast<AffineStoreOp>(op);		auto storeOp = cast<AffineWriteOpInterface>(op);
auto memref = storeOp.getMemRef();		auto memref = storeOp.getMemRef();
// Skip this store if there are no dependences on its memref. This means		// Skip this store if there are no dependences on its memref. This means
// that store either:		// that store either:
// *) writes to a memref that is only read within the same loop nest		// *) writes to a memref that is only read within the same loop nest
// (self-dependence edges are not represented in graph at the moment),		// (self-dependence edges are not represented in graph at the moment),
// *) writes to a function live out memref (function parameter), or		// *) writes to a function live out memref (function parameter), or
// *) is dead.		// *) is dead.
if (llvm::all_of(nodeOutEdges, [=](const Edge &edge) {		if (llvm::all_of(nodeOutEdges, [=](const Edge &edge) {
Show All 18 Lines	public:
// function/block argument).		// function/block argument).
// *) The node has no successors in the dependence graph.		// *) The node has no successors in the dependence graph.
bool canRemoveNode(unsigned id) {		bool canRemoveNode(unsigned id) {
if (writesToLiveInOrEscapingMemrefs(id))		if (writesToLiveInOrEscapingMemrefs(id))
return false;		return false;
Node *node = getNode(id);		Node *node = getNode(id);
for (auto *storeOpInst : node->stores) {		for (auto *storeOpInst : node->stores) {
// Return false if there exist out edges from 'id' on 'memref'.		// Return false if there exist out edges from 'id' on 'memref'.
if (getOutEdgeCount(id, cast<AffineStoreOp>(storeOpInst).getMemRef()) > 0)		auto storeMemref = cast<AffineWriteOpInterface>(storeOpInst).getMemRef();
		if (getOutEdgeCount(id, storeMemref) > 0)
		ftynseUnsubmitted Done Reply Inline Actions Nit: extract `cast<AffineWriteLikeOpInterface>(storeOpInst).getMemRef()` into a named variable for better formatting here ftynse: Nit: extract `cast<AffineWriteLikeOpInterface>(storeOpInst).getMemRef()` into a named variable…
return false;		return false;
}		}
return true;		return true;
}		}

// Returns true iff there is an edge from node 'srcId' to node 'dstId' which		// Returns true iff there is an edge from node 'srcId' to node 'dstId' which
// is for 'value' if non-null, or for any value otherwise. Returns false		// is for 'value' if non-null, or for any value otherwise. Returns false
// otherwise.		// otherwise.
▲ Show 20 Lines • Show All 312 Lines • ▼ Show 20 Lines	if (auto forOp = dyn_cast<AffineForOp>(op)) {
collector.collect(&op);		collector.collect(&op);
// Return false if a non 'affine.for' region was found (not currently		// Return false if a non 'affine.for' region was found (not currently
// supported).		// supported).
if (collector.hasNonForRegion)		if (collector.hasNonForRegion)
return false;		return false;
Node node(nextNodeId++, &op);		Node node(nextNodeId++, &op);
for (auto *opInst : collector.loadOpInsts) {		for (auto *opInst : collector.loadOpInsts) {
node.loads.push_back(opInst);		node.loads.push_back(opInst);
auto memref = cast<AffineLoadOp>(opInst).getMemRef();		auto memref = cast<AffineReadOpInterface>(opInst).getMemRef();
memrefAccesses[memref].insert(node.id);		memrefAccesses[memref].insert(node.id);
}		}
for (auto *opInst : collector.storeOpInsts) {		for (auto *opInst : collector.storeOpInsts) {
node.stores.push_back(opInst);		node.stores.push_back(opInst);
auto memref = cast<AffineStoreOp>(opInst).getMemRef();		auto memref = cast<AffineWriteOpInterface>(opInst).getMemRef();
memrefAccesses[memref].insert(node.id);		memrefAccesses[memref].insert(node.id);
}		}
forToNodeMap[&op] = node.id;		forToNodeMap[&op] = node.id;
nodes.insert({node.id, node});		nodes.insert({node.id, node});
} else if (auto loadOp = dyn_cast<AffineLoadOp>(op)) {		} else if (auto loadOp = dyn_cast<AffineReadOpInterface>(op)) {
// Create graph node for top-level load op.		// Create graph node for top-level load op.
Node node(nextNodeId++, &op);		Node node(nextNodeId++, &op);
node.loads.push_back(&op);		node.loads.push_back(&op);
auto memref = cast<AffineLoadOp>(op).getMemRef();		auto memref = cast<AffineReadOpInterface>(op).getMemRef();
memrefAccesses[memref].insert(node.id);		memrefAccesses[memref].insert(node.id);
nodes.insert({node.id, node});		nodes.insert({node.id, node});
} else if (auto storeOp = dyn_cast<AffineStoreOp>(op)) {		} else if (auto storeOp = dyn_cast<AffineWriteOpInterface>(op)) {
// Create graph node for top-level store op.		// Create graph node for top-level store op.
Node node(nextNodeId++, &op);		Node node(nextNodeId++, &op);
node.stores.push_back(&op);		node.stores.push_back(&op);
auto memref = cast<AffineStoreOp>(op).getMemRef();		auto memref = cast<AffineWriteOpInterface>(op).getMemRef();
memrefAccesses[memref].insert(node.id);		memrefAccesses[memref].insert(node.id);
nodes.insert({node.id, node});		nodes.insert({node.id, node});
} else if (op.getNumRegions() != 0) {		} else if (op.getNumRegions() != 0) {
// Return false if another region is found (not currently supported).		// Return false if another region is found (not currently supported).
return false;		return false;
} else if (op.getNumResults() > 0 && !op.use_empty()) {		} else if (op.getNumResults() > 0 && !op.use_empty()) {
// Create graph node for top-level producer of SSA values, which		// Create graph node for top-level producer of SSA values, which
// could be used by loop nest nodes.		// could be used by loop nest nodes.
▲ Show 20 Lines • Show All 44 Lines • ▼ Show 20 Lines
// Removes load operations from 'srcLoads' which operate on 'memref', and		// Removes load operations from 'srcLoads' which operate on 'memref', and
// adds them to 'dstLoads'.		// adds them to 'dstLoads'.
static void moveLoadsAccessingMemrefTo(Value memref,		static void moveLoadsAccessingMemrefTo(Value memref,
SmallVectorImpl<Operation > srcLoads,		SmallVectorImpl<Operation > srcLoads,
SmallVectorImpl<Operation > dstLoads) {		SmallVectorImpl<Operation > dstLoads) {
dstLoads->clear();		dstLoads->clear();
SmallVector<Operation *, 4> srcLoadsToKeep;		SmallVector<Operation *, 4> srcLoadsToKeep;
for (auto load : srcLoads) {		for (auto load : srcLoads) {
if (cast<AffineLoadOp>(load).getMemRef() == memref)		if (cast<AffineReadOpInterface>(load).getMemRef() == memref)
dstLoads->push_back(load);		dstLoads->push_back(load);
else		else
srcLoadsToKeep.push_back(load);		srcLoadsToKeep.push_back(load);
}		}
srcLoads->swap(srcLoadsToKeep);		srcLoads->swap(srcLoadsToKeep);
}		}

// Returns the innermost common loop depth for the set of operations in 'ops'.		// Returns the innermost common loop depth for the set of operations in 'ops'.
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	static Value createPrivateMemRef(AffineForOp forOp, Operation *srcStoreOpInst,
uint64_t localBufSizeThreshold) {		uint64_t localBufSizeThreshold) {
auto *forInst = forOp.getOperation();		auto *forInst = forOp.getOperation();

// Create builder to insert alloc op just before 'forOp'.		// Create builder to insert alloc op just before 'forOp'.
OpBuilder b(forInst);		OpBuilder b(forInst);
// Builder to create constants at the top level.		// Builder to create constants at the top level.
OpBuilder top(forInst->getParentOfType<FuncOp>().getBody());		OpBuilder top(forInst->getParentOfType<FuncOp>().getBody());
// Create new memref type based on slice bounds.		// Create new memref type based on slice bounds.
auto oldMemRef = cast<AffineStoreOp>(srcStoreOpInst).getMemRef();		auto oldMemRef = cast<AffineWriteOpInterface>(srcStoreOpInst).getMemRef();
auto oldMemRefType = oldMemRef.getType().cast<MemRefType>();		auto oldMemRefType = oldMemRef.getType().cast<MemRefType>();
unsigned rank = oldMemRefType.getRank();		unsigned rank = oldMemRefType.getRank();

// Compute MemRefRegion for 'srcStoreOpInst' at depth 'dstLoopDepth'.		// Compute MemRefRegion for 'srcStoreOpInst' at depth 'dstLoopDepth'.
MemRefRegion region(srcStoreOpInst->getLoc());		MemRefRegion region(srcStoreOpInst->getLoc());
bool validRegion = succeeded(region.compute(srcStoreOpInst, dstLoopDepth));		bool validRegion = succeeded(region.compute(srcStoreOpInst, dstLoopDepth));
(void)validRegion;		(void)validRegion;
assert(validRegion && "unexpected memref region failure");		assert(validRegion && "unexpected memref region failure");
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines
}		}

// Checks if node 'srcId' can be safely fused into node 'dstId'. Node 'srcId'		// Checks if node 'srcId' can be safely fused into node 'dstId'. Node 'srcId'
// may write to multiple memrefs but it is required that only one of them,		// may write to multiple memrefs but it is required that only one of them,
// 'srcLiveOutStoreOp', has output edges.		// 'srcLiveOutStoreOp', has output edges.
// Returns true if 'dstNode's read/write region to 'memref' is a super set of		// Returns true if 'dstNode's read/write region to 'memref' is a super set of
// 'srcNode's write region to 'memref' and 'srcId' has only one output edge.		// 'srcNode's write region to 'memref' and 'srcId' has only one output edge.
// TODO(andydavis) Generalize this to handle more live in/out cases.		// TODO(andydavis) Generalize this to handle more live in/out cases.
static bool canFuseSrcWhichWritesToLiveOut(unsigned srcId, unsigned dstId,		static bool
AffineStoreOp srcLiveOutStoreOp,		canFuseSrcWhichWritesToLiveOut(unsigned srcId, unsigned dstId,
		AffineWriteOpInterface srcLiveOutStoreOp,
MemRefDependenceGraph *mdg) {		MemRefDependenceGraph *mdg) {
assert(srcLiveOutStoreOp && "Expected a valid store op");		assert(srcLiveOutStoreOp && "Expected a valid store op");
auto *dstNode = mdg->getNode(dstId);		auto *dstNode = mdg->getNode(dstId);
Value memref = srcLiveOutStoreOp.getMemRef();		Value memref = srcLiveOutStoreOp.getMemRef();
// Return false if 'srcNode' has more than one output edge on 'memref'.		// Return false if 'srcNode' has more than one output edge on 'memref'.
if (mdg->getOutEdgeCount(srcId, memref) > 1)		if (mdg->getOutEdgeCount(srcId, memref) > 1)
return false;		return false;

// Compute MemRefRegion 'srcWriteRegion' for 'srcStoreOp' on 'memref'.		// Compute MemRefRegion 'srcWriteRegion' for 'srcStoreOp' on 'memref'.
▲ Show 20 Lines • Show All 469 Lines • ▼ Show 20 Lines	while (!worklist.empty()) {
// consumer loop nest.		// consumer loop nest.
sinkSequentialLoops(dstNode);		sinkSequentialLoops(dstNode);

SmallVector<Operation *, 4> loads = dstNode->loads;		SmallVector<Operation *, 4> loads = dstNode->loads;
SmallVector<Operation *, 4> dstLoadOpInsts;		SmallVector<Operation *, 4> dstLoadOpInsts;
DenseSet<Value> visitedMemrefs;		DenseSet<Value> visitedMemrefs;
while (!loads.empty()) {		while (!loads.empty()) {
// Get memref of load on top of the stack.		// Get memref of load on top of the stack.
auto memref = cast<AffineLoadOp>(loads.back()).getMemRef();		auto memref = cast<AffineReadOpInterface>(loads.back()).getMemRef();
if (visitedMemrefs.count(memref) > 0)		if (visitedMemrefs.count(memref) > 0)
continue;		continue;
visitedMemrefs.insert(memref);		visitedMemrefs.insert(memref);
// Move all loads in 'loads' accessing 'memref' to 'dstLoadOpInsts'.		// Move all loads in 'loads' accessing 'memref' to 'dstLoadOpInsts'.
moveLoadsAccessingMemrefTo(memref, &loads, &dstLoadOpInsts);		moveLoadsAccessingMemrefTo(memref, &loads, &dstLoadOpInsts);
// Skip if no input edges along which to fuse.		// Skip if no input edges along which to fuse.
if (mdg->inEdges.count(dstId) == 0)		if (mdg->inEdges.count(dstId) == 0)
continue;		continue;
Show All 21 Lines	while (!worklist.empty()) {
// fusion.		// fusion.
auto srcStoreOp = mdg->getUniqueOutgoingStore(srcNode);		auto srcStoreOp = mdg->getUniqueOutgoingStore(srcNode);
if (!srcStoreOp) {		if (!srcStoreOp) {
// Get the src store op at the deepest loop depth.		// Get the src store op at the deepest loop depth.
// We will use 'LoopFusionUtils::canFuseLoops' to check fusion		// We will use 'LoopFusionUtils::canFuseLoops' to check fusion
// feasibility for loops with multiple stores.		// feasibility for loops with multiple stores.
unsigned maxLoopDepth = 0;		unsigned maxLoopDepth = 0;
for (auto *op : srcNode->stores) {		for (auto *op : srcNode->stores) {
auto storeOp = cast<AffineStoreOp>(op);		auto storeOp = cast<AffineWriteOpInterface>(op);
if (storeOp.getMemRef() != memref) {		if (storeOp.getMemRef() != memref) {
srcStoreOp = nullptr;		srcStoreOp = nullptr;
break;		break;
}		}
unsigned loopDepth = getNestingDepth(storeOp);		unsigned loopDepth = getNestingDepth(storeOp);
if (loopDepth > maxLoopDepth) {		if (loopDepth > maxLoopDepth) {
maxLoopDepth = loopDepth;		maxLoopDepth = loopDepth;
srcStoreOp = storeOp;		srcStoreOp = storeOp;
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	while (!worklist.empty()) {

// Skip if fusion is not feasible at all loop depths.		// Skip if fusion is not feasible at all loop depths.
if (!canFuse)		if (!canFuse)
continue;		continue;

// Gather 'dstNode' store ops to 'memref'.		// Gather 'dstNode' store ops to 'memref'.
SmallVector<Operation *, 2> dstStoreOpInsts;		SmallVector<Operation *, 2> dstStoreOpInsts;
for (auto *storeOpInst : dstNode->stores)		for (auto *storeOpInst : dstNode->stores)
if (cast<AffineStoreOp>(storeOpInst).getMemRef() == memref)		if (cast<AffineWriteOpInterface>(storeOpInst).getMemRef() == memref)
dstStoreOpInsts.push_back(storeOpInst);		dstStoreOpInsts.push_back(storeOpInst);

unsigned bestDstLoopDepth;		unsigned bestDstLoopDepth;
mlir::ComputationSliceState sliceState;		mlir::ComputationSliceState sliceState;
// Check if fusion would be profitable.		// Check if fusion would be profitable.
if (!isFusionProfitable(srcStoreOp, srcStoreOp, dstLoadOpInsts,		if (!isFusionProfitable(srcStoreOp, srcStoreOp, dstLoadOpInsts,
dstStoreOpInsts, &sliceState,		dstStoreOpInsts, &sliceState,
&bestDstLoopDepth, maximalFusion,		&bestDstLoopDepth, maximalFusion,
Show All 21 Lines	while (!worklist.empty()) {
// Promote single iteration slice loops to single IV value.		// Promote single iteration slice loops to single IV value.
for (auto forOp : sliceCollector.forOps) {		for (auto forOp : sliceCollector.forOps) {
promoteIfSingleIteration(forOp);		promoteIfSingleIteration(forOp);
}		}
if (createPrivateMemref) {		if (createPrivateMemref) {
// Create private memref for 'memref' in 'dstAffineForOp'.		// Create private memref for 'memref' in 'dstAffineForOp'.
SmallVector<Operation *, 4> storesForMemref;		SmallVector<Operation *, 4> storesForMemref;
for (auto *storeOpInst : sliceCollector.storeOpInsts) {		for (auto *storeOpInst : sliceCollector.storeOpInsts) {
if (cast<AffineStoreOp>(storeOpInst).getMemRef() == memref)		if (cast<AffineWriteOpInterface>(storeOpInst).getMemRef() ==
		memref)
storesForMemref.push_back(storeOpInst);		storesForMemref.push_back(storeOpInst);
}		}
// TODO(andydavis) Use union of memref write regions to compute		// TODO(andydavis) Use union of memref write regions to compute
// private memref footprint.		// private memref footprint.
auto newMemRef = createPrivateMemRef(		auto newMemRef = createPrivateMemRef(
dstAffineForOp, storesForMemref[0], bestDstLoopDepth,		dstAffineForOp, storesForMemref[0], bestDstLoopDepth,
fastMemorySpace, localBufSizeThreshold);		fastMemorySpace, localBufSizeThreshold);
visitedMemrefs.insert(newMemRef);		visitedMemrefs.insert(newMemRef);
// Create new node in dependence graph for 'newMemRef' alloc op.		// Create new node in dependence graph for 'newMemRef' alloc op.
unsigned newMemRefNodeId =		unsigned newMemRefNodeId =
mdg->addNode(newMemRef.getDefiningOp());		mdg->addNode(newMemRef.getDefiningOp());
// Add edge from 'newMemRef' node to dstNode.		// Add edge from 'newMemRef' node to dstNode.
mdg->addEdge(newMemRefNodeId, dstId, newMemRef);		mdg->addEdge(newMemRefNodeId, dstId, newMemRef);
}		}

// Collect dst loop stats after memref privatization transformation.		// Collect dst loop stats after memref privatization transformation.
LoopNestStateCollector dstLoopCollector;		LoopNestStateCollector dstLoopCollector;
dstLoopCollector.collect(dstAffineForOp.getOperation());		dstLoopCollector.collect(dstAffineForOp.getOperation());

// Add new load ops to current Node load op list 'loads' to		// Add new load ops to current Node load op list 'loads' to
// continue fusing based on new operands.		// continue fusing based on new operands.
for (auto *loadOpInst : dstLoopCollector.loadOpInsts) {		for (auto *loadOpInst : dstLoopCollector.loadOpInsts) {
auto loadMemRef = cast<AffineLoadOp>(loadOpInst).getMemRef();		auto loadMemRef =
		cast<AffineReadOpInterface>(loadOpInst).getMemRef();
// NOTE: Change 'loads' to a hash set in case efficiency is an		// NOTE: Change 'loads' to a hash set in case efficiency is an
// issue. We still use a vector since it's expected to be small.		// issue. We still use a vector since it's expected to be small.
if (visitedMemrefs.count(loadMemRef) == 0 &&		if (visitedMemrefs.count(loadMemRef) == 0 &&
!llvm::is_contained(loads, loadOpInst))		!llvm::is_contained(loads, loadOpInst))
loads.push_back(loadOpInst);		loads.push_back(loadOpInst);
}		}

// Clear and add back loads and stores.		// Clear and add back loads and stores.
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	auto canFuseWithSibNode = [&](Node *sibNode, Value memref) {
if (llvm::any_of(loadAndStoreMemrefSet, [=](Value memref) {		if (llvm::any_of(loadAndStoreMemrefSet, [=](Value memref) {
return mdg->getIncomingMemRefAccesses(sibNode->id, memref) > 0;		return mdg->getIncomingMemRefAccesses(sibNode->id, memref) > 0;
}))		}))
return false;		return false;

// Check that all stores are to the same memref.		// Check that all stores are to the same memref.
DenseSet<Value> storeMemrefs;		DenseSet<Value> storeMemrefs;
for (auto *storeOpInst : sibNode->stores) {		for (auto *storeOpInst : sibNode->stores) {
storeMemrefs.insert(cast<AffineStoreOp>(storeOpInst).getMemRef());		storeMemrefs.insert(
		cast<AffineWriteOpInterface>(storeOpInst).getMemRef());
}		}
if (storeMemrefs.size() != 1)		if (storeMemrefs.size() != 1)
return false;		return false;
return true;		return true;
};		};

// Search for siblings which load the same memref function argument.		// Search for siblings which load the same memref function argument.
auto fn = dstNode->op->getParentOfType<FuncOp>();		auto fn = dstNode->op->getParentOfType<FuncOp>();
for (unsigned i = 0, e = fn.getNumArguments(); i != e; ++i) {		for (unsigned i = 0, e = fn.getNumArguments(); i != e; ++i) {
for (auto *user : fn.getArgument(i).getUsers()) {		for (auto *user : fn.getArgument(i).getUsers()) {
if (auto loadOp = dyn_cast<AffineLoadOp>(user)) {		if (auto loadOp = dyn_cast<AffineReadOpInterface>(user)) {
// Gather loops surrounding 'use'.		// Gather loops surrounding 'use'.
SmallVector<AffineForOp, 4> loops;		SmallVector<AffineForOp, 4> loops;
getLoopIVs(*user, &loops);		getLoopIVs(*user, &loops);
// Skip 'use' if it is not within a loop nest.		// Skip 'use' if it is not within a loop nest.
if (loops.empty())		if (loops.empty())
continue;		continue;
Node *sibNode = mdg->getForOpNode(loops[0]);		Node *sibNode = mdg->getForOpNode(loops[0]);
assert(sibNode != nullptr);		assert(sibNode != nullptr);
▲ Show 20 Lines • Show All 128 Lines • Show Last 20 Lines

mlir/test/lib/Transforms/TestMemRefBoundCheck.cpp

Show All 31 Lines	struct TestMemRefBoundCheck
: public PassWrapper<TestMemRefBoundCheck, FunctionPass> {		: public PassWrapper<TestMemRefBoundCheck, FunctionPass> {
void runOnFunction() override;		void runOnFunction() override;
};		};

} // end anonymous namespace		} // end anonymous namespace

void TestMemRefBoundCheck::runOnFunction() {		void TestMemRefBoundCheck::runOnFunction() {
getFunction().walk([](Operation *opInst) {		getFunction().walk([](Operation *opInst) {
TypeSwitch<Operation *>(opInst).Case<AffineLoadOp, AffineStoreOp>(		TypeSwitch<Operation *>(opInst)
		.Case<AffineReadOpInterface, AffineWriteOpInterface>(
[](auto op) { boundCheckLoadOrStoreOp(op); });		[](auto op) { boundCheckLoadOrStoreOp(op); });

// TODO(bondhugula): do this for DMA ops as well.		// TODO(bondhugula): do this for DMA ops as well.
});		});
}		}

namespace mlir {		namespace mlir {
void registerMemRefBoundCheck() {		void registerMemRefBoundCheck() {
PassRegistration<TestMemRefBoundCheck>(		PassRegistration<TestMemRefBoundCheck>(
"test-memref-bound-check", "Check memref access bounds in a Function");		"test-memref-bound-check", "Check memref access bounds in a Function");
}		}
} // namespace mlir		} // namespace mlir