This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Dialect/Linalg/IR/
-
mlir/
-
Dialect/
-
Linalg/
-
IR/
28/29
LinalgInterfaces.td
-
LinalgStructuredOps.td
-
lib/
-
CAPI/Dialect/
-
Dialect/
-
Linalg.cpp
-
Dialect/
-
Linalg/
-
Analysis/
-
DependenceAnalysis.cpp
-
IR/
4/4
LinalgInterfaces.cpp
-
LinalgOps.cpp
-
Transforms/
-
BubbleUpExtractSlice.cpp
-
BufferizableOpInterfaceImpl.cpp
-
ConstantFold.cpp
-
DecomposeLinalgOps.cpp
-
Detensorize.cpp
-
DropUnitDims.cpp
-
ElementwiseOpFusion.cpp
-
Fusion.cpp
-
FusionOnTensors.cpp
-
Generalization.cpp
-
HoistPadding.cpp
-
InlineScalarOperands.cpp
-
Interchange.cpp
-
Loops.cpp
-
Promotion.cpp
-
SplitReduction.cpp
-
Tiling.cpp
-
TilingInterfaceImpl.cpp
-
Transforms.cpp
-
Vectorization.cpp
-
Utils/
-
Utils.cpp
-
SparseTensor/
-
Transforms/
-
SparseTensorRewriting.cpp
-
Sparsification.cpp
-
Utils/
-
Merger.cpp
-
test/
-
Dialect/Linalg/
-
Linalg/
-
canonicalize.mlir
2/2
roundtrip.mlir
-
lib/Dialect/
-
Dialect/
-
Linalg/
-
TestLinalgElementwiseFusion.cpp
-
TestLinalgFusionTransforms.cpp
-
Test/
-
TestOps.td
-
tools/mlir-linalg-ods-gen/
-
mlir-linalg-ods-gen/
-
mlir-linalg-ods-yaml-gen.cpp

Differential D135348

[mlir] Simplify DestinationStyleOpInterface.
ClosedPublic

Authored by pifon2a on Oct 6 2022, 12:55 AM.

Download Raw Diff

Details

Reviewers

nicolasvasilache
springerm
mravishankar
akuegel
aartbik
dcaballe

Commits

rGa7cccb9cbb2b: [mlir] Simplify DestinationStyleOpInterface.

Summary

Prepare DPS interface for moving it out of Linalg dialect. Remove some
of the methods. Express the structure of the op using the number of outputs only.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

pifon2a created this revision.Oct 6 2022, 12:55 AM

Herald added a reviewer: aartbik. · View Herald TranscriptOct 6 2022, 12:55 AM

Herald added a reviewer: aartbik. · View Herald Transcript

Herald added a reviewer: mravishankar. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: zero9178, anlunx, bzcheeseman and 20 others. · View Herald Transcript

pifon2a requested review of this revision.Oct 6 2022, 12:55 AM

Herald added a reviewer: dcaballe. · View Herald TranscriptOct 6 2022, 12:55 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer. · View Herald Transcript

pifon2a edited reviewers, added: akuegel; removed: aartbik, dcaballe.Oct 6 2022, 12:56 AM

Herald added a reviewer: aartbik. · View Herald TranscriptOct 6 2022, 12:56 AM

Herald added a reviewer: dcaballe. · View Herald Transcript

Harbormaster completed remote builds in B190689: Diff 465672.Oct 6 2022, 1:12 AM

springerm added inline comments.Oct 6 2022, 1:33 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
886–887	Add: All non-output operands are inputs.
886–887	Any particular reason why inputs are limited to these three? Can we support arbitrary types?
887	Add: It is assumed that the inputs of the op are the operands at position `[0; num_operands - getNumOutputs)`. The outputs of the op are the operands at position `[num_operands - getNumOutputs; num_operands)`. In other words, all input operands come first.
891	Add: The i-th output tensor is tied to the i-th OpResult. The op may not have any additional OpResults.
895	nit: Furthermore
924–925	Why is this the case?
924–925	I would make these `unsigned`. There are various arithmetic computation with `getNumOperands` etc. and I think these return `unsigned`, so you may get compiler warnings.
938–939	How does this work? Is the method calling itself? How about returning `$_op.getNumResults()` if the op has tensor semantics. Otherwise `llvm_unreachable("must be implemented")`. There must be as many outputs as there are OpResults (in case of tensor semantics).
mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
794–795	Can we move this limitation to the LinalgOp verifier?
796–797	I think this check does not do anything because `numInputs` is computed as `num_operands - num_outputs`.
805–812	nit: I would move this check before the previous check for better error messages.

akuegel added inline comments.Oct 6 2022, 2:22 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
898	nit: transformation -> transformations
924–925	It was already int64_t before.
938–939	It is the same as with getLibraryCallName. It would indeed call itself if the op does not implement getNumOutputs(). And unfortunately that does not result in a compiler error in all builds. I had to revert a change because of that because it only showed up on the Windows build. I like the suggestion of Matthias.
mlir/test/Dialect/Linalg/roundtrip.mlir
124	Maybe now we should have a test case in invalid.mlir instead? Or is it not actually invalid?

Address the comments.

pifon2a added a subscriber: ftynse.Oct 6 2022, 7:40 AM

pifon2a added inline comments.

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
886–887	Right, it can be arbitrary. Fixed the comment.
938–939	Checking for tensor semantics is quite expensive. And we also have to make it work for buffers. So, I am not sure what's best here.
938–939	@ftynse is there some way not to shoot ourselves in the foot with this? I also don't really understand the difference between `methodBody` and `defaultImplementation`.
mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp
794–795	We can, but why? Shouldn't there be at least one destination for DPS ops?
mlir/test/Dialect/Linalg/roundtrip.mlir
124	it has a mixed form with tensors and buffers. Should we still support mixed tensor-memref inputs?

Harbormaster completed remote builds in B190730: Diff 465730.Oct 6 2022, 7:52 AM

Just curious why you

"pifon2a edited reviewers, added: akuegel; removed: aartbik, dcaballe."

when you clearly touch sparse code? Luckily for me, Herald put me back :-)
Having said that, changes in sparse LGTM since they are all mechanical.

In D135348#3840484, @aartbik wrote:

Just curious why you

"pifon2a edited reviewers, added: akuegel; removed: aartbik, dcaballe."

when you clearly touch sparse code? Luckily for me, Herald put me back :-)
Having said that, changes in sparse LGTM since they are all mechanical.

Yes, because the changes were mechanical, I removed you and Diego. :-)

Yes, because the changes were mechanical, I removed you and Diego. :-)

Please still keep me in the loop so I can tell the sparse compiler team that changes are coming.
(as a heads up on rebasing mainly in this case :-)

mravishankar requested changes to this revision.Oct 6 2022, 10:40 AM

mravishankar added inline comments.

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
889	I think it would be better to not use `getNumOperands` directly. Some operations might have other operands that are neither inputs nor outputs. For example, optional padding value for a fictional pad operation (not the `tensor.pad` that exists. It might be better to have an interface method that expects the range of operands to be used as inputs and range of operands to be used as outputs to be published.

This revision now requires changes to proceed.Oct 6 2022, 10:40 AM

pifon2a added inline comments.Oct 6 2022, 1:09 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
889	@mravishankar could you elaborate a bit more on what is neither input nor output? I thought that everything that is not output is an input even if it is not a tensor/memref. For example, linalg.fill has one scalar input, insert_slice has a list of offsets-sizes-strides inputs.

@mravishankar could you elaborate a bit more on what is neither input nor output? I thought that everything that is not output is an input even if it is not a tensor/memref. For example, linalg.fill has one scalar input, insert_slice has a list of offsets-sizes-strides inputs.

Thats a pretty big assumption though. As an interface, it would be better to make as little assumptions about the operation as possible. The simplest here might be each operation is expected to return "list of input operands" and "list of output operands" (ideally it doesnt even need the list of input operands for destination passing style since that is irrelevant to the interface).

Thats a pretty big assumption though. As an interface, it would be better to make as little assumptions about the operation as possible. The simplest here might be each operation is expected to return "list of input operands" and "list of output operands" (ideally it doesnt even need the list of input operands for destination passing style since that is irrelevant to the interface).

I think that the list of inputs is relevant, because the DPS ops are allowed to have either tensor semantics or buffer semantics, not the mixed case. I don't think that the assumption is big. I see a bigger problem if we start targeting the case that we have never seen in practice.

In D135348#3841203, @pifon2a wrote:

Thats a pretty big assumption though. As an interface, it would be better to make as little assumptions about the operation as possible. The simplest here might be each operation is expected to return "list of input operands" and "list of output operands" (ideally it doesnt even need the list of input operands for destination passing style since that is irrelevant to the interface).

I think that the list of inputs is relevant, because the DPS ops are allowed to have either tensor semantics or buffer semantics, not the mixed case. I don't think that the assumption is big. I see a bigger problem if we start targeting the case that we have never seen in practice.

Maybe I am over indexing on the wording

/// It is assumed that the inputs of the op are the operands at position [0;
// getNumOperands() - getNumOutputs()). The outputs of the op are the operands
// at position [getNumOperands() - getNumOutputs(); getNumOperands()). In other
// words, all input operands come first.

why is that needed ? Why cant you have an interface method that is just

OpOperandVector getOutputOperands()

that each op that implements the interface overrides. You dont need to make assumptions on specific ordering of the operands? Is there a reason for the interface to make such a strong assumption. For example, we are prototyping a "pack + pad" operation that has the following argument list

let arguments = (ins Variadic<AnyShaped>:$inputs,
    Variadic<AnyShaped>:$outputs,
    DefaultValuedAttr<I64ArrayAttr, "{}">:$dims_pos,
    Variadic<Index>:$inner_tiles,
    I64ArrayAttr:$static_inner_tiles,
    Optional<AnyType>:$padding_value);

Here the "getOutputOperands" returns the $outputs operands. The assumption listed above doesnt hold. Why does the interface need to enforce ordering of operands, and how many operands exist explicitly. If this is an intermediate state before relaxing that requirement, thats fine, but if that is a hard requirement, then thats strange....

springerm added inline comments.Oct 6 2022, 10:36 PM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
938–939	Then I would just put a `llvm_unreachable(...)` there. A `defaultImplementation` should be provided only if overriding the InterfaceMethod is optional. I could be wrong but I think `methodBody` should be used if the InterfaceMethod is not supposed to be defined/overridden by ops (https://mlir.llvm.org/docs/Interfaces/). That would actually be the case for the vast majority of the InterfaceMethods here. But before changing anything, try it for one InterfaceMethod first and see if it still compiles...

In D135348#3841280, @mravishankar wrote:
why is that needed ? Why cant you have an interface method that is just
OpOperandVector getOutputOperands()
that each op that implements the interface overrides. You dont need to make assumptions on specific ordering of the operands? Is there a reason for the interface to make such a strong assumption.

Just to add another data point, we have a somewhat similar interface: ViewLikeOpInterface

We could follow a similar implementation strategy. The offsets are defined via the offsets InterfaceMethod (not via operand index ranges):

InterfaceMethod<
  /*desc=*/[{
    Return the dynamic offset operands.
  }],
  /*retTy=*/"::mlir::OperandRange",
  /*methodName=*/"offsets",
  /*args=*/(ins),
  /*methodBody=*/"",
  /*defaultImplementation=*/[{
    return $_op.getOffsets();
  }]
>,

Not sure what happens if the op has neither an offsets operand nor an offsets method (infinite loop?). We could force an implementation by putting an llvm_unreachable.

(This implementation approach still assumes that outputs is a consecutive block of operands.)

why is that needed ? Why cant you have an interface method that is just OpOperandVector getOutputOperands().

I was trying to avoid copying the OpOperands* all the time. In the pack/unpack ops, we can also move destination args to the and of the arg list. On the other hand, the same would need to be done for the already existing tensor.insert_slice. Which is also possible, but probably painful.

If we have $inits not at a fixed position, we have to construct SmallVector<OpOperand*> every time we call getOutputOperands. Also, does it mean that getNumOutputs would need to be expressed as getOutputOperands().size()? Would it make sense to have an interface method std::pair<unsigned, unsigned> getOutputsPositionsRange or smth like that that would return the interval of positions in the operand list that correspond to outputs? In that case, the users would need to override only this method.

@mravishankar

OperandRange does not own the data, does it? I was trying to avoid constructing SmallVector every time.

In D135348#3842139, @springerm wrote:
In D135348#3841280, @mravishankar wrote:
why is that needed ? Why cant you have an interface method that is just
OpOperandVector getOutputOperands()
that each op that implements the interface overrides. You dont need to make assumptions on specific ordering of the operands? Is there a reason for the interface to make such a strong assumption.
Just to add another data point, we have a somewhat similar interface: ViewLikeOpInterface

We could follow a similar implementation strategy. The offsets are defined via the offsets InterfaceMethod (not via operand index ranges):
InterfaceMethod<
  /*desc=*/[{
    Return the dynamic offset operands.
  }],
  /*retTy=*/"::mlir::OperandRange",
  /*methodName=*/"offsets",
  /*args=*/(ins),
  /*methodBody=*/"",
  /*defaultImplementation=*/[{
    return $_op.getOffsets();
  }]
>,
Not sure what happens if the op has neither an offsets operand nor an offsets method (infinite loop?). We could force an implementation by putting an llvm_unreachable.

(This implementation approach still assumes that outputs is a consecutive block of operands.)

In D135348#3843577, @pifon2a wrote:

why is that needed ? Why cant you have an interface method that is just OpOperandVector getOutputOperands().

I was trying to avoid copying the OpOperands* all the time. In the pack/unpack ops, we can also move destination args to the and of the arg list. On the other hand, the same would need to be done for the already existing tensor.insert_slice. Which is also possible, but probably painful.

If we have $inits not at a fixed position, we have to construct SmallVector<OpOperand*> every time we call getOutputOperands. Also, does it mean that getNumOutputs would need to be expressed as getOutputOperands().size()? Would it make sense to have an interface method std::pair<unsigned, unsigned> getOutputsPositionsRange or smth like that that would return the interval of positions in the operand list that correspond to outputs? In that case, the users would need to override only this method.

@mravishankar

Yeah having an getOutputsPositionRange would work. AFAIK OperandRange doesnt own its data. The OpOperands are owned by the operation.

Yep, OperandRange does not, but OpOperandVector does and it is used for DPS interface right now.

In D135348#3843656, @mravishankar wrote:

In D135348#3843577, @pifon2a wrote:

why is that needed ? Why cant you have an interface method that is just OpOperandVector getOutputOperands().

I was trying to avoid copying the OpOperands* all the time. In the pack/unpack ops, we can also move destination args to the and of the arg list. On the other hand, the same would need to be done for the already existing tensor.insert_slice. Which is also possible, but probably painful.

If we have $inits not at a fixed position, we have to construct SmallVector<OpOperand*> every time we call getOutputOperands. Also, does it mean that getNumOutputs would need to be expressed as getOutputOperands().size()? Would it make sense to have an interface method std::pair<unsigned, unsigned> getOutputsPositionsRange or smth like that that would return the interval of positions in the operand list that correspond to outputs? In that case, the users would need to override only this method.

@mravishankar

Yeah having an getOutputsPositionRange would work. AFAIK OperandRange doesnt own its data. The OpOperands are owned by the operation.

Ok, then I will add getOutputsPositionRange func.

nicolasvasilache added inline comments.Oct 11 2022, 3:36 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
889	@mravishankar I think I see the option to relax this in the future (in particular to support `list(tensor, shape)` that will also be useful for `tensor.expand_shape` and `tensor.pack/unpack`) but I would keep this out of the current commit. The current behavior has the `getNumOperands`-based assumption, let's first evolve to an interface and separately revisit that assumption if you don't mind.
924–925	Almost never use unsigned in C++: exception if doing bit manipulation E.g. https://stackoverflow.com/questions/10168079/why-is-size-t-unsigned
952	I see we now have `MutableOperandRange`, use that insead?
972–974	can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods?
984–986	can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods?
1028	can this use `getInputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods?

nicolasvasilache added inline comments.Oct 11 2022, 3:45 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
892	Does this means all ops that implement this interface must be of the form: results_range = OP(input_operands_range, unqualified_operands_range, output_operands_range) ? Or is there no `unqualified_operands_range` ? Also, is this true for all ops in perpetuity or only for the ops that use the default DPS interface impl? Can the DPS interface be configured differently for other ops? Could you please add these considerations to the doc?

pifon2a added inline comments.Oct 12 2022, 2:07 AM

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
972–974	It can. We should be careful though. At the moment creating MutableArrayRef<OpOperand> is cheap and doing `getOutputOperands()[i]` is ok-ish, but when/if we make the interface more "flexible" then `getOutputOperands` would construct SmallVector only to use one element of it.

Address the comments.

Update the doc.

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
938–939	I don't have a default implementation anymore.
952	It allows you to iterate over Values only, unfortunately.

Harbormaster completed remote builds in B192229: Diff 467843.Oct 14 2022, 11:35 AM

Thx for improving the interface!

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
945–946	Can we use this C++ feature (given that we also have to support some pretty old compiler versions)?

Thank you!

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td
945–946	it is used in SCF.cpp a lot, for example

Even though there are requested changes by Mahesh. I addressed these changes, I discussed it directly with him last week. I will push it.

This revision was not accepted when it landed; it landed in state Needs Review.Oct 17 2022, 3:45 AM

This revision was landed with ongoing or failed builds.

Closed by commit rGa7cccb9cbb2b: [mlir] Simplify DestinationStyleOpInterface. (authored by pifon2a). · Explain Why

This revision was automatically updated to reflect the committed changes.

pifon2a added a commit: rGa7cccb9cbb2b: [mlir] Simplify DestinationStyleOpInterface..

olegshyshkov mentioned this in D135854: [mlir] Add TransposeOp to Linalg structured ops..Oct 18 2022, 5:19 AM

Just leaving a note here that the design of this interface is causing a real headache for downstream users (i.e. IREE). I dont understand why this interface was adding anything w.r.t to getInputs() or getNumInputs(). That seems completely orthogonal to what DestinationStyleOpInterface was supposed to target , i.e. which operands of the op can be used to alias the result of the operation. The assumption that all operands that are not outputs are "input"s is a really strong an unnecessary assumption. In theory they might be "input"s but they might not be called as such in the op definition of downstream operations.
FTR : I did mark this as changes requested.... I am not sure why repeatedly changes get submitted when I have marked them as changes requested. It means I am not done with the review. Submitting it when marked changes requested drops it off my dashboard and I lost track of this. Integrating this into IREE is causing a massive headache, which is why I marked this as changes requested.

In D135348#3890708, @mravishankar wrote:

Just leaving a note here that the design of this interface is causing a real headache for downstream users (i.e. IREE). I dont understand why this interface was adding anything w.r.t to getInputs() or getNumInputs(). That seems completely orthogonal to what DestinationStyleOpInterface was supposed to target , i.e. which operands of the op can be used to alias the result of the operation. The assumption that all operands that are not outputs are "input"s is a really strong an unnecessary assumption. In theory they might be "input"s but they might not be called as such in the op definition of downstream operations.
FTR : I did mark this as changes requested.... I am not sure why repeatedly changes get submitted when I have marked them as changes requested. It means I am not done with the review. Submitting it when marked changes requested drops it off my dashboard and I lost track of this. Integrating this into IREE is causing a massive headache, which is why I marked this as changes requested.

Follow up to this. I see that the inputs were part of the initial design proposal here https://discourse.llvm.org/t/rfc-interface-for-destination-style-ops/64056 . So I missed the issues this can cause as well. So my bad on that.... I will post back on the RFC after I navigate things downstream, maybe changes to the DestinationStyleOpInterface that restrict the scope of the interface from what it is doing today....

With https://reviews.llvm.org/D136943 I was able to handle all the name conflicts. Thanks @pifon2a ! Apologies for the knee-jerk reaction above.
Still think we need to take a further look at the methods added under DestinationStyleOpInterface. I will post a summary of what I found on the relevant discourse thread.

Herald added a subscriber: Moerafaat. · View Herald TranscriptOct 31 2022, 9:40 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

Dialect/

Linalg/

IR/

LinalgInterfaces.td

453 lines

LinalgStructuredOps.td

19 lines

lib/

CAPI/

Dialect/

Linalg.cpp

6 lines

Dialect/

Linalg/

Analysis/

DependenceAnalysis.cpp

22 lines

IR/

LinalgInterfaces.cpp

70 lines

LinalgOps.cpp

140 lines

Transforms/

BubbleUpExtractSlice.cpp

4 lines

BufferizableOpInterfaceImpl.cpp

2 lines

ConstantFold.cpp

24 lines

DecomposeLinalgOps.cpp

6 lines

Detensorize.cpp

7 lines

DropUnitDims.cpp

12 lines

ElementwiseOpFusion.cpp

43 lines

25 lines

2 lines

14 lines

2 lines

InlineScalarOperands.cpp

4 lines

4 lines

6 lines

26 lines

18 lines

2 lines

TilingInterfaceImpl.cpp

22 lines

Transforms.cpp

14 lines

Vectorization.cpp

34 lines

Utils/

Utils.cpp

53 lines

SparseTensor/

Transforms/

SparseTensorRewriting.cpp

7 lines

Sparsification.cpp

74 lines

Utils/

Merger.cpp

6 lines

test/

Dialect/

Linalg/

canonicalize.mlir

2 lines

roundtrip.mlir

32 lines

lib/

Dialect/

Linalg/

TestLinalgElementwiseFusion.cpp

4 lines

TestLinalgFusionTransforms.cpp

12 lines

Test/

TestOps.td

14 lines

tools/

mlir-linalg-ods-gen/

mlir-linalg-ods-yaml-gen.cpp

14 lines

Diff 468150

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td

Show First 20 Lines • Show All 311 Lines • ▼ Show 20 Lines	InterfaceMethod<
Return true if `opOperand` is an init tensor. This is true when it is		Return true if `opOperand` is an init tensor. This is true when it is
an output tensor operand whose value is used in the payload region.		an output tensor operand whose value is used in the payload region.
}],		}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"isInitTensor",		/methodName=/"isInitTensor",
/args=/(ins "OpOperand *":$opOperand),		/args=/(ins "OpOperand *":$opOperand),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
if (!$_op.isOutputTensor(opOperand))		if (!$_op.isOutput(opOperand))
return false;		return false;
return payloadUsesValueFromOperand(opOperand);		return payloadUsesValueFromOperand(opOperand);
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Return the `opOperand` rank or zero for scalars.		Return the `opOperand` rank or zero for scalars.
}],		}],
▲ Show 20 Lines • Show All 272 Lines • ▼ Show 20 Lines	InterfaceMethod<
are expection. For example, in `map` output operand isn't used in		are expection. For example, in `map` output operand isn't used in
the block.		the block.
}],		}],
/retTy=/"OpOperandVector",		/retTy=/"OpOperandVector",
/methodName=/"getOpOperandsMatchingBBargs",		/methodName=/"getOpOperandsMatchingBBargs",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return $_op.getInputAndOutputOperands();		OpOperandVector result;
		result.reserve($_op->getNumOperands());
		llvm::transform(
		this->getOperation()->getOpOperands(),
		std::back_inserter(result),
		[](OpOperand &opOperand) { return &opOperand; });
		return result;
}]		}]
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Linalg generalization hooks.		// Linalg generalization hooks.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Hook to provide a custom AffineMap used to compute all the operand		Hook to provide a custom AffineMap used to compute all the operand
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	InterfaceMethod<
dimension is statically known, or ShapeType::kDynamicSize otherwise.		dimension is statically known, or ShapeType::kDynamicSize otherwise.
}],		}],
/retTy=/"SmallVector<int64_t>",		/retTy=/"SmallVector<int64_t>",
/methodName=/"getStaticShape",		/methodName=/"getStaticShape",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
SmallVector<int64_t> res;		SmallVector<int64_t> res;
// MLIR currently does not support dependent interfaces or interface		for (OpOperand &opOperand : this->getOperation()->getOpOperands())
// inheritance. By construction all ops with StructuredOpInterface must		llvm::append_range(res, getShape(&opOperand));
// implement DestinationStyleOpInterface.
// TODO: reevalute the need for a cast when a better mechanism exists.
auto iface = cast<DestinationStyleOpInterface>(*this->getOperation());
for (OpOperand *opOperand : iface.getInputAndOutputOperands())
llvm::append_range(res, getShape(opOperand));
return res;		return res;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{
Returns the statically-known loop ranges. Composes		Returns the statically-known loop ranges. Composes
`getShapesToLoopsMap()` with the result of `getStaticShape`.		`getShapesToLoopsMap()` with the result of `getStaticShape`.
Returns ShapeType::kDynamicSize for non-statically-known loop ranges.		Returns ShapeType::kDynamicSize for non-statically-known loop ranges.
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
// Forwarding functions to access interface methods from the		// Forwarding functions to access interface methods from the
// DestinationStyleOpInterface.		// DestinationStyleOpInterface.
// MLIR currently does not support dependent interfaces or interface		// MLIR currently does not support dependent interfaces or interface
// inheritance. By construction all ops with StructuredOpInterface must		// inheritance. By construction all ops with StructuredOpInterface must
// implement DestinationStyleOpInterface.		// implement DestinationStyleOpInterface.
// TODO: reevalute the need for a cast when a better mechanism exists.		// TODO: reevalute the need for a cast when a better mechanism exists.
//========================================================================//		//========================================================================//

ValueRange getInputs() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputs();
}

int64_t getNumInputs() {		int64_t getNumInputs() {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getNumInputs();		.getNumInputs();
}		}

ValueRange getOutputs() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputs();
}

int64_t getNumOutputs() {		int64_t getNumOutputs() {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getNumOutputs();		.getNumOutputs();
}		}

int64_t getNumInputsAndOutputs() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getNumInputsAndOutputs();
}

OpOperandVector getInputOperands() {		OpOperandVector getInputOperands() {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputOperands();		.getInputOperands();
}		}

OpOperand *getInputOperand(int64_t i) {		OpOperand *getInputOperand(int64_t i) {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputOperand(i);		.getInputOperand(i);
}		}

OpOperandVector getInputBufferOperands() {		void setOutputOperand(int64_t i, Value value) {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputBufferOperands();
}

OpOperandVector getInputTensorOperands() {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputTensorOperands();		.setOutputOperand(i, value);
}		}

OpOperandVector getOutputOperands() {		OpOperandVector getOutputOperands() {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputOperands();		.getOutputOperands();
}		}

OpOperand *getOutputOperand(int64_t i) {		OpOperand *getOutputOperand(int64_t i) {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputOperand(i);		.getOutputOperand(i);
}		}

void setOutputOperand(int64_t i, Value value) {		bool isInput(OpOperand *opOperand) {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.setOutputOperand(i, value);		.isInput(opOperand);
}		}

OpOperandVector getOutputBufferOperands() {		bool isOutput(OpOperand *opOperand) {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputBufferOperands();		.isOutput(opOperand);
}

OpOperandVector getOutputTensorOperands() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputTensorOperands();
}

SmallVector<MemRefType> getOutputBufferTypes() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputBufferTypes();
}

SmallVector<RankedTensorType> getOutputTensorTypes() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getOutputTensorTypes();
}

OpOperandVector getInputAndOutputOperands() {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.getInputAndOutputOperands();
}

bool isInputTensor(OpOperand *opOperand) {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.isInputTensor(opOperand);
}

bool isOutputTensor(OpOperand *opOperand) {
return cast<DestinationStyleOpInterface>(*this->getOperation())
.isOutputTensor(opOperand);
}		}

bool isScalar(OpOperand *opOperand) {		bool isScalar(OpOperand *opOperand) {
return cast<DestinationStyleOpInterface>(*this->getOperation())		return cast<DestinationStyleOpInterface>(*this->getOperation())
.isScalar(opOperand);		.isScalar(opOperand);
}		}

OpResult getTiedOpResult(OpOperand *opOperand) {		OpResult getTiedOpResult(OpOperand *opOperand) {
Show All 40 Lines	void setOperandSegmentAt(unsigned idx, unsigned val) {
getOperation()->setAttr("operand_segment_sizes", newAttr);		getOperation()->setAttr("operand_segment_sizes", newAttr);
}		}
}];		}];

let verify = [{ return detail::verifyStructuredOpInterface($_op); }];		let verify = [{ return detail::verifyStructuredOpInterface($_op); }];
let verifyWithRegions = 1;		let verifyWithRegions = 1;
}		}

// The 'DestinationStyleOpInterface' provides access to the methods relevant		// Ops that are in destination style have designated output operands, which act
// for destination-style ops. A destination-style operation has 'n' input		// as initial tensor values for the results of the operation or the output
// arguments and 'm' output arguments. Each op that wants to implement		// buffers to which the results of the op will be written.
// DestinationStyleOpInterface needs to define getInputs() and getOutputs()		//
// methods.		// Output operands must be tensors or memrefs. Input operands can have any
		// type. All non-output operands are inputs.
		springermUnsubmitted Done Reply Inline Actions Add: All non-output operands are inputs. springerm: Add: All non-output operands are inputs.
		springermUnsubmitted Done Reply Inline Actions Any particular reason why inputs are limited to these three? Can we support arbitrary types? springerm: Any particular reason why inputs are limited to these three? Can we support arbitrary types?
		pifon2aAuthorUnsubmitted Done Reply Inline Actions Right, it can be arbitrary. Fixed the comment. pifon2a: Right, it can be arbitrary. Fixed the comment.
		springermUnsubmitted Done Reply Inline Actions Add: It is assumed that the inputs of the op are the operands at position `[0; num_operands - getNumOutputs)`. The outputs of the op are the operands at position `[num_operands - getNumOutputs; num_operands)`. In other words, all input operands come first. springerm: Add: It is assumed that the inputs of the op are the operands at position `[0; num_operands…

		// It is assumed that the output operands of the op are the operands at
		mravishankarUnsubmitted Done Reply Inline Actions I think it would be better to not use `getNumOperands` directly. Some operations might have other operands that are neither inputs nor outputs. For example, optional padding value for a fictional pad operation (not the `tensor.pad` that exists. It might be better to have an interface method that expects the range of operands to be used as inputs and range of operands to be used as outputs to be published. mravishankar: I think it would be better to not use `getNumOperands` directly. Some operations might have…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions @mravishankar could you elaborate a bit more on what is neither input nor output? I thought that everything that is not output is an input even if it is not a tensor/memref. For example, linalg.fill has one scalar input, insert_slice has a list of offsets-sizes-strides inputs. pifon2a: @mravishankar could you elaborate a bit more on what is neither input nor output? I thought…
		nicolasvasilacheUnsubmitted Done Reply Inline Actions @mravishankar I think I see the option to relax this in the future (in particular to support `list(tensor, shape)` that will also be useful for `tensor.expand_shape` and `tensor.pack/unpack`) but I would keep this out of the current commit. The current behavior has the `getNumOperands`-based assumption, let's first evolve to an interface and separately revisit that assumption if you don't mind. nicolasvasilache: @mravishankar I think I see the option to relax this in the future (in particular to support…
		// position [start, end). The positions are defined by getOutputsPositionRange
		// method. All non-output operands are "inputs" of the DPS op.
		springermUnsubmitted Done Reply Inline Actions Add: The i-th output tensor is tied to the i-th OpResult. The op may not have any additional OpResults. springerm: Add: The i-th output tensor is tied to the i-th OpResult. The op may not have any additional…

		nicolasvasilacheUnsubmitted Done Reply Inline Actions Does this means all ops that implement this interface must be of the form: results_range = OP(input_operands_range, unqualified_operands_range, output_operands_range) ? Or is there no `unqualified_operands_range` ? Also, is this true for all ops in perpetuity or only for the ops that use the default DPS interface impl? Can the DPS interface be configured differently for other ops? Could you please add these considerations to the doc? nicolasvasilache: Does this means all ops that implement this interface must be of the form: ``` results_range =…
		// If the op has "tensor semantics", then the input operands are either scalars
		// or tensors. The output operands are tensors and every tensor output is tied
		// to a corresponding tensor OpResult in a 1-to-1 fashion. The i-th output
		springermUnsubmitted Done Reply Inline Actions nit: Furthermore springerm: nit: Furthermore
		// tensor is tied to the i-th OpResult. The op may not have any additional
		// OpResults. Output operands and their tied OpResults have the same type.
		//
		akuegelUnsubmitted Done Reply Inline Actions nit: transformation -> transformations akuegel: nit: transformation -> transformations
		// If the op has "buffer semantics", then the input operands are either memrefs
		// or other non-tensor types, e.g. scalar types. Furthermore, the output
		// operands are memrefs and the op has no results.
		//
		// Destination-passing style abstraction makes certain transformations easier.
		// For example, tiling implementation can extract/insert slices from/into the
		// destination of an op and use the resulting shaped value as an iter_arg in
		// the surrounding loop structure. As another example, bufferization does not
		// have to allocate new buffers for destinations (in case of in-place
		// bufferization) and can directly reuse the existing destination buffer.
		//
		// Example of a destination style op: `%r = tensor.insert_slice %t into %d`,
		// where `%t` is the single input and `%d` is the single output. `%d` is tied
		// to `%r`.
		//
		// Example of an op that is not in destination style: `%r = tensor.pad %t`.
		// This op is not in destination style because `%r` and `%t` have different
		// shape.
		//
		// Each op that wants to implement DestinationStyleOpInterface needs to define
		// the getOutputsPositionRange() method.
def DestinationStyleOpInterface : OpInterface<"DestinationStyleOpInterface"> {		def DestinationStyleOpInterface : OpInterface<"DestinationStyleOpInterface"> {
let cppNamespace = "::mlir::linalg";		let cppNamespace = "::mlir::linalg";
let methods = [		let methods = [
//===------------------------------------------------------------------===//		// This method has to be defined for every DPS op.
// Num input/output arguments handling.
//===------------------------------------------------------------------===//
// `getInputs` must be defined by each op that wants to implement the
// DestinationStyleOpInterface.
InterfaceMethod<
/desc=/[{
Return the input shape operands.
}],
/retTy=/"ValueRange",
/methodName=/"getInputs",
/args=/(ins)
>,
// These special methods rely on `getInputs` and `getOutputs` being defined
// by each op that wants to implement the DestinationStyleOpInterface.
InterfaceMethod<
/desc=/[{
Return the number of inputs.
}],
/retTy=/"int64_t",
/methodName=/"getNumInputs",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
return $_op.getInputs().size();
}]
>,
// `getOutputs` must be defined by each op that wants to implement the
// DestinationStyleOpInterface.
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return start and end indices of the output operands range.",
		springermUnsubmitted Done Reply Inline Actions Why is this the case? springerm: Why is this the case?
		springermUnsubmitted Done Reply Inline Actions I would make these `unsigned`. There are various arithmetic computation with `getNumOperands` etc. and I think these return `unsigned`, so you may get compiler warnings. springerm: I would make these `unsigned`. There are various arithmetic computation with `getNumOperands`…
		akuegelUnsubmitted Done Reply Inline Actions It was already int64_t before. akuegel: It was already int64_t before.
		nicolasvasilacheUnsubmitted Done Reply Inline Actions Almost never use unsigned in C++: exception if doing bit manipulation E.g. https://stackoverflow.com/questions/10168079/why-is-size-t-unsigned nicolasvasilache: Almost never use unsigned in C++: exception if doing bit manipulation E.g. https…
Return the output shape operands.		/retTy=/"std::pair<int64_t, int64_t>",
}],		/methodName=/"getOutputsPositionRange",
/retTy=/"ValueRange",
/methodName=/"getOutputs",
/args=/(ins)
>,
InterfaceMethod<
/desc=/[{
Return the number of outputs.
}],
/retTy=/"int64_t",
/methodName=/"getNumOutputs",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/""
return $_op.getOutputs().size();
}]
>,
InterfaceMethod<
/desc=/[{
Return the number of inputs and outputs.
}],
/retTy=/"int64_t",
/methodName=/"getNumInputsAndOutputs",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
return this->getOperation()->getNumOperands();
}]
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Input operands handling.		// Operands handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
		// The operand list is assumed to start with the input operands and end
		// with the output operands. Therefore, all methods to access the inputs
		// and outputs can be expressed if the number of output operands is know.
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the number of outputs.",
		springermUnsubmitted Done Reply Inline Actions How does this work? Is the method calling itself? How about returning `$_op.getNumResults()` if the op has tensor semantics. Otherwise `llvm_unreachable("must be implemented")`. There must be as many outputs as there are OpResults (in case of tensor semantics). springerm: How does this work? Is the method calling itself? How about returning `$_op.getNumResults()`…
		akuegelUnsubmitted Done Reply Inline Actions It is the same as with getLibraryCallName. It would indeed call itself if the op does not implement getNumOutputs(). And unfortunately that does not result in a compiler error in all builds. I had to revert a change because of that because it only showed up on the Windows build. I like the suggestion of Matthias. akuegel: It is the same as with getLibraryCallName. It would indeed call itself if the op does not…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions @ftynse is there some way not to shoot ourselves in the foot with this? I also don't really understand the difference between `methodBody` and `defaultImplementation`. pifon2a: @ftynse is there some way not to shoot ourselves in the foot with this? I also don't really…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions Checking for tensor semantics is quite expensive. And we also have to make it work for buffers. So, I am not sure what's best here. pifon2a: Checking for tensor semantics is quite expensive. And we also have to make it work for buffers.
		springermUnsubmitted Done Reply Inline Actions Then I would just put a `llvm_unreachable(...)` there. A `defaultImplementation` should be provided only if overriding the InterfaceMethod is optional. I could be wrong but I think `methodBody` should be used if the InterfaceMethod is not supposed to be defined/overridden by ops (https://mlir.llvm.org/docs/Interfaces/). That would actually be the case for the vast majority of the InterfaceMethods here. But before changing anything, try it for one InterfaceMethod first and see if it still compiles... springerm: Then I would just put a `llvm_unreachable(...)` there. A `defaultImplementation` should be…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions I don't have a default implementation anymore. pifon2a: I don't have a default implementation anymore.
Return the input operands.		/retTy=/"int64_t",
}],		/methodName=/"getNumOutputs",
/retTy=/"OpOperandVector",
/methodName=/"getInputOperands",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
int64_t numInputs = getNumInputs();
OpOperandVector result;
result.reserve(numInputs);
llvm::transform(
this->getOperation()->getOpOperands().take_front(numInputs),
std::back_inserter(result),
[](OpOperand &opOperand) { return &opOperand; });
return result;
}]
>,
InterfaceMethod<
/desc=/[{
Return the `i`-th input operand.
}],
/retTy=/"OpOperand*",
/methodName=/"getInputOperand",
/args=/(ins "int64_t":$i),
/methodBody=/"",
/defaultImplementation=/[{
assert(i >= 0 && i < getNumInputs());
return &this->getOperation()->getOpOperand(i);
}]
>,
InterfaceMethod<
/desc=/[{
Return the subset of input operands that are of buffer type.
}],
/retTy=/"OpOperandVector",
/methodName=/"getInputBufferOperands",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
OpOperandVector result;		auto [start, end] = $_op.getOutputsPositionRange();
result.reserve(getNumInputs());		return end - start;
		springermUnsubmitted Not Done Reply Inline Actions Can we use this C++ feature (given that we also have to support some pretty old compiler versions)? springerm: Can we use this C++ feature (given that we also have to support some pretty old compiler…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions it is used in SCF.cpp a lot, for example pifon2a: it is used in SCF.cpp a lot, for example
llvm::copy_if(getInputOperands(),
std::back_inserter(result),
[](OpOperand *opOperand) {
return opOperand->get().getType().template isa<MemRefType>();
});
return result;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the output operands.",
Return the subset of input operands that are of tensor type.
}],
/retTy=/"OpOperandVector",
/methodName=/"getInputTensorOperands",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
OpOperandVector result;
result.reserve(getNumInputs());
llvm::copy_if(getInputOperands(),
std::back_inserter(result),
[](OpOperand *opOperand) {
return opOperand->get().getType().template isa<RankedTensorType>();
});
return result;
}]
>,
//===------------------------------------------------------------------===//
// Output operands handling.
//===------------------------------------------------------------------===//
InterfaceMethod<
/desc=/[{
Return the output operands.
}],
/retTy=/"OpOperandVector",		/retTy=/"OpOperandVector",
/methodName=/"getOutputOperands",		/methodName=/"getOutputOperands",
		nicolasvasilacheUnsubmitted Done Reply Inline Actions I see we now have `MutableOperandRange`, use that insead? nicolasvasilache: I see we now have `MutableOperandRange`, use that insead?
		pifon2aAuthorUnsubmitted Done Reply Inline Actions It allows you to iterate over Values only, unfortunately. pifon2a: It allows you to iterate over Values only, unfortunately.
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
int64_t numOutputs = getNumOutputs();		auto [start, end] = $_op.getOutputsPositionRange();

OpOperandVector result;		OpOperandVector result;
result.reserve(numOutputs);		result.reserve(end - start);
llvm::transform(		for (int i = start; i < end; ++i)
this->getOperation()->getOpOperands()		result.push_back(&$_op->getOpOperand(i));
.take_back(numOutputs),
std::back_inserter(result),
[](OpOperand &opOperand) { return &opOperand; });
return result;		return result;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the `i`-th output operand.",
Return the `i`-th output operand.
}],
/retTy=/"OpOperand*",		/retTy=/"OpOperand*",
/methodName=/"getOutputOperand",		/methodName=/"getOutputOperand",
/args=/(ins "int64_t":$i),		/args=/(ins "int64_t":$i),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i >= 0 && i < getNumOutputs());		assert(i >= 0 && i < $_op.getNumOutputs());
return &this->getOperation()->getOpOperand(getNumInputs() + i);		auto [start, end] = $_op.getOutputsPositionRange();
		return &$_op->getOpOperand(start + i);
		nicolasvasilacheUnsubmitted Done Reply Inline Actions can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods? nicolasvasilache: can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation…
		pifon2aAuthorUnsubmitted Done Reply Inline Actions It can. We should be careful though. At the moment creating MutableArrayRef<OpOperand> is cheap and doing `getOutputOperands()[i]` is ok-ish, but when/if we make the interface more "flexible" then `getOutputOperands` would construct SmallVector only to use one element of it. pifon2a: It can. We should be careful though. At the moment creating MutableArrayRef<OpOperand> is cheap…
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Set the `i`-th output operand.",
Set the `i`-th output operand.
}],
/retTy=/"void",		/retTy=/"void",
/methodName=/"setOutputOperand",		/methodName=/"setOutputOperand",
/args=/(ins "int64_t":$i, "Value":$value),		/args=/(ins "int64_t":$i, "Value":$value),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(i >= 0 && i < getNumOutputs());		assert(i >= 0 && i < $_op.getNumOutputs());
this->getOperation()->setOperand(getNumInputs() + i, value);		auto [start, end] = $_op.getOutputsPositionRange();
		$_op->setOperand(start + i, value);
		nicolasvasilacheUnsubmitted Done Reply Inline Actions can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods? nicolasvasilache: can this use `getOutputOperands()[i]` and avoid distributing the underlying `this->getOperation…
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the number of inputs.",
Return the subset of output operands that are of buffer type.		/retTy=/"int64_t",
}],		/methodName=/"getNumInputs",
/retTy=/"OpOperandVector",
/methodName=/"getOutputBufferOperands",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
OpOperandVector result;		return $_op.getNumOperands() - $_op.getNumOutputs();
result.reserve(getNumOutputs());
llvm::copy_if(getOutputOperands(),
std::back_inserter(result),
[](OpOperand *opOperand) {
return opOperand->get().getType().template isa<MemRefType>();
});
return result;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the input operands.",
Return the subset of output operands that are of tensor type.
}],
/retTy=/"OpOperandVector",		/retTy=/"OpOperandVector",
/methodName=/"getOutputTensorOperands",		/methodName=/"getInputOperands",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
		auto [start, end] = $_op.getOutputsPositionRange();
		int64_t numOutputs = end - start;
		int64_t numOperands = $_op.getNumOperands();

OpOperandVector result;		OpOperandVector result;
result.reserve(getNumOutputs());		result.reserve(numOperands - numOutputs);
llvm::copy_if(getOutputOperands(),		for (int i = 0; i < start; ++i)
std::back_inserter(result),		result.push_back(&$_op->getOpOperand(i));
[](OpOperand *opOperand) {		for (int i = end; i < numOperands; ++i)
return opOperand->get().getType().template isa<RankedTensorType>();		result.push_back(&$_op->getOpOperand(end + i));
});
return result;
}]
>,
InterfaceMethod<
/desc=/[{
Return the types of the subset of output operands that are of buffer type.
}],
/retTy=/"SmallVector<MemRefType>",
/methodName=/"getOutputBufferTypes",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
SmallVector<MemRefType> result;
result.reserve(getNumOutputs());
llvm::transform(getOutputBufferOperands(),
std::back_inserter(result),
[](OpOperand *opOperands) {
return opOperands->get().getType().cast<MemRefType>();
});
return result;		return result;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/[{ Return the `i`-th input operand. }],
Return the types of the subset of output operands that are of tensor type.		/retTy=/"OpOperand*",
}],		/methodName=/"getInputOperand",
/retTy=/"SmallVector<RankedTensorType>",		/args=/(ins "int64_t":$i),
/methodName=/"getOutputTensorTypes",
/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
SmallVector<RankedTensorType> result;		assert(i >= 0 && i < getNumInputs());
result.reserve(getNumOutputs());		auto [start, end] = $_op.getOutputsPositionRange();
		nicolasvasilacheUnsubmitted Done Reply Inline Actions can this use `getInputOperands()[i]` and avoid distributing the underlying `this->getOperation()` order assumption to more methods? nicolasvasilache: can this use `getInputOperands()[i]` and avoid distributing the underlying `this->getOperation…
llvm::transform(getOutputTensorOperands(),		return &$_op->getOpOperand(i < start ? i : i + end - start) ;
std::back_inserter(result),
[](OpOperand *opOperands) {
return opOperands->get().getType().cast<RankedTensorType>();
});
return result;
}]		}]
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Input and Output arguments handling.		// Input and Output arguments handling.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return true if `opOperand` is an input.",
Return the range over input and output operands.
}],
/retTy=/"OpOperandVector",
/methodName=/"getInputAndOutputOperands",
/args=/(ins),
/methodBody=/"",
/defaultImplementation=/[{
int64_t numInputsAndOutputs = getNumInputsAndOutputs();
OpOperandVector result;
result.reserve(numInputsAndOutputs);
llvm::transform(
this->getOperation()->getOpOperands(),
std::back_inserter(result),
[](OpOperand &opOperand) { return &opOperand; });
return result;
}]
>,
InterfaceMethod<
/desc=/[{
Return true if `opOperand` is an input tensor.
}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"isInputTensor",		/methodName=/"isInput",
/args=/(ins "OpOperand *":$opOperand),		/args=/(ins "OpOperand *":$opOperand),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
if (!opOperand->get().getType().template isa<RankedTensorType>())		auto [start, end] = $_op.getOutputsPositionRange();
return false;		auto operandNumber = opOperand->getOperandNumber();
if (opOperand->getOperandNumber() < $_op.getNumInputs())		return operandNumber < start \|\| operandNumber >= end;
return true;
return false;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return true if `opOperand` is an output.",
Return true if `opOperand` is an output tensor.
}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"isOutputTensor",		/methodName=/"isOutput",
/args=/(ins "OpOperand *":$opOperand),		/args=/(ins "OpOperand *":$opOperand),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
if (!opOperand->get().getType().template isa<RankedTensorType>())		auto [start, end] = $_op.getOutputsPositionRange();
return false;		auto operandNumber = opOperand->getOperandNumber();
if (opOperand->getOperandNumber() >= $_op.getNumInputs())		return operandNumber >= start && operandNumber < end;
return true;
return false;
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return true if the `opOperand` is a scalar value.",
Return true if the `opOperand` is a scalar value.
}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"isScalar",		/methodName=/"isScalar",
/args=/(ins "OpOperand*":$opOperand),		/args=/(ins "OpOperand*":$opOperand),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(opOperand->getOwner() == this->getOperation());		assert(opOperand->getOwner() == this->getOperation());
return !opOperand->get().getType().template isa<ShapedType>();		return !opOperand->get().getType().template isa<ShapedType>();
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return the result tied to `opOperand`.",
Return the result tied to `opOperand`.
}],
/retTy=/"OpResult",		/retTy=/"OpResult",
/methodName=/"getTiedOpResult",		/methodName=/"getTiedOpResult",
/args=/(ins "OpOperand*":$opOperand),		/args=/(ins "OpOperand*":$opOperand),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
assert(opOperand->getOwner() == this->getOperation());		assert(opOperand->getOwner() == this->getOperation());
int64_t resultIndex = opOperand->getOperandNumber() - getNumInputs();
		auto [start, end] = $_op.getOutputsPositionRange();
		int64_t resultIndex = opOperand->getOperandNumber() - start;
assert(resultIndex >= 0 &&		assert(resultIndex >= 0 &&
resultIndex < this->getOperation()->getNumResults() );		resultIndex < $_op->getNumResults() );
return this->getOperation()->getResult(resultIndex);		return $_op->getResult(resultIndex);
}]		}]
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Other interface methods.		// Other interface methods.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return whether the op has only MemRef input and outputs.",
Return whether the op has only MemRef input and outputs.
}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"hasBufferSemantics",		/methodName=/"hasBufferSemantics",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return this->getOperation()->getNumResults() == 0 &&		return $_op->getNumResults() == 0 &&
llvm::all_of(this->getOperation()->getOpOperands(),		llvm::all_of($_op->getOpOperands(),
[&](OpOperand &opOperand) {		[&](OpOperand &opOperand) {
return isScalar(&opOperand) \|\|		return isScalar(&opOperand) \|\|
opOperand.get().getType().template isa<MemRefType>();		opOperand.get().getType().template isa<MemRefType>();
});		});
}]		}]
>,		>,
InterfaceMethod<		InterfaceMethod<
/desc=/[{		/desc=/"Return whether the op has only RankedTensor input and outputs.",
Return whether the op has only RankedTensor input and outputs.
}],
/retTy=/"bool",		/retTy=/"bool",
/methodName=/"hasTensorSemantics",		/methodName=/"hasTensorSemantics",
/args=/(ins),		/args=/(ins),
/methodBody=/"",		/methodBody=/"",
/defaultImplementation=/[{		/defaultImplementation=/[{
return llvm::all_of(this->getOperation()->getOpOperands(),		return llvm::all_of($_op->getOpOperands(),
[&](OpOperand &opOperand) {		[&](OpOperand &opOperand) {
return isScalar(&opOperand) \|\|		return isScalar(&opOperand) \|\|
opOperand.get().getType().template isa<RankedTensorType>();		opOperand.get().getType().template isa<RankedTensorType>();
});		});
}]		}]
>,		>,
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
// Other static interface methods.		// Other static interface methods.
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

Show First 20 Lines • Show All 209 Lines • ▼ Show 20 Lines	std::string getLibraryCallName() {
getLibraryCall()->str() : "op_has_no_registered_library_name";		getLibraryCall()->str() : "op_has_no_registered_library_name";
}		}

static std::function<void(ImplicitLocOpBuilder &,		static std::function<void(ImplicitLocOpBuilder &,
Block &, ArrayRef<NamedAttribute>)>		Block &, ArrayRef<NamedAttribute>)>
getRegionBuilder() {		getRegionBuilder() {
return nullptr;		return nullptr;
}		}
		std::pair<int64_t, int64_t> getOutputsPositionRange() {
		int64_t getNumOperands = this->getNumOperands();
		return {getNumOperands - getOutputs().size(), getNumOperands};
		}
}];		}];

let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
let hasCustomAssemblyFormat = 1;		let hasCustomAssemblyFormat = 1;
let hasFolder = 1;		let hasFolder = 1;
let hasVerifier = 1;		let hasVerifier = 1;
}		}

Show All 40 Lines	let extraClassDeclaration = structuredOpsBaseDecls # [{
// Implement functions necessary for LinalgStructuredInterface.		// Implement functions necessary for LinalgStructuredInterface.
SmallVector<StringRef> getIteratorTypesArray();		SmallVector<StringRef> getIteratorTypesArray();
ArrayAttr getIndexingMaps();		ArrayAttr getIndexingMaps();
std::string getLibraryCallName() {		std::string getLibraryCallName() {
return "op_has_no_registered_library_name";		return "op_has_no_registered_library_name";
}		}

// Implement functions necessary for DestinationStyleOpInterface.		// Implement functions necessary for DestinationStyleOpInterface.
unsigned getNumInputs() {		std::pair<int64_t, int64_t> getOutputsPositionRange() {
return this->getOperation()->getNumOperands() - getNumOutputs();		int64_t getNumOperands = this->getNumOperands();
};		return {getNumOperands - 1, getNumOperands};
unsigned getNumOutputs() { return 1; };		}
mlir::ValueRange getOutputs() { return getOperands().take_back(1); }
linalg::OpOperandVector getOpOperandsMatchingBBargs() {		linalg::OpOperandVector getOpOperandsMatchingBBargs() {
return getInputOperands();		return getInputOperands();
}		}

static std::function<void(mlir::ImplicitLocOpBuilder &, mlir::Block &,		static std::function<void(mlir::ImplicitLocOpBuilder &, mlir::Block &,
mlir::ArrayRef<mlir::NamedAttribute>)>		mlir::ArrayRef<mlir::NamedAttribute>)>
getRegionBuilder() {		getRegionBuilder() {
return nullptr;		return nullptr;
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	let extraClassDeclaration = structuredOpsBaseDecls # [{
// Declare functions necessary for LinalgStructuredInterface.		// Declare functions necessary for LinalgStructuredInterface.
SmallVector<StringRef> getIteratorTypesArray();		SmallVector<StringRef> getIteratorTypesArray();
ArrayAttr getIndexingMaps();		ArrayAttr getIndexingMaps();
std::string getLibraryCallName() {		std::string getLibraryCallName() {
return "op_has_no_registered_library_name";		return "op_has_no_registered_library_name";
}		}

// Implement functions necessary for DestinationStyleOpInterface.		// Implement functions necessary for DestinationStyleOpInterface.
mlir::ValueRange getOutputs() { return getInits(); }
unsigned getNumInputs() { return getInputs().size(); };
unsigned getNumOutputs() { return getInits().size(); };
static std::function<void(mlir::ImplicitLocOpBuilder &, mlir::Block &,		static std::function<void(mlir::ImplicitLocOpBuilder &, mlir::Block &,
mlir::ArrayRef<mlir::NamedAttribute>)>		mlir::ArrayRef<mlir::NamedAttribute>)>
getRegionBuilder() {		getRegionBuilder() {
return nullptr;		return nullptr;
}		}
		std::pair<int64_t, int64_t> getOutputsPositionRange() {
		return {getInits().size(), getNumOperands()};
		}
}];		}];

let hasCustomAssemblyFormat = 1;		let hasCustomAssemblyFormat = 1;
let hasVerifier = 1;		let hasVerifier = 1;
}		}


//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Named Linalg ops, implemented as a declarative configurations of generic ops.		// Named Linalg ops, implemented as a declarative configurations of generic ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yamlgen.td"		include "mlir/Dialect/Linalg/IR/LinalgNamedStructuredOps.yamlgen.td"

#endif // LINALG_STRUCTURED_OPS		#endif // LINALG_STRUCTURED_OPS

mlir/lib/CAPI/Dialect/Linalg.cpp

Show All 23 Lines	void mlirLinalgFillBuiltinNamedOpRegion(MlirOperation mlirOp) {

assert(fun && "Expected a builtin named Linalg op.");		assert(fun && "Expected a builtin named Linalg op.");
assert(op->getNumRegions() == 1 && "Expected Linalg op with 1 region");		assert(op->getNumRegions() == 1 && "Expected Linalg op with 1 region");
assert(op->getRegion(0).getBlocks().empty() &&		assert(op->getRegion(0).getBlocks().empty() &&
"Expected Linalg op with 0 blocks");		"Expected Linalg op with 0 blocks");

SmallVector<Type, 8> argTypes;		SmallVector<Type, 8> argTypes;
SmallVector<Location, 8> argLocs;		SmallVector<Location, 8> argLocs;
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
argTypes.push_back(getElementTypeOrSelf(opOperand->get().getType()));		argTypes.push_back(getElementTypeOrSelf(opOperand.get().getType()));
argLocs.push_back(opOperand->get().getLoc());		argLocs.push_back(opOperand.get().getLoc());
}		}

ImplicitLocOpBuilder b(op->getLoc(), op->getContext());		ImplicitLocOpBuilder b(op->getLoc(), op->getContext());
Region &region = op->getRegion(0);		Region &region = op->getRegion(0);
Block body = b.createBlock(&region, /insertPt=*/{}, argTypes, argLocs);		Block body = b.createBlock(&region, /insertPt=*/{}, argTypes, argLocs);
b.setInsertionPointToStart(body);		b.setInsertionPointToStart(body);
fun(b, *body, op->getAttrs());		fun(b, *body, op->getAttrs());
}		}

MLIR_DEFINE_CAPI_DIALECT_REGISTRATION(Linalg, linalg, LinalgDialect)		MLIR_DEFINE_CAPI_DIALECT_REGISTRATION(Linalg, linalg, LinalgDialect)

mlir/lib/Dialect/Linalg/Analysis/DependenceAnalysis.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	LinalgDependenceGraph::getDependencesInto(
return llvm::make_range(iter->second.begin(), iter->second.end());		return llvm::make_range(iter->second.begin(), iter->second.end());
}		}

void LinalgDependenceGraph::addDependencesBetween(LinalgOp src, LinalgOp dst) {		void LinalgDependenceGraph::addDependencesBetween(LinalgOp src, LinalgOp dst) {
LLVM_DEBUG(dbgs() << "addDependencesBetween " << *src.getOperation()		LLVM_DEBUG(dbgs() << "addDependencesBetween " << *src.getOperation()
<< " and " << *dst.getOperation() << "\n");		<< " and " << *dst.getOperation() << "\n");
if (src.hasTensorSemantics() && dst.hasTensorSemantics()) {		if (src.hasTensorSemantics() && dst.hasTensorSemantics()) {
for (OpOperand *dstOpOperand : dst.getInputOperands()) {		for (OpOperand *dstOpOperand : dst.getInputOperands()) {
		if (!dstOpOperand->get().getType().isa<RankedTensorType>())
		continue;
// Check if the operand is defined by the src.		// Check if the operand is defined by the src.
auto definingOp = dstOpOperand->get().getDefiningOp<LinalgOp>();		auto definingOp = dstOpOperand->get().getDefiningOp<LinalgOp>();
if (definingOp && definingOp == src)		if (definingOp && definingOp == src)
addDependenceElem(DependenceType::RAW, dstOpOperand->get(),		addDependenceElem(DependenceType::RAW, dstOpOperand->get(),
dstOpOperand);		dstOpOperand);
}		}
for (OpOperand *dstOpOperand : dst.getOutputOperands()) {		for (OpOperand *dstOpOperand : dst.getOutputOperands()) {
// Check if the operand is defined by the src.		// Check if the operand is defined by the src.
auto definingOp = dstOpOperand->get().getDefiningOp<LinalgOp>();		auto definingOp = dstOpOperand->get().getDefiningOp<LinalgOp>();
if (definingOp && definingOp == src) {		if (definingOp && definingOp == src) {
if (dst.isInitTensor(dstOpOperand)) {		if (dst.isInitTensor(dstOpOperand)) {
addDependenceElem(DependenceType::RAW, dstOpOperand->get(),		addDependenceElem(DependenceType::RAW, dstOpOperand->get(),
dstOpOperand);		dstOpOperand);
}		}
addDependenceElem(DependenceType::WAW, dstOpOperand->get(),		addDependenceElem(DependenceType::WAW, dstOpOperand->get(),
dstOpOperand);		dstOpOperand);
}		}
}		}
return;		return;
}		}
assert(src.hasBufferSemantics() && dst.hasBufferSemantics() &&		assert(src.hasBufferSemantics() && dst.hasBufferSemantics() &&
"unhandled dependence tracking for mixed buffer/tensor operations");		"unhandled dependence tracking for mixed buffer/tensor operations");
for (OpOperand *srcOpOperand : src.getOutputBufferOperands()) { // W		for (OpOperand *srcOpOperand : src.getOutputOperands()) { // W
// RAW graph		// RAW graph
for (OpOperand *dstOpOperand : dst.getInputBufferOperands()) // R		for (OpOperand *dstOpOperand : dst.getInputOperands()) { // R
		if (!dstOpOperand->get().getType().isa<MemRefType>())
		continue;
if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // RAW alias		if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // RAW alias
addDependenceElem(DependenceType::RAW, srcOpOperand, dstOpOperand);		addDependenceElem(DependenceType::RAW, srcOpOperand, dstOpOperand);
		}
// WAW graph		// WAW graph
for (OpOperand *dstOpOperand : dst.getOutputBufferOperands()) // W		for (OpOperand *dstOpOperand : dst.getOutputOperands()) // W
if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // WAW alias		if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // WAW alias
addDependenceElem(DependenceType::WAW, srcOpOperand, dstOpOperand);		addDependenceElem(DependenceType::WAW, srcOpOperand, dstOpOperand);
}		}
for (OpOperand *srcOpOperand : src.getInputBufferOperands()) { // R		for (OpOperand *srcOpOperand : src.getInputOperands()) { // R
		if (!srcOpOperand->get().getType().isa<MemRefType>())
		continue;
// RAR graph		// RAR graph
for (OpOperand *dstOpOperand : dst.getInputBufferOperands()) // R		for (OpOperand *dstOpOperand : dst.getInputOperands()) { // R
		if (!dstOpOperand->get().getType().isa<MemRefType>())
		continue;
if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // RAR alias		if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // RAR alias
addDependenceElem(DependenceType::RAR, srcOpOperand, dstOpOperand);		addDependenceElem(DependenceType::RAR, srcOpOperand, dstOpOperand);
		}
// WAR graph		// WAR graph
for (OpOperand *dstOpOperand : dst.getOutputBufferOperands()) // W		for (OpOperand *dstOpOperand : dst.getOutputOperands()) // W
if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // WAR alias		if (aliases.alias(srcOpOperand->get(), dstOpOperand->get())) // WAR alias
addDependenceElem(DependenceType::WAR, srcOpOperand, dstOpOperand);		addDependenceElem(DependenceType::WAR, srcOpOperand, dstOpOperand);
}		}
}		}

SmallVector<Operation *, 8>		SmallVector<Operation *, 8>
LinalgDependenceGraph::findCoveringDependences(LinalgOp srcLinalgOp,		LinalgDependenceGraph::findCoveringDependences(LinalgOp srcLinalgOp,
LinalgOp dstLinalgOp) const {		LinalgOp dstLinalgOp) const {
▲ Show 20 Lines • Show All 141 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp

Show All 25 Lines
#include "mlir/Dialect/Linalg/IR/LinalgInterfaces.cpp.inc"		#include "mlir/Dialect/Linalg/IR/LinalgInterfaces.cpp.inc"

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Interface utility functions		// Interface utility functions
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
bool linalg::detail::canOpOperandsBeDroppedImpl(		bool linalg::detail::canOpOperandsBeDroppedImpl(
linalg::LinalgOp linalgOp, ArrayRef<OpOperand *> droppedOperands) {		linalg::LinalgOp linalgOp, ArrayRef<OpOperand *> droppedOperands) {
SmallVector<AffineMap> indexingMaps;		SmallVector<AffineMap> indexingMaps;
for (auto *opOperand : linalgOp.getInputAndOutputOperands()) {		for (auto &opOperand : linalgOp->getOpOperands()) {
if (llvm::is_contained(droppedOperands, opOperand))		if (llvm::is_contained(droppedOperands, &opOperand))
continue;		continue;
indexingMaps.push_back(linalgOp.getMatchingIndexingMap(opOperand));		indexingMaps.push_back(linalgOp.getMatchingIndexingMap(&opOperand));
}		}
return inversePermutation(concatAffineMaps(indexingMaps)) != AffineMap();		return inversePermutation(concatAffineMaps(indexingMaps)) != AffineMap();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ContractionOpInterface implementation		// ContractionOpInterface implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 440 Lines • ▼ Show 20 Lines	static OpFoldResult createFoldedDimOp(OpBuilder &b, Location loc, Value source,
if (!shapedType.hasRank() \|\| shapedType.isDynamicDim(dim))		if (!shapedType.hasRank() \|\| shapedType.isDynamicDim(dim))
return createOrFoldDimOp(b, loc, source, dim);		return createOrFoldDimOp(b, loc, source, dim);
return b.getIndexAttr(shapedType.getDimSize(dim));		return b.getIndexAttr(shapedType.getDimSize(dim));
}		}

SmallVector<OpFoldResult> LinalgOp::createFlatListOfOperandDims(OpBuilder &b,		SmallVector<OpFoldResult> LinalgOp::createFlatListOfOperandDims(OpBuilder &b,
Location loc) {		Location loc) {
SmallVector<OpFoldResult> res;		SmallVector<OpFoldResult> res;
for (OpOperand *opOperand : getInputAndOutputOperands()) {		for (OpOperand &opOperand : getOperation()->getOpOperands()) {
for (int64_t i = 0, e = getRank(opOperand); i < e; ++i)		for (int64_t i = 0, e = getRank(&opOperand); i < e; ++i)
res.push_back(createFoldedDimOp(b, loc, opOperand->get(), i));		res.push_back(createFoldedDimOp(b, loc, opOperand.get(), i));
}		}
return res;		return res;
}		}

SmallVector<int64_t, 4> LinalgOp::createFlatListOfOperandStaticDims() {		SmallVector<int64_t, 4> LinalgOp::createFlatListOfOperandStaticDims() {
SmallVector<int64_t, 4> res;		SmallVector<int64_t, 4> res;
assert(!hasDynamicShape() && "expected operands to have static shapes");		assert(!hasDynamicShape() && "expected operands to have static shapes");
for (OpOperand *opOperand : getInputAndOutputOperands())		for (OpOperand &opOperand : getOperation()->getOpOperands())
llvm::append_range(res, getShape(opOperand));		llvm::append_range(res, getShape(&opOperand));
return res;		return res;
}		}

SmallVector<Range, 4> LinalgOp::createLoopRanges(OpBuilder &b, Location loc) {		SmallVector<Range, 4> LinalgOp::createLoopRanges(OpBuilder &b, Location loc) {
AffineMap map = getLoopsToShapesMap();		AffineMap map = getLoopsToShapesMap();
unsigned numDims = map.getNumDims(), numRes = map.getNumResults();		unsigned numDims = map.getNumDims(), numRes = map.getNumResults();
auto viewSizes = createFlatListOfOperandDims(b, loc);		auto viewSizes = createFlatListOfOperandDims(b, loc);
SmallVector<Range, 4> res(numDims);		SmallVector<Range, 4> res(numDims);
▲ Show 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	LogicalResult mlir::linalg::detail::verifyStructuredOpInterface(Operation *op) {
// Before checking indexing maps, we need to make sure the attributes		// Before checking indexing maps, we need to make sure the attributes
// referenced by it are valid.		// referenced by it are valid.
if (linalgOp.hasDynamicIndexingMaps())		if (linalgOp.hasDynamicIndexingMaps())
if (failed(linalgOp.verifyIndexingMapRequiredAttributes()))		if (failed(linalgOp.verifyIndexingMapRequiredAttributes()))
return failure();		return failure();

// All input/output operands must be indexed.		// All input/output operands must be indexed.
if (static_cast<int64_t>(linalgOp.getIndexingMapsArray().size()) !=		if (static_cast<int64_t>(linalgOp.getIndexingMapsArray().size()) !=
linalgOp.getNumInputsAndOutputs())		linalgOp->getNumOperands())
return op->emitOpError("expected the number of indexing_map (")		return op->emitOpError("expected the number of indexing_map (")
<< linalgOp.getIndexingMapsArray().size()		<< linalgOp.getIndexingMapsArray().size()
<< ") to be equal to the number of input/output operands ("		<< ") to be equal to the number of input/output operands ("
<< linalgOp.getNumInputsAndOutputs() << ")";		<< linalgOp->getNumOperands() << ")";

for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
AffineMap indexingMap = linalgOp.getMatchingIndexingMap(opOperand);		AffineMap indexingMap = linalgOp.getMatchingIndexingMap(&opOperand);

// Symbols disallowed.		// Symbols disallowed.
if (indexingMap.getNumSymbols() != 0)		if (indexingMap.getNumSymbols() != 0)
return op->emitOpError("unexpected symbols in indexing_map #")		return op->emitOpError("unexpected symbols in indexing_map #")
<< opOperand->getOperandNumber();		<< opOperand.getOperandNumber();

// Domain must be consistent.		// Domain must be consistent.
unsigned numLoops = linalgOp.getNumLoops();		unsigned numLoops = linalgOp.getNumLoops();
if (indexingMap.getNumDims() != numLoops)		if (indexingMap.getNumDims() != numLoops)
return op->emitOpError("expected indexing_map #")		return op->emitOpError("expected indexing_map #")
<< opOperand->getOperandNumber() << " to have " << numLoops		<< opOperand.getOperandNumber() << " to have " << numLoops
<< " dim(s) to match the number of loops";		<< " dim(s) to match the number of loops";

int64_t rank = linalgOp.getRank(opOperand);		int64_t rank = linalgOp.getRank(&opOperand);
if (indexingMap.getNumResults() != rank)		if (indexingMap.getNumResults() != rank)
return op->emitOpError("expected operand rank (")		return op->emitOpError("expected operand rank (")
<< rank << ") to match the result rank of indexing_map #"		<< rank << ") to match the result rank of indexing_map #"
<< opOperand->getOperandNumber() << " ("		<< opOperand.getOperandNumber() << " ("
<< indexingMap.getNumResults() << ")";		<< indexingMap.getNumResults() << ")";
}		}

SmallVector<unsigned> redDims;		SmallVector<unsigned> redDims;
linalgOp.getReductionDims(redDims);		linalgOp.getReductionDims(redDims);

if (!linalgOp.getShapesToLoopsMap())		if (!linalgOp.getShapesToLoopsMap())
return op->emitOpError("expected the shape-to-loops map to be non-null");		return op->emitOpError("expected the shape-to-loops map to be non-null");

// Check if given shapes match to inferred shapes.		// Check if given shapes match to inferred shapes.
SmallVector<int64_t, 4> endLoopRangeValues = linalgOp.getStaticLoopRanges();		SmallVector<int64_t, 4> endLoopRangeValues = linalgOp.getStaticLoopRanges();
SmallVector<int64_t, 4> startLoopRangeValues(endLoopRangeValues.size(), 0);		SmallVector<int64_t, 4> startLoopRangeValues(endLoopRangeValues.size(), 0);

// Verify only static cases since we can't get exact dimension sizes and loop		// Verify only static cases since we can't get exact dimension sizes and loop
// ranges for dynamic cases in this stage.		// ranges for dynamic cases in this stage.
if (llvm::none_of(endLoopRangeValues, ShapedType::isDynamic)) {		if (llvm::none_of(endLoopRangeValues, ShapedType::isDynamic)) {
for (int64_t &range : endLoopRangeValues)		for (int64_t &range : endLoopRangeValues)
range -= 1;		range -= 1;
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
AffineMap indexingMap = linalgOp.getMatchingIndexingMap(opOperand);		AffineMap indexingMap = linalgOp.getMatchingIndexingMap(&opOperand);
SmallVector<int64_t, 4> startIndices =		SmallVector<int64_t, 4> startIndices =
indexingMap.compose(startLoopRangeValues);		indexingMap.compose(startLoopRangeValues);
SmallVector<int64_t, 4> endIndices =		SmallVector<int64_t, 4> endIndices =
indexingMap.compose(endLoopRangeValues);		indexingMap.compose(endLoopRangeValues);
ArrayRef<int64_t> shape = linalgOp.getShape(opOperand);		ArrayRef<int64_t> shape = linalgOp.getShape(&opOperand);
for (auto dim : llvm::seq<int64_t>(0, shape.size())) {		for (auto dim : llvm::seq<int64_t>(0, shape.size())) {
// Ignore dynamic dimension or the case that the dimension size is 0		// Ignore dynamic dimension or the case that the dimension size is 0
if (ShapedType::isDynamic(shape[dim]) \|\| shape[dim] == 0)		if (ShapedType::isDynamic(shape[dim]) \|\| shape[dim] == 0)
continue;		continue;

// The first index or last index should be the maximum or the minimum in		// The first index or last index should be the maximum or the minimum in
// the inferred index ranges since the range is increasing or		// the inferred index ranges since the range is increasing or
// decreasing. The size of dimensions of input/output operands and the		// decreasing. The size of dimensions of input/output operands and the
Show All 14 Lines	for (OpOperand &opOperand : linalgOp->getOpOperands()) {
}		}
return op->emitOpError(		return op->emitOpError(
"unexpected result less than 0 at expression #")		"unexpected result less than 0 at expression #")
<< dim << " in " << mapStr;		<< dim << " in " << mapStr;
}		}
if (indexingMap.getResult(dim).dyn_cast<AffineDimExpr>()) {		if (indexingMap.getResult(dim).dyn_cast<AffineDimExpr>()) {
if (inferredDimSize != shape[dim]) {		if (inferredDimSize != shape[dim]) {
return op->emitOpError("inferred input/output operand #")		return op->emitOpError("inferred input/output operand #")
<< opOperand->getOperandNumber()		<< opOperand.getOperandNumber() << " has shape's dimension #"
<< " has shape's dimension #" << dim << " to be "		<< dim << " to be " << inferredDimSize << ", but found "
<< inferredDimSize << ", but found " << shape[dim];		<< shape[dim];
}		}
} else {		} else {
if (inferredDimSize > shape[dim]) {		if (inferredDimSize > shape[dim]) {
return op->emitOpError("inferred input/output operand #")		return op->emitOpError("inferred input/output operand #")
<< opOperand->getOperandNumber()		<< opOperand.getOperandNumber() << " has shape's dimension #"
<< " has shape's dimension #" << dim		<< dim << " to be greater than or equal to "
<< " to be greater than or equal to " << inferredDimSize		<< inferredDimSize << ", but found " << shape[dim];
<< ", but found " << shape[dim];
}		}
}		}
}		}
}		}
}		}

// Check the region has exactly one block.		// Check the region has exactly one block.
if (linalgOp->getNumRegions() != 1 \|\|		if (linalgOp->getNumRegions() != 1 \|\|
Show All 25 Lines	LogicalResult mlir::linalg::detail::verifyStructuredOpInterface(Operation *op) {
return success();		return success();
}		}

LogicalResult		LogicalResult
mlir::linalg::detail::verifyDestinationStyleOpInterface(Operation *op) {		mlir::linalg::detail::verifyDestinationStyleOpInterface(Operation *op) {
DestinationStyleOpInterface dstStyleOp =		DestinationStyleOpInterface dstStyleOp =
cast<DestinationStyleOpInterface>(op);		cast<DestinationStyleOpInterface>(op);

		SmallVector<OpOperand *> outputBufferOperands, outputTensorOperands;
		for (OpOperand *operand : dstStyleOp.getOutputOperands()) {
		Type type = operand->get().getType();
		if (type.isa<MemRefType>())
		outputBufferOperands.push_back(operand);
		if (type.isa<RankedTensorType>())
		outputTensorOperands.push_back(operand);
		}

// Expect at least one output operand.		// Expect at least one output operand.
// This means an op that constructs a tensor out of indices cannot be a		// This means an op that constructs a tensor out of indices cannot be a
// LinalgOp at the moment. For now this will have to be a special op until we		// LinalgOp at the moment. For now this will have to be a special op until we
// have output shape operands that are not tensors.		// have output shape operands that are not tensors.
int64_t numInputs = dstStyleOp.getNumInputs();		int64_t numInputs = dstStyleOp.getNumInputs();
int64_t numOutputs = dstStyleOp.getNumOutputs();		int64_t numOutputs = dstStyleOp.getNumOutputs();
if (numOutputs == 0)		if (numOutputs == 0)
return op->emitOpError("expected at least one output operand");		return op->emitOpError("expected at least one output operand");
		springermUnsubmitted Done Reply Inline Actions Can we move this limitation to the LinalgOp verifier? springerm: Can we move this limitation to the LinalgOp verifier?
		pifon2aAuthorUnsubmitted Done Reply Inline Actions We can, but why? Shouldn't there be at least one destination for DPS ops? pifon2a: We can, but why? Shouldn't there be at least one destination for DPS ops?
if (failed(OpTrait::impl::verifyNOperands(op, numInputs + numOutputs)))		if (failed(OpTrait::impl::verifyNOperands(op, numInputs + numOutputs)))
return failure();		return failure();
		springermUnsubmitted Done Reply Inline Actions I think this check does not do anything because `numInputs` is computed as `num_operands - num_outputs`. springerm: I think this check does not do anything because `numInputs` is computed as `num_operands…
// Verify the number of results matches the number of output tensors.		// Verify the number of results matches the number of output tensors.
if (op->getNumResults() != dstStyleOp.getOutputTensorOperands().size())		if (op->getNumResults() != outputTensorOperands.size())
return op->emitOpError("expected the number of results (")		return op->emitOpError("expected the number of results (")
<< op->getNumResults()		<< op->getNumResults()
<< ") to be equal to the number of output tensors ("		<< ") to be equal to the number of output tensors ("
<< dstStyleOp.getOutputTensorOperands().size() << ")";		<< outputTensorOperands.size() << ")";

// Simplifying assumption: either full tensor or full buffer mode.		// Simplifying assumption: either full tensor or full buffer mode.
// This allows simpler verification of output operands vs result types		// This allows simpler verification of output operands vs result types
// without premature tracking of which operand is what in mixed-mode.		// without premature tracking of which operand is what in mixed-mode.
// TODO: relax when mixed-mode needs to pass verification.		// TODO: relax when mixed-mode needs to pass verification.
if (!dstStyleOp.getOutputBufferOperands().empty() &&		if (!outputBufferOperands.empty() && !outputTensorOperands.empty())
!dstStyleOp.getOutputTensorOperands().empty())
return op->emitOpError(		return op->emitOpError(
"expected output operands to all have tensor type or "		"expected output operands to all have tensor type or "
"all have buffer type");		"all have buffer type");
		springermUnsubmitted Done Reply Inline Actions nit: I would move this check before the previous check for better error messages. springerm: nit: I would move this check before the previous check for better error messages.

for (OpOperand *opOperand : dstStyleOp.getOutputTensorOperands()) {		for (OpOperand *opOperand : outputTensorOperands) {
OpResult result = dstStyleOp.getTiedOpResult(opOperand);		OpResult result = dstStyleOp.getTiedOpResult(opOperand);
if (result.getType() != opOperand->get().getType())		if (result.getType() != opOperand->get().getType())
return op->emitOpError("expected type of operand #")		return op->emitOpError("expected type of operand #")
<< opOperand->getOperandNumber() << " ("		<< opOperand->getOperandNumber() << " ("
<< opOperand->get().getType() << ")"		<< opOperand->get().getType() << ")"
<< " to match type of corresponding result (" << result.getType()		<< " to match type of corresponding result (" << result.getType()
<< ")";		<< ")";
}		}

return success();		return success();
}		}

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

Show First 20 Lines • Show All 761 Lines • ▼ Show 20 Lines	for (auto attr : (*this)->getAttrs())
if (genericAttrNamesSet.count(attr.getName().strref()) > 0)		if (genericAttrNamesSet.count(attr.getName().strref()) > 0)
genericAttrs.push_back(attr);		genericAttrs.push_back(attr);
if (!genericAttrs.empty()) {		if (!genericAttrs.empty()) {
auto genericDictAttr = DictionaryAttr::get(getContext(), genericAttrs);		auto genericDictAttr = DictionaryAttr::get(getContext(), genericAttrs);
p << genericDictAttr;		p << genericDictAttr;
}		}

// Printing is shared with named ops, except for the region and attributes		// Printing is shared with named ops, except for the region and attributes
printCommonStructuredOpParts(p, getInputs(), getOutputs());		printCommonStructuredOpParts(p, SmallVector<Value>(getInputOperands()),
		SmallVector<Value>(getOutputOperands()));

genericAttrNames.push_back("operand_segment_sizes");		genericAttrNames.push_back("operand_segment_sizes");
genericAttrNamesSet.insert(genericAttrNames.back());		genericAttrNamesSet.insert(genericAttrNames.back());

bool hasExtraAttrs = false;		bool hasExtraAttrs = false;
for (NamedAttribute n : (*this)->getAttrs()) {		for (NamedAttribute n : (*this)->getAttrs()) {
if ((hasExtraAttrs = !genericAttrNamesSet.contains(n.getName().strref())))		if ((hasExtraAttrs = !genericAttrNamesSet.contains(n.getName().strref())))
break;		break;
▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	ParseResult GenericOp::parse(OpAsmParser &parser, OperationState &result) {
result.addTypes(outputTensorsTypes);		result.addTypes(outputTensorsTypes);

return success();		return success();
}		}

static void getGenericEffectsImpl(		static void getGenericEffectsImpl(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects,		&effects,
ValueRange results, ValueRange inputBuffers, ValueRange outputs) {		ValueRange results, OpOperandVector inputOperands,
for (Value value : inputBuffers) {		OpOperandVector outputOperands) {
effects.emplace_back(MemoryEffects::Read::get(), value,		for (auto *operand : inputOperands) {
		if (!operand->get().getType().isa<MemRefType>())
		continue;
		effects.emplace_back(MemoryEffects::Read::get(), operand->get(),
SideEffects::DefaultResource::get());		SideEffects::DefaultResource::get());
}		}
for (Value value : outputs) {		for (auto *operand : outputOperands) {
effects.emplace_back(MemoryEffects::Read::get(), value,		if (!operand->get().getType().isa<MemRefType>())
		continue;
		effects.emplace_back(MemoryEffects::Read::get(), operand->get(),
SideEffects::DefaultResource::get());		SideEffects::DefaultResource::get());
effects.emplace_back(MemoryEffects::Write::get(), value,		effects.emplace_back(MemoryEffects::Write::get(), operand->get(),
SideEffects::DefaultResource::get());		SideEffects::DefaultResource::get());
}		}
}		}

void GenericOp::getEffects(		void GenericOp::getEffects(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects) {		&effects) {
SmallVector<Value> inputBuffers = getInputBufferOperands();		getGenericEffectsImpl(effects, getOperation()->getResults(),
SmallVector<Value> outputBuffers = getOutputBufferOperands();		getInputOperands(), getOutputOperands());
getGenericEffectsImpl(effects, getOperation()->getResults(), inputBuffers,
outputBuffers);
}		}

static bool isResultValueDead(linalg::GenericOp genericOp, OpResult result) {		static bool isResultValueDead(linalg::GenericOp genericOp, OpResult result) {
if (!result.use_empty())		if (!result.use_empty())
return false;		return false;
// If out operand not used in payload, we can drop it.		// If out operand not used in payload, we can drop it.
OpOperand *outputOpOperand =		OpOperand *outputOpOperand =
genericOp.getOutputOperand(result.getResultNumber());		genericOp.getOutputOperand(result.getResultNumber());
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(GenericOp genericOp,

// Gather information about the dropped outputs.		// Gather information about the dropped outputs.
llvm::SmallDenseMap<unsigned, unsigned> origOutsToNewOutsPos =		llvm::SmallDenseMap<unsigned, unsigned> origOutsToNewOutsPos =
deduplicateOutputOperands(genericOp, droppedOpOperands,		deduplicateOutputOperands(genericOp, droppedOpOperands,
newOutputOperands, newIndexingMaps);		newOutputOperands, newIndexingMaps);

// Check if there is any change to operands.		// Check if there is any change to operands.
if (newInputOperands.size() + newOutputOperands.size() ==		if (newInputOperands.size() + newOutputOperands.size() ==
static_cast<size_t>(genericOp.getNumInputsAndOutputs()))		genericOp->getNumOperands())
return failure();		return failure();

// Create the new op with the body being empty.		// Create the new op with the body being empty.
Location loc = genericOp.getLoc();		Location loc = genericOp.getLoc();
SmallVector<Type> newResultTypes;		SmallVector<Type> newResultTypes;
if (genericOp.hasTensorSemantics()) {		if (genericOp.hasTensorSemantics()) {
newResultTypes = llvm::to_vector(llvm::map_range(		newResultTypes = llvm::to_vector(llvm::map_range(
newOutputOperands, [](Value v) { return v.getType(); }));		newOutputOperands, [](Value v) { return v.getType(); }));
Show All 35 Lines	private:
// - The preserved input operands list (by reference).		// - The preserved input operands list (by reference).
llvm::SmallDenseMap<unsigned, unsigned>		llvm::SmallDenseMap<unsigned, unsigned>
deduplicateInputOperands(GenericOp genericOp,		deduplicateInputOperands(GenericOp genericOp,
SmallVector<OpOperand *> &droppedOpOperands,		SmallVector<OpOperand *> &droppedOpOperands,
SmallVector<Value> &newInputOperands,		SmallVector<Value> &newInputOperands,
SmallVector<AffineMap> &newIndexingMaps) const {		SmallVector<AffineMap> &newIndexingMaps) const {
llvm::SmallDenseMap<unsigned, unsigned> origToNewPos;		llvm::SmallDenseMap<unsigned, unsigned> origToNewPos;
llvm::SmallDenseMap<std::pair<Value, AffineMap>, unsigned> dedupedInputs;		llvm::SmallDenseMap<std::pair<Value, AffineMap>, unsigned> dedupedInputs;
for (const auto &inputOpOperand :		for (const auto &en : llvm::enumerate(genericOp.getInputOperands())) {
llvm::enumerate(genericOp.getInputOperands())) {		OpOperand *inputOpOperand = en.value();
// Check if operand is dead and if dropping the indexing map makes the		// Check if operand is dead and if dropping the indexing map makes the
// loops to shape computation invalid.		// loops to shape computation invalid.
if (!genericOp.payloadUsesValueFromOperand(inputOpOperand.value())) {		if (!genericOp.payloadUsesValueFromOperand(inputOpOperand)) {
// Add the current operands to the list of potentially droppable		// Add the current operands to the list of potentially droppable
// operands. If it cannot be dropped, this needs to be popped back.		// operands. If it cannot be dropped, this needs to be popped back.
droppedOpOperands.push_back(inputOpOperand.value());		droppedOpOperands.push_back(inputOpOperand);
if (genericOp.canOpOperandsBeDropped(droppedOpOperands))		if (genericOp.canOpOperandsBeDropped(droppedOpOperands))
continue;		continue;
droppedOpOperands.pop_back();		droppedOpOperands.pop_back();
}		}

// Check if this operand is a duplicate.		// Check if this operand is a duplicate.
AffineMap indexingMap =		AffineMap indexingMap = genericOp.getMatchingIndexingMap(inputOpOperand);
genericOp.getMatchingIndexingMap(inputOpOperand.value());
auto it = dedupedInputs.find(		auto it = dedupedInputs.find(
std::make_pair(inputOpOperand.value()->get(), indexingMap));		std::make_pair(inputOpOperand->get(), indexingMap));
if (it != dedupedInputs.end()) {		if (it != dedupedInputs.end()) {
origToNewPos[inputOpOperand.index()] = it->second;		origToNewPos[en.index()] = it->second;
droppedOpOperands.push_back(inputOpOperand.value());		droppedOpOperands.push_back(inputOpOperand);
continue;		continue;
}		}

// This is a preserved argument.		// This is a preserved argument.
origToNewPos[inputOpOperand.index()] = newInputOperands.size();		origToNewPos[en.index()] = newInputOperands.size();
dedupedInputs[{inputOpOperand.value()->get(), indexingMap}] =		dedupedInputs[{inputOpOperand->get(), indexingMap}] =
newInputOperands.size();		newInputOperands.size();
newInputOperands.push_back(inputOpOperand.value()->get());		newInputOperands.push_back(inputOpOperand->get());
newIndexingMaps.push_back(indexingMap);		newIndexingMaps.push_back(indexingMap);
}		}
return origToNewPos;		return origToNewPos;
}		}

// Deduplicate output operands, and return the		// Deduplicate output operands, and return the
// - Mapping from operand position in the original op, to operand position in		// - Mapping from operand position in the original op, to operand position in
// the canonicalized op.		// the canonicalized op.
// - The preserved output operands list (by reference).		// - The preserved output operands list (by reference).
llvm::SmallDenseMap<unsigned, unsigned>		llvm::SmallDenseMap<unsigned, unsigned>
deduplicateOutputOperands(GenericOp genericOp,		deduplicateOutputOperands(GenericOp genericOp,
SmallVector<OpOperand *> &droppedOpOperands,		SmallVector<OpOperand *> &droppedOpOperands,
SmallVector<Value> &newOutputOperands,		SmallVector<Value> &newOutputOperands,
SmallVector<AffineMap> &newIndexingMaps) const {		SmallVector<AffineMap> &newIndexingMaps) const {
llvm::SmallDenseMap<unsigned, unsigned> origToNewPos;		llvm::SmallDenseMap<unsigned, unsigned> origToNewPos;
llvm::SmallDenseMap<std::tuple<Value, AffineMap, Value>, unsigned>		llvm::SmallDenseMap<std::tuple<Value, AffineMap, Value>, unsigned>
dedupedOutpts;		dedupedOutpts;
// If the op doesnt have tensor semantics, keep all the outputs as		// If the op doesnt have tensor semantics, keep all the outputs as
// preserved.		// preserved.
if (!genericOp.hasTensorSemantics()) {		if (!genericOp.hasTensorSemantics()) {
for (const auto &outputOpOperand :		for (const auto &en : llvm::enumerate(genericOp.getOutputOperands())) {
llvm::enumerate(genericOp.getOutputOperands())) {		origToNewPos[en.index()] = newOutputOperands.size();
origToNewPos[outputOpOperand.index()] = newOutputOperands.size();		newOutputOperands.push_back(en.value()->get());
newOutputOperands.push_back(outputOpOperand.value()->get());		newIndexingMaps.push_back(genericOp.getMatchingIndexingMap(en.value()));
newIndexingMaps.push_back(
genericOp.getMatchingIndexingMap(outputOpOperand.value()));
}		}
return origToNewPos;		return origToNewPos;
}		}
// Output argument can be dropped if the result has		// Output argument can be dropped if the result has
// - no users, and		// - no users, and
// - it is not used in the payload, and		// - it is not used in the payload, and
// - the corresponding indexing maps are not needed for loop bound		// - the corresponding indexing maps are not needed for loop bound
// computation.		// computation.
▲ Show 20 Lines • Show All 299 Lines • ▼ Show 20 Lines	ParseResult MapOp::parse(OpAsmParser &parser, OperationState &result) {
Region *body = result.addRegion();		Region *body = result.addRegion();
if (parser.parseRegion(*body, regionArgs))		if (parser.parseRegion(*body, regionArgs))
return failure();		return failure();

return success();		return success();
}		}

void MapOp::print(OpAsmPrinter &p) {		void MapOp::print(OpAsmPrinter &p) {
printCommonStructuredOpParts(p, getInputs(), getOutputs());		printCommonStructuredOpParts(p, SmallVector<Value>(getInputOperands()),
		SmallVector<Value>(getOutputOperands()));
p.printOptionalAttrDict((*this)->getAttrs());		p.printOptionalAttrDict((*this)->getAttrs());

p << "(";		p << "(";
llvm::interleaveComma(getMapper().getArguments(), p,		llvm::interleaveComma(getMapper().getArguments(), p,
[&](auto arg) { p.printRegionArgument(arg); });		[&](auto arg) { p.printRegionArgument(arg); });
p << ") ";		p << ") ";

p.printRegion(getMapper(), /printEntryBlockArgs=/false);		p.printRegion(getMapper(), /printEntryBlockArgs=/false);
Show All 16 Lines	for (const auto &[bbArgType, inputArg] :
if (bbArgType != inputElemType) {		if (bbArgType != inputElemType) {
return emitOpError() << "expected element type of input " << inputElemType		return emitOpError() << "expected element type of input " << inputElemType
<< " to match bbArg type " << bbArgType;		<< " to match bbArg type " << bbArgType;
}		}
}		}

// The shape of each input must match the shape of the output.		// The shape of each input must match the shape of the output.
auto outputShape =		auto outputShape =
getOutputs().front().getType().cast<ShapedType>().getShape();		getOutputOperand(0)->get().getType().cast<ShapedType>().getShape();
for (Type inputArgType : TypeRange{getInputs()}) {		for (Type inputArgType : TypeRange{getInputs()}) {
auto inputElemShape = inputArgType.cast<ShapedType>().getShape();		auto inputElemShape = inputArgType.cast<ShapedType>().getShape();
if (inputElemShape != outputShape) {		if (inputElemShape != outputShape) {
return emitOpError() << "expected shape of input (" << inputElemShape		return emitOpError() << "expected shape of input (" << inputElemShape
<< ") to match shape of output (" << outputShape		<< ") to match shape of output (" << outputShape
<< ")";		<< ")";
}		}
}		}
Show All 12 Lines	ArrayAttr MapOp::getIndexingMaps() {
int64_t numIndexingMaps = getOperands().size();		int64_t numIndexingMaps = getOperands().size();
return builder.getAffineMapArrayAttr(SmallVector<AffineMap>(		return builder.getAffineMapArrayAttr(SmallVector<AffineMap>(
numIndexingMaps, builder.getMultiDimIdentityMap(rank)));		numIndexingMaps, builder.getMultiDimIdentityMap(rank)));
}		}

void MapOp::getEffects(		void MapOp::getEffects(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects) {		&effects) {
SmallVector<Value> inputBuffers = getInputBufferOperands();		getGenericEffectsImpl(effects, getOperation()->getResults(),
SmallVector<Value> outputBuffers = getOutputBufferOperands();		getInputOperands(), getOutputOperands());
getGenericEffectsImpl(effects, getOperation()->getResults(), inputBuffers,
outputBuffers);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ReduceOp		// ReduceOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void ReduceOp::getAsmBlockArgumentNames(Region &region,		void ReduceOp::getAsmBlockArgumentNames(Region &region,
OpAsmSetValueNameFn setNameFn) {		OpAsmSetValueNameFn setNameFn) {
Show All 29 Lines	ArrayAttr ReduceOp::getIndexingMaps() {
for (int64_t i = 0, e = getNumOutputs(); i < e; ++i)		for (int64_t i = 0, e = getNumOutputs(); i < e; ++i)
affineMaps.push_back(resultMap);		affineMaps.push_back(resultMap);
return Builder(getContext()).getAffineMapArrayAttr(affineMaps);		return Builder(getContext()).getAffineMapArrayAttr(affineMaps);
}		}

void ReduceOp::getEffects(		void ReduceOp::getEffects(
SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>		SmallVectorImpl<SideEffects::EffectInstance<MemoryEffects::Effect>>
&effects) {		&effects) {
SmallVector<Value> inputBuffers = getInputBufferOperands();		getGenericEffectsImpl(effects, getOperation()->getResults(),
SmallVector<Value> outputBuffers = getOutputBufferOperands();		getInputOperands(), getOutputOperands());
getGenericEffectsImpl(effects, getOperation()->getResults(), inputBuffers,
outputBuffers);
}		}

static ParseResult parseDenseI64ArrayAttr(OpAsmParser &parser,		static ParseResult parseDenseI64ArrayAttr(OpAsmParser &parser,
NamedAttrList &attributes,		NamedAttrList &attributes,
StringRef attributeName) {		StringRef attributeName) {
if (parser.parseKeyword(attributeName) \|\| parser.parseEqual())		if (parser.parseKeyword(attributeName) \|\| parser.parseEqual())
return failure();		return failure();

Show All 22 Lines
}		}

static void printDenseI64ArrayAttr(OpAsmPrinter &p, StringRef attributeName,		static void printDenseI64ArrayAttr(OpAsmPrinter &p, StringRef attributeName,
ArrayRef<int64_t> attributeValue) {		ArrayRef<int64_t> attributeValue) {
p << " " << attributeName << " = [" << attributeValue << "] ";		p << " " << attributeName << " = [" << attributeValue << "] ";
}		}

void ReduceOp::print(OpAsmPrinter &p) {		void ReduceOp::print(OpAsmPrinter &p) {
printCommonStructuredOpParts(p, getInputs(), getOutputs());		printCommonStructuredOpParts(p, SmallVector<Value>(getInputOperands()),
		SmallVector<Value>(getOutputOperands()));
printDenseI64ArrayAttr(p, getDimensionsAttrName(), getDimensions());		printDenseI64ArrayAttr(p, getDimensionsAttrName(), getDimensions());
p.printOptionalAttrDict((*this)->getAttrs(), {getDimensionsAttrName()});		p.printOptionalAttrDict((*this)->getAttrs(), {getDimensionsAttrName()});

p << "(";		p << "(";
llvm::interleaveComma(getCombiner().getArguments(), p,		llvm::interleaveComma(getCombiner().getArguments(), p,
[&](auto arg) { p.printRegionArgument(arg); });		[&](auto arg) { p.printRegionArgument(arg); });
p << ") ";		p << ") ";

▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines	for (auto [input, bbArg] : llvm::zip(getInputs(), block->getArguments())) {
if (inputElementType != bbArg.getType())		if (inputElementType != bbArg.getType())
return emitOpError()		return emitOpError()
<< "input element type " << inputElementType		<< "input element type " << inputElementType
<< " does not match corresponding block argument type "		<< " does not match corresponding block argument type "
<< bbArg.getType();		<< bbArg.getType();
}		}

// Check that the last block arguments match the element type of the outputs.		// Check that the last block arguments match the element type of the outputs.
for (auto [output, bbArg] : llvm::zip(		for (auto [output, bbArg] :
getOutputs(), block->getArguments().take_back(getNumOutputs()))) {		llvm::zip(getOutputOperands(),
		block->getArguments().take_back(getNumOutputs()))) {
auto outputElementType =		auto outputElementType =
output.getType().cast<ShapedType>().getElementType();		output->get().getType().cast<ShapedType>().getElementType();
if (outputElementType != bbArg.getType())		if (outputElementType != bbArg.getType())
return emitOpError()		return emitOpError()
<< "output element type " << outputElementType		<< "output element type " << outputElementType
<< " does not match corresponding block argument type "		<< " does not match corresponding block argument type "
<< bbArg.getType();		<< bbArg.getType();
}		}
return success();		return success();
}		}
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

namespace {		namespace {
struct EraseDeadLinalgOp : public OpInterfaceRewritePattern<LinalgOp> {		struct EraseDeadLinalgOp : public OpInterfaceRewritePattern<LinalgOp> {
using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;		using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;

LogicalResult matchAndRewrite(LinalgOp op,		LogicalResult matchAndRewrite(LinalgOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
for (OpOperand *opOperand : op.getInputAndOutputOperands()) {		for (OpOperand &opOperand : op->getOpOperands()) {
// Linalg "inputs" may be either tensor or memref type.		// Linalg "inputs" may be either tensor or memref type.
// tensor<0xelt_type> is a convention that may not always mean		// tensor<0xelt_type> is a convention that may not always mean
// "0 iterations". Only erase in cases we see memref<...x0x...>.		// "0 iterations". Only erase in cases we see memref<...x0x...>.
auto mt = opOperand->get().getType().dyn_cast<MemRefType>();		auto mt = opOperand.get().getType().dyn_cast<MemRefType>();
if (!mt)		if (!mt)
continue;		continue;
if (llvm::is_contained(op.getShape(opOperand), 0)) {		if (llvm::is_contained(op.getShape(&opOperand), 0)) {
rewriter.eraseOp(op);		rewriter.eraseOp(op);
return success();		return success();
}		}
}		}
return failure();		return failure();
}		}
};		};

struct FoldTensorCastProducerOp : public OpInterfaceRewritePattern<LinalgOp> {		struct FoldTensorCastProducerOp : public OpInterfaceRewritePattern<LinalgOp> {
using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;		using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;

LogicalResult matchAndRewrite(LinalgOp op,		LogicalResult matchAndRewrite(LinalgOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// If no operand comes from a tensor::CastOp and can be folded then fail.		// If no operand comes from a tensor::CastOp and can be folded then fail.
bool hasTensorCastOperand =		bool hasTensorCastOperand =
llvm::any_of(op.getInputAndOutputOperands(), [&](OpOperand *opOperand) {		llvm::any_of(op->getOpOperands(), [&](OpOperand &opOperand) {
if (opOperand->get().isa<BlockArgument>())		if (opOperand.get().isa<BlockArgument>())
return false;		return false;
auto castOp = opOperand->get().getDefiningOp<tensor::CastOp>();		auto castOp = opOperand.get().getDefiningOp<tensor::CastOp>();
return castOp && canFoldIntoConsumerOp(castOp);		return castOp && canFoldIntoConsumerOp(castOp);
});		});
if (!hasTensorCastOperand)		if (!hasTensorCastOperand)
return failure();		return failure();

SmallVector<Type, 4> newResultTypes;		SmallVector<Type, 4> newResultTypes;
newResultTypes.reserve(op->getNumResults());		newResultTypes.reserve(op->getNumResults());
SmallVector<Value, 4> newOperands;		SmallVector<Value, 4> newOperands;
newOperands.reserve(op->getNumOperands());		newOperands.reserve(op->getNumOperands());
// Inputs may fold.		// Inputs may fold.
for (OpOperand *opOperand : op.getInputOperands()) {		for (auto *input : op.getInputOperands()) {
auto tensorCastOp = opOperand->get().getDefiningOp<tensor::CastOp>();		auto tensorCastOp = input->get().getDefiningOp<tensor::CastOp>();
newOperands.push_back(canFoldIntoConsumerOp(tensorCastOp)		newOperands.push_back(canFoldIntoConsumerOp(tensorCastOp)
? tensorCastOp.getSource()		? tensorCastOp.getSource()
: opOperand->get());		: input->get());
}		}
// Init tensors may fold, in which case the resultType must also change.		// Init tensors may fold, in which case the resultType must also change.
for (OpOperand *opOperand : op.getOutputOperands()) {		for (auto *output : op.getOutputOperands()) {
auto tensorCastOp = opOperand->get().getDefiningOp<tensor::CastOp>();		auto tensorCastOp = output->get().getDefiningOp<tensor::CastOp>();
bool fold = canFoldIntoConsumerOp(tensorCastOp);		bool fold = canFoldIntoConsumerOp(tensorCastOp);
newOperands.push_back(fold ? tensorCastOp.getOperand()		newOperands.push_back(fold ? tensorCastOp.getOperand() : output->get());
: opOperand->get());
newResultTypes.push_back(newOperands.back().getType());		newResultTypes.push_back(newOperands.back().getType());
}		}
// Clone op.		// Clone op.
Operation *newOp =		Operation *newOp =
op.clone(rewriter, op->getLoc(), newResultTypes, newOperands);		op.clone(rewriter, op->getLoc(), newResultTypes, newOperands);
SmallVector<Value, 4> replacements;		SmallVector<Value, 4> replacements;
replacements.reserve(newOp->getNumResults());		replacements.reserve(newOp->getNumResults());
for (auto result : llvm::zip(op->getResults(), newOp->getResults())) {		for (auto result : llvm::zip(op->getResults(), newOp->getResults())) {
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	LogicalResult matchAndRewrite(tensor::CastOp castOp,
// Replace the `outs` for the result with a `tensor.cast`. This cast is now		// Replace the `outs` for the result with a `tensor.cast`. This cast is now
// going from a more dynamic shape to a less dynamic shape. If the producer		// going from a more dynamic shape to a less dynamic shape. If the producer
// for this cast, i.e. producer of the out operand, is also an operation		// for this cast, i.e. producer of the out operand, is also an operation
// that folds with tensor.cast consumer (like this pattern), the cast will		// that folds with tensor.cast consumer (like this pattern), the cast will
// continue to propagate as far up the stack as it can go.		// continue to propagate as far up the stack as it can go.
OpOperand *outOperand = linalgOp.getOutputOperand(resultNumber);		OpOperand *outOperand = linalgOp.getOutputOperand(resultNumber);
Value newOperand =		Value newOperand =
rewriter.create<tensor::CastOp>(loc, resultType, outOperand->get());		rewriter.create<tensor::CastOp>(loc, resultType, outOperand->get());
SmallVector<Value> newOperands = linalgOp.getInputOperands();		SmallVector<Value> newOperands{linalgOp.getInputOperands()};
SmallVector<Value> outputOperands = linalgOp.getOutputOperands();		SmallVector<Value> outputOperands{linalgOp.getOutputOperands()};
outputOperands[resultNumber] = newOperand;		outputOperands[resultNumber] = newOperand;
newOperands.append(outputOperands.begin(), outputOperands.end());		newOperands.append(outputOperands.begin(), outputOperands.end());

SmallVector<Type> resultTypes(linalgOp->result_type_begin(),		SmallVector<Type> resultTypes(linalgOp->result_type_begin(),
linalgOp->result_type_end());		linalgOp->result_type_end());
resultTypes[resultNumber] = resultType;		resultTypes[resultNumber] = resultType;
Operation *newOp = linalgOp.clone(rewriter, loc, resultTypes, newOperands);		Operation *newOp = linalgOp.clone(rewriter, loc, resultTypes, newOperands);

// Create a tensor.cast operation back to the original type.		// Create a tensor.cast operation back to the original type.
Value castBack = rewriter.create<tensor::CastOp>(		Value castBack = rewriter.create<tensor::CastOp>(
loc, resultValue.getType(), newOp->getResult(resultNumber));		loc, resultValue.getType(), newOp->getResult(resultNumber));

SmallVector<Value> results(newOp->result_begin(), newOp->result_end());		SmallVector<Value> results(newOp->result_begin(), newOp->result_end());
results[resultNumber] = castBack;		results[resultNumber] = castBack;
rewriter.replaceOp(linalgOp, results);		rewriter.replaceOp(linalgOp, results);
rewriter.replaceOp(castOp, newOp->getResult(resultNumber));		rewriter.replaceOp(castOp, newOp->getResult(resultNumber));
return success();		return success();
}		}
};		};

/// For each of the operand in `operands` this function maps the static sizes of		/// For each of the operand in `operands` this function maps the static sizes of
/// dimensions to their affine dim expressions.		/// dimensions to their affine dim expressions.
static void populateMap(LinalgOp linalgOp, ArrayRef<OpOperand *> operands,		static void populateMap(LinalgOp linalgOp, MutableArrayRef<OpOperand> operands,
llvm::DenseMap<AffineExpr, int64_t> &affineExprToSize) {		llvm::DenseMap<AffineExpr, int64_t> &affineExprToSize) {
for (OpOperand *opOperand : operands) {		for (OpOperand &opOperand : operands) {
if (linalgOp.isScalar(opOperand))		if (linalgOp.isScalar(&opOperand))
continue;		continue;
Value src = opOperand->get();		Value src = opOperand.get();
auto sourceType = src.getType().cast<RankedTensorType>();		auto sourceType = src.getType().cast<RankedTensorType>();
auto sourceMap = linalgOp.getMatchingIndexingMap(opOperand);		auto sourceMap = linalgOp.getMatchingIndexingMap(&opOperand);

// Get the `sourceShape` of the `sourceType`. If the operand is a result of		// Get the `sourceShape` of the `sourceType`. If the operand is a result of
// `tensor.cast` operation and source of the cast operation has a static		// `tensor.cast` operation and source of the cast operation has a static
// shape, then assign it to the `sourceShape`.		// shape, then assign it to the `sourceShape`.
auto *parentOp = src.getDefiningOp();		auto *parentOp = src.getDefiningOp();
ArrayRef<int64_t> sourceShape = sourceType.getShape();		ArrayRef<int64_t> sourceShape = sourceType.getShape();
if (parentOp) {		if (parentOp) {
if (auto castOp = dyn_cast<tensor::CastOp>(parentOp)) {		if (auto castOp = dyn_cast<tensor::CastOp>(parentOp)) {
Show All 26 Lines	static void createNewOperandWithStaticSizes(
SmallVector<Value> &newOperands, SmallVector<Type> &resultTypes,		SmallVector<Value> &newOperands, SmallVector<Type> &resultTypes,
bool &changeNeeded) {		bool &changeNeeded) {
Value src = opOperand->get();		Value src = opOperand->get();
newOperands.push_back(src);		newOperands.push_back(src);
if (linalgOp.isScalar(opOperand))		if (linalgOp.isScalar(opOperand))
return;		return;
auto sourceType = src.getType().cast<RankedTensorType>();		auto sourceType = src.getType().cast<RankedTensorType>();
Type resultType = sourceType;		Type resultType = sourceType;
if (sourceType.hasStaticShape() && linalgOp.isOutputTensor(opOperand)) {		if (sourceType.hasStaticShape() && linalgOp.isOutput(opOperand)) {
resultTypes.push_back(resultType);		resultTypes.push_back(resultType);
return;		return;
}		}
ArrayRef<int64_t> sourceShape = sourceType.getShape();		ArrayRef<int64_t> sourceShape = sourceType.getShape();
AffineMap sourceMap = linalgOp.getMatchingIndexingMap(opOperand);		AffineMap sourceMap = linalgOp.getMatchingIndexingMap(opOperand);
SmallVector<int64_t> newShape;		SmallVector<int64_t> newShape;
// If operand is updated with new shape, `newOperandNeeded` will be		// If operand is updated with new shape, `newOperandNeeded` will be
// true.		// true.
Show All 16 Lines	static void createNewOperandWithStaticSizes(
if (newOperandNeeded) {		if (newOperandNeeded) {
changeNeeded = true;		changeNeeded = true;
// Get the new operand value given its size and element type by		// Get the new operand value given its size and element type by
// casting it.		// casting it.
Value newOperand = rewriter.create<tensor::CastOp>(loc, resultType, src);		Value newOperand = rewriter.create<tensor::CastOp>(loc, resultType, src);
unsigned index = opOperand->getOperandNumber();		unsigned index = opOperand->getOperandNumber();
newOperands[index] = newOperand;		newOperands[index] = newOperand;
}		}
if (linalgOp.isOutputTensor(opOperand))		if (linalgOp.isOutput(opOperand))
resultTypes.push_back(resultType);		resultTypes.push_back(resultType);
}		}

/// Static shapes for the operands can be inferred if any one of the operands		/// Static shapes for the operands can be inferred if any one of the operands
/// have a static shape. This can be done by referring to the affine dim		/// have a static shape. This can be done by referring to the affine dim
/// expressions for the operand.		/// expressions for the operand.
struct InferStaticShapeOfOperands : public OpInterfaceRewritePattern<LinalgOp> {		struct InferStaticShapeOfOperands : public OpInterfaceRewritePattern<LinalgOp> {
using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;		using OpInterfaceRewritePattern<LinalgOp>::OpInterfaceRewritePattern;
Show All 10 Lines	if (llvm::any_of(linalgOp.getIndexingMapsArray(), [](AffineMap map) {
return failure();		return failure();

// Maps affine dim expressions to the static size of that dimension.		// Maps affine dim expressions to the static size of that dimension.
llvm::DenseMap<AffineExpr, int64_t> affineExprToSize;		llvm::DenseMap<AffineExpr, int64_t> affineExprToSize;
Location loc = linalgOp.getLoc();		Location loc = linalgOp.getLoc();

// For each of the affine dim expression, check if the size is known. If		// For each of the affine dim expression, check if the size is known. If
// known add that in the map.		// known add that in the map.
populateMap(linalgOp, linalgOp.getInputAndOutputOperands(),		populateMap(linalgOp, linalgOp->getOpOperands(), affineExprToSize);
affineExprToSize);

SmallVector<Value> newOperands;		SmallVector<Value> newOperands;
SmallVector<Type> resultTypes;		SmallVector<Type> resultTypes;

// `changeNeeded` is `false` if the operands of `linalgOp` require no		// `changeNeeded` is `false` if the operands of `linalgOp` require no
// change in their types.		// change in their types.
bool changeNeeded = false;		bool changeNeeded = false;
newOperands.reserve(linalgOp.getNumInputsAndOutputs());		newOperands.reserve(linalgOp->getNumOperands());
resultTypes.reserve(linalgOp.getNumOutputs());		resultTypes.reserve(linalgOp.getNumOutputs());

// Iterate over all the operands and update the static sizes.		// Iterate over all the operands and update the static sizes.
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
createNewOperandWithStaticSizes(loc, rewriter, opOperand,		createNewOperandWithStaticSizes(loc, rewriter, &opOperand,
affineExprToSize, linalgOp, newOperands,		affineExprToSize, linalgOp, newOperands,
resultTypes, changeNeeded);		resultTypes, changeNeeded);
}		}

// If the generic op has all the required static information, no		// If the generic op has all the required static information, no
// canonicalization needed.		// canonicalization needed.
if (!changeNeeded)		if (!changeNeeded)
return failure();		return failure();
▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp

Show First 20 Lines • Show All 106 Lines • ▼ Show 20 Lines	SmallVector<OpFoldResult> tileOffsets(sizeBounds.size(),
rewriter.getIndexAttr(0));		rewriter.getIndexAttr(0));
SmallVector<OpFoldResult> tileSizes = sizeBounds;		SmallVector<OpFoldResult> tileSizes = sizeBounds;
for (auto const &result : enumerate(indexingMap.getResults())) {		for (auto const &result : enumerate(indexingMap.getResults())) {
unsigned position = result.value().cast<AffineDimExpr>().getPosition();		unsigned position = result.value().cast<AffineDimExpr>().getPosition();
tileOffsets[position] = sliceOp.getMixedOffsets()[result.index()];		tileOffsets[position] = sliceOp.getMixedOffsets()[result.index()];
tileSizes[position] = sliceOp.getMixedSizes()[result.index()];		tileSizes[position] = sliceOp.getMixedSizes()[result.index()];
}		}

SmallVector<Value> valuesToTile = linalgOp.getInputAndOutputOperands();		SmallVector<Value> valuesToTile = linalgOp->getOperands();
SmallVector<Value> tiledOperands =		SmallVector<Value> tiledOperands =
makeTiledShapes(rewriter, linalgLoc, linalgOp, valuesToTile,		makeTiledShapes(rewriter, linalgLoc, linalgOp, valuesToTile,
tileOffsets, tileSizes, sizeBounds,		tileOffsets, tileSizes, sizeBounds,
/omitPartialTileCheck=/true);		/omitPartialTileCheck=/true);

SmallVector<Type, 4> resultTensorTypes;		SmallVector<Type, 4> resultTensorTypes;
for (OpOperand *opOperand : linalgOp.getOutputTensorOperands())		for (OpOperand *opOperand : linalgOp.getOutputOperands())
resultTensorTypes.push_back(		resultTensorTypes.push_back(
tiledOperands[opOperand->getOperandNumber()].getType());		tiledOperands[opOperand->getOperandNumber()].getType());

Operation *newOp =		Operation *newOp =
linalgOp.clone(rewriter, linalgLoc, resultTensorTypes, tiledOperands);		linalgOp.clone(rewriter, linalgLoc, resultTensorTypes, tiledOperands);
rewriter.replaceOp(sliceOp, newOp->getResults());		rewriter.replaceOp(sliceOp, newOp->getResults());
return success();		return success();
}		}
};		};
} // namespace		} // namespace

void mlir::linalg::populateBubbleUpExtractSliceOpPatterns(		void mlir::linalg::populateBubbleUpExtractSliceOpPatterns(
RewritePatternSet &patterns) {		RewritePatternSet &patterns) {
auto *context = patterns.getContext();		auto *context = patterns.getContext();
patterns.add<BubbleUpExtractSliceOpPattern>(context);		patterns.add<BubbleUpExtractSliceOpPattern>(context);
}		}

mlir/lib/Dialect/Linalg/Transforms/BufferizableOpInterfaceImpl.cpp

Show First 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	getAliasingOpOperand(Operation *op, OpResult opResult,
return {genericOp.getOutputOperand(opResult.getResultNumber())};		return {genericOp.getOutputOperand(opResult.getResultNumber())};
}		}

SmallVector<OpResult> getAliasingOpResult(Operation *op, OpOperand &opOperand,		SmallVector<OpResult> getAliasingOpResult(Operation *op, OpOperand &opOperand,
const AnalysisState &state) const {		const AnalysisState &state) const {
auto genericOp = cast<linalg::DestinationStyleOpInterface>(op);		auto genericOp = cast<linalg::DestinationStyleOpInterface>(op);

// The i-th "out" tensor may alias with the i-th OpResult.		// The i-th "out" tensor may alias with the i-th OpResult.
if (genericOp.isOutputTensor(&opOperand))		if (genericOp.isOutput(&opOperand))
return {genericOp.getTiedOpResult(&opOperand)};		return {genericOp.getTiedOpResult(&opOperand)};
return {};		return {};
}		}

BufferRelation bufferRelation(Operation *op, OpResult opResult,		BufferRelation bufferRelation(Operation *op, OpResult opResult,
const AnalysisState &state) const {		const AnalysisState &state) const {
return BufferRelation::Equivalent;		return BufferRelation::Equivalent;
}		}
Show All 30 Lines

mlir/lib/Dialect/Linalg/Transforms/ConstantFold.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	if (genericOp.getNumOutputs() != 1)
return failure();		return failure();

auto outputType = genericOp.getResultTypes().front().dyn_cast<ShapedType>();		auto outputType = genericOp.getResultTypes().front().dyn_cast<ShapedType>();
// Require the output types to be static given that we are generating		// Require the output types to be static given that we are generating
// constants.		// constants.
if (!outputType \|\| !outputType.hasStaticShape())		if (!outputType \|\| !outputType.hasStaticShape())
return failure();		return failure();

if (!llvm::all_of(genericOp.getInputOperands(), [](OpOperand *operand) {		if (!llvm::all_of(genericOp.getInputs(), [](Value input) {
return operand->get().getType().isa<ShapedType>();		return input.getType().isa<ShapedType>();
}))		}))
return failure();		return failure();

// Make sure all element types are the same.		// Make sure all element types are the same.
auto getOperandElementType = [](OpOperand *operand) {		auto getOperandElementType = [](Value value) {
return operand->get().getType().cast<ShapedType>().getElementType();		return value.getType().cast<ShapedType>().getElementType();
};		};
if (!llvm::all_equal(llvm::map_range(genericOp.getInputAndOutputOperands(),		if (!llvm::all_equal(
getOperandElementType)))		llvm::map_range(genericOp->getOperands(), getOperandElementType)))
return failure();		return failure();

// We can only handle the case where we have int/float elements.		// We can only handle the case where we have int/float elements.
auto elementType = outputType.getElementType();		auto elementType = outputType.getElementType();
if (!elementType.isIntOrFloat())		if (!elementType.isIntOrFloat())
return failure();		return failure();

// Require all indexing maps to be permutations for now. This is common and		// Require all indexing maps to be permutations for now. This is common and
Show All 19 Lines	LogicalResult matchAndRewrite(GenericOp genericOp,
RegionComputationFn computeFn =		RegionComputationFn computeFn =
static_cast<const ConcreteType *>(this)->getRegionComputeFn(genericOp);		static_cast<const ConcreteType *>(this)->getRegionComputeFn(genericOp);
if (!computeFn)		if (!computeFn)
return failure();		return failure();

// All inputs should be constants.		// All inputs should be constants.
int numInputs = genericOp.getNumInputs();		int numInputs = genericOp.getNumInputs();
SmallVector<DenseIntOrFPElementsAttr> inputValues(numInputs);		SmallVector<DenseIntOrFPElementsAttr> inputValues(numInputs);
for (const auto &operand : llvm::enumerate(genericOp.getInputOperands())) {		for (const auto &en : llvm::enumerate(genericOp.getInputOperands())) {
if (!matchPattern(operand.value()->get(),		if (!matchPattern(en.value()->get(),
m_Constant(&inputValues[operand.index()])))		m_Constant(&inputValues[en.index()])))
return failure();		return failure();
}		}

// Identified this as a potential candidate for folding. Now check the		// Identified this as a potential candidate for folding. Now check the
// policy to see whether we are allowed to proceed.		// policy to see whether we are allowed to proceed.
for (auto *operand : genericOp.getInputOperands()) {		for (OpOperand *operand : genericOp.getInputOperands()) {
if (!controlFn(operand))		if (!controlFn(operand))
return failure();		return failure();
}		}

auto linalgOp = cast<LinalgOp>(genericOp.getOperation());		auto linalgOp = cast<LinalgOp>(genericOp.getOperation());
SmallVector<int64_t, 4> loopBounds = linalgOp.computeStaticLoopSizes();		SmallVector<int64_t, 4> loopBounds = linalgOp.computeStaticLoopSizes();
int64_t numElements = outputType.getNumElements();		int64_t numElements = outputType.getNumElements();

Show All 32 Lines	LogicalResult matchAndRewrite(GenericOp genericOp,
SmallVector<uint64_t> srcLinearIndices(numInputs, 0);		SmallVector<uint64_t> srcLinearIndices(numInputs, 0);
uint64_t dstLinearIndex = 0;		uint64_t dstLinearIndex = 0;

// Allocate spaces for compute function inputs. Initial values do not matter		// Allocate spaces for compute function inputs. Initial values do not matter
// here as they will be overwritten later.		// here as they will be overwritten later.
APIntOrFloatArray computeFnInputs;		APIntOrFloatArray computeFnInputs;

auto inputShapes = llvm::to_vector<4>(		auto inputShapes = llvm::to_vector<4>(
llvm::map_range(genericOp.getInputOperands(), [](OpOperand *operand) {		llvm::map_range(genericOp.getInputs(), [](Value value) {
return operand->get().getType().cast<ShapedType>().getShape();		return value.getType().cast<ShapedType>().getShape();
}));		}));

// Given a `linearIndex`, remap it to a linear index to access linalg op		// Given a `linearIndex`, remap it to a linear index to access linalg op
// inputs/ouputs. This mutates `indices`, `srcIndices`, `dstIndices`,		// inputs/ouputs. This mutates `indices`, `srcIndices`, `dstIndices`,
// `srcLinearIndices`, `dstLinearIndex` in place.		// `srcLinearIndices`, `dstLinearIndex` in place.
auto computeRemappedLinearIndex = [&](int linearIndex) {		auto computeRemappedLinearIndex = [&](int linearIndex) {
int totalCount = linearIndex;		int totalCount = linearIndex;
for (int dim = loopBounds.size() - 1; dim >= 0; --dim) {		for (int dim = loopBounds.size() - 1; dim >= 0; --dim) {
▲ Show 20 Lines • Show All 123 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/DecomposeLinalgOps.cpp

Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	for (auto scalarOpResult : peeledScalarOperation->getResults()) {
Value emptyTensor =		Value emptyTensor =
rewriter.create<tensor::EmptyOp>(loc, domain, scalarOpResult.getType());		rewriter.create<tensor::EmptyOp>(loc, domain, scalarOpResult.getType());
newInitValues.push_back(emptyTensor);		newInitValues.push_back(emptyTensor);
newResultTypes.push_back(emptyTensor.getType());		newResultTypes.push_back(emptyTensor.getType());
peeledGenericOpIndexingMaps.push_back(indexingMap);		peeledGenericOpIndexingMaps.push_back(indexingMap);
}		}

/// Create the peeled generic op with an empty body.		/// Create the peeled generic op with an empty body.
SmallVector<Value> outsOperands = genericOp.getOutputOperands();		SmallVector<Value> outsOperands = genericOp.getOutputs();
outsOperands.append(newInitValues.begin(), newInitValues.end());		outsOperands.append(newInitValues.begin(), newInitValues.end());
SmallVector<Type> resultTypes = llvm::to_vector(genericOp.getResultTypes());		SmallVector<Type> resultTypes = llvm::to_vector(genericOp.getResultTypes());
resultTypes.append(newResultTypes.begin(), newResultTypes.end());		resultTypes.append(newResultTypes.begin(), newResultTypes.end());
auto indexingMapAttr =		auto indexingMapAttr =
rewriter.getAffineMapArrayAttr(peeledGenericOpIndexingMaps);		rewriter.getAffineMapArrayAttr(peeledGenericOpIndexingMaps);
return rewriter.create<GenericOp>(		return rewriter.create<GenericOp>(
loc, resultTypes, genericOp.getInputs(), outsOperands, indexingMapAttr,		loc, resultTypes, genericOp.getInputs(), outsOperands, indexingMapAttr,
genericOp.getIteratorTypes(), /doc=/nullptr, /libraryCall=/nullptr,		genericOp.getIteratorTypes(), /doc=/nullptr, /libraryCall=/nullptr,
[](OpBuilder, Location, ValueRange) {});		[](OpBuilder, Location, ValueRange) {});
}		}

GenericOp		GenericOp
DecomposeLinalgOp::createResidualGenericOp(GenericOp genericOp,		DecomposeLinalgOp::createResidualGenericOp(GenericOp genericOp,
GenericOp peeledGenericOp,		GenericOp peeledGenericOp,
PatternRewriter &rewriter) const {		PatternRewriter &rewriter) const {
/// Append all results from the peeledGenericOps as `ins` operand for the		/// Append all results from the peeledGenericOps as `ins` operand for the
/// residual generic op.		/// residual generic op.
SmallVector<Value> residualGenericOpOperands = llvm::to_vector(		SmallVector<Value> residualGenericOpOperands = genericOp.getInputs();
llvm::map_range(genericOp.getInputOperands(),
[](OpOperand *operand) { return operand->get(); }));
unsigned origNumResults = genericOp.getNumResults();		unsigned origNumResults = genericOp.getNumResults();
unsigned peeledGenericOpNumResults = peeledGenericOp.getNumResults();		unsigned peeledGenericOpNumResults = peeledGenericOp.getNumResults();
SmallVector<Value> extraIns;		SmallVector<Value> extraIns;
for (auto resultNum :		for (auto resultNum :
llvm::seq<unsigned>(origNumResults, peeledGenericOpNumResults))		llvm::seq<unsigned>(origNumResults, peeledGenericOpNumResults))
extraIns.push_back(peeledGenericOp->getResult(resultNum));		extraIns.push_back(peeledGenericOp->getResult(resultNum));
residualGenericOpOperands.append(extraIns);		residualGenericOpOperands.append(extraIns);

▲ Show 20 Lines • Show All 157 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp

	Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
	/// Returns true if tensorType can be detensored.			/// Returns true if tensorType can be detensored.
	bool canBeDetensored(TensorType tensorType) {			bool canBeDetensored(TensorType tensorType) {
	return tensorType.hasRank() && tensorType.getRank() == 0;			return tensorType.hasRank() && tensorType.getRank() == 0;
	}			}

	bool shouldBeDetensored(Operation *op, TypeConverter typeConverter) {			bool shouldBeDetensored(Operation *op, TypeConverter typeConverter) {
	GenericOp genericOp = dyn_cast_or_null<GenericOp>(op);			GenericOp genericOp = dyn_cast_or_null<GenericOp>(op);
	return genericOp &&			return genericOp &&
	llvm::all_of(			llvm::all_of(genericOp->getOpOperands(), [&](OpOperand &opOperand) {
	genericOp.getInputAndOutputOperands(), [&](OpOperand *opOperand) {			return !typeConverter.isLegal(opOperand.get().getType());
	return !typeConverter.isLegal(opOperand->get().getType());
	});			});
	}			}

	/// A conversion patttern for detensoring `linalg.generic` ops.			/// A conversion patttern for detensoring `linalg.generic` ops.
	class DetensorizeGenericOp : public OpConversionPattern<GenericOp> {			class DetensorizeGenericOp : public OpConversionPattern<GenericOp> {
	public:			public:
	using OpConversionPattern::OpConversionPattern;			using OpConversionPattern::OpConversionPattern;
	LogicalResult			LogicalResult
	matchAndRewrite(GenericOp op, OpAdaptor adaptor,			matchAndRewrite(GenericOp op, OpAdaptor adaptor,
	▲ Show 20 Lines • Show All 494 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp

Show First 20 Lines • Show All 371 Lines • ▼ Show 20 Lines	if (llvm::any_of(genericOp->getOperandTypes(), [](Type type) {
return failure();		return failure();
MLIRContext *context = rewriter.getContext();		MLIRContext *context = rewriter.getContext();
Location loc = genericOp.getLoc();		Location loc = genericOp.getLoc();

SmallVector<AffineMap> newIndexingMaps;		SmallVector<AffineMap> newIndexingMaps;
SmallVector<ArrayAttr> reassociationMaps;		SmallVector<ArrayAttr> reassociationMaps;
SmallVector<Type> newInputOutputTypes;		SmallVector<Type> newInputOutputTypes;
bool doCanonicalization = false;		bool doCanonicalization = false;
for (OpOperand *opOperand : genericOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : genericOp->getOpOperands()) {
auto replacementInfo = replaceUnitExtents(genericOp, opOperand, context);		auto replacementInfo = replaceUnitExtents(genericOp, &opOperand, context);
if (replacementInfo) {		if (replacementInfo) {
reassociationMaps.push_back(replacementInfo->reassociation);		reassociationMaps.push_back(replacementInfo->reassociation);
newIndexingMaps.push_back(replacementInfo->indexMap);		newIndexingMaps.push_back(replacementInfo->indexMap);
newInputOutputTypes.push_back(replacementInfo->type);		newInputOutputTypes.push_back(replacementInfo->type);
doCanonicalization \|=		doCanonicalization \|=
replacementInfo->type != opOperand->get().getType();		replacementInfo->type != opOperand.get().getType();
} else {		} else {
// If replaceUnitExtents cannot handle this case, maintain the same		// If replaceUnitExtents cannot handle this case, maintain the same
// type, indexing map, and create a set of mappings representing an		// type, indexing map, and create a set of mappings representing an
// identity matrix.		// identity matrix.
newInputOutputTypes.push_back(opOperand->get().getType());		newInputOutputTypes.push_back(opOperand.get().getType());
newIndexingMaps.push_back(genericOp.getMatchingIndexingMap(opOperand));		newIndexingMaps.push_back(genericOp.getMatchingIndexingMap(&opOperand));
int64_t origRank = genericOp.getRank(opOperand);		int64_t origRank = genericOp.getRank(&opOperand);
auto maps = llvm::to_vector<8>(llvm::map_range(		auto maps = llvm::to_vector<8>(llvm::map_range(
llvm::seq<int64_t>(0, origRank), [&](int64_t dim) -> Attribute {		llvm::seq<int64_t>(0, origRank), [&](int64_t dim) -> Attribute {
return AffineMapAttr::get(		return AffineMapAttr::get(
AffineMap::get(origRank, /symbolCount = / 0,		AffineMap::get(origRank, /symbolCount = / 0,
getAffineDimExpr(dim, context), context));		getAffineDimExpr(dim, context), context));
}));		}));
reassociationMaps.push_back(ArrayAttr::get(context, maps));		reassociationMaps.push_back(ArrayAttr::get(context, maps));
}		}
▲ Show 20 Lines • Show All 165 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/ElementwiseOpFusion.cpp

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	bool mlir::linalg::areElementwiseOpsFusable(OpOperand *fusedOperand) {

// Verify that		// Verify that
// - the producer has all "parallel" iterator type.		// - the producer has all "parallel" iterator type.
if (producer.getNumParallelLoops() != producer.getNumLoops())		if (producer.getNumParallelLoops() != producer.getNumLoops())
return false;		return false;

// Only allow fusing the producer of an input operand for now.		// Only allow fusing the producer of an input operand for now.
// TODO: allow fusing the producer of an output operand.		// TODO: allow fusing the producer of an output operand.
if (!consumer.isInputTensor(fusedOperand))		if (!consumer.isInput(fusedOperand))
return false;		return false;

// Get the consumer index map. The number of results of the consumer index		// Get the consumer index map. The number of results of the consumer index
// map must match the number of loops of the producer.		// map must match the number of loops of the producer.
AffineMap consumerIndexMap = consumer.getMatchingIndexingMap(fusedOperand);		AffineMap consumerIndexMap = consumer.getMatchingIndexingMap(fusedOperand);
if (consumerIndexMap.getNumResults() != producer.getNumLoops())		if (consumerIndexMap.getNumResults() != producer.getNumLoops())
return false;		return false;

▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	for (IndexOp indexOp :
llvm::make_early_inc_range(producerBlock.getOps<IndexOp>())) {		llvm::make_early_inc_range(producerBlock.getOps<IndexOp>())) {
Value newIndex = rewriter.create<mlir::AffineApplyOp>(		Value newIndex = rewriter.create<mlir::AffineApplyOp>(
producer.getLoc(),		producer.getLoc(),
consumerToProducerLoopsMap.getSubMap(indexOp.getDim()), fusedIndices);		consumerToProducerLoopsMap.getSubMap(indexOp.getDim()), fusedIndices);
mapper.map(indexOp.getResult(), newIndex);		mapper.map(indexOp.getResult(), newIndex);
}		}
}		}
// TODO: allow fusing the producer of an output operand.		// TODO: allow fusing the producer of an output operand.
assert(consumer.isInputTensor(fusedOperand) &&		assert(consumer.isInput(fusedOperand) &&
"expected producer of input operand");		"expected producer of input operand");
// 3. Consumer input operands up to consumerIdx (exclusive).		// 3. Consumer input operands up to consumerIdx (exclusive).
for (BlockArgument bbArg : consumerBlock.getArguments().take_front(		for (BlockArgument bbArg : consumerBlock.getArguments().take_front(
fusedOperand->getOperandNumber())) // input assumption.		fusedOperand->getOperandNumber())) // input assumption.
mapper.map(bbArg, fusedBlock->addArgument(bbArg.getType(), bbArg.getLoc()));		mapper.map(bbArg, fusedBlock->addArgument(bbArg.getType(), bbArg.getLoc()));

// Replacing consumerIdx requires getting the cloned, yielded, value from		// Replacing consumerIdx requires getting the cloned, yielded, value from
// the (cloned) producer block. This happens in step 9.		// the (cloned) producer block. This happens in step 9.
▲ Show 20 Lines • Show All 71 Lines • ▼ Show 20 Lines
mlir::linalg::fuseElementwiseOps(RewriterBase &rewriter,		mlir::linalg::fuseElementwiseOps(RewriterBase &rewriter,
OpOperand *fusedOperand) {		OpOperand *fusedOperand) {
assert(areElementwiseOpsFusable(fusedOperand) &&		assert(areElementwiseOpsFusable(fusedOperand) &&
"expected elementwise operation pre-conditions to pass");		"expected elementwise operation pre-conditions to pass");
auto producerResult = fusedOperand->get().cast<OpResult>();		auto producerResult = fusedOperand->get().cast<OpResult>();
auto producer = cast<GenericOp>(producerResult.getOwner());		auto producer = cast<GenericOp>(producerResult.getOwner());
auto consumer = cast<GenericOp>(fusedOperand->getOwner());		auto consumer = cast<GenericOp>(fusedOperand->getOwner());
// TODO: allow fusing the producer of an output operand.		// TODO: allow fusing the producer of an output operand.
assert(consumer.isInputTensor(fusedOperand) &&		assert(consumer.isInput(fusedOperand) &&
"expected producer of input operand");		"expected producer of input operand");

// Compute the fused operands list and indexing maps.		// Compute the fused operands list and indexing maps.
SmallVector<Value> fusedInputOperands, fusedOutputOperands;		SmallVector<Value> fusedInputOperands, fusedOutputOperands;
SmallVector<Type> fusedResultTypes;		SmallVector<Type> fusedResultTypes;
SmallVector<AffineMap> fusedIndexMaps;		SmallVector<AffineMap> fusedIndexMaps;
fusedInputOperands.reserve(producer.getNumInputs() + consumer.getNumInputs());		fusedInputOperands.reserve(producer.getNumInputs() + consumer.getNumInputs());
fusedOutputOperands.reserve(producer.getNumOutputs() +		fusedOutputOperands.reserve(producer.getNumOutputs() +
consumer.getNumOutputs());		consumer.getNumOutputs());
fusedResultTypes.reserve(producer.getNumOutputs() + consumer.getNumOutputs());		fusedResultTypes.reserve(producer.getNumOutputs() + consumer.getNumOutputs());
fusedIndexMaps.reserve(producer.getNumInputsAndOutputs() +		fusedIndexMaps.reserve(producer->getNumOperands() +
consumer.getNumInputsAndOutputs());		consumer->getNumOperands());
// In the following, numbering matches that of `generateFusedTensorOpRegion`.		// In the following, numbering matches that of `generateFusedTensorOpRegion`.
// 3. Consumer input operands/maps up to consumerIdx (exclusive).		// 3. Consumer input operands/maps up to consumerIdx (exclusive).
SmallVector<OpOperand *> consumerInputs = consumer.getInputOperands();		auto consumerInputs = consumer.getInputOperands();
SmallVector<OpOperand *>::iterator it =		auto it = llvm::find_if(consumerInputs, [&](OpOperand operand) {
llvm::find(consumerInputs, fusedOperand);		return operand == fusedOperand;
		});
assert(it != consumerInputs.end() && "expected to find the consumer operand");		assert(it != consumerInputs.end() && "expected to find the consumer operand");
for (OpOperand *opOperand : llvm::make_range(consumerInputs.begin(), it)) {		for (OpOperand *opOperand : llvm::make_range(consumerInputs.begin(), it)) {
fusedInputOperands.push_back(opOperand->get());		fusedInputOperands.push_back(opOperand->get());
fusedIndexMaps.push_back(consumer.getMatchingIndexingMap(opOperand));		fusedIndexMaps.push_back(consumer.getMatchingIndexingMap(opOperand));
}		}
// 4. Splice in producer's input operands/maps.		// 4. Splice in producer's input operands/maps.
AffineMap producerResultIndexMap =		AffineMap producerResultIndexMap =
producer.getIndexingMapMatchingResult(producerResult);		producer.getIndexingMapMatchingResult(producerResult);
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	public:
FuseElementwiseOps(MLIRContext *context, ControlFusionFn fun,		FuseElementwiseOps(MLIRContext *context, ControlFusionFn fun,
PatternBenefit benefit = 1)		PatternBenefit benefit = 1)
: OpRewritePattern<GenericOp>(context, benefit),		: OpRewritePattern<GenericOp>(context, benefit),
controlFn(std::move(fun)) {}		controlFn(std::move(fun)) {}

LogicalResult matchAndRewrite(GenericOp genericOp,		LogicalResult matchAndRewrite(GenericOp genericOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Find the first operand that is defined by another generic op on tensors.		// Find the first operand that is defined by another generic op on tensors.
for (OpOperand *opOperand : genericOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : genericOp->getOpOperands()) {
if (!areElementwiseOpsFusable(opOperand))		if (!areElementwiseOpsFusable(&opOperand))
continue;		continue;
if (!controlFn(opOperand))		if (!controlFn(&opOperand))
continue;		continue;

FailureOr<Operation *> fusedOp = fuseElementwiseOps(rewriter, opOperand);		FailureOr<Operation *> fusedOp = fuseElementwiseOps(rewriter, &opOperand);
if (succeeded(fusedOp)) {		if (succeeded(fusedOp)) {
auto replacements =		auto replacements =
fusedOp.value()->getResults().take_back(genericOp.getNumResults());		fusedOp.value()->getResults().take_back(genericOp.getNumResults());
rewriter.replaceOp(genericOp, replacements);		rewriter.replaceOp(genericOp, replacements);
return success();		return success();
}		}
}		}
return failure();		return failure();
▲ Show 20 Lines • Show All 331 Lines • ▼ Show 20 Lines	fuseWithReshapeByExpansion(GenericOp genericOp, Operation *reshapeOp,
SmallVector<Value> expandedOpOperands;		SmallVector<Value> expandedOpOperands;
expandedOpOperands.reserve(genericOp.getNumInputs());		expandedOpOperands.reserve(genericOp.getNumInputs());
for (OpOperand *opOperand : genericOp.getInputOperands()) {		for (OpOperand *opOperand : genericOp.getInputOperands()) {
if (opOperand == fusableOpOperand) {		if (opOperand == fusableOpOperand) {
expandedOpOperands.push_back(isExpanding ? expandingReshapeOp.getSrc()		expandedOpOperands.push_back(isExpanding ? expandingReshapeOp.getSrc()
: collapsingReshapeOp.getSrc());		: collapsingReshapeOp.getSrc());
continue;		continue;
}		}
if (genericOp.isInputTensor(opOperand)) {		if (auto opOperandType =
		opOperand->get().getType().dyn_cast<RankedTensorType>()) {
AffineMap indexingMap = genericOp.getMatchingIndexingMap(opOperand);		AffineMap indexingMap = genericOp.getMatchingIndexingMap(opOperand);
auto opOperandType = opOperand->get().getType().cast<RankedTensorType>();
RankedTensorType expandedOperandType =		RankedTensorType expandedOperandType =
getExpandedType(opOperandType, indexingMap, expansionInfo);		getExpandedType(opOperandType, indexingMap, expansionInfo);
if (expandedOperandType != opOperand->get().getType()) {		if (expandedOperandType != opOperand->get().getType()) {
// Reshape the operand to get the right type.		// Reshape the operand to get the right type.
SmallVector<ReassociationIndices> reassociation =		SmallVector<ReassociationIndices> reassociation =
getReassociationForExpansion(indexingMap, expansionInfo);		getReassociationForExpansion(indexingMap, expansionInfo);
if (failed(reshapeLikeShapesAreCompatible(		if (failed(reshapeLikeShapesAreCompatible(
[&](const Twine &msg) {		[&](const Twine &msg) {
▲ Show 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	public:
FoldWithProducerReshapeOpByExpansion(MLIRContext *context,		FoldWithProducerReshapeOpByExpansion(MLIRContext *context,
ControlFusionFn foldReshapes,		ControlFusionFn foldReshapes,
PatternBenefit benefit = 1)		PatternBenefit benefit = 1)
: OpRewritePattern<GenericOp>(context, benefit),		: OpRewritePattern<GenericOp>(context, benefit),
controlFoldingReshapes(std::move(foldReshapes)) {}		controlFoldingReshapes(std::move(foldReshapes)) {}

LogicalResult matchAndRewrite(GenericOp genericOp,		LogicalResult matchAndRewrite(GenericOp genericOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
for (OpOperand *opOperand : genericOp.getInputTensorOperands()) {		for (OpOperand *opOperand : genericOp.getInputOperands()) {
tensor::CollapseShapeOp reshapeOp =		tensor::CollapseShapeOp reshapeOp =
opOperand->get().getDefiningOp<tensor::CollapseShapeOp>();		opOperand->get().getDefiningOp<tensor::CollapseShapeOp>();
if (!reshapeOp)		if (!reshapeOp)
continue;		continue;
// Fold only if		// Fold only if
// - The tensor reshape op is folding.		// - The tensor reshape op is folding.
// - All constraints of fusing with reshape by expansion are met.		// - All constraints of fusing with reshape by expansion are met.
if (!isFusableWithReshapeByDimExpansion(genericOp, opOperand) \|\|		if (!isFusableWithReshapeByDimExpansion(genericOp, opOperand) \|\|
▲ Show 20 Lines • Show All 644 Lines • ▼ Show 20 Lines	public:
FoldWithProducerReshapeOpByCollapsing(MLIRContext *context,		FoldWithProducerReshapeOpByCollapsing(MLIRContext *context,
ControlFusionFn foldReshapes,		ControlFusionFn foldReshapes,
PatternBenefit benefit = 1)		PatternBenefit benefit = 1)
: OpRewritePattern<GenericOp>(context, benefit),		: OpRewritePattern<GenericOp>(context, benefit),
controlFoldingReshapes(std::move(foldReshapes)) {}		controlFoldingReshapes(std::move(foldReshapes)) {}

LogicalResult matchAndRewrite(GenericOp genericOp,		LogicalResult matchAndRewrite(GenericOp genericOp,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
for (OpOperand *opOperand : genericOp.getInputTensorOperands()) {		for (OpOperand &opOperand : genericOp->getOpOperands()) {
tensor::ExpandShapeOp reshapeOp =		tensor::ExpandShapeOp reshapeOp =
opOperand->get().getDefiningOp<tensor::ExpandShapeOp>();		opOperand.get().getDefiningOp<tensor::ExpandShapeOp>();
if (!reshapeOp)		if (!reshapeOp)
continue;		continue;

SmallVector<ReassociationIndices> collapsableIterationDims =		SmallVector<ReassociationIndices> collapsableIterationDims =
getCollapsableIterationSpaceDims(genericOp, opOperand,		getCollapsableIterationSpaceDims(genericOp, &opOperand,
reshapeOp.getReassociationIndices());		reshapeOp.getReassociationIndices());
if (collapsableIterationDims.empty() \|\|		if (collapsableIterationDims.empty() \|\|
!controlFoldingReshapes(opOperand)) {		!controlFoldingReshapes(&opOperand)) {
continue;		continue;
}		}

Optional<SmallVector<Value>> replacements =		Optional<SmallVector<Value>> replacements =
collapseGenericOpIterationDims(genericOp, collapsableIterationDims,		collapseGenericOpIterationDims(genericOp, collapsableIterationDims,
rewriter);		rewriter);
if (!replacements) {		if (!replacements) {
return rewriter.notifyMatchFailure(		return rewriter.notifyMatchFailure(
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	for (OpOperand *opOperand : genericOp.getInputOperands()) {
continue;		continue;

// The operands and the indexing_maps of the fused operation the same as		// The operands and the indexing_maps of the fused operation the same as
// the operands and indexing_maps of the generic operations with the		// the operands and indexing_maps of the generic operations with the
// values at the constant index dropped.		// values at the constant index dropped.
SmallVector<AffineMap> fusedIndexMaps;		SmallVector<AffineMap> fusedIndexMaps;
SmallVector<Value> fusedOperands;		SmallVector<Value> fusedOperands;
SmallVector<Location> fusedLocs{genericOp.getLoc()};		SmallVector<Location> fusedLocs{genericOp.getLoc()};
fusedIndexMaps.reserve(genericOp.getNumInputsAndOutputs());		fusedIndexMaps.reserve(genericOp->getNumOperands());
fusedOperands.reserve(genericOp.getNumInputs());		fusedOperands.reserve(genericOp.getNumInputs());
fusedLocs.reserve(fusedLocs.size() + genericOp.getNumInputs());		fusedLocs.reserve(fusedLocs.size() + genericOp.getNumInputs());
for (OpOperand *inputOperand : genericOp.getInputOperands()) {		for (OpOperand *inputOperand : genericOp.getInputOperands()) {
if (inputOperand == opOperand)		if (inputOperand == opOperand)
continue;		continue;
Value inputValue = inputOperand->get();		Value inputValue = inputOperand->get();
fusedIndexMaps.push_back(		fusedIndexMaps.push_back(
genericOp.getMatchingIndexingMap(inputOperand));		genericOp.getMatchingIndexingMap(inputOperand));
Show All 9 Lines	for (OpOperand *opOperand : genericOp.getInputOperands()) {
return rewriter.notifyMatchFailure(		return rewriter.notifyMatchFailure(
genericOp, "fused op loop bound computation failed");		genericOp, "fused op loop bound computation failed");
}		}

// Create a constant scalar value from the splat constant.		// Create a constant scalar value from the splat constant.
Value scalarConstant = rewriter.create<arith::ConstantOp>(		Value scalarConstant = rewriter.create<arith::ConstantOp>(
def->getLoc(), constantAttr, constantAttr.getType());		def->getLoc(), constantAttr, constantAttr.getType());

SmallVector<Value> outputOperands = genericOp.getOutputOperands();		SmallVector<Value> outputOperands = genericOp.getOutputs();
auto fusedOp = rewriter.create<GenericOp>(		auto fusedOp = rewriter.create<GenericOp>(
rewriter.getFusedLoc(fusedLocs), genericOp->getResultTypes(),		rewriter.getFusedLoc(fusedLocs), genericOp->getResultTypes(),
/inputs=/fusedOperands,		/inputs=/fusedOperands,
/outputs=/outputOperands,		/outputs=/outputOperands,
rewriter.getAffineMapArrayAttr(fusedIndexMaps),		rewriter.getAffineMapArrayAttr(fusedIndexMaps),
genericOp.getIteratorTypes(),		genericOp.getIteratorTypes(),
/doc=/nullptr,		/doc=/nullptr,
/library_call=/nullptr);		/library_call=/nullptr);
▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Fusion.cpp

Show First 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
// guarantees at least one such dimension is found. If multiple candidates exist		// guarantees at least one such dimension is found. If multiple candidates exist
// they must agree by construction (i.e. have the same size) and we just return		// they must agree by construction (i.e. have the same size) and we just return
// the first one.		// the first one.
static ShapeDimension		static ShapeDimension
getShapeDefiningLoopRange(LinalgOp op, unsigned loopDepth,		getShapeDefiningLoopRange(LinalgOp op, unsigned loopDepth,
bool fromSubViewOpOnly = false) {		bool fromSubViewOpOnly = false) {
// Iterate over the inputs and outputs in order.		// Iterate over the inputs and outputs in order.
// Extract the subranges from the linearized ranges.		// Extract the subranges from the linearized ranges.
for (OpOperand *opOperand : op.getInputAndOutputOperands()) {		for (OpOperand &opOperand : op->getOpOperands()) {
// The method `getRangeFromOperandShape` requires using SubViewOp or		// The method `getRangeFromOperandShape` requires using SubViewOp or
// ExtractSliceOps. If the value isn't defined from there continue.		// ExtractSliceOps. If the value isn't defined from there continue.
// todo: The method should be adapted to get the values from		// todo: The method should be adapted to get the values from
// `ViewInterface`. The interface needs a `getOrCreateRanges` method which		// `ViewInterface`. The interface needs a `getOrCreateRanges` method which
// currently returns a `linalg.range`. The fix here is to move this op to		// currently returns a `linalg.range`. The fix here is to move this op to
// `std` dialect and add the method to `ViewInterface`.		// `std` dialect and add the method to `ViewInterface`.
if (fromSubViewOpOnly &&		if (fromSubViewOpOnly &&
!isa_and_nonnull<memref::SubViewOp, tensor::ExtractSliceOp>(		!isa_and_nonnull<memref::SubViewOp, tensor::ExtractSliceOp>(
opOperand->get().getDefiningOp()))		opOperand.get().getDefiningOp()))
continue;		continue;

AffineMap map = op.getMatchingIndexingMap(opOperand);		AffineMap map = op.getMatchingIndexingMap(&opOperand);
LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange I/O idx: "		LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange I/O idx: "
<< opOperand->getOperandNumber() << "\n");		<< opOperand.getOperandNumber() << "\n");
LLVM_DEBUG(llvm::dbgs()		LLVM_DEBUG(llvm::dbgs()
<< "getShapeDefiningLoopRange map: " << map << "\n");		<< "getShapeDefiningLoopRange map: " << map << "\n");
SmallVector<Value, 8> shapeRanges(map.getNumResults(), nullptr);		SmallVector<Value, 8> shapeRanges(map.getNumResults(), nullptr);
for (const auto &en : llvm::enumerate(map.getResults())) {		for (const auto &en : llvm::enumerate(map.getResults())) {
auto dimExpr = en.value().dyn_cast<AffineDimExpr>();		auto dimExpr = en.value().dyn_cast<AffineDimExpr>();
if (!dimExpr)		if (!dimExpr)
continue;		continue;
if (loopDepth == en.value().cast<AffineDimExpr>().getPosition()) {		if (loopDepth == en.value().cast<AffineDimExpr>().getPosition()) {
LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange loopDepth: "		LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange loopDepth: "
<< loopDepth << "\n");		<< loopDepth << "\n");
LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange shape: "		LLVM_DEBUG(llvm::dbgs() << "getShapeDefiningLoopRange shape: "
<< opOperand->get() << "\n");		<< opOperand.get() << "\n");
return ShapeDimension{opOperand->get(),		return ShapeDimension{opOperand.get(),
static_cast<unsigned>(en.index())};		static_cast<unsigned>(en.index())};
}		}
}		}
}		}
llvm_unreachable("Expect to be able to extract a shape defining loop range");		llvm_unreachable("Expect to be able to extract a shape defining loop range");
}		}

static SmallVector<Value> getTiledOperands(LinalgOp producer) {		static SmallVector<Value> getTiledOperands(LinalgOp producer) {
return producer.getInputAndOutputOperands();		return producer->getOperands();
}		}

/// Fuses the producer by cloning the `producer`. The `fusedLoopsAndRanges`		/// Fuses the producer by cloning the `producer`. The `fusedLoopsAndRanges`
/// provides the loop range information for the fused loops. The rest are		/// provides the loop range information for the fused loops. The rest are
/// obtained from the producer itself, since they are not tiled + fused.		/// obtained from the producer itself, since they are not tiled + fused.
static LinalgOp fuse(OpBuilder &b, LinalgOp producer,		static LinalgOp fuse(OpBuilder &b, LinalgOp producer,
const DenseMap<unsigned, Range> &fusedLoopsAndRanges) {		const DenseMap<unsigned, Range> &fusedLoopsAndRanges) {
SmallVector<OpFoldResult> ivs, tileSizes, sizeBounds;		SmallVector<OpFoldResult> ivs, tileSizes, sizeBounds;
Show All 16 Lines	if (it != fusedLoopsAndRanges.end()) {
tileSizes.push_back(b.getIndexAttr(0));		tileSizes.push_back(b.getIndexAttr(0));
loopRanges.push_back(Range{b.getIndexAttr(0), dim, b.getIndexAttr(1)});		loopRanges.push_back(Range{b.getIndexAttr(0), dim, b.getIndexAttr(1)});
LLVM_DEBUG(llvm::dbgs() << "full loop#" << i << " with LoopRange "		LLVM_DEBUG(llvm::dbgs() << "full loop#" << i << " with LoopRange "
<< loopRanges.back() << "\n");		<< loopRanges.back() << "\n");
}		}
}		}

SmallVector<Value, 8> clonedShapes;		SmallVector<Value, 8> clonedShapes;
clonedShapes.reserve(producer.getNumInputsAndOutputs());		clonedShapes.reserve(producer->getNumOperands());

// Compute subranges for all tensor input/output operands.		// Compute subranges for all tensor input/output operands.
clonedShapes.append(makeTiledShapes(		clonedShapes.append(makeTiledShapes(
b, loc, producer, getTiledOperands(producer), ivs, tileSizes, sizeBounds,		b, loc, producer, getTiledOperands(producer), ivs, tileSizes, sizeBounds,
/*omitPartialTileCheck=/false));		/*omitPartialTileCheck=/false));

// Iterate over the results in order.		// Iterate over the results in order.
// Extract the subtensor type from the linearized range.		// Extract the subtensor type from the linearized range.
// Since we do not enforce any canonicalizations on the fly, this is always		// Since we do not enforce any canonicalizations on the fly, this is always
// fully dynamic at construction time.		// fully dynamic at construction time.
SmallVector<Type, 4> resultTypes;		SmallVector<Type, 4> resultTypes;
resultTypes.reserve(producer->getNumResults());		resultTypes.reserve(producer->getNumResults());
for (RankedTensorType t : producer.getOutputTensorTypes()) {		for (OpOperand *operand : producer.getOutputOperands()) {
unsigned rank = t.getRank();		auto tensorType = operand->get().getType().dyn_cast<RankedTensorType>();
		if (!tensorType)
		continue;
		unsigned rank = tensorType.getRank();
SmallVector<int64_t, 4> staticOffsetsVector(		SmallVector<int64_t, 4> staticOffsetsVector(
rank, ShapedType::kDynamicStrideOrOffset);		rank, ShapedType::kDynamicStrideOrOffset);
SmallVector<int64_t, 4> staticSizesVector(rank, ShapedType::kDynamicSize);		SmallVector<int64_t, 4> staticSizesVector(rank, ShapedType::kDynamicSize);
SmallVector<int64_t, 4> staticStridesVector(		SmallVector<int64_t, 4> staticStridesVector(
rank, ShapedType::kDynamicStrideOrOffset);		rank, ShapedType::kDynamicStrideOrOffset);
resultTypes.push_back(tensor::ExtractSliceOp::inferResultType(		resultTypes.push_back(tensor::ExtractSliceOp::inferResultType(
t.cast<RankedTensorType>(), staticOffsetsVector, staticSizesVector,		tensorType, staticOffsetsVector, staticSizesVector,
staticStridesVector));		staticStridesVector));
}		}

Operation *clonedOp = producer.clone(b, loc, resultTypes, clonedShapes);		Operation *clonedOp = producer.clone(b, loc, resultTypes, clonedShapes);

// Shift all IndexOp results by the tile offset.		// Shift all IndexOp results by the tile offset.
SmallVector<OpFoldResult> allIvs = llvm::to_vector(		SmallVector<OpFoldResult> allIvs = llvm::to_vector(
llvm::map_range(loopRanges, [&](Range range) { return range.offset; }));		llvm::map_range(loopRanges, [&](Range range) { return range.offset; }));
▲ Show 20 Lines • Show All 290 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/FusionOnTensors.cpp

Show First 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	static LinalgOp getTiledProducer(OpBuilder &b, OpResult producerResult,
for (auto it : zip(tiledSliceDimIndices, tiledProducerLoopIndices)) {		for (auto it : zip(tiledSliceDimIndices, tiledProducerLoopIndices)) {
int64_t tiledSliceDim = std::get<0>(it);		int64_t tiledSliceDim = std::get<0>(it);
int64_t tiledProducerLoop = std::get<1>(it);		int64_t tiledProducerLoop = std::get<1>(it);
tileIvs[tiledProducerLoop] = sliceOpRanges[tiledSliceDim].offset;		tileIvs[tiledProducerLoop] = sliceOpRanges[tiledSliceDim].offset;
tileSizes[tiledProducerLoop] = sliceOpRanges[tiledSliceDim].size;		tileSizes[tiledProducerLoop] = sliceOpRanges[tiledSliceDim].size;
allIvs[tiledProducerLoop] = tileIvs[tiledProducerLoop];		allIvs[tiledProducerLoop] = tileIvs[tiledProducerLoop];
}		}
erase_value(tileIvs, OpFoldResult());		erase_value(tileIvs, OpFoldResult());
SmallVector<Value> tiledOperands = producerOp.getInputAndOutputOperands();		SmallVector<Value> tiledOperands = producerOp->getOperands();
tiledOperands = makeTiledShapes(b, loc, producerOp, tiledOperands, tileIvs,		tiledOperands = makeTiledShapes(b, loc, producerOp, tiledOperands, tileIvs,
tileSizes, producerLoopBounds,		tileSizes, producerLoopBounds,
/*omitPartialTileCheck=/false);		/*omitPartialTileCheck=/false);

// Output fusion has to update the iteration arguments of the tile loop nest.		// Output fusion has to update the iteration arguments of the tile loop nest.
// In particular, the iteration argument of the outermost tile loop needs to		// In particular, the iteration argument of the outermost tile loop needs to
// be set to the producer output instead of the producer result and `clonedOp`		// be set to the producer output instead of the producer result and `clonedOp`
// shall use the existing `sliceOp` result instead of the tiled producer		// shall use the existing `sliceOp` result instead of the tiled producer
▲ Show 20 Lines • Show All 244 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Generalization.cpp

Show First 20 Lines • Show All 44 Lines • ▼ Show 20 Lines	static LogicalResult generalizeNamedOpPrecondition(LinalgOp linalgOp) {
return success();		return success();
}		}

FailureOr<GenericOp> mlir::linalg::generalizeNamedOp(RewriterBase &rewriter,		FailureOr<GenericOp> mlir::linalg::generalizeNamedOp(RewriterBase &rewriter,
LinalgOp linalgOp) {		LinalgOp linalgOp) {
if (failed(generalizeNamedOpPrecondition(linalgOp)))		if (failed(generalizeNamedOpPrecondition(linalgOp)))
return rewriter.notifyMatchFailure(linalgOp, "preconditions not met");		return rewriter.notifyMatchFailure(linalgOp, "preconditions not met");

SmallVector<Value> inputOperands = linalgOp.getInputOperands();		SmallVector<Value> inputs = linalgOp.getInputOperands();
SmallVector<Value> outputOperands = linalgOp.getOutputOperands();		SmallVector<Value> outputs = linalgOp.getOutputOperands();
SmallVector<AffineMap> indexingMaps = linalgOp.getIndexingMapsArray();		SmallVector<AffineMap> indexingMaps = linalgOp.getIndexingMapsArray();
SmallVector<StringRef> iterators = linalgOp.getIteratorTypesArray();		SmallVector<StringRef> iterators = linalgOp.getIteratorTypesArray();
SmallVector<RankedTensorType> resultTypes = linalgOp.getOutputTensorTypes();		SmallVector<Type> resultTypes = linalgOp.hasTensorSemantics()
SmallVector<Type> types(resultTypes.begin(), resultTypes.end());		? TypeRange(ValueRange(outputs))
		: TypeRange{};

// All named ops have a region attached that can be inlined.		// All named ops have a region attached that can be inlined.
assert(linalgOp->getNumRegions() == 1 &&		assert(linalgOp->getNumRegions() == 1 &&
"expect named op to have one region attached");		"expect named op to have one region attached");
GenericOp genericOp =		GenericOp genericOp = rewriter.create<GenericOp>(
rewriter.create<GenericOp>(linalgOp.getLoc(), types, inputOperands,		linalgOp.getLoc(), resultTypes, inputs, outputs, indexingMaps, iterators);
outputOperands, indexingMaps, iterators);
rewriter.inlineRegionBefore(linalgOp->getRegion(0), genericOp.getRegion(),		rewriter.inlineRegionBefore(linalgOp->getRegion(0), genericOp.getRegion(),
genericOp.getRegion().begin());		genericOp.getRegion().begin());
rewriter.replaceOp(linalgOp, genericOp->getResults());		rewriter.replaceOp(linalgOp, genericOp->getResults());
return genericOp;		return genericOp;
}		}

namespace {		namespace {

Show All 23 Lines

mlir/lib/Dialect/Linalg/Transforms/HoistPadding.cpp

Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	private:
bool valid;		bool valid;
};		};

/// Return true if all uses of `padOp` are an input tensor of some		/// Return true if all uses of `padOp` are an input tensor of some
/// LinalgOp.		/// LinalgOp.
static bool isOnlyUsedAsInputOfLinalgOp(tensor::PadOp padOp) {		static bool isOnlyUsedAsInputOfLinalgOp(tensor::PadOp padOp) {
for (OpOperand &use : padOp.getResult().getUses()) {		for (OpOperand &use : padOp.getResult().getUses()) {
auto linalgUser = dyn_cast<linalg::LinalgOp>(use.getOwner());		auto linalgUser = dyn_cast<linalg::LinalgOp>(use.getOwner());
if (!linalgUser \|\| !linalgUser.isInputTensor(&use)) {		if (!linalgUser \|\| !linalgUser.isInput(&use)) {
LLVM_DEBUG(DBGS() << "Found a use of " << *(padOp)		LLVM_DEBUG(DBGS() << "Found a use of " << *(padOp)
<< "\nthat is not an input tensor of a LinalgOp, "		<< "\nthat is not an input tensor of a LinalgOp, "
<< "cannot hoist\n"		<< "cannot hoist\n"
<< *(use.getOwner()) << "\n");		<< *(use.getOwner()) << "\n");
return false;		return false;
}		}
}		}
return true;		return true;
▲ Show 20 Lines • Show All 435 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/InlineScalarOperands.cpp

Show All 37 Lines	LogicalResult matchAndRewrite(GenericOp genericOp,
if (!genericOp.hasTensorSemantics())		if (!genericOp.hasTensorSemantics())
return failure();		return failure();

SmallVector<size_t> scalarOperands;		SmallVector<size_t> scalarOperands;
SmallVector<AffineMap> newIndexingMaps;		SmallVector<AffineMap> newIndexingMaps;
SmallVector<Value> newOperands;		SmallVector<Value> newOperands;
for (OpOperand *opOperand : genericOp.getInputOperands()) {		for (OpOperand *opOperand : genericOp.getInputOperands()) {
AffineMap map = genericOp.getMatchingIndexingMap(opOperand);		AffineMap map = genericOp.getMatchingIndexingMap(opOperand);
if (genericOp.isInputTensor(opOperand) && map.isConstant()) {		if (genericOp.isInput(opOperand) && map.isConstant()) {
scalarOperands.emplace_back(opOperand->getOperandNumber());		scalarOperands.emplace_back(opOperand->getOperandNumber());
} else {		} else {
newIndexingMaps.emplace_back(map);		newIndexingMaps.emplace_back(map);
newOperands.emplace_back(opOperand->get());		newOperands.emplace_back(opOperand->get());
}		}
}		}

if (scalarOperands.empty())		if (scalarOperands.empty())
return failure();		return failure();

for (OpOperand *opOperand : genericOp.getOutputOperands())		for (OpOperand *opOperand : genericOp.getOutputOperands())
newIndexingMaps.emplace_back(genericOp.getMatchingIndexingMap(opOperand));		newIndexingMaps.emplace_back(genericOp.getMatchingIndexingMap(opOperand));

Location loc = genericOp->getLoc();		Location loc = genericOp->getLoc();
SmallVector<Value> outputOperands = genericOp.getOutputOperands();		SmallVector<Value> outputOperands = genericOp.getOutputs();
auto newOp = rewriter.create<GenericOp>(		auto newOp = rewriter.create<GenericOp>(
loc, genericOp->getResultTypes(), newOperands, outputOperands,		loc, genericOp->getResultTypes(), newOperands, outputOperands,
newIndexingMaps, genericOp.getIteratorTypesArray());		newIndexingMaps, genericOp.getIteratorTypesArray());
rewriter.cloneRegionBefore(genericOp.getRegion(), newOp.getRegion(),		rewriter.cloneRegionBefore(genericOp.getRegion(), newOp.getRegion(),
newOp.getRegion().begin());		newOp.getRegion().begin());

Block *body = newOp.getBody();		Block *body = newOp.getBody();
PatternRewriter::InsertionGuard guard(rewriter);		PatternRewriter::InsertionGuard guard(rewriter);
▲ Show 20 Lines • Show All 50 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Interchange.cpp

Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	mlir::linalg::interchangeGenericOp(RewriterBase &rewriter, GenericOp genericOp,

// Start a guarded inplace update.		// Start a guarded inplace update.
rewriter.startRootUpdate(genericOp);		rewriter.startRootUpdate(genericOp);
auto guard =		auto guard =
llvm::make_scope_exit([&]() { rewriter.finalizeRootUpdate(genericOp); });		llvm::make_scope_exit([&]() { rewriter.finalizeRootUpdate(genericOp); });

// 2. Compute the interchanged indexing maps.		// 2. Compute the interchanged indexing maps.
SmallVector<AffineMap> newIndexingMaps;		SmallVector<AffineMap> newIndexingMaps;
for (OpOperand *opOperand : genericOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : genericOp->getOpOperands()) {
AffineMap m = genericOp.getMatchingIndexingMap(opOperand);		AffineMap m = genericOp.getMatchingIndexingMap(&opOperand);
if (!permutationMap.isEmpty())		if (!permutationMap.isEmpty())
m = m.compose(permutationMap);		m = m.compose(permutationMap);
newIndexingMaps.push_back(m);		newIndexingMaps.push_back(m);
}		}
genericOp->setAttr(getIndexingMapsAttrName(),		genericOp->setAttr(getIndexingMapsAttrName(),
rewriter.getAffineMapArrayAttr(newIndexingMaps));		rewriter.getAffineMapArrayAttr(newIndexingMaps));

// 3. Compute the interchanged iterator types.		// 3. Compute the interchanged iterator types.
Show All 28 Lines

mlir/lib/Dialect/Linalg/Transforms/Loops.cpp

Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines
/// ```		/// ```
template <typename LoadOpTy, typename StoreOpTy>		template <typename LoadOpTy, typename StoreOpTy>
static void emitScalarImplementation(OpBuilder &b, Location loc,		static void emitScalarImplementation(OpBuilder &b, Location loc,
ArrayRef<Value> allIvs,		ArrayRef<Value> allIvs,
LinalgOp linalgOp) {		LinalgOp linalgOp) {
assert(linalgOp.hasBufferSemantics() &&		assert(linalgOp.hasBufferSemantics() &&
"expected linalg op with buffer semantics");		"expected linalg op with buffer semantics");
SmallVector<Value> indexedValues;		SmallVector<Value> indexedValues;
indexedValues.reserve(linalgOp.getNumInputsAndOutputs());		indexedValues.reserve(linalgOp->getNumOperands());

auto allIvsPlusDims = SmallVector<Value>(allIvs.begin(), allIvs.end());		auto allIvsPlusDims = SmallVector<Value>(allIvs.begin(), allIvs.end());

// TODO: Avoid the loads if the corresponding argument of the		// TODO: Avoid the loads if the corresponding argument of the
// region has no uses.		// region has no uses.
// 1.a. Emit load from input operand or for scalars access the operand itself.		// 1.a. Emit load from input operand or for scalars access the operand itself.
for (OpOperand *inputOperand : linalgOp.getInputOperands()) {		for (OpOperand *inputOperand : linalgOp.getInputOperands()) {
if (linalgOp.isScalar(inputOperand)) {		if (linalgOp.isScalar(inputOperand)) {
Show All 13 Lines	indexedValues.push_back(
b.create<LoadOpTy>(loc, outputOperand->get(), indexing));		b.create<LoadOpTy>(loc, outputOperand->get(), indexing));
}		}

// TODO: When a region inliner exists, use it.		// TODO: When a region inliner exists, use it.
// 2. Inline region, currently only works for a single basic block.		// 2. Inline region, currently only works for a single basic block.
// 3. Emit store.		// 3. Emit store.
SmallVector<SmallVector<Value>, 8> indexing;		SmallVector<SmallVector<Value>, 8> indexing;
SmallVector<Value> outputBuffers;		SmallVector<Value> outputBuffers;
for (OpOperand *outputOperand : linalgOp.getOutputBufferOperands()) {		for (OpOperand *outputOperand : linalgOp.getOutputOperands()) {
		if (!outputOperand->get().getType().isa<MemRefType>())
		continue;
indexing.push_back(makeCanonicalAffineApplies(		indexing.push_back(makeCanonicalAffineApplies(
b, loc, linalgOp.getMatchingIndexingMap(outputOperand),		b, loc, linalgOp.getMatchingIndexingMap(outputOperand),
allIvsPlusDims));		allIvsPlusDims));
outputBuffers.push_back(outputOperand->get());		outputBuffers.push_back(outputOperand->get());
}		}
inlineRegionAndEmitStore<LoadOpTy, StoreOpTy>(b, loc, linalgOp, indexedValues,		inlineRegionAndEmitStore<LoadOpTy, StoreOpTy>(b, loc, linalgOp, indexedValues,
indexing, outputBuffers);		indexing, outputBuffers);
}		}
▲ Show 20 Lines • Show All 210 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Promotion.cpp

Show First 20 Lines • Show All 139 Lines • ▼ Show 20 Lines
} // namespace		} // namespace

LinalgOpInstancePromotionOptions::LinalgOpInstancePromotionOptions(		LinalgOpInstancePromotionOptions::LinalgOpInstancePromotionOptions(
LinalgOp linalgOp, const LinalgPromotionOptions &options)		LinalgOp linalgOp, const LinalgPromotionOptions &options)
: subViews(), alignment(options.alignment) {		: subViews(), alignment(options.alignment) {
assert(linalgOp.hasBufferSemantics() && "revisit usage of shaped operand");		assert(linalgOp.hasBufferSemantics() && "revisit usage of shaped operand");
auto vUseFullTileBuffers =		auto vUseFullTileBuffers =
options.useFullTileBuffers.value_or(llvm::SmallBitVector());		options.useFullTileBuffers.value_or(llvm::SmallBitVector());
vUseFullTileBuffers.resize(linalgOp.getNumInputsAndOutputs(),		vUseFullTileBuffers.resize(linalgOp->getNumOperands(),
options.useFullTileBuffersDefault);		options.useFullTileBuffersDefault);

for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
int64_t operandNumber = opOperand->getOperandNumber();		int64_t operandNumber = opOperand.getOperandNumber();
if (options.operandsToPromote &&		if (options.operandsToPromote &&
!options.operandsToPromote->count(operandNumber))		!options.operandsToPromote->count(operandNumber))
continue;		continue;
Operation *op = opOperand->get().getDefiningOp();		Operation *op = opOperand.get().getDefiningOp();
if (auto sv = dyn_cast_or_null<memref::SubViewOp>(op)) {		if (auto sv = dyn_cast_or_null<memref::SubViewOp>(op)) {
subViews[operandNumber] = sv;		subViews[operandNumber] = sv;
useFullTileBuffers[sv] = vUseFullTileBuffers[operandNumber];		useFullTileBuffers[sv] = vUseFullTileBuffers[operandNumber];
}		}
}		}

if (options.allocationFn) {		if (options.allocationFn) {
allocationFn = *options.allocationFn;		allocationFn = *options.allocationFn;
▲ Show 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	promoteSubViews(ImplicitLocOpBuilder &b, LinalgOp op,
if (failed(promotedBuffersAndViews) \|\|		if (failed(promotedBuffersAndViews) \|\|
promotedBuffersAndViews->size() != options.subViews.size())		promotedBuffersAndViews->size() != options.subViews.size())
return failure();		return failure();

// 2. Append all other operands as they appear, this enforces that such		// 2. Append all other operands as they appear, this enforces that such
// operands are not views. This is to support cases such as FillOp taking		// operands are not views. This is to support cases such as FillOp taking
// extra scalars etc. Keep a reference to output buffers;		// extra scalars etc. Keep a reference to output buffers;
SmallVector<Value, 8> opViews;		SmallVector<Value, 8> opViews;
opViews.reserve(op.getNumInputsAndOutputs());		opViews.reserve(op->getNumOperands());
SmallVector<std::pair<Value, Value>, 8> writebackViews;		SmallVector<std::pair<Value, Value>, 8> writebackViews;
writebackViews.reserve(promotedBuffersAndViews->size());		writebackViews.reserve(promotedBuffersAndViews->size());
for (OpOperand *opOperand : op.getInputAndOutputOperands()) {		for (OpOperand &opOperand : op->getOpOperands()) {
int64_t operandNumber = opOperand->getOperandNumber();		int64_t operandNumber = opOperand.getOperandNumber();
if (options.subViews.count(operandNumber) != 0) {		if (options.subViews.count(operandNumber) != 0) {
if (options.useFullTileBuffers[opOperand->get()])		if (options.useFullTileBuffers[opOperand.get()])
opViews.push_back(		opViews.push_back(
(*promotedBuffersAndViews)[operandNumber].fullLocalView);		(*promotedBuffersAndViews)[operandNumber].fullLocalView);
else		else
opViews.push_back(		opViews.push_back(
(*promotedBuffersAndViews)[operandNumber].partialLocalView);		(*promotedBuffersAndViews)[operandNumber].partialLocalView);
if (operandNumber >= op.getNumInputs())		if (operandNumber >= op.getNumInputs())
writebackViews.emplace_back(std::make_pair(		writebackViews.emplace_back(std::make_pair(
opOperand->get(),		opOperand.get(),
(*promotedBuffersAndViews)[operandNumber].partialLocalView));		(*promotedBuffersAndViews)[operandNumber].partialLocalView));
} else {		} else {
opViews.push_back(opOperand->get());		opViews.push_back(opOperand.get());
}		}
}		}
op->setOperands(0, opViews.size(), opViews);		op->setOperands(0, opViews.size(), opViews);

OpBuilder::InsertionGuard guard(b);		OpBuilder::InsertionGuard guard(b);
b.setInsertionPointAfter(op);		b.setInsertionPointAfter(op);
// 3. Emit write-back for the promoted output views: copy the partial view.		// 3. Emit write-back for the promoted output views: copy the partial view.
for (auto viewAndPartialLocalView : writebackViews) {		for (auto viewAndPartialLocalView : writebackViews) {
Show All 11 Lines
LogicalResult		LogicalResult
mlir::linalg::promoteSubviewsPrecondition(Operation *op,		mlir::linalg::promoteSubviewsPrecondition(Operation *op,
LinalgPromotionOptions options) {		LinalgPromotionOptions options) {
LinalgOp linalgOp = dyn_cast<LinalgOp>(op);		LinalgOp linalgOp = dyn_cast<LinalgOp>(op);
// Transformation applies to buffers only.		// Transformation applies to buffers only.
if (!linalgOp \|\| !linalgOp.hasBufferSemantics())		if (!linalgOp \|\| !linalgOp.hasBufferSemantics())
return failure();		return failure();
// Check that at least one of the requested operands is indeed a subview.		// Check that at least one of the requested operands is indeed a subview.
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
auto sv =		auto sv =
isa_and_nonnull<memref::SubViewOp>(opOperand->get().getDefiningOp());		isa_and_nonnull<memref::SubViewOp>(opOperand.get().getDefiningOp());
if (sv) {		if (sv) {
if (!options.operandsToPromote \|\|		if (!options.operandsToPromote \|\|
options.operandsToPromote->count(opOperand->getOperandNumber()))		options.operandsToPromote->count(opOperand.getOperandNumber()))
return success();		return success();
}		}
}		}
// TODO: Check all subviews requested are bound by a static constant.		// TODO: Check all subviews requested are bound by a static constant.
// TODO: Check that the total footprint fits within a given size.		// TODO: Check that the total footprint fits within a given size.
return failure();		return failure();
}		}

Show All 11 Lines

mlir/lib/Dialect/Linalg/Transforms/SplitReduction.cpp

Show First 20 Lines • Show All 208 Lines • ▼ Show 20 Lines	GenericOp genericOp = b.create<GenericOp>(
ValueRange({identityTensor}), newMaps, newIteratorTypes);		ValueRange({identityTensor}), newMaps, newIteratorTypes);
b.inlineRegionBefore(op->getRegion(0), genericOp.getRegion(),		b.inlineRegionBefore(op->getRegion(0), genericOp.getRegion(),
genericOp.getRegion().begin());		genericOp.getRegion().begin());

// Then create a new reduction that only reduce the newly added dimension		// Then create a new reduction that only reduce the newly added dimension
// from the previous op.		// from the previous op.
unsigned intermRank = newOutputShape.size();		unsigned intermRank = newOutputShape.size();
AffineMap inputMap = b.getMultiDimIdentityMap(intermRank);		AffineMap inputMap = b.getMultiDimIdentityMap(intermRank);
SmallVector<Value> outputOperands = op.getOutputOperands();
SmallVector<StringRef> reductionIteratorTypes;		SmallVector<StringRef> reductionIteratorTypes;
SmallVector<AffineExpr> exprs;		SmallVector<AffineExpr> exprs;
for (unsigned i : llvm::seq<unsigned>(0, intermRank)) {		for (unsigned i : llvm::seq<unsigned>(0, intermRank)) {
if (insertSplitDimension == i) {		if (insertSplitDimension == i) {
reductionIteratorTypes.push_back(getReductionIteratorTypeName());		reductionIteratorTypes.push_back(getReductionIteratorTypeName());
} else {		} else {
exprs.push_back(b.getAffineDimExpr(i));		exprs.push_back(b.getAffineDimExpr(i));
reductionIteratorTypes.push_back(getParallelIteratorTypeName());		reductionIteratorTypes.push_back(getParallelIteratorTypeName());
}		}
}		}
AffineMap outputMap = AffineMap::get(intermRank, 0, exprs, op.getContext());		AffineMap outputMap = AffineMap::get(intermRank, 0, exprs, op.getContext());
SmallVector<AffineMap> reductionMaps = {inputMap, outputMap};		SmallVector<AffineMap> reductionMaps = {inputMap, outputMap};

auto reduction = b.create<GenericOp>(		auto reduction = b.create<GenericOp>(
loc, op->getResultTypes(), ValueRange({genericOp.getResult(0)}),		loc, op->getResultTypes(), ValueRange({genericOp.getResult(0)}),
outputOperands, reductionMaps, reductionIteratorTypes,		SmallVector<Value>{op.getOutputOperands()}, reductionMaps,
		reductionIteratorTypes,
[reductionOp](OpBuilder &b, Location loc, ValueRange inputs) {		[reductionOp](OpBuilder &b, Location loc, ValueRange inputs) {
Operation clonedReductionOp = b.clone(reductionOp);		Operation clonedReductionOp = b.clone(reductionOp);
clonedReductionOp->setOperand(0, inputs[0]);		clonedReductionOp->setOperand(0, inputs[0]);
clonedReductionOp->setOperand(1, inputs[1]);		clonedReductionOp->setOperand(1, inputs[1]);
b.create<linalg::YieldOp>(loc, clonedReductionOp->getResult(0));		b.create<linalg::YieldOp>(loc, clonedReductionOp->getResult(0));
});		});
b.replaceOp(op, reduction.getResults());		b.replaceOp(op, reduction.getResults());

▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	FailureOr<SplitReductionResult> mlir::linalg::splitReductionByScaling(
MLIRContext *context = op.getContext();		MLIRContext *context = op.getContext();
// For now assume outputs are 1-1 with reduction neutralElements.		// For now assume outputs are 1-1 with reduction neutralElements.
// TODO: generalize when multi-reduction support is available.		// TODO: generalize when multi-reduction support is available.
SmallVector<Value> newOutputs;		SmallVector<Value> newOutputs;
newOutputs.reserve(op.getNumOutputs());		newOutputs.reserve(op.getNumOutputs());
SmallVector<Operation *> emptyOrAllocTensorOps;		SmallVector<Operation *> emptyOrAllocTensorOps;
SmallVector<linalg::FillOp> fillOps;		SmallVector<linalg::FillOp> fillOps;
fillOps.reserve(op.getNumOutputs());		fillOps.reserve(op.getNumOutputs());
for (auto it : llvm::zip(op.getOutputs(), neutralElements)) {		for (auto it : llvm::zip(op.getOutputOperands(), neutralElements)) {
Value rankedTensor = std::get<0>(it);		Value rankedTensor = std::get<0>(it)->get();
auto t = rankedTensor.getType().cast<RankedTensorType>();		auto t = rankedTensor.getType().cast<RankedTensorType>();
RankedTensorType newT = RankedTensorType::Builder(t).insertDim(		RankedTensorType newT = RankedTensorType::Builder(t).insertDim(
reductionDimSize / splitFactor, insertSplitDimension);		reductionDimSize / splitFactor, insertSplitDimension);
SmallVector<Value> dims =		SmallVector<Value> dims =
tensor::createDynamicDimValues(b, loc, rankedTensor);		tensor::createDynamicDimValues(b, loc, rankedTensor);
Value emptyOrAllocTensor;		Value emptyOrAllocTensor;
if (useAlloc) {		if (useAlloc) {
emptyOrAllocTensor =		emptyOrAllocTensor =
b.create<bufferization::AllocTensorOp>(loc, newT, dims);		b.create<bufferization::AllocTensorOp>(loc, newT, dims);
} else {		} else {
emptyOrAllocTensor = b.create<tensor::EmptyOp>(loc, newT.getShape(),		emptyOrAllocTensor = b.create<tensor::EmptyOp>(loc, newT.getShape(),
t.getElementType(), dims);		t.getElementType(), dims);
}		}
Value constantOp = b.create<arith::ConstantOp>(loc, std::get<1>(it));		Value constantOp = b.create<arith::ConstantOp>(loc, std::get<1>(it));
fillOps.push_back(		fillOps.push_back(
b.create<linalg::FillOp>(op->getLoc(), constantOp, emptyOrAllocTensor));		b.create<linalg::FillOp>(op->getLoc(), constantOp, emptyOrAllocTensor));
newOutputs.push_back(fillOps.back().getResult(0));		newOutputs.push_back(fillOps.back().getResult(0));
emptyOrAllocTensorOps.push_back(emptyOrAllocTensor.getDefiningOp());		emptyOrAllocTensorOps.push_back(emptyOrAllocTensor.getDefiningOp());
}		}

// Step 2. Reindex / expand indexing maps.		// Step 2. Reindex / expand indexing maps.
// Reindex existing input indexings: k -> k * splitFactor + k'.		// Reindex existing input indexings: k -> k * splitFactor + k'.
SmallVector<AffineMap> newMaps;		SmallVector<AffineMap> newMaps;
newMaps.reserve(op.getNumInputsAndOutputs() + 1);		newMaps.reserve(op->getNumOperands() + 1);
for (OpOperand *o : op.getInputOperands())		for (OpOperand *o : op.getInputOperands())
newMaps.push_back(scaleReductionDim(op, *o, reductionDimPos, splitFactor));		newMaps.push_back(scaleReductionDim(op, *o, reductionDimPos, splitFactor));
// Provision a new indexing for the shape-only tensor.		// Provision a new indexing for the shape-only tensor.
auto nDims = op.getNumLoops() + 1;		auto nDims = op.getNumLoops() + 1;
auto redDim = getAffineDimExpr(reductionDimPos, context);		auto redDim = getAffineDimExpr(reductionDimPos, context);
auto redDimP1 = getAffineDimExpr(reductionDimPos + 1, context);		auto redDimP1 = getAffineDimExpr(reductionDimPos + 1, context);
newMaps.push_back(AffineMap::get(nDims, 0, {redDim, redDimP1}, context));		newMaps.push_back(AffineMap::get(nDims, 0, {redDim, redDimP1}, context));
// Expand existing output indexings.		// Expand existing output indexings.
// TODO: a subset of these may not reduce along reducePos and should be		// TODO: a subset of these may not reduce along reducePos and should be
// reindexed: k -> k * splitFactor + k', when multi-reduction support is		// reindexed: k -> k * splitFactor + k', when multi-reduction support is
// available.		// available.
for (OpOperand *o : op.getOutputOperands())		for (OpOperand *o : op.getOutputOperands())
newMaps.push_back(insertParallelDim(op, *o, reductionDimPos,		newMaps.push_back(insertParallelDim(op, *o, reductionDimPos,
reductionDimSize / splitFactor));		reductionDimSize / splitFactor));

// Step 3. Handle operands.		// Step 3. Handle operands.
// Compute the new input tensors.		// Compute the new input tensors.
auto newInputs = llvm::to_vector<4>(op.getInputs());		SmallVector<Value> newInputs(op.getInputOperands());
// Add a single shape-only tensor to carry the dimensions without resorting to		// Add a single shape-only tensor to carry the dimensions without resorting to
// more complex inversions.		// more complex inversions.
newInputs.push_back(b.create<tensor::EmptyOp>(		newInputs.push_back(b.create<tensor::EmptyOp>(
loc, ArrayRef<int64_t>{reductionDimSize / splitFactor, splitFactor},		loc, ArrayRef<int64_t>{reductionDimSize / splitFactor, splitFactor},
b.getIntegerType(1)));		b.getIntegerType(1)));
// Output tensors are already good to go.		// Output tensors are already good to go.

// Step 4. Create the new op matching the original op with an extra parallel		// Step 4. Create the new op matching the original op with an extra parallel
Show All 12 Lines	FailureOr<SplitReductionResult> mlir::linalg::splitReductionByScaling(
// Step 5. Create new reduction ops that only reduce the newly added		// Step 5. Create new reduction ops that only reduce the newly added
// dimensions from the previous op.		// dimensions from the previous op.
// For now assume outputs are 1-1 with reduction ops.		// For now assume outputs are 1-1 with reduction ops.
// TODO: a subset of these may not reduce in the first place and do not		// TODO: a subset of these may not reduce in the first place and do not
// require a new op, when multi-reduction support is available.		// require a new op, when multi-reduction support is available.
// TODO: all results can be handled in a single GenericOp, when		// TODO: all results can be handled in a single GenericOp, when
// multi-reduction support is available.		// multi-reduction support is available.
SmallVector<LinalgOp> results;		SmallVector<LinalgOp> results;
for (auto it :		for (auto it : llvm::zip(genericOp->getResults(), op.getOutputOperands(),
llvm::zip(genericOp->getResults(), op.getOutputs(), combinerOps)) {		combinerOps)) {
Value reindexedOutput = std::get<0>(it);		Value reindexedOutput = std::get<0>(it);
Value originalOutput = std::get<1>(it);		Value originalOutput = std::get<1>(it)->get();
auto originalOutputType = originalOutput.getType().cast<RankedTensorType>();		auto originalOutputType = originalOutput.getType().cast<RankedTensorType>();
Operation *combinerOp = std::get<2>(it);		Operation *combinerOp = std::get<2>(it);

AffineMap map = b.getMultiDimIdentityMap(originalOutputType.getRank() + 1);		AffineMap map = b.getMultiDimIdentityMap(originalOutputType.getRank() + 1);
SmallVector<AffineMap> indexingMaps = {		SmallVector<AffineMap> indexingMaps = {
map, map.dropResult(insertSplitDimension)};		map, map.dropResult(insertSplitDimension)};
SmallVector<StringRef> reductionIteratorTypes(		SmallVector<StringRef> reductionIteratorTypes(
originalOutputType.getRank() + 1, getParallelIteratorTypeName());		originalOutputType.getRank() + 1, getParallelIteratorTypeName());
▲ Show 20 Lines • Show All 59 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Tiling.cpp

Show First 20 Lines • Show All 497 Lines • ▼ Show 20 Lines	auto tiledLoopBodyBuilder =
if (!options.interchangeVector.empty())		if (!options.interchangeVector.empty())
interchangedIvs = applyMapToValues(b, loc, invPermutationMap, ivs);		interchangedIvs = applyMapToValues(b, loc, invPermutationMap, ivs);
else		else
interchangedIvs.assign(ivs.begin(), ivs.end());		interchangedIvs.assign(ivs.begin(), ivs.end());

// Tile the `operandValuesToUse` that either match the `op` operands		// Tile the `operandValuesToUse` that either match the `op` operands
// themselves or the tile loop arguments forwarding them.		// themselves or the tile loop arguments forwarding them.
assert(operandValuesToUse.size() ==		assert(operandValuesToUse.size() ==
static_cast<size_t>(op.getNumInputsAndOutputs()) &&		static_cast<size_t>(op->getNumOperands()) &&
"expect the number of operands and inputs and outputs to match");		"expect the number of operands and inputs and outputs to match");
SmallVector<Value> valuesToTile = operandValuesToUse;		SmallVector<Value> valuesToTile = operandValuesToUse;
SmallVector<OpFoldResult> sizeBounds =		SmallVector<OpFoldResult> sizeBounds =
makeComposedFoldedMultiResultAffineApply(b, loc, shapeSizesToLoopsMap,		makeComposedFoldedMultiResultAffineApply(b, loc, shapeSizesToLoopsMap,
allShapeSizes);		allShapeSizes);
SmallVector<Value> tiledOperands = makeTiledShapes(		SmallVector<Value> tiledOperands = makeTiledShapes(
b, loc, op, valuesToTile, getAsOpFoldResult(interchangedIvs), tileSizes,		b, loc, op, valuesToTile, getAsOpFoldResult(interchangedIvs), tileSizes,
sizeBounds,		sizeBounds,
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/TilingInterfaceImpl.cpp

Show First 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	struct LinalgOpTilingInterface
SmallVector<Operation *>		SmallVector<Operation *>
getTiledImplementation(Operation *op, OpBuilder &b,		getTiledImplementation(Operation *op, OpBuilder &b,
ArrayRef<OpFoldResult> offsets,		ArrayRef<OpFoldResult> offsets,
ArrayRef<OpFoldResult> sizes) const {		ArrayRef<OpFoldResult> sizes) const {
// Leave the `sizeBounds` value empty. That is only needed when the `sizes`		// Leave the `sizeBounds` value empty. That is only needed when the `sizes`
// specified could lead to out of bounds accesses.		// specified could lead to out of bounds accesses.
Location loc = op->getLoc();		Location loc = op->getLoc();
LinalgOp linalgOp = cast<LinalgOp>(op);		LinalgOp linalgOp = cast<LinalgOp>(op);
SmallVector<Value> valuesToTile = linalgOp.getInputAndOutputOperands();		SmallVector<Value> valuesToTile = linalgOp->getOperands();
SmallVector<Value, 4> tiledOperands = makeTiledShapes(		SmallVector<Value, 4> tiledOperands = makeTiledShapes(
b, loc, linalgOp, valuesToTile, offsets, sizes, {}, true);		b, loc, linalgOp, valuesToTile, offsets, sizes, {}, true);

SmallVector<Type> resultTensorTypes = llvm::to_vector(llvm::map_range(		SmallVector<Type> resultTensorTypes =
linalgOp.getOutputTensorOperands(), [&](OpOperand *opOperand) {		getTensorOutputTypes(linalgOp, tiledOperands);
return tiledOperands[opOperand->getOperandNumber()].getType();
}));

Operation *tiledOp =		Operation *tiledOp =
linalgOp.clone(b, loc, resultTensorTypes, tiledOperands);		linalgOp.clone(b, loc, resultTensorTypes, tiledOperands);
offsetIndices(b, cast<LinalgOp>(tiledOp), offsets);		offsetIndices(b, cast<LinalgOp>(tiledOp), offsets);

return {tiledOp};		return {tiledOp};
}		}

▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	struct LinalgOpTilingInterface
LogicalResult generateScalarImplementation(Operation *op, OpBuilder &builder,		LogicalResult generateScalarImplementation(Operation *op, OpBuilder &builder,
Location loc,		Location loc,
ValueRange ivs) const {		ValueRange ivs) const {
auto linalgOp = cast<LinalgOp>(op);		auto linalgOp = cast<LinalgOp>(op);
if (!linalgOp.hasBufferSemantics())		if (!linalgOp.hasBufferSemantics())
return op->emitOpError("expected operation to have buffer semantics");		return op->emitOpError("expected operation to have buffer semantics");

SmallVector<Value> indexedValues;		SmallVector<Value> indexedValues;
indexedValues.reserve(linalgOp.getNumInputsAndOutputs());		indexedValues.reserve(linalgOp->getNumOperands());
Location linalgOpLoc = op->getLoc();		Location linalgOpLoc = op->getLoc();
/// Load the data corresponding to the block arguments that		/// Load the data corresponding to the block arguments that
/// represent input operands.		/// represent input operands.
for (OpOperand *operand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &operand : linalgOp->getOpOperands()) {
if (!linalgOp.payloadUsesValueFromOperand(operand)) {		if (!linalgOp.payloadUsesValueFromOperand(&operand)) {
indexedValues.push_back(nullptr);		indexedValues.push_back(nullptr);
continue;		continue;
}		}
if (linalgOp.isScalar(operand)) {		if (linalgOp.isScalar(&operand)) {
indexedValues.push_back(operand->get());		indexedValues.push_back(operand.get());
continue;		continue;
}		}
SmallVector<Value> indices = getIndicesForAccess(		SmallVector<Value> indices = getIndicesForAccess(
builder, linalgOpLoc, linalgOp.getMatchingIndexingMap(operand), ivs);		builder, linalgOpLoc, linalgOp.getMatchingIndexingMap(&operand), ivs);
Value load =		Value load =
builder.create<memref::LoadOp>(linalgOpLoc, operand->get(), indices);		builder.create<memref::LoadOp>(linalgOpLoc, operand.get(), indices);
indexedValues.push_back(load);		indexedValues.push_back(load);
}		}

/// Inline the op payload and store the result.		/// Inline the op payload and store the result.
return inlinePayload(builder, linalgOp, ivs, indexedValues);		return inlinePayload(builder, linalgOp, ivs, indexedValues);
}		}
};		};

Show All 24 Lines

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

Show First 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	linalg::rewriteAsPaddedOp(OpBuilder &b, LinalgOp opToPad,
assert(opToPad.hasTensorSemantics() &&		assert(opToPad.hasTensorSemantics() &&
"expected operation to have tensor semantics");		"expected operation to have tensor semantics");

OpBuilder::InsertionGuard g(b);		OpBuilder::InsertionGuard g(b);
// Set IP after op because we also take the dims of the original output.		// Set IP after op because we also take the dims of the original output.
b.setInsertionPointAfter(opToPad);		b.setInsertionPointAfter(opToPad);
// Make a copy of the shaped operands and update it.		// Make a copy of the shaped operands and update it.
SmallVector<Value> newOperands;		SmallVector<Value> newOperands;
newOperands.reserve(opToPad.getNumInputsAndOutputs());		newOperands.reserve(opToPad->getNumOperands());
for (OpOperand *opOperand : opToPad.getInputAndOutputOperands()) {		for (OpOperand &opOperand : opToPad->getOpOperands()) {
FailureOr<Value> paddedOperand = padOperandToSmallestStaticBoundingBox(		FailureOr<Value> paddedOperand = padOperandToSmallestStaticBoundingBox(
b, opToPad, opOperand, paddingDimensions, paddingValues, packPaddings);		b, opToPad, &opOperand, paddingDimensions, paddingValues, packPaddings);
// Exit if `paddingDimensions` cannot be bounded statically.		// Exit if `paddingDimensions` cannot be bounded statically.
if (failed(paddedOperand))		if (failed(paddedOperand))
return failure();		return failure();
newOperands.push_back(*paddedOperand);		newOperands.push_back(*paddedOperand);
}		}

SmallVector<SmallVector<Value>> reifiedResultShapes;		SmallVector<SmallVector<Value>> reifiedResultShapes;
if (failed(cast<ReifyRankedShapedTypeOpInterface>(opToPad.getOperation())		if (failed(cast<ReifyRankedShapedTypeOpInterface>(opToPad.getOperation())
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	mlir::linalg::LinalgPaddingPattern::returningMatchAndRewrite(
FailureOr<SmallVector<Value>> newResults =		FailureOr<SmallVector<Value>> newResults =
rewriteAsPaddedOp(rewriter, linalgOp, options.paddingDimensions,		rewriteAsPaddedOp(rewriter, linalgOp, options.paddingDimensions,
options.paddingValues, options.packPaddings, paddedOp);		options.paddingValues, options.packPaddings, paddedOp);
if (failed(newResults))		if (failed(newResults))
return failure();		return failure();

// Hoist the padding.		// Hoist the padding.
for (const auto &en : enumerate(options.hoistPaddings)) {		for (const auto &en : enumerate(options.hoistPaddings)) {
if (static_cast<int64_t>(en.index()) >= paddedOp.getNumInputsAndOutputs())		if (static_cast<int64_t>(en.index()) >= paddedOp->getNumOperands())
break;		break;
OpOperand *opOperand = &paddedOp->getOpOperand(en.index());		OpOperand &opOperand = paddedOp->getOpOperand(en.index());
auto padOp = opOperand->get().getDefiningOp<tensor::PadOp>();		auto padOp = opOperand.get().getDefiningOp<tensor::PadOp>();
if (!padOp \|\| en.value() == 0)		if (!padOp \|\| en.value() == 0)
continue;		continue;

// Fail hoisting if the operand shape is not fully static.		// Fail hoisting if the operand shape is not fully static.
if (llvm::any_of(paddedOp.getShape(opOperand), ShapedType::isDynamic))		if (llvm::any_of(paddedOp.getShape(&opOperand), ShapedType::isDynamic))
return failure();		return failure();

tensor::PadOp hoistedOp;		tensor::PadOp hoistedOp;
SmallVector<GenericOp> transposeOps;		SmallVector<GenericOp> transposeOps;
SmallVector<int64_t> transposeVector =		SmallVector<int64_t> transposeVector =
en.index() < options.transposePaddings.size()		en.index() < options.transposePaddings.size()
? options.transposePaddings[en.index()]		? options.transposePaddings[en.index()]
: SmallVector<int64_t>{};		: SmallVector<int64_t>{};
▲ Show 20 Lines • Show All 372 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp

Show First 20 Lines • Show All 453 Lines • ▼ Show 20 Lines	vectorizeAsLinalgGeneric(OpBuilder &b, LinalgOp linalgOp,
// TODO: the common vector shape is equal to the static loop sizes only when		// TODO: the common vector shape is equal to the static loop sizes only when
// all indexing maps are projected permutations. For convs and stencils the		// all indexing maps are projected permutations. For convs and stencils the
// logic will need to evolve.		// logic will need to evolve.
SmallVector<int64_t> commonVectorShape = linalgOp.computeStaticLoopSizes();		SmallVector<int64_t> commonVectorShape = linalgOp.computeStaticLoopSizes();

// 3. Turn all BBArgs into vector.transfer_read / load.		// 3. Turn all BBArgs into vector.transfer_read / load.
Location loc = linalgOp.getLoc();		Location loc = linalgOp.getLoc();
Value zero = b.create<arith::ConstantIndexOp>(loc, 0);		Value zero = b.create<arith::ConstantIndexOp>(loc, 0);
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
BlockArgument bbarg = block->getArgument(opOperand->getOperandNumber());		BlockArgument bbarg = block->getArgument(opOperand.getOperandNumber());
if (linalgOp.isScalar(opOperand)) {		if (linalgOp.isScalar(&opOperand)) {
bvm.map(bbarg, opOperand->get());		bvm.map(bbarg, opOperand.get());
continue;		continue;
}		}
VectorType readType;		VectorType readType;
AffineMap map;		AffineMap map;
// TODO: can we keep this simplification?		// TODO: can we keep this simplification?
// if (linalgOp.getShape(opOperand).empty()) {		// if (linalgOp.getShape(&opOperand).empty()) {
// readType = VectorType::get({}, bbarg.getType());		// readType = VectorType::get({}, bbarg.getType());
// } else {		// } else {
if (opOperand->getOperandNumber() < linalgOp.getNumInputs()) {		if (opOperand.getOperandNumber() < linalgOp.getNumInputs()) {
map = inverseAndBroadcastProjectedPermutation(		map = inverseAndBroadcastProjectedPermutation(
linalgOp.getMatchingIndexingMap(opOperand));		linalgOp.getMatchingIndexingMap(&opOperand));
readType = VectorType::get(commonVectorShape,		readType = VectorType::get(commonVectorShape,
getElementTypeOrSelf(opOperand->get()));		getElementTypeOrSelf(opOperand.get()));
} else {		} else {
map = inversePermutation(		map = inversePermutation(
reindexIndexingMap(linalgOp.getMatchingIndexingMap(opOperand)));		reindexIndexingMap(linalgOp.getMatchingIndexingMap(&opOperand)));
readType = VectorType::get(map.compose(linalgOp.getShape(opOperand)),		readType = VectorType::get(map.compose(linalgOp.getShape(&opOperand)),
getElementTypeOrSelf(opOperand->get()));		getElementTypeOrSelf(opOperand.get()));
}		}
// }		// }

auto shape = linalgOp.getShape(opOperand);		auto shape = linalgOp.getShape(&opOperand);
SmallVector<Value> indices(shape.size(), zero);		SmallVector<Value> indices(shape.size(), zero);
Value readValue = b.create<vector::TransferReadOp>(		Value readValue = b.create<vector::TransferReadOp>(
loc, readType, opOperand->get(), indices, map);		loc, readType, opOperand.get(), indices, map);
// Not all ops support 0-d vectors, extract the scalar for now.		// Not all ops support 0-d vectors, extract the scalar for now.
// TODO: remove this.		// TODO: remove this.
if (readValue.getType().cast<VectorType>().getRank() == 0)		if (readValue.getType().cast<VectorType>().getRank() == 0)
readValue = b.create<vector::ExtractElementOp>(loc, readValue);		readValue = b.create<vector::ExtractElementOp>(loc, readValue);

LDBG("new vectorized bbarg(" << bbarg.getArgNumber() << "): " << readValue);		LDBG("new vectorized bbarg(" << bbarg.getArgNumber() << "): " << readValue);
bvm.map(bbarg, readValue);		bvm.map(bbarg, readValue);
bvm.map(opOperand->get(), readValue);		bvm.map(opOperand.get(), readValue);
}		}

SmallVector<CustomVectorizationHook> hooks;		SmallVector<CustomVectorizationHook> hooks;
// 4a. Register CustomVectorizationHook for yieldOp.		// 4a. Register CustomVectorizationHook for yieldOp.
CustomVectorizationHook vectorizeYield =		CustomVectorizationHook vectorizeYield =
[&](Operation *op,		[&](Operation *op,
const BlockAndValueMapping &bvm) -> VectorizationResult {		const BlockAndValueMapping &bvm) -> VectorizationResult {
return vectorizeLinalgYield(b, op, bvm, linalgOp, newResults);		return vectorizeLinalgYield(b, op, bvm, linalgOp, newResults);
▲ Show 20 Lines • Show All 830 Lines • ▼ Show 20 Lines
struct Conv1DGenerator : public StructuredGenerator<LinalgOp> {		struct Conv1DGenerator : public StructuredGenerator<LinalgOp> {
Conv1DGenerator(OpBuilder &builder, LinalgOp linalgOp, int strideW,		Conv1DGenerator(OpBuilder &builder, LinalgOp linalgOp, int strideW,
int dilationW)		int dilationW)
: StructuredGenerator<LinalgOp>(builder, linalgOp), strideW(strideW),		: StructuredGenerator<LinalgOp>(builder, linalgOp), strideW(strideW),
dilationW(dilationW) {		dilationW(dilationW) {
// Determine whether `linalgOp` can be generated with this generator		// Determine whether `linalgOp` can be generated with this generator
if (linalgOp.getNumInputs() != 2 \|\| linalgOp.getNumOutputs() != 1)		if (linalgOp.getNumInputs() != 2 \|\| linalgOp.getNumOutputs() != 1)
return;		return;
lhsShaped = linalgOp.getInputs()[0];		lhsShaped = linalgOp.getInputOperand(0)->get();
rhsShaped = linalgOp.getInputs()[1];		rhsShaped = linalgOp.getInputOperand(1)->get();
resShaped = linalgOp.getOutputs()[0];		resShaped = linalgOp.getOutputOperand(0)->get();
lhsShapedType = lhsShaped.getType().dyn_cast<ShapedType>();		lhsShapedType = lhsShaped.getType().dyn_cast<ShapedType>();
rhsShapedType = rhsShaped.getType().dyn_cast<ShapedType>();		rhsShapedType = rhsShaped.getType().dyn_cast<ShapedType>();
resShapedType = resShaped.getType().dyn_cast<ShapedType>();		resShapedType = resShaped.getType().dyn_cast<ShapedType>();
if (!lhsShapedType \|\| !rhsShapedType \|\| !resShapedType)		if (!lhsShapedType \|\| !rhsShapedType \|\| !resShapedType)
return;		return;
if (lhsShapedType.getRank() != 3 \|\|		if (lhsShapedType.getRank() != 3 \|\|
(rhsShapedType.getRank() != 2 && rhsShapedType.getRank() != 3) \|\|		(rhsShapedType.getRank() != 2 && rhsShapedType.getRank() != 3) \|\|
resShapedType.getRank() != 3)		resShapedType.getRank() != 3)
▲ Show 20 Lines • Show All 438 Lines • Show Last 20 Lines

mlir/lib/Dialect/Linalg/Utils/Utils.cpp

Show First 20 Lines • Show All 484 Lines • ▼ Show 20 Lines	void GenerateLoopNest<scf::ForOp>::doit(
ArrayRef<StringRef> iteratorTypes,		ArrayRef<StringRef> iteratorTypes,
function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,		function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,
ValueRange)>		ValueRange)>
bodyBuilderFn,		bodyBuilderFn,
ArrayRef<linalg::ProcInfo> procInfo) {		ArrayRef<linalg::ProcInfo> procInfo) {
assert((procInfo.empty() \|\| (procInfo.size() == loopRanges.size())) &&		assert((procInfo.empty() \|\| (procInfo.size() == loopRanges.size())) &&
"expected as many entries for proc info as number of loops, even if "		"expected as many entries for proc info as number of loops, even if "
"they are null entries");		"they are null entries");
SmallVector<Value> iterArgInitValues = linalgOp.getOutputTensorOperands();		SmallVector<Value> iterArgInitValues = linalgOp.hasBufferSemantics()
		? SmallVector<Value>{}
		: linalgOp.getOutputOperands();

SmallVector<Value, 4> lbs, ubs, steps;		SmallVector<Value, 4> lbs, ubs, steps;
unpackRanges(b, loc, loopRanges, lbs, ubs, steps);		unpackRanges(b, loc, loopRanges, lbs, ubs, steps);
LoopNest loopNest = mlir::scf::buildLoopNest(		LoopNest loopNest = mlir::scf::buildLoopNest(
b, loc, lbs, ubs, steps, iterArgInitValues,		b, loc, lbs, ubs, steps, iterArgInitValues,
[&](OpBuilder &b, Location loc, ValueRange ivs, ValueRange iterArgs) {		[&](OpBuilder &b, Location loc, ValueRange ivs, ValueRange iterArgs) {
assert(iterArgs.size() == linalgOp.getOutputTensorOperands().size() &&		assert(iterArgs.size() == iterArgInitValues.size() &&
"expect the number of output tensors and iter args to match");		"expect the number of output tensors and iter args to match");
SmallVector<Value> operandValuesToUse =		SmallVector<Value> operandValuesToUse = linalgOp->getOperands();
linalgOp.getInputAndOutputOperands();
if (!iterArgs.empty()) {		if (!iterArgs.empty()) {
operandValuesToUse = linalgOp.getInputOperands();		operandValuesToUse = linalgOp.getInputOperands();
operandValuesToUse.append(iterArgs.begin(), iterArgs.end());		operandValuesToUse.append(iterArgs.begin(), iterArgs.end());
}		}
return bodyBuilderFn(b, loc, ivs, operandValuesToUse);		return bodyBuilderFn(b, loc, ivs, operandValuesToUse);
});		});

if (loopNest.loops.empty() \|\| procInfo.empty())		if (loopNest.loops.empty() \|\| procInfo.empty())
Show All 13 Lines
template <>		template <>
void GenerateLoopNest<AffineForOp>::doit(		void GenerateLoopNest<AffineForOp>::doit(
OpBuilder &b, Location loc, ArrayRef<Range> loopRanges, LinalgOp linalgOp,		OpBuilder &b, Location loc, ArrayRef<Range> loopRanges, LinalgOp linalgOp,
ArrayRef<StringRef> iteratorTypes,		ArrayRef<StringRef> iteratorTypes,
function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,		function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,
ValueRange)>		ValueRange)>
bodyBuilderFn,		bodyBuilderFn,
ArrayRef<linalg::ProcInfo> /procInfo/) {		ArrayRef<linalg::ProcInfo> /procInfo/) {
SmallVector<Value> iterArgInitValues = linalgOp.getOutputTensorOperands();		SmallVector<Value> iterArgInitValues = linalgOp.hasBufferSemantics()
		? SmallVector<Value>{}
		: linalgOp.getOutputOperands();
assert(iterArgInitValues.empty() && "unexpected AffineForOp init values");		assert(iterArgInitValues.empty() && "unexpected AffineForOp init values");
SmallVector<Value, 4> lbs, ubs, steps;		SmallVector<Value, 4> lbs, ubs, steps;
unpackRanges(b, loc, loopRanges, lbs, ubs, steps);		unpackRanges(b, loc, loopRanges, lbs, ubs, steps);

// Affine loops require constant steps.		// Affine loops require constant steps.
SmallVector<int64_t, 4> constantSteps;		SmallVector<int64_t, 4> constantSteps;
constantSteps.reserve(steps.size());		constantSteps.reserve(steps.size());
for (Value v : steps) {		for (Value v : steps) {
auto op = v.getDefiningOp<arith::ConstantIndexOp>();		auto op = v.getDefiningOp<arith::ConstantIndexOp>();
assert(op && "Affine loops require constant steps");		assert(op && "Affine loops require constant steps");
constantSteps.push_back(op.value());		constantSteps.push_back(op.value());
}		}

mlir::buildAffineLoopNest(b, loc, lbs, ubs, constantSteps,		mlir::buildAffineLoopNest(b, loc, lbs, ubs, constantSteps,
[&](OpBuilder &b, Location loc, ValueRange ivs) {		[&](OpBuilder &b, Location loc, ValueRange ivs) {
SmallVector<Value> operandValuesToUse =		bodyBuilderFn(b, loc, ivs,
linalgOp.getInputAndOutputOperands();		linalgOp->getOperands());
bodyBuilderFn(b, loc, ivs, operandValuesToUse);
});		});
}		}

/// Update the `lb`, `ub` and `step` to get per processor `lb`, `ub` and `step`.		/// Update the `lb`, `ub` and `step` to get per processor `lb`, `ub` and `step`.
void updateBoundsForCyclicDistribution(OpBuilder &b, Location loc, Value procId,		void updateBoundsForCyclicDistribution(OpBuilder &b, Location loc, Value procId,
Value nprocs, Value &lb, Value &ub,		Value nprocs, Value &lb, Value &ub,
Value &step) {		Value &step) {
AffineExpr d0, d1;		AffineExpr d0, d1;
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines
template <>		template <>
void GenerateLoopNest<scf::ParallelOp>::doit(		void GenerateLoopNest<scf::ParallelOp>::doit(
OpBuilder &b, Location loc, ArrayRef<Range> loopRanges, LinalgOp linalgOp,		OpBuilder &b, Location loc, ArrayRef<Range> loopRanges, LinalgOp linalgOp,
ArrayRef<StringRef> iteratorTypes,		ArrayRef<StringRef> iteratorTypes,
function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,		function_ref<scf::ValueVector(OpBuilder &, Location, ValueRange,
ValueRange)>		ValueRange)>
bodyBuilderFn,		bodyBuilderFn,
ArrayRef<linalg::ProcInfo> procInfo) {		ArrayRef<linalg::ProcInfo> procInfo) {
SmallVector<Value> iterArgInitValues = linalgOp.getOutputTensorOperands();		SmallVector<Value> iterArgInitValues = linalgOp.hasBufferSemantics()
		? SmallVector<Value>{}
		: linalgOp.getOutputOperands();
assert(iterArgInitValues.empty() && "unexpected ParallelOp init values");		assert(iterArgInitValues.empty() && "unexpected ParallelOp init values");
// This function may be passed more iterator types than ranges.		// This function may be passed more iterator types than ranges.
assert(iteratorTypes.size() >= loopRanges.size() &&		assert(iteratorTypes.size() >= loopRanges.size() &&
"expected iterator type for all ranges");		"expected iterator type for all ranges");
assert((procInfo.empty() \|\| (procInfo.size() == loopRanges.size())) &&		assert((procInfo.empty() \|\| (procInfo.size() == loopRanges.size())) &&
"expected proc information for all loops when present");		"expected proc information for all loops when present");
iteratorTypes = iteratorTypes.take_front(loopRanges.size());		iteratorTypes = iteratorTypes.take_front(loopRanges.size());
SmallVector<Value, 8> lbsStorage, ubsStorage, stepsStorage, ivs;		SmallVector<Value, 8> lbsStorage, ubsStorage, stepsStorage, ivs;
Show All 13 Lines	if (it.value().distributionMethod != linalg::DistributionMethod::None) {
b, loc, it.value().procId, it.value().nprocs, lbsStorage[it.index()],		b, loc, it.value().procId, it.value().nprocs, lbsStorage[it.index()],
ubsStorage[it.index()], stepsStorage[it.index()]);		ubsStorage[it.index()], stepsStorage[it.index()]);
}		}
}		}
ValueRange lbs(lbsStorage), ubs(ubsStorage), steps(stepsStorage);		ValueRange lbs(lbsStorage), ubs(ubsStorage), steps(stepsStorage);
generateParallelLoopNest(		generateParallelLoopNest(
b, loc, lbs, ubs, steps, iteratorTypes, procInfo,		b, loc, lbs, ubs, steps, iteratorTypes, procInfo,
[&](OpBuilder &b, Location loc, ValueRange ivs) {		[&](OpBuilder &b, Location loc, ValueRange ivs) {
SmallVector<Value> operandValuesToUse =		bodyBuilderFn(b, loc, ivs, linalgOp->getOperands());
linalgOp.getInputAndOutputOperands();
bodyBuilderFn(b, loc, ivs, operandValuesToUse);
},		},
ivs);		ivs);

assert(ivs.size() == iteratorTypes.size() && "did not generate enough loops");		assert(ivs.size() == iteratorTypes.size() && "did not generate enough loops");
}		}

static Value materializeTiledShape(OpBuilder &builder, Location loc,		static Value materializeTiledShape(OpBuilder &builder, Location loc,
Value valueToTile,		Value valueToTile,
▲ Show 20 Lines • Show All 161 Lines • ▼ Show 20 Lines	for (unsigned idx = 0, e = tileSizes.size(); idx < e; ++idx) {
IRRewriter rewriter(b);		IRRewriter rewriter(b);
sizes.push_back(makeComposedFoldedAffineApply(rewriter, loc, d0 - 1, size));		sizes.push_back(makeComposedFoldedAffineApply(rewriter, loc, d0 - 1, size));
LLVM_DEBUG(llvm::dbgs() << "computeTileSizes: " << sizes.back() << "\n");		LLVM_DEBUG(llvm::dbgs() << "computeTileSizes: " << sizes.back() << "\n");
}		}
return sizes;		return sizes;
}		}

SmallVector<Type> getTensorOutputTypes(LinalgOp op, ValueRange operands) {		SmallVector<Type> getTensorOutputTypes(LinalgOp op, ValueRange operands) {
// TODO: use an interface/adaptor to avoid leaking position in		if (op.hasBufferSemantics())
// `tiledOperands`.		return {};
return llvm::to_vector(		return llvm::to_vector(
llvm::map_range(op.getOutputTensorOperands(), [&](OpOperand *opOperand) {		llvm::map_range(op.getOutputOperands(), [&](OpOperand *opOperand) {
return operands[opOperand->getOperandNumber()].getType();		return operands[opOperand->getOperandNumber()].getType();
}));		}));
}		}

SmallVector<Value> insertSlicesBack(OpBuilder &builder, Location loc,		SmallVector<Value> insertSlicesBack(OpBuilder &builder, Location loc,
LinalgOp op, ValueRange operands,		LinalgOp op, ValueRange operands,
ValueRange results) {		ValueRange results) {
		if (op.hasBufferSemantics())
		return {};
SmallVector<Value> tensorResults;		SmallVector<Value> tensorResults;
tensorResults.reserve(results.size());		tensorResults.reserve(results.size());
// Insert a insert_slice for each output tensor.		// Insert a insert_slice for each output tensor.
unsigned resultIdx = 0;		unsigned resultIdx = 0;
for (OpOperand *opOperand : op.getOutputTensorOperands()) {		for (OpOperand *opOperand : op.getOutputOperands()) {
// TODO: use an interface/adaptor to avoid leaking position in		// TODO: use an interface/adaptor to avoid leaking position in
// `tiledOperands`.		// `tiledOperands`.
Value outputTensor = operands[opOperand->getOperandNumber()];		Value outputTensor = operands[opOperand->getOperandNumber()];
if (auto sliceOp = outputTensor.getDefiningOp<tensor::ExtractSliceOp>()) {		if (auto sliceOp = outputTensor.getDefiningOp<tensor::ExtractSliceOp>()) {
Value inserted = builder.create<tensor::InsertSliceOp>(		Value inserted = builder.create<tensor::InsertSliceOp>(
loc, sliceOp.getSource().getType(), results[resultIdx],		loc, sliceOp.getSource().getType(), results[resultIdx],
sliceOp.getSource(), sliceOp.getOffsets(), sliceOp.getSizes(),		sliceOp.getSource(), sliceOp.getOffsets(), sliceOp.getSizes(),
sliceOp.getStrides(), sliceOp.getStaticOffsets(),		sliceOp.getStrides(), sliceOp.getStaticOffsets(),
Show All 21 Lines	computeAllSliceParameters(OpBuilder &builder, Location loc, LinalgOp linalgOp,
// Construct (potentially temporary) mins and maxes on which to apply maps		// Construct (potentially temporary) mins and maxes on which to apply maps
// that define tile subshapes.		// that define tile subshapes.
SmallVector<OpFoldResult> lbs =		SmallVector<OpFoldResult> lbs =
computeTileOffsets(builder, loc, ivs, tileSizes);		computeTileOffsets(builder, loc, ivs, tileSizes);
SmallVector<OpFoldResult> subShapeSizes =		SmallVector<OpFoldResult> subShapeSizes =
computeTileSizes(builder, loc, tileSizes, sizeBounds);		computeTileSizes(builder, loc, tileSizes, sizeBounds);

assert(static_cast<int64_t>(valuesToTile.size()) ==		assert(static_cast<int64_t>(valuesToTile.size()) ==
linalgOp.getNumInputsAndOutputs() &&		linalgOp->getNumOperands() &&
"expected one value to tile for every operand");		"expected one value to tile for every operand");
SmallVector<Optional<SliceParameters>> allSliceParams;		SmallVector<Optional<SliceParameters>> allSliceParams;
allSliceParams.reserve(valuesToTile.size());		allSliceParams.reserve(valuesToTile.size());
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
Value shapedOp = valuesToTile[opOperand->getOperandNumber()];		Value shapedOp = valuesToTile[opOperand.getOperandNumber()];
LLVM_DEBUG(llvm::dbgs() << "makeTiledShapes: for operand " << shapedOp);		LLVM_DEBUG(llvm::dbgs() << "makeTiledShapes: for operand " << shapedOp);
AffineMap map = linalgOp.getMatchingIndexingMap(opOperand);		AffineMap map = linalgOp.getMatchingIndexingMap(&opOperand);
// Use `opOperand` as is if it is not tiled and not an output tensor. Having		// Use `opOperand` as is if it is not tiled and not an output tensor. Having
// an extract/insert slice pair for all output tensors simplifies follow up		// an extract/insert slice pair for all output tensors simplifies follow up
// transformations such as padding and bufferization since the		// transformations such as padding and bufferization since the
// extract/insert slice pairs make the accessed iteration argument		// extract/insert slice pairs make the accessed iteration argument
// subdomains explicit.		// subdomains explicit.
if (!isTiled(map, tileSizes) && !linalgOp.isOutputTensor(opOperand)) {
		Type operandType = opOperand.get().getType();
		if (!isTiled(map, tileSizes) && !(operandType.isa<RankedTensorType>() &&
		linalgOp.isOutput(&opOperand))) {
allSliceParams.push_back(llvm::None);		allSliceParams.push_back(llvm::None);
LLVM_DEBUG(llvm::dbgs() << ": not tiled: use shape: "		LLVM_DEBUG(llvm::dbgs()
<< opOperand->get().getType() << "\n");		<< ": not tiled: use shape: " << operandType << "\n");
continue;		continue;
}		}
LLVM_DEBUG(llvm::dbgs() << ": tiled: figure out subshape...\n");		LLVM_DEBUG(llvm::dbgs() << ": tiled: figure out subshape...\n");

allSliceParams.push_back(computeSliceParameters(		allSliceParams.push_back(computeSliceParameters(
builder, loc, shapedOp, tileSizes, map, lbs, sizeBounds, subShapeSizes,		builder, loc, shapedOp, tileSizes, map, lbs, sizeBounds, subShapeSizes,
omitPartialTileCheck));		omitPartialTileCheck));
}		}
▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp

Show First 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	static bool isSumOfMul(GenericOp op) {
return false;		return false;
}		}

// Helper to detect direct yield of a zero value.		// Helper to detect direct yield of a zero value.
static bool isZeroYield(GenericOp op) {		static bool isZeroYield(GenericOp op) {
auto yieldOp = cast<linalg::YieldOp>(op.getRegion().front().getTerminator());		auto yieldOp = cast<linalg::YieldOp>(op.getRegion().front().getTerminator());
if (auto arg = yieldOp.getOperand(0).dyn_cast<BlockArgument>()) {		if (auto arg = yieldOp.getOperand(0).dyn_cast<BlockArgument>()) {
if (arg.getOwner()->getParentOp() == op) {		if (arg.getOwner()->getParentOp() == op) {
OpOperand *t = op.getInputAndOutputOperands()[arg.getArgNumber()];		return isZeroValue(op->getOperand(arg.getArgNumber()));
return isZeroValue(t->get());
}		}
}		}
return isZeroValue(yieldOp.getOperand(0));		return isZeroValue(yieldOp.getOperand(0));
}		}

/// Populates given sizes array from type (for static sizes) and from		/// Populates given sizes array from type (for static sizes) and from
/// the tensor (for dynamic sizes).		/// the tensor (for dynamic sizes).
static void sizesForTensor(OpBuilder &builder, SmallVector<Value, 4> &sizes,		static void sizesForTensor(OpBuilder &builder, SmallVector<Value, 4> &sizes,
▲ Show 20 Lines • Show All 119 Lines • ▼ Show 20 Lines	if (!prod \|\| !prod.hasTensorSemantics() \|\| prod.getNumResults() != 1 \|\|
return failure();		return failure();
// Sampling consumer and sum of multiplication chain producer.		// Sampling consumer and sum of multiplication chain producer.
if (!isAlloc(op.getOutputOperand(0), /isZero=/false) \|\|		if (!isAlloc(op.getOutputOperand(0), /isZero=/false) \|\|
!isAlloc(prod.getOutputOperand(0), /isZero=/true) \|\|		!isAlloc(prod.getOutputOperand(0), /isZero=/true) \|\|
!isSampling(op) \|\| !isSumOfMul(prod))		!isSampling(op) \|\| !isSumOfMul(prod))
return failure();		return failure();
// Modify operand structure of producer and consumer.		// Modify operand structure of producer and consumer.
Location loc = prod.getLoc();		Location loc = prod.getLoc();
SmallVector<Value> inputOps = prod.getInputOperands();		SmallVector<Value> inputOps = prod.getInputs();
SmallVector<Value> outputOps = op.getOutputOperands();		SmallVector<Value> outputOps = op.getOutputs();
SmallVector<AffineMap> fusedIndexMaps = prod.getIndexingMapsArray();		SmallVector<AffineMap> fusedIndexMaps = prod.getIndexingMapsArray();
inputOps.push_back(op.getInputOperand(1 - other)->get());		inputOps.push_back(op.getInputOperand(1 - other)->get());
fusedIndexMaps.push_back(fusedIndexMaps.back()); // mimic other		fusedIndexMaps.push_back(fusedIndexMaps.back()); // mimic other
// Fuse producer and consumer into a new generic op.		// Fuse producer and consumer into a new generic op.
auto fusedOp = rewriter.create<GenericOp>(		auto fusedOp = rewriter.create<GenericOp>(
loc, op.getResult(0).getType(), inputOps, outputOps,		loc, op.getResult(0).getType(), inputOps, outputOps,
rewriter.getAffineMapArrayAttr(fusedIndexMaps), prod.getIteratorTypes(),		rewriter.getAffineMapArrayAttr(fusedIndexMaps), prod.getIteratorTypes(),
/doc=/nullptr, /library_call=/nullptr);		/doc=/nullptr, /library_call=/nullptr);
▲ Show 20 Lines • Show All 372 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines

/// Helper method to inspect sparse encodings in the tensor types.		/// Helper method to inspect sparse encodings in the tensor types.
/// Fills the per-dimension sparsity information for all tensors.		/// Fills the per-dimension sparsity information for all tensors.
/// Returns true if the sparse annotations and affine subscript		/// Returns true if the sparse annotations and affine subscript
/// expressions of all tensors are admissible. Returns false if		/// expressions of all tensors are admissible. Returns false if
/// no annotations are found or inadmissible constructs occur.		/// no annotations are found or inadmissible constructs occur.
static bool findSparseAnnotations(Merger &merger, linalg::GenericOp op) {		static bool findSparseAnnotations(Merger &merger, linalg::GenericOp op) {
bool annotated = false;		bool annotated = false;
for (OpOperand *t : op.getInputAndOutputOperands()) {		for (OpOperand &t : op->getOpOperands()) {
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t->get().getType());		auto enc = getSparseTensorEncoding(t.get().getType());
if (enc)		if (enc)
annotated = true;		annotated = true;
assert(map.getNumResults() == op.getRank(t));		assert(map.getNumResults() == op.getRank(&t));
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {
unsigned tensor = t->getOperandNumber();		unsigned tensor = t.getOperandNumber();
AffineExpr a = map.getResult(toOrigDim(enc, d));		AffineExpr a = map.getResult(toOrigDim(enc, d));
if (!findAffine(merger, tensor, a, toDimLevelFormat(enc, d)))		if (!findAffine(merger, tensor, a, toDimLevelFormat(enc, d)))
return false; // inadmissible affine expression		return false; // inadmissible affine expression
}		}
}		}
return annotated;		return annotated;
}		}

▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	static bool computeIterationGraph(Merger &merger, linalg::GenericOp op,
OpOperand *skip = nullptr) {		OpOperand *skip = nullptr) {
// Set up an n x n from/to adjacency matrix of the iteration graph		// Set up an n x n from/to adjacency matrix of the iteration graph
// for the implicit loop indices i_0 .. i_n-1.		// for the implicit loop indices i_0 .. i_n-1.
unsigned n = op.getNumLoops();		unsigned n = op.getNumLoops();
std::vector<std::vector<bool>> adjM(n, std::vector<bool>(n, false));		std::vector<std::vector<bool>> adjM(n, std::vector<bool>(n, false));
std::vector<unsigned> inDegree(n, 0); // in-degree of each node.		std::vector<unsigned> inDegree(n, 0); // in-degree of each node.
auto iteratorTypes = op.getIteratorTypesArray();		auto iteratorTypes = op.getIteratorTypesArray();
// Iterate over the indexing maps of every tensor in the tensor expression.		// Iterate over the indexing maps of every tensor in the tensor expression.
for (OpOperand *t : op.getInputAndOutputOperands()) {		for (OpOperand &t : op->getOpOperands()) {
// Skip tensor during cycle resolution.		// Skip tensor during cycle resolution.
if (t == skip)		if (&t == skip)
continue;		continue;
// Get map and encoding.		// Get map and encoding.
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t->get().getType());		auto enc = getSparseTensorEncoding(t.get().getType());
assert(map.getNumDims() == n);		assert(map.getNumDims() == n);
// Skip dense tensor constraints when not requested.		// Skip dense tensor constraints when not requested.
if (!(mask & SortMask::kIncludeDense) && !enc)		if (!(mask & SortMask::kIncludeDense) && !enc)
continue;		continue;
// Each tensor expression and optional dimension ordering (row-major		// Each tensor expression and optional dimension ordering (row-major
// by default) puts an ordering constraint on the loop indices. For		// by default) puts an ordering constraint on the loop indices. For
// example, the tensor expresion A_ijk forces the ordering i < j < k		// example, the tensor expresion A_ijk forces the ordering i < j < k
// on the loop indices if no explicit dimension ordering is given.		// on the loop indices if no explicit dimension ordering is given.
for (unsigned d = 1, rank = map.getNumResults(); d < rank; d++) {		for (unsigned d = 1, rank = map.getNumResults(); d < rank; d++) {
AffineExpr f = map.getResult(toOrigDim(enc, d - 1));		AffineExpr f = map.getResult(toOrigDim(enc, d - 1));
AffineExpr t = map.getResult(toOrigDim(enc, d));		AffineExpr t = map.getResult(toOrigDim(enc, d));
addAffineOrderings(adjM, inDegree, f, t, 0);		addAffineOrderings(adjM, inDegree, f, t, 0);
}		}
// Push unrelated loops into sparse iteration space, so these		// Push unrelated loops into sparse iteration space, so these
// will be skipped more often.		// will be skipped more often.
if (mask & SortMask::kIncludeUndef) {		if (mask & SortMask::kIncludeUndef) {
unsigned tensor = t->getOperandNumber();		unsigned tensor = t.getOperandNumber();
for (unsigned i = 0; i < n; i++)		for (unsigned i = 0; i < n; i++)
if (merger.isDimLevelType(tensor, i, DimLvlType::kCompressed) \|\|		if (merger.isDimLevelType(tensor, i, DimLvlType::kCompressed) \|\|
merger.isDimLevelType(tensor, i, DimLvlType::kSingleton)) {		merger.isDimLevelType(tensor, i, DimLvlType::kSingleton)) {
for (unsigned j = 0; j < n; j++)		for (unsigned j = 0; j < n; j++)
if (merger.isDimLevelType(tensor, j, DimLvlType::kUndef)) {		if (merger.isDimLevelType(tensor, j, DimLvlType::kUndef)) {
adjM[i][j] = true;		adjM[i][j] = true;
inDegree[j]++;		inDegree[j]++;
}		}
▲ Show 20 Lines • Show All 203 Lines • ▼ Show 20 Lines	static Value genOutputBuffer(CodeGen &codegen, OpBuilder &builder,
}		}
return init;		return init;
}		}

/// Local bufferization of all dense and sparse data structures.		/// Local bufferization of all dense and sparse data structures.
static void genBuffers(Merger &merger, CodeGen &codegen, OpBuilder &builder,		static void genBuffers(Merger &merger, CodeGen &codegen, OpBuilder &builder,
linalg::GenericOp op) {		linalg::GenericOp op) {
Location loc = op.getLoc();		Location loc = op.getLoc();
assert(op.getNumInputsAndOutputs() == op.getNumInputs() + 1);		assert(op->getNumOperands() == op.getNumInputs() + 1);
// For every tensor, find lower and upper bound on dimensions, set the		// For every tensor, find lower and upper bound on dimensions, set the
// same bounds on loop indices, and obtain dense or sparse buffer(s).		// same bounds on loop indices, and obtain dense or sparse buffer(s).
auto dynShape = {ShapedType::kDynamicSize};		auto dynShape = {ShapedType::kDynamicSize};
SmallVector<Value, 4> args;		SmallVector<Value, 4> args;
for (OpOperand *t : op.getInputAndOutputOperands()) {		for (OpOperand &t : op->getOpOperands()) {
unsigned tensor = t->getOperandNumber();		unsigned tensor = t.getOperandNumber();
auto shape = op.getShape(t);		auto shape = op.getShape(&t);
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t->get().getType());		auto enc = getSparseTensorEncoding(t.get().getType());
// Scan all dimensions of current tensor.		// Scan all dimensions of current tensor.
args.clear();		args.clear();
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {
AffineExpr a = map.getResult(toOrigDim(enc, d));		AffineExpr a = map.getResult(toOrigDim(enc, d));
if (a.getKind() != AffineExprKind::DimId)		if (a.getKind() != AffineExprKind::DimId)
continue; // compound		continue; // compound
unsigned idx = a.cast<AffineDimExpr>().getPosition();		unsigned idx = a.cast<AffineDimExpr>().getPosition();
// Handle the different storage schemes.		// Handle the different storage schemes.
if (merger.isDimLevelType(tensor, idx, DimLvlType::kCompressed)) {		if (merger.isDimLevelType(tensor, idx, DimLvlType::kCompressed)) {
// Compressed dimension, fetch pointer and indices.		// Compressed dimension, fetch pointer and indices.
auto ptrTp =		auto ptrTp =
MemRefType::get(dynShape, getPointerOverheadType(builder, enc));		MemRefType::get(dynShape, getPointerOverheadType(builder, enc));
auto indTp =		auto indTp =
MemRefType::get(dynShape, getIndexOverheadType(builder, enc));		MemRefType::get(dynShape, getIndexOverheadType(builder, enc));
auto dim = builder.getIndexAttr(d);		auto dim = builder.getIndexAttr(d);
codegen.pointers[tensor][idx] =		codegen.pointers[tensor][idx] =
builder.create<ToPointersOp>(loc, ptrTp, t->get(), dim);		builder.create<ToPointersOp>(loc, ptrTp, t.get(), dim);
codegen.indices[tensor][idx] =		codegen.indices[tensor][idx] =
builder.create<ToIndicesOp>(loc, indTp, t->get(), dim);		builder.create<ToIndicesOp>(loc, indTp, t.get(), dim);
} else if (merger.isDimLevelType(tensor, idx, DimLvlType::kSingleton)) {		} else if (merger.isDimLevelType(tensor, idx, DimLvlType::kSingleton)) {
// Singleton dimension, fetch indices.		// Singleton dimension, fetch indices.
auto indTp =		auto indTp =
MemRefType::get(dynShape, getIndexOverheadType(builder, enc));		MemRefType::get(dynShape, getIndexOverheadType(builder, enc));
auto dim = builder.getIndexAttr(d);		auto dim = builder.getIndexAttr(d);
codegen.indices[tensor][idx] =		codegen.indices[tensor][idx] =
builder.create<ToIndicesOp>(loc, indTp, t->get(), dim);		builder.create<ToIndicesOp>(loc, indTp, t.get(), dim);
} else {		} else {
// Dense dimension, nothing to fetch.		// Dense dimension, nothing to fetch.
assert(merger.isDimLevelType(tensor, idx, DimLvlType::kDense));		assert(merger.isDimLevelType(tensor, idx, DimLvlType::kDense));
}		}
// Find upper bound in current dimension.		// Find upper bound in current dimension.
unsigned p = toOrigDim(enc, d);		unsigned p = toOrigDim(enc, d);
Value up = linalg::createOrFoldDimOp(builder, loc, t->get(), p);		Value up = linalg::createOrFoldDimOp(builder, loc, t.get(), p);
if (ShapedType::isDynamic(shape[p]))		if (ShapedType::isDynamic(shape[p]))
args.push_back(up);		args.push_back(up);
assert(codegen.highs[tensor][idx] == nullptr);		assert(codegen.highs[tensor][idx] == nullptr);
codegen.sizes[idx] = codegen.highs[tensor][idx] = up;		codegen.sizes[idx] = codegen.highs[tensor][idx] = up;
}		}
// Perform the required bufferization. Dense inputs materialize		// Perform the required bufferization. Dense inputs materialize
// from the input tensors. Dense outputs need special handling.		// from the input tensors. Dense outputs need special handling.
// Sparse inputs use sparse primitives to obtain the values.		// Sparse inputs use sparse primitives to obtain the values.
Type elementType = getElementTypeOrSelf(t->get().getType());		Type elementType = getElementTypeOrSelf(t.get().getType());
if (!enc) {		if (!enc) {
// Non-annotated dense tensors.		// Non-annotated dense tensors.
auto denseTp = MemRefType::get(shape, elementType);		auto denseTp = MemRefType::get(shape, elementType);
if (tensor < op.getNumInputs())		if (tensor < op.getNumInputs())
codegen.buffers[tensor] =		codegen.buffers[tensor] =
builder.create<bufferization::ToMemrefOp>(loc, denseTp, t->get());		builder.create<bufferization::ToMemrefOp>(loc, denseTp, t.get());
else		else
codegen.buffers[tensor] =		codegen.buffers[tensor] =
genOutputBuffer(codegen, builder, op, denseTp, args);		genOutputBuffer(codegen, builder, op, denseTp, args);
} else if (t != codegen.sparseOut) {		} else if (&t != codegen.sparseOut) {
// Annotated sparse tensors (not involved in output).		// Annotated sparse tensors (not involved in output).
auto sparseTp = MemRefType::get(dynShape, elementType);		auto sparseTp = MemRefType::get(dynShape, elementType);
codegen.buffers[tensor] =		codegen.buffers[tensor] =
builder.create<ToValuesOp>(loc, sparseTp, t->get());		builder.create<ToValuesOp>(loc, sparseTp, t.get());
}		}
}		}
}		}

/// Constructs vector type.		/// Constructs vector type.
static VectorType vectorType(CodeGen &codegen, Type etp) {		static VectorType vectorType(CodeGen &codegen, Type etp) {
unsigned numScalableDims = codegen.options.enableVLAVectorization;		unsigned numScalableDims = codegen.options.enableVLAVectorization;
return VectorType::get(codegen.curVecLength, etp, numScalableDims);		return VectorType::get(codegen.curVecLength, etp, numScalableDims);
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines	static Value genTensorLoad(Merger &merger, CodeGen &codegen, OpBuilder &builder,
// Test if the load was hoisted to a higher loop nest.		// Test if the load was hoisted to a higher loop nest.
Value val = merger.exp(exp).val;		Value val = merger.exp(exp).val;
if (val) {		if (val) {
if (codegen.curVecLength > 1 && !val.getType().isa<VectorType>())		if (codegen.curVecLength > 1 && !val.getType().isa<VectorType>())
return genVectorInvariantValue(codegen, builder, val);		return genVectorInvariantValue(codegen, builder, val);
return val;		return val;
}		}
// Load during insertion.		// Load during insertion.
OpOperand *t = op.getInputAndOutputOperands()[merger.exp(exp).tensor];		OpOperand &t = op->getOpOperand(merger.exp(exp).tensor);
if (t == codegen.sparseOut) {		if (&t == codegen.sparseOut) {
if (codegen.redCustom != -1u)		if (codegen.redCustom != -1u)
return genInsertionLoadReduce(merger, codegen, builder, op, t);		return genInsertionLoadReduce(merger, codegen, builder, op, &t);
return genInsertionLoad(codegen, builder, op, t);		return genInsertionLoad(codegen, builder, op, &t);
}		}
// Actual load.		// Actual load.
SmallVector<Value, 4> args;		SmallVector<Value, 4> args;
Value ptr = genSubscript(codegen, builder, op, t, args);		Value ptr = genSubscript(codegen, builder, op, &t, args);
if (codegen.curVecLength > 1)		if (codegen.curVecLength > 1)
return genVectorLoad(codegen, builder, ptr, args);		return genVectorLoad(codegen, builder, ptr, args);
return builder.create<memref::LoadOp>(op.getLoc(), ptr, args);		return builder.create<memref::LoadOp>(op.getLoc(), ptr, args);
}		}

/// Generates a store on a dense or sparse tensor.		/// Generates a store on a dense or sparse tensor.
static void genTensorStore(Merger &merger, CodeGen &codegen, OpBuilder &builder,		static void genTensorStore(Merger &merger, CodeGen &codegen, OpBuilder &builder,
linalg::GenericOp op, unsigned exp, Value rhs) {		linalg::GenericOp op, unsigned exp, Value rhs) {
▲ Show 20 Lines • Show All 223 Lines • ▼ Show 20 Lines
static void genInvariants(Merger &merger, CodeGen &codegen, OpBuilder &builder,		static void genInvariants(Merger &merger, CodeGen &codegen, OpBuilder &builder,
linalg::GenericOp op, unsigned exp, unsigned ldx,		linalg::GenericOp op, unsigned exp, unsigned ldx,
bool atStart, unsigned last = -1u) {		bool atStart, unsigned last = -1u) {
if (exp == -1u)		if (exp == -1u)
return;		return;
if (merger.exp(exp).kind == Kind::kTensor) {		if (merger.exp(exp).kind == Kind::kTensor) {
// Inspect tensor indices.		// Inspect tensor indices.
bool atLevel = ldx == -1u;		bool atLevel = ldx == -1u;
OpOperand *t = op.getInputAndOutputOperands()[merger.exp(exp).tensor];		OpOperand &t = op->getOpOperand(merger.exp(exp).tensor);
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(&t);
auto enc = getSparseTensorEncoding(t->get().getType());		auto enc = getSparseTensorEncoding(t.get().getType());
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {
AffineExpr a = map.getResult(toOrigDim(enc, d));		AffineExpr a = map.getResult(toOrigDim(enc, d));
if (!isInvariantAffine(codegen, a, ldx, atLevel))		if (!isInvariantAffine(codegen, a, ldx, atLevel))
return; // still in play		return; // still in play
}		}
// All exhausted at this level (atLevel denotes exactly at this level).		// All exhausted at this level (atLevel denotes exactly at this level).
if (!atLevel)		if (!atLevel)
return;		return;
OpOperand *lhs = op.getOutputOperand(0);		OpOperand *lhs = op.getOutputOperand(0);
if (lhs == t) {		if (lhs == &t) {
// Start or end a scalarized reduction		// Start or end a scalarized reduction
if (atStart) {		if (atStart) {
Kind kind = merger.exp(last).kind;		Kind kind = merger.exp(last).kind;
Value load = kind == Kind::kReduce		Value load = kind == Kind::kReduce
? getCustomRedId(merger.exp(last).op)		? getCustomRedId(merger.exp(last).op)
: genTensorLoad(merger, codegen, builder, op, exp);		: genTensorLoad(merger, codegen, builder, op, exp);
codegen.redKind = getReduction(kind);		codegen.redKind = getReduction(kind);
codegen.redExp = exp;		codegen.redExp = exp;
▲ Show 20 Lines • Show All 166 Lines • ▼ Show 20 Lines
}		}

/// Checks unit stride for dense tensors. The iteration graph may have ignored		/// Checks unit stride for dense tensors. The iteration graph may have ignored
/// dense access patterns in order to avoid cycles (sparse access patterns are		/// dense access patterns in order to avoid cycles (sparse access patterns are
/// always placed innermost), but that means dense access has become strided.		/// always placed innermost), but that means dense access has become strided.
/// This prevents effective vectorization.		/// This prevents effective vectorization.
static bool denseUnitStrides(Merger &merger, linalg::GenericOp op,		static bool denseUnitStrides(Merger &merger, linalg::GenericOp op,
unsigned idx) {		unsigned idx) {
for (OpOperand *t : op.getInputAndOutputOperands()) {		for (OpOperand &t : op->getOpOperands()) {
if (!getSparseTensorEncoding(t->get().getType())) {		if (!getSparseTensorEncoding(t.get().getType())) {
auto map = op.getMatchingIndexingMap(t);		auto map = op.getMatchingIndexingMap(&t);
for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {		for (unsigned d = 0, rank = map.getNumResults(); d < rank; d++) {
AffineExpr a = map.getResult(d);		AffineExpr a = map.getResult(d);
// Report non-unit stride if innermost index appears at an outer		// Report non-unit stride if innermost index appears at an outer
// dimension (true non-unit stride) or if the innermost index appears		// dimension (true non-unit stride) or if the innermost index appears
// in a compound subscript in the innermost dimension. Even if the		// in a compound subscript in the innermost dimension. Even if the
// latter is unit stride, it does not play well with scatter/gather.		// latter is unit stride, it does not play well with scatter/gather.
// TODO: accept unit stride affine innermost like a[i,j+k+1]?		// TODO: accept unit stride affine innermost like a[i,j+k+1]?
if (a.isFunctionOfDim(idx) &&		if (a.isFunctionOfDim(idx) &&
▲ Show 20 Lines • Show All 549 Lines • ▼ Show 20 Lines	GenericOpSparsifier(MLIRContext *context, SparsificationOptions o)
: OpRewritePattern<linalg::GenericOp>(context), options(o) {}		: OpRewritePattern<linalg::GenericOp>(context), options(o) {}

LogicalResult matchAndRewrite(linalg::GenericOp op,		LogicalResult matchAndRewrite(linalg::GenericOp op,
PatternRewriter &rewriter) const override {		PatternRewriter &rewriter) const override {
// Detects sparse annotations and translate the per-dimension sparsity		// Detects sparse annotations and translate the per-dimension sparsity
// information for all tensors to loop indices in the kernel.		// information for all tensors to loop indices in the kernel.
if (op.getNumOutputs() != 1)		if (op.getNumOutputs() != 1)
return failure();		return failure();
unsigned numTensors = op.getNumInputsAndOutputs();		unsigned numTensors = op->getNumOperands();
unsigned numLoops = op.getNumLoops();		unsigned numLoops = op.getNumLoops();
Merger merger(numTensors, numLoops);		Merger merger(numTensors, numLoops);
if (!findSparseAnnotations(merger, op))		if (!findSparseAnnotations(merger, op))
return failure();		return failure();

// Builds the tensor expression for the Linalg operation in SSA form.		// Builds the tensor expression for the Linalg operation in SSA form.
Optional<unsigned> optExp = merger.buildTensorExpFromLinalg(op);		Optional<unsigned> optExp = merger.buildTensorExpFromLinalg(op);
if (!optExp.has_value())		if (!optExp.has_value())
▲ Show 20 Lines • Show All 100 Lines • Show Last 20 Lines

mlir/lib/Dialect/SparseTensor/Utils/Merger.cpp

	Show First 20 Lines • Show All 904 Lines • ▼ Show 20 Lines

	Optional<unsigned> Merger::buildTensorExp(linalg::GenericOp op, Value v) {			Optional<unsigned> Merger::buildTensorExp(linalg::GenericOp op, Value v) {
	if (auto arg = v.dyn_cast<BlockArgument>()) {			if (auto arg = v.dyn_cast<BlockArgument>()) {
	unsigned argN = arg.getArgNumber();			unsigned argN = arg.getArgNumber();
	// Any argument of the generic op that is not marked as a scalar			// Any argument of the generic op that is not marked as a scalar
	// argument is considered a tensor, indexed by the implicit loop			// argument is considered a tensor, indexed by the implicit loop
	// bounds. This includes rank-0 tensor arguments.			// bounds. This includes rank-0 tensor arguments.
	if (arg.getOwner()->getParentOp() == op) {			if (arg.getOwner()->getParentOp() == op) {
	OpOperand *t = op.getInputAndOutputOperands()[argN];			OpOperand &t = op->getOpOperand(argN);
	if (!op.isScalar(t))			if (!op.isScalar(&t))
	return addExp(kTensor, argN);			return addExp(kTensor, argN);
	v = t->get(); // get scalar value			v = t.get(); // get scalar value
	}			}
	// Any other argument (marked as scalar argument for the generic op			// Any other argument (marked as scalar argument for the generic op
	// or belonging to an enveloping op) is considered invariant.			// or belonging to an enveloping op) is considered invariant.
	return addExp(kInvariant, v);			return addExp(kInvariant, v);
	}			}
	// Something defined outside is invariant.			// Something defined outside is invariant.
	Operation *def = v.getDefiningOp();			Operation *def = v.getDefiningOp();
	if (def->getBlock() != &op.getRegion().front())			if (def->getBlock() != &op.getRegion().front())
	▲ Show 20 Lines • Show All 345 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/canonicalize.mlir

	Show First 20 Lines • Show All 269 Lines • ▼ Show 20 Lines

	// CHECK: return			// CHECK: return
	return			return
	}			}

	// -----			// -----

	// CHECK-LABEL: func @remove_deadargs_generic_basic			// CHECK-LABEL: func @remove_deadargs_generic_basic
	// CHECK-SAME: (%[[ARG0:.*]]: tensor<?xf32>) -> tensor<?xf32> {			// CHECK-SAME: (%[[ARG0:.*]]: tensor<?xf32>) -> tensor<?xf32> {
	// CHECK: %[[GENERIC_OP:.*]] = linalg.generic			// CHECK: %[[GENERIC_OP:.*]] = linalg.generic
	// CHECK-SAME: ins(%[[ARG0]] : tensor<?xf32>)			// CHECK-SAME: ins(%[[ARG0]] : tensor<?xf32>)
	// CHECK-SAME: outs({{.*}} : tensor<?xf32>) {			// CHECK-SAME: outs({{.*}} : tensor<?xf32>) {
	#map0 = affine_map<(d0) -> (d0)>			#map0 = affine_map<(d0) -> (d0)>
	func.func @remove_deadargs_generic_basic(%arg0: tensor<?xf32>) -> (tensor<?xf32>) {			func.func @remove_deadargs_generic_basic(%arg0: tensor<?xf32>) -> (tensor<?xf32>) {
	%c0 = arith.constant 0 : index			%c0 = arith.constant 0 : index
	%cst = arith.constant 7.0 : f32			%cst = arith.constant 7.0 : f32
	%0 = tensor.dim %arg0, %c0 : tensor<?xf32>			%0 = tensor.dim %arg0, %c0 : tensor<?xf32>
	▲ Show 20 Lines • Show All 537 Lines • Show Last 20 Lines

mlir/test/Dialect/Linalg/roundtrip.mlir

	Show First 20 Lines • Show All 115 Lines • ▼ Show 20 Lines
	// CHECK: linalg.generic {			// CHECK: linalg.generic {
	// CHECK-SAME: indexing_maps = [#{{[0-9a-z]}}, #{{[0-9a-z]}}, #{{[0-9a-z]*}}],			// CHECK-SAME: indexing_maps = [#{{[0-9a-z]}}, #{{[0-9a-z]}}, #{{[0-9a-z]*}}],
	// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel"],			// CHECK-SAME: iterator_types = ["parallel", "parallel", "parallel"],
	// CHECK-SAME: library_call = "some_external_function_name_1"}			// CHECK-SAME: library_call = "some_external_function_name_1"}
	// CHECK-SAME: ins({{.}}, {{.}} : memref<?x?xvector<3x4xi4>, strided<[?, 1], offset: ?>>, f32)			// CHECK-SAME: ins({{.}}, {{.}} : memref<?x?xvector<3x4xi4>, strided<[?, 1], offset: ?>>, f32)
	// CHECK-SAME: outs({{.*}} : memref<?x?x?xf32, strided<[?, ?, 1], offset: ?>>)			// CHECK-SAME: outs({{.*}} : memref<?x?x?xf32, strided<[?, ?, 1], offset: ?>>)
	// CHECK-SAME: {foo = 1 : i64}			// CHECK-SAME: {foo = 1 : i64}

	func.func @generic_with_tensor_input(%arg0: tensor<?x?xvector<3x4xi4>>,
	akuegelUnsubmitted Done Reply Inline Actions Maybe now we should have a test case in invalid.mlir instead? Or is it not actually invalid? akuegel: Maybe now we should have a test case in invalid.mlir instead? Or is it not actually invalid?
	pifon2aAuthorUnsubmitted Done Reply Inline Actions it has a mixed form with tensors and buffers. Should we still support mixed tensor-memref inputs? pifon2a: it has a mixed form with tensors and buffers. Should we still support mixed tensor-memref…
	%arg1: memref<?x?x?xf32, strided<[?, ?, 1], offset: ?>>) {
	%cst = arith.constant 0.0 : f32
	linalg.generic #trait_0
	ins(%arg0, %cst : tensor<?x?xvector<3x4xi4>>, f32)
	outs(%arg1 : memref<?x?x?xf32, strided<[?, ?, 1], offset: ?>>)
	attrs = {foo = 1} {
	^bb(%0: vector<3x4xi4>, %1: f32, %2: f32) :
	linalg.yield %1 : f32
	}
	return
	}
	// CHECK-LABEL: func @generic_with_tensor_input
	// CHECK: linalg.generic {
	// CHECK-SAME: indexing_maps = [#{{.}}, #{{.}}], iterator_types = ["parallel", "parallel", "parallel"],
	// CHECK-SAME: library_call = "some_external_function_name_1"}
	// CHECK-SAME: ins({{.}}, {{.}} : tensor<?x?xvector<3x4xi4>>, f32)
	// CHECK-SAME: outs({{.*}} : memref<?x?x?xf32, strided<[?, ?, 1], offset: ?>>)
	// CHECK-SAME: {foo = 1 : i64}

	// -----			// -----

	#map0 = affine_map<(d0, d1, d2) -> (d0, d1, d2)>			#map0 = affine_map<(d0, d1, d2) -> (d0, d1, d2)>
	func.func @generic_without_inputs(%arg0 : memref<?x?x?xf32>) {			func.func @generic_without_inputs(%arg0 : memref<?x?x?xf32>) {
	linalg.generic {indexing_maps = [#map0],			linalg.generic {indexing_maps = [#map0],
	iterator_types = ["parallel", "parallel", "parallel"]}			iterator_types = ["parallel", "parallel", "parallel"]}
	outs(%arg0 : memref<?x?x?xf32>) {			outs(%arg0 : memref<?x?x?xf32>) {
	^bb0(%arg3: f32):			^bb0(%arg3: f32):
	▲ Show 20 Lines • Show All 143 Lines • ▼ Show 20 Lines
	// CHECK: %{{.*}} = linalg.index 2 : index			// CHECK: %{{.*}} = linalg.index 2 : index
	// CHECK: linalg.yield %{{.*}} : f32			// CHECK: linalg.yield %{{.*}} : f32

	// -----			// -----


	func.func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?x?xf32>, %c3: memref<?x?x?xf32>,			func.func @named_ops(%a3: memref<?x?x?xf32>, %b3: memref<?x?x?xf32>, %c3: memref<?x?x?xf32>,
	%ta3: tensor<?x?x?xf32>, %tb3: tensor<?x?x?xf32>, %tc3: tensor<?x?x?xf32>)			%ta3: tensor<?x?x?xf32>, %tb3: tensor<?x?x?xf32>, %tc3: tensor<?x?x?xf32>)
	-> (tensor<?x?x?xf32>, tensor<?x?x?xf32>)			-> (tensor<?x?x?xf32>)
	{			{
	linalg.batch_matmul ins(%a3, %b3: memref<?x?x?xf32>, memref<?x?x?xf32>)			linalg.batch_matmul ins(%a3, %b3: memref<?x?x?xf32>, memref<?x?x?xf32>)
	outs(%c3: memref<?x?x?xf32>)			outs(%c3: memref<?x?x?xf32>)
	linalg.batch_matmul ins(%ta3, %tb3: tensor<?x?x?xf32>, tensor<?x?x?xf32>)
	outs(%c3: memref<?x?x?xf32>)
	%res1 = linalg.batch_matmul			%res1 = linalg.batch_matmul
	ins(%ta3, %tb3: tensor<?x?x?xf32>, tensor<?x?x?xf32>)			ins(%ta3, %tb3: tensor<?x?x?xf32>, tensor<?x?x?xf32>)
	outs(%tc3: tensor<?x?x?xf32>)			outs(%tc3: tensor<?x?x?xf32>)
	-> tensor<?x?x?xf32>			-> tensor<?x?x?xf32>
	%res2 = linalg.batch_matmul			return %res1 : tensor<?x?x?xf32>
	ins(%ta3, %b3: tensor<?x?x?xf32>, memref<?x?x?xf32>)
	outs(%tc3: tensor<?x?x?xf32>)
	-> tensor<?x?x?xf32>
	return %res1, %res2 : tensor<?x?x?xf32>, tensor<?x?x?xf32>
	}			}
	// CHECK-LABEL: func @named_ops			// CHECK-LABEL: func @named_ops
	// CHECK: linalg.batch_matmul			// CHECK: linalg.batch_matmul
	// CHECK: linalg.batch_matmul			// CHECK: linalg.batch_matmul
	// CHECK: linalg.batch_matmul
	// CHECK: linalg.batch_matmul

	// -----			// -----

	func.func @fill_tensor(%arg0 : index, %arg1 : index, %arg2 : f32) -> tensor<?x?xf32> {			func.func @fill_tensor(%arg0 : index, %arg1 : index, %arg2 : f32) -> tensor<?x?xf32> {
	%0 = tensor.empty(%arg0, %arg1) : tensor<?x?xf32>			%0 = tensor.empty(%arg0, %arg1) : tensor<?x?xf32>
	%1 = linalg.fill ins(%arg2 : f32) outs(%0 : tensor<?x?xf32>) -> tensor<?x?xf32>			%1 = linalg.fill ins(%arg2 : f32) outs(%0 : tensor<?x?xf32>) -> tensor<?x?xf32>
	return %1 : tensor<?x?xf32>			return %1 : tensor<?x?xf32>
	}			}
	▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Linalg/TestLinalgElementwiseFusion.cpp

Show All 20 Lines

using namespace mlir;		using namespace mlir;

static void addOperands(Operation *op, SetVector<Value> &operandSet) {		static void addOperands(Operation *op, SetVector<Value> &operandSet) {
if (!op)		if (!op)
return;		return;
TypeSwitch<Operation *, void>(op)		TypeSwitch<Operation *, void>(op)
.Case<linalg::LinalgOp>([&](linalg::LinalgOp linalgOp) {		.Case<linalg::LinalgOp>([&](linalg::LinalgOp linalgOp) {
SmallVector<Value> inputOperands = linalgOp.getInputOperands();		SmallVector<Value> inputOperands{linalgOp.getInputOperands()};
operandSet.insert(inputOperands.begin(), inputOperands.end());		operandSet.insert(inputOperands.begin(), inputOperands.end());
})		})
.Default([&](Operation *operation) {		.Default([&](Operation *operation) {
operandSet.insert(operation->operand_begin(), operation->operand_end());		operandSet.insert(operation->operand_begin(), operation->operand_end());
});		});
}		}

template <int limit = 3>		template <int limit = 3>
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	if (controlFuseByExpansion) {
}		}
}		}

Operation *consumer = fusedOperand->getOwner();		Operation *consumer = fusedOperand->getOwner();
if (auto expandOp = dyn_cast<tensor::ExpandShapeOp>(consumer)) {		if (auto expandOp = dyn_cast<tensor::ExpandShapeOp>(consumer)) {
if (expandOp->hasOneUse()) {		if (expandOp->hasOneUse()) {
OpOperand &use = *expandOp->getUses().begin();		OpOperand &use = *expandOp->getUses().begin();
auto linalgOp = dyn_cast<linalg::LinalgOp>(use.getOwner());		auto linalgOp = dyn_cast<linalg::LinalgOp>(use.getOwner());
if (linalgOp && linalgOp.isOutputTensor(&use))		if (linalgOp && linalgOp.isOutput(&use))
return true;		return true;
}		}
return false;		return false;
}		}
return true;		return true;
};		};

linalg::populateFoldReshapeOpsByExpansionPatterns(fusionPatterns,		linalg::populateFoldReshapeOpsByExpansionPatterns(fusionPatterns,
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Linalg/TestLinalgFusionTransforms.cpp

Show All 32 Lines	f.walk([&](LinalgOp op) {
// TODO: support multi-results.		// TODO: support multi-results.
if (op->getNumResults() <= 1)		if (op->getNumResults() <= 1)
linalgOps.push_back(op);		linalgOps.push_back(op);
});		});

// Tile and Fuse for tensors inputs (TODO: all tensor operands).		// Tile and Fuse for tensors inputs (TODO: all tensor operands).
bool changed = false;		bool changed = false;
for (LinalgOp linalgOp : llvm::reverse(linalgOps)) {		for (LinalgOp linalgOp : llvm::reverse(linalgOps)) {
for (OpOperand *opOperand : linalgOp.getInputAndOutputOperands()) {		for (OpOperand &opOperand : linalgOp->getOpOperands()) {
if (opOperand->get().getType().isa<MemRefType>()) {		if (opOperand.get().getType().isa<MemRefType>()) {
// TODO: LinalgDependenceGraph should be able to update itself.		// TODO: LinalgDependenceGraph should be able to update itself.
// The current naive and expensive reconstruction of the graph should be		// The current naive and expensive reconstruction of the graph should be
// removed.		// removed.
linalg::Aliases aliases;		linalg::Aliases aliases;
linalg::LinalgDependenceGraph graph(aliases, linalgOps);		linalg::LinalgDependenceGraph graph(aliases, linalgOps);
auto info = fuseProducerOfBuffer(b, *opOperand, graph);		auto info = fuseProducerOfBuffer(b, opOperand, graph);
if (failed(info))		if (failed(info))
continue;		continue;
auto *originalOp = info->originalProducer.getOperation();		auto *originalOp = info->originalProducer.getOperation();
eraseSet.insert(originalOp);		eraseSet.insert(originalOp);
auto *originalOpInLinalgOpsVector =		auto *originalOpInLinalgOpsVector =
std::find(linalgOps.begin(), linalgOps.end(), originalOp);		std::find(linalgOps.begin(), linalgOps.end(), originalOp);
*originalOpInLinalgOpsVector = info->fusedProducer.getOperation();		*originalOpInLinalgOpsVector = info->fusedProducer.getOperation();
changed = true;		changed = true;
} else if (opOperand->get().getType().isa<RankedTensorType>()) {		} else if (opOperand.get().getType().isa<RankedTensorType>()) {
// Tile and Fuse tensor input.		// Tile and Fuse tensor input.
if (opOperand->getOperandNumber() >= linalgOp.getNumInputs())		if (opOperand.getOperandNumber() >= linalgOp.getNumInputs())
continue;		continue;
auto info = fuseProducerOfTensor(b, *opOperand);		auto info = fuseProducerOfTensor(b, opOperand);
if (failed(info))		if (failed(info))
continue;		continue;
auto *originalOp = info->originalProducer.getOperation();		auto *originalOp = info->originalProducer.getOperation();
auto *originalOpInLinalgOpsVector =		auto *originalOpInLinalgOpsVector =
std::find(linalgOps.begin(), linalgOps.end(), originalOp);		std::find(linalgOps.begin(), linalgOps.end(), originalOp);
*originalOpInLinalgOpsVector = info->fusedProducer.getOperation();		*originalOpInLinalgOpsVector = info->fusedProducer.getOperation();
// Don't mark for erasure in the tensor case, let DCE handle this.		// Don't mark for erasure in the tensor case, let DCE handle this.
changed = true;		changed = true;
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestOps.td

Show First 20 Lines • Show All 2,829 Lines • ▼ Show 20 Lines	let extraClassDeclaration = [{
mlir::ArrayAttr getIndexingMaps() {		mlir::ArrayAttr getIndexingMaps() {
return getOperation()->getAttrOfType<mlir::ArrayAttr>("indexing_maps");		return getOperation()->getAttrOfType<mlir::ArrayAttr>("indexing_maps");
}		}

std::string getLibraryCallName() {		std::string getLibraryCallName() {
return "";		return "";
}		}

// To conform with interface requirement on operand naming.		std::pair<int64_t, int64_t> getOutputsPositionRange() {
mlir::ValueRange inputs() { return getInputs(); }		int64_t getNumOperands = this->getNumOperands();
mlir::ValueRange outputs() { return getOutputs(); }		return {getNumOperands - 1, getNumOperands};
		}
}];		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Test LinalgFillOpInterface.		// Test LinalgFillOpInterface.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def TestLinalgFillOpNotLinalgOp : TEST_Op<"fill_op_not_linalg_op", [		def TestLinalgFillOpNotLinalgOp : TEST_Op<"fill_op_not_linalg_op", [
Show All 40 Lines	let extraClassDeclaration = [{
mlir::ArrayAttr getIndexingMaps() {		mlir::ArrayAttr getIndexingMaps() {
return getOperation()->getAttrOfType<mlir::ArrayAttr>("indexing_maps");		return getOperation()->getAttrOfType<mlir::ArrayAttr>("indexing_maps");
}		}

std::string getLibraryCallName() {		std::string getLibraryCallName() {
return "";		return "";
}		}

// To conform with interface requirement on operand naming.		std::pair<int64_t, int64_t> getOutputsPositionRange() {
mlir::ValueRange inputs() { return getInputs(); }		int64_t getNumOperands = this->getNumOperands();
mlir::ValueRange outputs() { return getOutputs(); }		return {getNumOperands - 1, getNumOperands};
		}
}];		}];
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Test Ops with Default-Valued String Attributes		// Test Ops with Default-Valued String Attributes
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

def TestDefaultStrAttrNoValueOp : TEST_Op<"no_str_value"> {		def TestDefaultStrAttrNoValueOp : TEST_Op<"no_str_value"> {
▲ Show 20 Lines • Show All 187 Lines • Show Last 20 Lines

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-yaml-gen.cpp

Show First 20 Lines • Show All 557 Lines • ▼ Show 20 Lines	let extraClassDeclaration = structuredOpsBaseDecls # [{{
static void regionBuilder(ImplicitLocOpBuilder &b,		static void regionBuilder(ImplicitLocOpBuilder &b,
Block &block, ArrayRef<NamedAttribute> attrs);		Block &block, ArrayRef<NamedAttribute> attrs);
static std::function<void(ImplicitLocOpBuilder &,		static std::function<void(ImplicitLocOpBuilder &,
Block &, ArrayRef<NamedAttribute>)>		Block &, ArrayRef<NamedAttribute>)>
getRegionBuilder() {{		getRegionBuilder() {{
return regionBuilder;		return regionBuilder;
}		}

		std::pair<int64_t, int64_t> getOutputsPositionRange() {{
		int64_t getNumOperands = this->getNumOperands();
		return {{getNumOperands - 1, getNumOperands};
		}

// Generic methods.		// Generic methods.
static unsigned getNumRegionArgs();		static unsigned getNumRegionArgs();
std::string getLibraryCallName();		std::string getLibraryCallName();
{7}		{7}
}];		}];
}		}
)FMT";		)FMT";

▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines
// {0}: Class name		// {0}: Class name
static const char rankPolyStructuredOpIndexingMapsFormat[] = R"FMT(		static const char rankPolyStructuredOpIndexingMapsFormat[] = R"FMT(
ArrayAttr {0}::getIndexingMaps() {{		ArrayAttr {0}::getIndexingMaps() {{
MLIRContext *context = getContext();		MLIRContext *context = getContext();
AffineMap scalarMap = AffineMap::get(getNumParallelLoops(), 0, context);		AffineMap scalarMap = AffineMap::get(getNumParallelLoops(), 0, context);
AffineMap tensorMap = AffineMap::getMultiDimIdentityMap(		AffineMap tensorMap = AffineMap::getMultiDimIdentityMap(
getNumParallelLoops(), context);		getNumParallelLoops(), context);
SmallVector<AffineMap> indexingMaps;		SmallVector<AffineMap> indexingMaps;
for (OpOperand *opOperand : getInputAndOutputOperands())		for (OpOperand &opOperand : getOperation()->getOpOperands())
indexingMaps.push_back(getRank(opOperand) == 0 ? scalarMap : tensorMap);		indexingMaps.push_back(getRank(&opOperand) == 0 ? scalarMap : tensorMap);
return Builder(getContext()).getAffineMapArrayAttr(indexingMaps);		return Builder(getContext()).getAffineMapArrayAttr(indexingMaps);
}		}
)FMT";		)FMT";

// Implementations of fold and getEffects.		// Implementations of fold and getEffects.
// Parameters:		// Parameters:
// {0}: Class name		// {0}: Class name
const char structuredOpFoldersFormat[] = R"FMT(		const char structuredOpFoldersFormat[] = R"FMT(
LogicalResult {0}::fold(ArrayRef<Attribute>,		LogicalResult {0}::fold(ArrayRef<Attribute>,
SmallVectorImpl<OpFoldResult> &) {{		SmallVectorImpl<OpFoldResult> &) {{
return foldMemRefCast(*this);		return foldMemRefCast(*this);
}		}
void {0}::getEffects(SmallVectorImpl<		void {0}::getEffects(SmallVectorImpl<
SideEffects::EffectInstance<MemoryEffects::Effect> >&effects) {{		SideEffects::EffectInstance<MemoryEffects::Effect> >&effects) {{
SmallVector<Value> inputBuffers = getInputBufferOperands();		if (hasTensorSemantics()) return;
SmallVector<Value> outputBuffers = getOutputBufferOperands();
getGenericEffectsImpl(effects,		getGenericEffectsImpl(effects,
getOperation()->getResults(), inputBuffers, outputBuffers);		getOperation()->getResults(), getInputOperands(), getOutputOperands());
}		}
)FMT";		)FMT";

// Implementation of parse/print.		// Implementation of parse/print.
// Parameters:		// Parameters:
// {0}: Class name		// {0}: Class name
static const char structuredOpParserFormat[] = R"FMT(		static const char structuredOpParserFormat[] = R"FMT(
ParseResult {0}::parse(OpAsmParser &parser, OperationState &result) {{		ParseResult {0}::parse(OpAsmParser &parser, OperationState &result) {{
▲ Show 20 Lines • Show All 597 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Simplify DestinationStyleOpInterface.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 468150

mlir/include/mlir/Dialect/Linalg/IR/LinalgInterfaces.td

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td

mlir/lib/CAPI/Dialect/Linalg.cpp

mlir/lib/Dialect/Linalg/Analysis/DependenceAnalysis.cpp

mlir/lib/Dialect/Linalg/IR/LinalgInterfaces.cpp

mlir/lib/Dialect/Linalg/IR/LinalgOps.cpp

mlir/lib/Dialect/Linalg/Transforms/BubbleUpExtractSlice.cpp

mlir/lib/Dialect/Linalg/Transforms/BufferizableOpInterfaceImpl.cpp

mlir/lib/Dialect/Linalg/Transforms/ConstantFold.cpp

mlir/lib/Dialect/Linalg/Transforms/DecomposeLinalgOps.cpp

mlir/lib/Dialect/Linalg/Transforms/Detensorize.cpp

mlir/lib/Dialect/Linalg/Transforms/DropUnitDims.cpp

mlir/lib/Dialect/Linalg/Transforms/ElementwiseOpFusion.cpp

mlir/lib/Dialect/Linalg/Transforms/Fusion.cpp

mlir/lib/Dialect/Linalg/Transforms/FusionOnTensors.cpp

mlir/lib/Dialect/Linalg/Transforms/Generalization.cpp

mlir/lib/Dialect/Linalg/Transforms/HoistPadding.cpp

mlir/lib/Dialect/Linalg/Transforms/InlineScalarOperands.cpp

mlir/lib/Dialect/Linalg/Transforms/Interchange.cpp

mlir/lib/Dialect/Linalg/Transforms/Loops.cpp

mlir/lib/Dialect/Linalg/Transforms/Promotion.cpp

mlir/lib/Dialect/Linalg/Transforms/SplitReduction.cpp

mlir/lib/Dialect/Linalg/Transforms/Tiling.cpp

mlir/lib/Dialect/Linalg/Transforms/TilingInterfaceImpl.cpp

mlir/lib/Dialect/Linalg/Transforms/Transforms.cpp

mlir/lib/Dialect/Linalg/Transforms/Vectorization.cpp

mlir/lib/Dialect/Linalg/Utils/Utils.cpp

mlir/lib/Dialect/SparseTensor/Transforms/SparseTensorRewriting.cpp

mlir/lib/Dialect/SparseTensor/Transforms/Sparsification.cpp

mlir/lib/Dialect/SparseTensor/Utils/Merger.cpp

mlir/test/Dialect/Linalg/canonicalize.mlir

mlir/test/Dialect/Linalg/roundtrip.mlir

mlir/test/lib/Dialect/Linalg/TestLinalgElementwiseFusion.cpp

mlir/test/lib/Dialect/Linalg/TestLinalgFusionTransforms.cpp

mlir/test/lib/Dialect/Test/TestOps.td

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-yaml-gen.cpp

[mlir] Simplify DestinationStyleOpInterface.
ClosedPublic