This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
docs/
-
Traits.md
-
Traits/
6/6
Broadcastable.md
-
_index.md
-
include/mlir/Dialect/Tosa/IR/
-
mlir/
-
Dialect/
-
Tosa/
-
IR/
-
TosaOpBase.td
-
TosaOps.td
-
lib/
-
Conversion/TosaToLinalg/
-
TosaToLinalg/
4/7
TosaToLinalg.cpp
-
Dialect/
4/6
Traits.cpp
-
test/
-
Conversion/TosaToLinalg/
-
TosaToLinalg/
-
tosa-to-linalg.mlir
-
Dialect/
-
traits.mlir

Differential D153291

TOSA-to-Linalg lowering for element-wise ops
ClosedPublic

Authored by rafaelubalmw on Jun 19 2023, 8:47 AM.

Download Raw Diff

Details

Reviewers

sabauma
eric-k256
nicolasvasilache
mehdi_amini
rsuderman

Commits

rGb2d76a063dd7: TOSA-to-Linalg lowering for element-wise ops

Summary

Wrote complete documentation for the Broadcastable op trait. This is mostly meant as a thorough description of its previous behavior, with the exception of minor feature updates.

Restricted legality criteria for a Broadcastable op in order to simplify current and future lowering passes and increase efficiency of code generated by those passes. New restriction are: 1) A dynamic dimension in an inferred result is not compatible with a static dimension in the actual result. 2) Broadcast semantics are restricted to input operands and not supported between inferred and actual result shapes.

Implemented TOSA-to-Linalg lowering support for unary, binary, tertiary element-wise ops. This support is complete for all legal cases described in the Broadcastable trait documentation.

Added unit tests for tosa.abs, tosa.add, and tosa.select as examples of unary, binary, and tertiary ops.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

rafaelubalmw created this revision.Jun 19 2023, 8:47 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 19 2023, 8:47 AM

Herald added subscribers: davidegrohmann, bviyer, Moerafaat and 29 others. · View Herald Transcript

Harbormaster completed remote builds in B239825: Diff 532679.Jun 19 2023, 9:05 AM

Added documentation for the 'Broadcastable' trait.

Herald added a subscriber: arphaman. · View Herald TranscriptJun 19 2023, 5:03 PM

Documentation

rafaelubalmw edited the summary of this revision. (Show Details)Jun 19 2023, 5:24 PM

rafaelubalmw added reviewers: sabauma, eric-k256.

rafaelubalmw updated this revision to Diff 532767.Jun 19 2023, 5:25 PM

This comment was removed by rafaelubalmw.

rafaelubalmw published this revision for review.Jun 19 2023, 5:26 PM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJun 19 2023, 5:26 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: limo1996, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B239898: Diff 532767.Jun 19 2023, 5:42 PM

Created TableGen class 'Tosa_ElementwiseOp'.

Merged 'Tosa_ElemWiseUnaryOp' and 'Tosa_ElemWiseBinaryOp' into
'Tosa_ElementwiseOp'. Revisited all element-wise op definitions to
use this class when possible. Restrictions on element data types are
left out of 'Tosa_ElementiwseOp'.

rafaelubalmw added reviewers: mehdi_amini, rsuderman.Jun 20 2023, 8:47 AM

Harbormaster completed remote builds in B240036: Diff 532953.Jun 20 2023, 9:56 AM

This is complementary to something we have been working on.

Please see this RFC: https://discourse.llvm.org/t/rfc-adding-allranksmatchifknown-trait-to-tosa-broadcastable-operators/71500
which lays out our proposal for (1) adding verifier for tosa broadcastable operators having equal ranks, and (2) use of EqualizeRanks helper
function to make operands have equal ranks when legalizing to tosa.

the EqualizeRanks function is already merged into llvm and available for use.

I see two differences: (1) EqualizeRanks strictly use the ranks of operands (and not that of the result) to determine what the equalized rank should be.
(2) it uses tosa::ReshapeOp to extend the ranks instead of tensor::ExtendShapeOp

also, see PR https://github.com/tensorflow/tensorflow/pull/60753 where we apply EqualizeRanks in tf/tfl legalization to tosa.

once the PR 60753 lands in tensorflow, we would then add the allranksmatchifknown trait to the tosa dialect.

hopefully we can converge to some common solution.

sabauma added inline comments.Jun 22 2023, 9:09 AM

mlir/docs/Traits/Broadcastable.md
38	This separation of behavior between static and dynamic dimensions is a little odd. In the current formulation of TOSA, tensor's static types and ranks are useful analysis results that allow for the generation of better code. Ideally, the behavior of an operator would not change by erasing rank/dimension information (it would just perform worse). This specification changes that, by eliminating implicit broadcasting only for dynamic dimensions. For instance, if I erase the dimensions of the network below: mlir func.func @example_static(%arg0: tensor<2xf32>, %arg1: tensor<1xf32>) -> tensor<2xf32> { %0 = "tosa.add"(%arg0, %arg1) : (tensor<2xf32>, tensor<1xf32>) -> tensor<2xf32> return %0 : tensor<2xf32> } // to func.func @example_dynamic(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>) -> tensor<?xf32> { %0 = "tosa.add"(%arg0, %arg1) : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32> return %0 : tensor<?xf32> } then you would reasonably expect that `example_dynamic` can operate on is a strict superset of the runtime values that `example_static` can operate on, but that is not the case (in fact, it cannot consume any of the runtime values that were legal inputs to `example_static`). Another possible design would be to eliminate _implicit_ broadcasting entirely (maybe have an explicit broadcast operator).

Support for dynamic dimension broadcasting in TOSA-to-Linalg lowering of element-wise ops

Harbormaster completed remote builds in B242533: Diff 536395.Jun 30 2023, 2:07 PM

Added comprehensive unit tests for TOSA element-wise lowering

Hi all,

This is an updated implementation for TOSA-to-Linalg lowering of unary/binary/ternary element-wise ops, according to the latest changes proposed in this RFC:

https://discourse.llvm.org/t/rfc-tosa-to-linalg-lowering-of-element-wise-ops/71559

@mehdi_amini @eric-k256 @sabauma @mamrami @rsuderman @nicolasvasilache

Harbormaster completed remote builds in B242902: Diff 536915.Jul 3 2023, 5:17 PM

rafaelubalmw mentioned this in D154458: New features and bug fix in MLIR test generation tool.Jul 4 2023, 12:02 PM

eric-k256 mentioned this in rGdea01f5e00e4: New features and bug fix in MLIR test generation tool.Jul 7 2023, 11:35 AM

Hi everyone. Just wanted to leave a quick note here as a reminder. Since these changes are quite widespread, it'll be increasingly likely for them to conflict with other work as time passes. Thanks everyone for your time and efforts.

Hi, this is a great useful feature. I just have a few question on it.

mlir/docs/Traits/Broadcastable.md
30	Sorry, could you explain it bit more? What does it mean by `1) Dynamic dimensions are never broadcast even if their runtime size is one.` ? the dynamic dimension of tensor (marked as ?) does perform broadcasting in certain cases.
159	If I read this correctly, this trait can be shared with dialects those ops are broadcastable. Do you think is this test case similar to the case below that is mentioned in https://github.com/openxla/stablehlo/blob/43f3eb6b43eb9d1ce0bce9ecfc8e1b62b81f5268/rfcs/20230704-dynamism-101.md // Dynamic result type - doesn't make sense as is. // How does the operation know what result to produce? 1x1xf32? 1x2xf32? etc. // Resolving this would need an additional argument - see below. %1 = stablehlo.broadcast_in_dim %arg0, dims = [0] : (tensor<1xf32>) -> tensor<1x?xf32> The description change of this trait is great. I just wonder if it is possible to conflict with other dialects and so that increasing difficulty to use that trait?
mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
527	I guess to make more systematic use of `notifyMatchFailure` would be better. Similar to the spots in this patch.
557	As mentioned by Tai Ly, would the merged `EqualizeRanks` can help here?

rafaelubalmw marked 5 inline comments as done.Jul 18 2023, 10:22 AM

rafaelubalmw added inline comments.

mlir/docs/Traits/Broadcastable.md
30	The comment you are referring to was a mistake in the summary of this Phabricator review, which I have now corrected. An initial version of this pull request was proposing to remove broadcast semantics from dynamic dimensions for simplicity of the lowering pass. However, several sources pointed out the importance of this feature, which was then added to the pull request.
159	The example in the external RFC you are pointing to seems to focus on the concept of "load bearing" dimensions. As I understand it, these are static dimensions in result types that provide crucial information to interpret the operation's behavior, and therefore, such dimensions should never be dynamic. I'm inclined to avoid the introduction of load bearing dimensions when possible, as they obscure operation semantics with counterintuitive implicit behavior. This is precisely the reason why this pull request is proposing to forbid implicit broadcast semantics for the result. Once you know that such broadcast never occurs, you may still allow an inferred result to be of type `tensor<1xf32>` while the actual result is of type `tensor<?xf32>`. Here, you know the result has one element, but the programmer has decided to use a dynamic dimension in its type. This flexibility may come in handy during a shape inference pass. In a given intermediate state, input shapes may have been resolved as static, but the output shape has not yet been updated until the current operation is processed. Such state does not violate operation invariants. The `Broadcastable` trait is definitely usable in other dialects. As a matter of fact, it is currently used by the `tf` and `tfl` dialects in Tensorflow, in spite of its vague definition. The current definition has been designed keeping those particular dialects in mind. Regarding the `sablehlo` op you are pointing to, I'm afraid its syntax differs enough from element-wise TOSA/Tensorflow ops that it no longer serves as a good candidate for the adoption of the `Broadcastable` trait in the first place.
mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
527	This is an intentional use of `assert` as opposed to `notifyMatchFailure()`. While the latter indicates the failure of a pass to process a plausible input, the former hints at a failure to guarantee logical invariants. Function `expandRank()` can never be invoked with a value for argument `rank` less than the current rank of argument `tensor`.
557	These are some key differences between the merged (1) `ConversionUtils.h:EqualizeRanks()` and (2) `TosaToLinalg.cpp:equalizeRank()`. (1) focuses on equalizing the ranks of two input values to the highest rank, while (2) matches all input ranks to the op result. This is intended to work for ops with any number of input arguments. To avoid confusion with naming, I have renamed (2) as `expandInputRanks()`. (1) is aimed for a TOSA-to-TOSA conversion pass, where rank expansion is carried out by a `tosa.reshape` op. While we could use recursive pattern application with (1), (2) gets us out of the TOSA dialect directly by emitting a `tensor.expand_shape` op instead. This op also takes additional dimension reassociation information, which is necessary when dealing with dynamic dimensions.

rafaelubalmw edited the summary of this revision. (Show Details)Jul 18 2023, 10:31 AM

rafaelubalmw edited the summary of this revision. (Show Details)

rafaelubalmw marked 4 inline comments as done.Jul 18 2023, 10:40 AM

Renamed equalizeRanks() to expandInputRanks() to avoid confusion with the new mlir::tosa::ExpandRank() function.

Thanks for the explanation. Your statement is clear, and I don't have further question.

Harbormaster completed remote builds in B246306: Diff 541654.Jul 18 2023, 5:50 PM

Thanks for the updates. This looks good to me.

This revision is now accepted and ready to land.Jul 20 2023, 9:47 AM

Thank you, @eric-k256 . Mind landing it for me?

jpienaar added a subscriber: jpienaar.Jul 20 2023, 10:24 AM

jpienaar added inline comments.

mlir/docs/Traits/Broadcastable.md
25	Nit: I think we primarily use ` for vars rather than * *
mlir/lib/Dialect/Traits.cpp
205	Why?

rafaelubalmw marked 2 inline comments as done.Jul 20 2023, 12:59 PM

rafaelubalmw added inline comments.

mlir/lib/Dialect/Traits.cpp
205	See section "Modification 2: Forbidding implicit dynamic-to-static dimension cast in result dimensions" in the RFC: https://discourse.llvm.org/t/rfc-tosa-to-linalg-lowering-of-element-wise-ops/71559

rafaelubalmw marked an inline comment as done.Jul 20 2023, 1:00 PM

Using backtick for variable names in

Harbormaster completed remote builds in B247006: Diff 542646.Jul 20 2023, 4:29 PM

Looks okay to me. I'll land it unless there are any remaining issues.

Closed by commit rGb2d76a063dd7: TOSA-to-Linalg lowering for element-wise ops (authored by rafaelubalmw, committed by eric-k256). · Explain WhyJul 21 2023, 3:09 PM

This revision was automatically updated to reflect the committed changes.

eric-k256 added a commit: rGb2d76a063dd7: TOSA-to-Linalg lowering for element-wise ops.

jpienaar added inline comments.Jul 24 2023, 9:51 AM

mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp
590	Is this verified or checked before here?
645	Lets convert this to range based (I don't think index is used outside that). Also https://llvm.org/docs/CodingStandards.html#don-t-evaluate-end-every-time-through-a-loop
842	nit: unranked to be consistent with rest.
mlir/lib/Dialect/Traits.cpp
205	I know I've run into this TF side (and it sounds like current integrate rotation is running into it), I mostly err on not failing to verify unless wrong. TF in particular I know one can explicitly set shapes in GraphDef (in some cases) and so you end up with weirdness post import of static shape in to identity op and then dynamic shape out or vice versa and then needing to have shape inference pass fix it up, so it is invalid post import until a cleanup.

eric-k256 added inline comments.Jul 24 2023, 4:05 PM

mlir/lib/Dialect/Traits.cpp
205	I wasn't aware of this, that's good to know. I don't suppose there is a test suite that I can run to verify the TF side that would have caught this? I run the check-mlir tests within the repo right now when about to merge but no TF side tests.

rafaelubalmw added inline comments.Jul 26 2023, 7:20 AM

mlir/lib/Dialect/Traits.cpp
205	Does this mean we need to update the behavior for this case? If this restriction becomes problematic, a reasonable alternative would be allowing for implicit dynamic-to-static cast with undefined behavior if the resulting runtime dimension size does not match the given static size. We would continue to forbid implicit result broadcasting. Let me know if I should go ahead with this change. Also, given that this patch already landed, should I be creating a new Phabricator review to address the latest comments?

jpienaar added inline comments.Jul 26 2023, 8:06 AM

mlir/lib/Dialect/Traits.cpp
205	I mean we could have patched into TF repo and ran ... but that's an unreasonable bar for external contributor to test all downstream projects. In most of these cases it could just be bad tests too (but there is a batch of them). Yes to new patch. I like the alternative here, I think this is inline with https://mlir.llvm.org/getting_started/DeveloperGuide/#ir-verifier and would mean that materializing runtime asserts that abort would be an allowed lowering.

Revision Contents

Path

Size

mlir/

docs/

Traits.md

Traits/

Broadcastable.md

197 lines

	Traits/

	_index.md
	Traits.md

11 lines

include/

mlir/

Dialect/

Tosa/

IR/

TosaOpBase.td

12 lines

TosaOps.td

109 lines

lib/

Conversion/

TosaToLinalg/

TosaToLinalg.cpp

389 lines

Dialect/

Traits.cpp

22 lines

test/

Conversion/

TosaToLinalg/

tosa-to-linalg.mlir

482 lines

Dialect/

traits.mlir

32 lines

Diff 543089

mlir/docs/Traits.md

This file was moved to mlir/docs/Traits/_index.md.

mlir/docs/Traits/Broadcastable.md

This file was added.

				# The `Broadcastable` Trait

				[TOC]

				## Description

				The `Broadcastable` trait enforces the following properties on an operation:

				- The operation has at least one input operand.

				- The operation has exactly one result.

				- All input operands and result are of type `tensor` or `vector`.

				- A shape inference mechanism is able to compute the result shape solely based on input operand shapes.

				- Input operands have broadcast-compatible shapes, according to the verification rules presented below.

				- The operation's result shape is compatible with —though not necessarily identical to— the shape inferred from its input operands, according to the verification rules presented below.


				## Dimension inference

				Given an operation with two input operands, the size of dimension `i` of its result can be inferred from dimension `i` of the operands according to the table below. Here, `dim0` and `dim1` represent dimension `i` of the input operands in an interchangeable order, while `inferredDim` represents the inferred size for dimension `i` of the operation result. Dimensions are classified in three categories: dynamic ("?"), static equal to 1 ("1"), and static greater than 1 (">1").

				jpienaarUnsubmitted Done Reply Inline Actions Nit: I think we primarily use ` for vars rather than * * jpienaar: Nit: I think we primarily use ``` for vars rather than * *

				\| `dim0` \| `dim1` \| `inferredDim` \| Notes \|
				\| -------- \| -------- \| ------------- \| ----- \|
				\| ? \| ? \| ? \| If `RuntimeSize(dim0)` is 1, dimension `dim0` is broadcast to `RuntimeSize(dim1)`. If `RuntimeSize(dim1)` is 1, dimension `dim1` is broadcast to `RuntimeSize(dim0)`. The operation produces undefined behavior if both runtime sizes are greater than 1 and not equal. \|
				\| ? \| 1 \| ? \| Dimension `dim1` is broadcast to `RuntimeSize(dim0)`. \|
				tatwaichongUnsubmitted Done Reply Inline Actions Sorry, could you explain it bit more? What does it mean by `1) Dynamic dimensions are never broadcast even if their runtime size is one.` ? the dynamic dimension of tensor (marked as ?) does perform broadcasting in certain cases. tatwaichong: Sorry, could you explain it bit more? What does it mean by `1) Dynamic dimensions are never…
				rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions The comment you are referring to was a mistake in the summary of this Phabricator review, which I have now corrected. An initial version of this pull request was proposing to remove broadcast semantics from dynamic dimensions for simplicity of the lowering pass. However, several sources pointed out the importance of this feature, which was then added to the pull request. rafaelubalmw: The comment you are referring to was a mistake in the summary of this Phabricator review, which…
				\| ? \| >1 \| `dim1` \| If `RuntimeSize(dim0)` is 1, `dim0` is broadcast to `dim1`. The operation produces undefined behavior if `RuntimeSize(dim0)` is greater than 1 and not equal to `dim1`. \|
				\| 1 \| 1 \| 1 \| \|
				\| 1 \| >1 \| `dim1` \| Dimension `dim0` is broadcast to `dim1`. \|
				\| >1 \| >1 \| `dim0` \| The operation verifier produces a compile-time error if `dim0` != `dim1`. \|


				The following pseudo-function is a formal representation of the dimension inference process:

				sabaumaUnsubmitted Done Reply Inline Actions This separation of behavior between static and dynamic dimensions is a little odd. In the current formulation of TOSA, tensor's static types and ranks are useful analysis results that allow for the generation of better code. Ideally, the behavior of an operator would not change by erasing rank/dimension information (it would just perform worse). This specification changes that, by eliminating implicit broadcasting only for dynamic dimensions. For instance, if I erase the dimensions of the network below: mlir func.func @example_static(%arg0: tensor<2xf32>, %arg1: tensor<1xf32>) -> tensor<2xf32> { %0 = "tosa.add"(%arg0, %arg1) : (tensor<2xf32>, tensor<1xf32>) -> tensor<2xf32> return %0 : tensor<2xf32> } // to func.func @example_dynamic(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>) -> tensor<?xf32> { %0 = "tosa.add"(%arg0, %arg1) : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32> return %0 : tensor<?xf32> } then you would reasonably expect that `example_dynamic` can operate on is a strict superset of the runtime values that `example_static` can operate on, but that is not the case (in fact, it cannot consume any of the runtime values that were legal inputs to `example_static`). Another possible design would be to eliminate _implicit_ broadcasting entirely (maybe have an explicit broadcast operator). sabauma: This separation of behavior between static and dynamic dimensions is a little odd. In the…
				```python
				InferDim(dim0, dim1):
				switch (dim0, dim1):
				case (?, ?):
				case (?, 1):
				case (1, 1):
				case (>1, ?):
				case (>1, 1):
				return dim0
				case (?, >1):
				case (1, ?):
				case (1, >1):
				return dim1
				case (>1, >1):
				ERROR_IF(dim0 != dim1)
				return dim0
				```

				## Shape inference

				The shape inference process begins by correcting rank differences in input operands. A shape is expanded by adding additional dimensions of size 1 on its left until the desired rank is reached, as shown here:

				```python
				ExpandRank(shape, rank):
				while len(shape) < rank:
				shape.prepend(1)
				```

				Given the shapes of two ranked input operands, the result's shape is inferred by equalizing input ranks and inferring individual dimensions, as shown here:

				```python
				InferShape(shape0, shape1):

				# Equalize ranks
				rank = max(GetRank(shape0), GetRank(shape1))
				ExpandRank(shape0, rank)
				ExpandRank(shape1, rank)

				# Infer shape
				inferredShape = []
				for (dim0, dim1) in zip(shape0, shape1):
				inferredDim = InferDim(dim0, dim1)
				inferredShape.append(inferredDim)
				return inferredShape
				```

				The result shape for an operation with an arbitrary number of input operands is then inferred by discarding unranked operands, applying shape inference on the first ranked operand pair, and updating the inferred shape with each additional ranked operand. If the operation has no ranked operands, the result shape cannot be inferred. If the operation has exactly one ranked operand, its shape is directly provided as the inferred result shape. Formally:

				```python
				InferResultShape(op):

				# Filter ranked operands
				rankedOperands = filter(op.operands, IsRanked)
				if len(rankedOperands) == 0:
				return None

				# Infer result shape
				inferredShape = GetShape(rankedOperands[0])
				for operand in rankedOperands[1:]:
				inferredShape = InferShape(inferredShape, GetShape(operand))
				return inferredShape
				```

				## Verification

				The legality of an operation with the `Broadcastable` trait is verified by first running the shape inference process. If a failure occurs during shape inference, it is concluded that input operands are not broadcast-compatible, and verification fails. If shape inference succeeds, verification continues.

				If either the result is unranked or all input operands are unranked, no further verification steps are needed, and the process ends here successfully. If, on the contrary, both the result and at least one input operand are ranked, verification continues by checking for a matching rank between the previously inferred shape and the result.

				Once a rank match is guaranteed, each dimension of the inferred shape is compared with the corresponding dimension of the actual result shape according to the following table table:


				\| `inferredDim` \| `actualDim` \| Verification outcome \|
				\| ------------- \| ----------- \| -------------------- \|
				\| ? \| ? \| OK \|
				\| ? \| static \| Error <br> An inferred dimension being dynamic indicates that its size cannot be inferred at compile time from its input operands. The presence of a static dimension in the actual result is counterintuitive and is therefore not allowed. \|
				\| static \| ? \| OK <br> The actual result dimension may be dynamic even when a static size can be inferred at compile time. The programmer may choose to relax the specificity of the result dimension for forward compatibility of the result type. \|
				\| static \| static \| OK if equal <br> When both the inferred and actual dimensions are static, they must be set to the same size. \|


				The full verification process can be formally specified as follows:

				```python
				Verify(op):

				# Run shape inference
				inferredShape = InferResultShape(op.operands)

				# Done if result is unranked or all operands are unranked
				if not IsRanked(op.result) or inferredShape is None:
				return

				# Rank must match
				actualShape = GetShape(op.result):
				ERROR_IF(len(inferredShape) != len(actualShape))

				# Verify
				for (inferredDim, actualDim) in zip(inferredShape, actualShape):
				ERROR_IF(IsDynamic(inferredDim) and IsStatic(actualDim))
				ERROR_IF(IsStatic(actualDim) and inferredDim != actualDim)
				```

				## Examples

				The following are correct uses of broadcastable ops:

				```mlir
				// Exact match of static sizes.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<1x2xi32>, tensor<1x2xi32) -> tensor<1x2xi32>

				// Dynamic sizes match. The programmer must guarantee that the runtime sizes of
				// %arg0 and %arg1 are equal at runtime.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<?xi32>, tensor<?xi32) -> tensor<?xi32>

				// The shape of %arg0 is broadcast from tensor<1xi32> to tensor<4xi32>.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<1xi32>, tensor<4xi32) -> tensor<4xi32>

				// The shape of %result is inferred as tensor<4xi32>, while the actual result
				// type is tensor<?xi32>. The inferred shape is compatible with the actual shape.
				%result = "test.broadcastable"(%arg0) : (tensor<4xi32) -> tensor<?xi32>

				tatwaichongUnsubmitted Done Reply Inline Actions If I read this correctly, this trait can be shared with dialects those ops are broadcastable. Do you think is this test case similar to the case below that is mentioned in https://github.com/openxla/stablehlo/blob/43f3eb6b43eb9d1ce0bce9ecfc8e1b62b81f5268/rfcs/20230704-dynamism-101.md // Dynamic result type - doesn't make sense as is. // How does the operation know what result to produce? 1x1xf32? 1x2xf32? etc. // Resolving this would need an additional argument - see below. %1 = stablehlo.broadcast_in_dim %arg0, dims = [0] : (tensor<1xf32>) -> tensor<1x?xf32> The description change of this trait is great. I just wonder if it is possible to conflict with other dialects and so that increasing difficulty to use that trait? tatwaichong: If I read this correctly, this trait can be shared with dialects those ops are broadcastable.
				rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions The example in the external RFC you are pointing to seems to focus on the concept of "load bearing" dimensions. As I understand it, these are static dimensions in result types that provide crucial information to interpret the operation's behavior, and therefore, such dimensions should never be dynamic. I'm inclined to avoid the introduction of load bearing dimensions when possible, as they obscure operation semantics with counterintuitive implicit behavior. This is precisely the reason why this pull request is proposing to forbid implicit broadcast semantics for the result. Once you know that such broadcast never occurs, you may still allow an inferred result to be of type `tensor<1xf32>` while the actual result is of type `tensor<?xf32>`. Here, you know the result has one element, but the programmer has decided to use a dynamic dimension in its type. This flexibility may come in handy during a shape inference pass. In a given intermediate state, input shapes may have been resolved as static, but the output shape has not yet been updated until the current operation is processed. Such state does not violate operation invariants. The `Broadcastable` trait is definitely usable in other dialects. As a matter of fact, it is currently used by the `tf` and `tfl` dialects in Tensorflow, in spite of its vague definition. The current definition has been designed keeping those particular dialects in mind. Regarding the `sablehlo` op you are pointing to, I'm afraid its syntax differs enough from element-wise TOSA/Tensorflow ops that it no longer serves as a good candidate for the adoption of the `Broadcastable` trait in the first place. rafaelubalmw: The example in the external RFC you are pointing to seems to focus on the concept of "load…
				// The shape of %arg0 is first expanded to tensor<1x1x4xi32> and then broadcast
				// to tensor<2x3x4xi32>.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<4xi32>, tensor<2x3x4xi32) -> tensor<2x3x4xi32>

				// Input and results tensors have different element types (i1, i32, i64). The
				// 'Broadcastable' trait has no restrictions on element types.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<2xi1>, tensor<2xi32) -> tensor<2xi64>

				// No result shape verification is needed when the result is unranked.
				%result = "test.broadcastable"(%arg0) : (tensor<2xi32>) -> tensor<*xi32>

				// No result shape verification needed when all inputs are unranked.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<xi32>, tensor<xi32>) -> tensor<2xi32>
				```


				The following are incorrect uses of broadcastable ops:

				```mlir
				// Dimension 0 of input operands is static but not equal.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<3xi32>, tensor<2xi32) -> tensor<?xi32>

				// The inferred result shape is tensor<3xi32>, but the actual result shape is
				// tensor<1x3xi32>. Inferred and actual shapes differ in rank.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<3xi32>, tensor<3xi32) -> tensor<1x3xi32>

				// The inferred result shape is tensor<?xi32>, but the actual shape is
				// tensor<4xi32>. The inferred shape is not compatible with the actual shape.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<?xi32>, tensor<?xi32) -> tensor<4xi32>

				// The inferred result shape is tensor<2xi32>, but the actual result shape is
				// tensor<4xi32>, which is not compatible.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<2xi32>, tensor<2xi32) -> tensor<4xi32>

				// The inferred result shape is tensor<1xi32>, but the actual result shape is
				// tensor<4xi32>. Broadcast semantics are not applicable for results.
				%result = "test.broadcastable"(%arg0, %arg1) : (tensor<1xi32>, tensor<1xi32) -> tensor<4xi32>
				```

mlir/docs/Traits/_index.md

This file was moved from mlir/docs/Traits.md.

	Show First 20 Lines • Show All 235 Lines • ▼ Show 20 Lines
	that has the trait AutomaticAllocationScope.			that has the trait AutomaticAllocationScope.

	### Broadcastable			### Broadcastable

	* `OpTrait::ResultsBroadcastableShape` -- `ResultsBroadcastableShape`			* `OpTrait::ResultsBroadcastableShape` -- `ResultsBroadcastableShape`

	This trait adds the property that the operation is known to have			This trait adds the property that the operation is known to have
	[broadcast-compatible](https://docs.scipy.org/doc/numpy/user/basics.broadcasting.html)			[broadcast-compatible](https://docs.scipy.org/doc/numpy/user/basics.broadcasting.html)
	operands and its result types' shape is the broadcast compatible with the shape			operands and that its result type is compatible with the inferred broadcast shape. See [The `Broadcastable` Trait](Traits/Broadcastable.md) for details.
	of the broadcasted operands. Specifically, starting from the most varying
	dimension, each dimension pair of the two operands' shapes should either be the
	same or one of them is one. Also, the result shape should have the corresponding
	dimension equal to the larger one, if known. Shapes are checked partially if
	ranks or dimensions are not known. For example, an op with `tensor<?x2xf32>` and
	`tensor<2xf32>` as operand types and `tensor<3x2xf32>` as the result type is
	broadcast-compatible.

	This trait requires that the operands are either vector or tensor types.

	### Commutative			### Commutative

	* `OpTrait::IsCommutative` -- `Commutative`			* `OpTrait::IsCommutative` -- `Commutative`

	This trait adds the property that the operation is commutative, i.e. `X op Y ==			This trait adds the property that the operation is commutative, i.e. `X op Y ==
	Y op X`			Y op X`

	▲ Show 20 Lines • Show All 90 Lines • Show Last 20 Lines

mlir/include/mlir/Dialect/Tosa/IR/TosaOpBase.td

	Show First 20 Lines • Show All 209 Lines • ▼ Show 20 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// TOSA Operator.			// TOSA Operator.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	class Tosa_Op<string mnemonic, list<Trait> traits = []> :			class Tosa_Op<string mnemonic, list<Trait> traits = []> :
	Op<Tosa_Dialect, mnemonic, !listconcat(traits, [TosaOpInterface])> {			Op<Tosa_Dialect, mnemonic, !listconcat(traits, [TosaOpInterface])> {
	}			}

	class Tosa_ElemWiseUnaryOp<string mnemonic, list<Trait> traits = []> :			class Tosa_ElementwiseOp<string mnemonic, list<Trait> traits = []> :
	Tosa_Op<mnemonic, !listconcat(traits, [			Tosa_Op<mnemonic, !listconcat(traits, [
	DeclareOpInterfaceMethods<InferShapedTypeOpInterface,			DeclareOpInterfaceMethods<InferShapedTypeOpInterface,
	["inferReturnTypeComponents"]>,			["inferReturnTypeComponents"]>,
	Pure, SameOperandsAndResultElementType])> {			ResultsBroadcastableShape,
	}			Pure])> {

	class Tosa_ElemWiseBinaryOp<string mnemonic, list<Trait> traits = []> :
	Tosa_Op<mnemonic, !listconcat(traits, [
	DeclareOpInterfaceMethods<InferShapedTypeOpInterface,
	["inferReturnTypeComponents"]>,
	ResultsBroadcastableShape, Pure, SameOperandsAndResultElementType])> {
	}			}

	#endif // TOSA_OP_BASE			#endif // TOSA_OP_BASE

mlir/include/mlir/Dialect/Tosa/IR/TosaOps.td

Show First 20 Lines • Show All 339 Lines • ▼ Show 20 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TOSA Spec Section 2.3		// TOSA Spec Section 2.3
// Operator Class: Activation Functions.		// Operator Class: Activation Functions.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: clamp		// Operator: clamp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_ClampOp : Tosa_ElemWiseUnaryOp<"clamp"> {		def Tosa_ClampOp : Tosa_ElementwiseOp<"clamp"> {
let summary = "Computes clamp(features, min, max).";		let summary = "Computes clamp(features, min, max).";

let description = [{		let description = [{
Clamp to an arbitrary minimum and maximum value.		Clamp to an arbitrary minimum and maximum value.
Maximum and minimum values are specified as values in the range of the		Maximum and minimum values are specified as values in the range of the
input type.		input type.
No zero point subtraction is done to the values, thus to clamp to the zero		No zero point subtraction is done to the values, thus to clamp to the zero
point value, the zero point itself should be supplied as the minimum value.		point value, the zero point itself should be supplied as the minimum value.
Show All 12 Lines	def Tosa_ClampOp : Tosa_ElementwiseOp<"clamp"> {
);		);

let hasCanonicalizer = 1;		let hasCanonicalizer = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: sigmoid		// Operator: sigmoid
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_SigmoidOp : Tosa_ElemWiseUnaryOp<"sigmoid"> {		def Tosa_SigmoidOp : Tosa_ElementwiseOp<"sigmoid"> {
let summary = "Computes elementwise sigmoid of input.";		let summary = "Computes elementwise sigmoid of input.";

let description = [{		let description = [{
Sigmoid function: output = 1 / (1 + exp(-input))		Sigmoid function: output = 1 / (1 + exp(-input))
For quantized integer data types, the TABLE operator should be used instead		For quantized integer data types, the TABLE operator should be used instead
with the following definition. The sigmoid table has 513 entries each of		with the following definition. The sigmoid table has 513 entries each of
16-bit precision and covering the input range -16.0 to +16.0		16-bit precision and covering the input range -16.0 to +16.0
in steps of 1/16.		in steps of 1/16.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input		Tosa_Tensor:$input
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: tanh		// Operator: tanh
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_TanhOp : Tosa_ElemWiseUnaryOp<"tanh"> {		def Tosa_TanhOp : Tosa_ElementwiseOp<"tanh", [SameOperandsAndResultElementType]> {
let summary = "Computes elementwise hyperbolic tangent of input";		let summary = "Computes elementwise hyperbolic tangent of input";

let description = [{		let description = [{
Parameterized hyperbolic tangent.		Parameterized hyperbolic tangent.
For quantized integer data types, the TABLE operator should be used instead		For quantized integer data types, the TABLE operator should be used instead
with the following definition. The tanh_table has 513 entries each of		with the following definition. The tanh_table has 513 entries each of
16-bit precision and covering the input range -8.0 to +8.0 in steps of 1/32.		16-bit precision and covering the input range -8.0 to +8.0 in steps of 1/32.
}];		}];
Show All 37 Lines
// TOSA Spec Section 2.4		// TOSA Spec Section 2.4
// Operator Class: Elementwise unary/binary/ternary operators.		// Operator Class: Elementwise unary/binary/ternary operators.
// Operator Subclass: Elementwise binary ops.		// Operator Subclass: Elementwise binary ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: add		// Operator: add
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_AddOp : Tosa_ElemWiseBinaryOp<"add", [Commutative]> {		def Tosa_AddOp : Tosa_ElementwiseOp<"add", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Elementwise addition operator";		let summary = "Elementwise addition operator";

let description = [{		let description = [{
Elementwise addition of input1 and input2. Axis of size 1 will be broadcast,		Elementwise addition of input1 and input2. Axis of size 1 will be broadcast,
as necessary.		as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: arithmetic_right_shift		// Operator: arithmetic_right_shift
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_ArithmeticRightShiftOp : Tosa_ElemWiseBinaryOp<"arithmetic_right_shift"> {		def Tosa_ArithmeticRightShiftOp : Tosa_ElementwiseOp<"arithmetic_right_shift",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise Arithmetic Right Shift";		let summary = "Elementwise Arithmetic Right Shift";

let description = [{		let description = [{
Elementwise arithmetic right shift of input1 by the amount specified in		Elementwise arithmetic right shift of input1 by the amount specified in
input2. Axis of size 1 will be broadcast, as necessary.		input2. Axis of size 1 will be broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2,		Tosa_Tensor:$input2,
BoolAttr:$round		BoolAttr:$round
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: bitwise_and		// Operator: bitwise_and
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_BitwiseAndOp : Tosa_ElemWiseBinaryOp<"bitwise_and", [Commutative]> {		def Tosa_BitwiseAndOp : Tosa_ElementwiseOp<"bitwise_and", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Bitwise AND operator";		let summary = "Bitwise AND operator";

let description = [{		let description = [{
Elementwise bitwise AND of input1 and input2. Axis of size 1		Elementwise bitwise AND of input1 and input2. Axis of size 1
will be broadcast as necessary.		will be broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: bitwise_or		// Operator: bitwise_or
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_BitwiseOrOp : Tosa_ElemWiseBinaryOp<"bitwise_or", [Commutative]> {		def Tosa_BitwiseOrOp : Tosa_ElementwiseOp<"bitwise_or", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Bitwise OR operator";		let summary = "Bitwise OR operator";

let description = [{		let description = [{
Elementwise bitwise OR of input1 and input2. Axis of size 1 will be		Elementwise bitwise OR of input1 and input2. Axis of size 1 will be
broadcast as necessary.		broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: bitwise_xor		// Operator: bitwise_xor
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_BitwiseXorOp : Tosa_ElemWiseBinaryOp<"bitwise_xor", [Commutative]> {		def Tosa_BitwiseXorOp : Tosa_ElementwiseOp<"bitwise_xor", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Bitwise XOR operator";		let summary = "Bitwise XOR operator";

let description = [{		let description = [{
Elementwise bitwise XOR of input1 and input2. Axis of size 1 will be		Elementwise bitwise XOR of input1 and input2. Axis of size 1 will be
broadcast as necessary.		broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: div		// Operator: div
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_DivOp : Tosa_ElemWiseBinaryOp<"div"> {		def Tosa_DivOp : Tosa_ElementwiseOp<"div", [SameOperandsAndResultElementType]> {
let summary = "Integer divide operator";		let summary = "Integer divide operator";

let description = [{		let description = [{
Elementwise integer divide operator of input1 by input2. Axis of size 1		Elementwise integer divide operator of input1 by input2. Axis of size 1
will be broadcast, as necessary.		will be broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Int32Tensor:$input1,		Tosa_Int32Tensor:$input1,
Tosa_Int32Tensor:$input2		Tosa_Int32Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Int32Tensor:$output		Tosa_Int32Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_and		// Operator: logical_and
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalAndOp : Tosa_ElemWiseBinaryOp<"logical_and", [Commutative]> {		def Tosa_LogicalAndOp : Tosa_ElementwiseOp<"logical_and", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Returns the truth value of x AND y element-wise.";		let summary = "Returns the truth value of x AND y element-wise.";

let description = [{		let description = [{
Elementwise logical AND of input1 and input2. Axis of size 1 will be		Elementwise logical AND of input1 and input2. Axis of size 1 will be
broadcast, as necessary.		broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
I1Tensor:$input1,		I1Tensor:$input1,
I1Tensor:$input2		I1Tensor:$input2
);		);

let results = (outs		let results = (outs
I1Tensor:$z		I1Tensor:$z
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_left_shift		// Operator: logical_left_shift
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalLeftShiftOp : Tosa_ElemWiseBinaryOp<"logical_left_shift"> {		def Tosa_LogicalLeftShiftOp : Tosa_ElementwiseOp<"logical_left_shift",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise Logical Left Shift";		let summary = "Elementwise Logical Left Shift";

let description = [{		let description = [{
Elementwise left shift of input1 and input2. Axis of size 1 will be		Elementwise left shift of input1 and input2. Axis of size 1 will be
broadcast, as necessary.		broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_right_shift		// Operator: logical_right_shift
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalRightShiftOp : Tosa_ElemWiseBinaryOp<"logical_right_shift"> {		def Tosa_LogicalRightShiftOp : Tosa_ElementwiseOp<"logical_right_shift",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise Logical Right Shift";		let summary = "Elementwise Logical Right Shift";

let description = [{		let description = [{
Elementwise logical right shift of input1 by the amount specified in input2.		Elementwise logical right shift of input1 by the amount specified in input2.
Axis of size 1 will be broadcast, as necessary.		Axis of size 1 will be broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_or		// Operator: logical_or
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalOrOp : Tosa_ElemWiseBinaryOp<"logical_or", [Commutative]> {		def Tosa_LogicalOrOp : Tosa_ElementwiseOp<"logical_or", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Returns the truth value of x OR y element-wise.";		let summary = "Returns the truth value of x OR y element-wise.";

let description = [{		let description = [{
Elementwise logical OR of input1 and input2. Axis of size 1 will be		Elementwise logical OR of input1 and input2. Axis of size 1 will be
broadcast as necessary.		broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
I1Tensor:$input1,		I1Tensor:$input1,
I1Tensor:$input2		I1Tensor:$input2
);		);

let results = (outs		let results = (outs
I1Tensor:$z		I1Tensor:$z
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_xor		// Operator: logical_xor
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalXorOp : Tosa_ElemWiseBinaryOp<"logical_xor", [Commutative]> {		def Tosa_LogicalXorOp : Tosa_ElementwiseOp<"logical_xor", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Returns the truth value of x XOR y element-wise.";		let summary = "Returns the truth value of x XOR y element-wise.";

let description = [{		let description = [{
Elementwise logical XOR of input1 and input2. Axis of size 1 will be		Elementwise logical XOR of input1 and input2. Axis of size 1 will be
broadcast as necessary.		broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
I1Tensor:$input1,		I1Tensor:$input1,
I1Tensor:$input2		I1Tensor:$input2
);		);

let results = (outs		let results = (outs
I1Tensor:$z		I1Tensor:$z
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: maximum		// Operator: maximum
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_MaximumOp : Tosa_ElemWiseBinaryOp<"maximum", [Commutative]> {		def Tosa_MaximumOp : Tosa_ElementwiseOp<"maximum", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Elementwise Maximum";		let summary = "Elementwise Maximum";

let description = [{		let description = [{
Elementwise max of input1 and input2. Axis of size 1 will be broadcast, as		Elementwise max of input1 and input2. Axis of size 1 will be broadcast, as
necessary.		necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: minimum		// Operator: minimum
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_MinimumOp : Tosa_ElemWiseBinaryOp<"minimum", [Commutative]> {		def Tosa_MinimumOp : Tosa_ElementwiseOp<"minimum", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Elementwise Minimum";		let summary = "Elementwise Minimum";

let description = [{		let description = [{
Elementwise minimum of input1 and input2. Axis of size 1		Elementwise minimum of input1 and input2. Axis of size 1
will be broadcast, as necessary.		will be broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: mul		// Operator: mul
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_MulOp : Tosa_ElemWiseBinaryOp<"mul", [Commutative]> {		def Tosa_MulOp : Tosa_ElementwiseOp<"mul", [
		Commutative,
		SameOperandsAndResultElementType]> {
let summary = "Multiplication operator";		let summary = "Multiplication operator";

let description = [{		let description = [{
Elementwise multiplication (Hadamard product) of input1 and input2.		Elementwise multiplication (Hadamard product) of input1 and input2.
Axis of size 1 will be broadcast, as necessary.		Axis of size 1 will be broadcast, as necessary.
i8/i16 input type can be promoted to i32 result type.		i8/i16 input type can be promoted to i32 result type.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2,		Tosa_Tensor:$input2,
I32Attr:$shift		I32Attr:$shift
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: pow		// Operator: pow
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_PowOp : Tosa_ElemWiseBinaryOp<"pow"> {		def Tosa_PowOp : Tosa_ElementwiseOp<"pow", [SameOperandsAndResultElementType]> {
let summary = "Computes the power of one value to another.";		let summary = "Computes the power of one value to another.";

let description = [{		let description = [{
Elementwise input1 raised to the power of input2.		Elementwise input1 raised to the power of input2.
Axis of size 1 will be broadcast, as necessary.		Axis of size 1 will be broadcast, as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$z		Tosa_Tensor:$z
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: sub		// Operator: sub
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_SubOp : Tosa_ElemWiseBinaryOp<"sub"> {		def Tosa_SubOp : Tosa_ElementwiseOp<"sub", [SameOperandsAndResultElementType]> {
let summary = "Elementwise subtraction operator";		let summary = "Elementwise subtraction operator";

let description = [{		let description = [{
Elementwise subtraction of input1 and input2. Axis of size 1 will be		Elementwise subtraction of input1 and input2. Axis of size 1 will be
broadcast as necessary.		broadcast as necessary.
}];		}];

let arguments = (ins		let arguments = (ins
▲ Show 20 Lines • Show All 46 Lines • ▼ Show 20 Lines
// TOSA Spec Section 2.5		// TOSA Spec Section 2.5
// Operator Class: Elementwise unary/binary/ternary operators.		// Operator Class: Elementwise unary/binary/ternary operators.
// Operator Subclass: Elementwise unary ops.		// Operator Subclass: Elementwise unary ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: abs		// Operator: abs
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_AbsOp : Tosa_ElemWiseUnaryOp<"abs"> {		def Tosa_AbsOp : Tosa_ElementwiseOp<"abs", [SameOperandsAndResultElementType]> {
let summary = "Elementwise abs op";		let summary = "Elementwise abs op";

let description = [{		let description = [{
Elementwise absolute value operation		Elementwise absolute value operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: bitwise_not		// Operator: bitwise_not
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_BitwiseNotOp : Tosa_ElemWiseUnaryOp<"bitwise_not"> {		def Tosa_BitwiseNotOp : Tosa_ElementwiseOp<"bitwise_not",
		[SameOperandsAndResultElementType]> {
let summary = "Bitwise NOT operator";		let summary = "Bitwise NOT operator";

let description = [{		let description = [{
Elementwise bitwise NOT of input tensor.		Elementwise bitwise NOT of input tensor.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: ceil		// Operator: ceil
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_CeilOp : Tosa_ElemWiseUnaryOp<"ceil"> {		def Tosa_CeilOp : Tosa_ElementwiseOp<"ceil", [SameOperandsAndResultElementType]> {
let summary = "Elementwise ceil op";		let summary = "Elementwise ceil op";

let description = [{		let description = [{
Elementwise ceiling operation		Elementwise ceiling operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: clz		// Operator: clz
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_ClzOp : Tosa_ElemWiseUnaryOp<"clz"> {		def Tosa_ClzOp : Tosa_ElementwiseOp<"clz", [SameOperandsAndResultElementType]> {
let summary = "Elementwise count leading zero op";		let summary = "Elementwise count leading zero op";

let description = [{		let description = [{
Elementwise count leading zeros operation		Elementwise count leading zeros operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: exp		// Operator: exp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_ExpOp : Tosa_ElemWiseUnaryOp<"exp"> {		def Tosa_ExpOp : Tosa_ElementwiseOp<"exp", [SameOperandsAndResultElementType]> {
let summary = "Elementwise exp op";		let summary = "Elementwise exp op";

let description = [{		let description = [{
Elementwise e to the x operation		Elementwise e to the x operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: floor		// Operator: floor
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_FloorOp : Tosa_ElemWiseUnaryOp<"floor"> {		def Tosa_FloorOp : Tosa_ElementwiseOp<"floor", [SameOperandsAndResultElementType]> {
let summary = "Elementwise floor op";		let summary = "Elementwise floor op";

let description = [{		let description = [{
Elementwise floor operation		Elementwise floor operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: log		// Operator: log
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogOp : Tosa_ElemWiseUnaryOp<"log"> {		def Tosa_LogOp : Tosa_ElementwiseOp<"log", [SameOperandsAndResultElementType]> {
let summary = "Elementwise log op";		let summary = "Elementwise log op";

let description = [{		let description = [{
Elementwise natural logarithm operation		Elementwise natural logarithm operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: logical_not		// Operator: logical_not
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_LogicalNotOp : Tosa_ElemWiseUnaryOp<"logical_not"> {		def Tosa_LogicalNotOp : Tosa_ElementwiseOp<"logical_not",
		[SameOperandsAndResultElementType]> {
let summary = "Returns the truth value of NOT x element-wise.";		let summary = "Returns the truth value of NOT x element-wise.";

let description = [{		let description = [{
Elementwise logical NOT of input.		Elementwise logical NOT of input.
}];		}];

let arguments = (ins		let arguments = (ins
I1Tensor:$input1		I1Tensor:$input1
);		);

let results = (outs		let results = (outs
I1Tensor:$output		I1Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: negate		// Operator: negate
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_NegateOp : Tosa_ElemWiseUnaryOp<"negate"> {		def Tosa_NegateOp : Tosa_ElementwiseOp<"negate",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise negate op";		let summary = "Elementwise negate op";

let description = [{		let description = [{
Elementwise negation operation		Elementwise negation operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
OptionalAttr<Tosa_UnaryOpQuantizationAttr>:$quantization_info		OptionalAttr<Tosa_UnaryOpQuantizationAttr>:$quantization_info
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);

let builders = [Tosa_UnaryOpQuantInfoBuilder];		let builders = [Tosa_UnaryOpQuantInfoBuilder];

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: reciprocal		// Operator: reciprocal
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_ReciprocalOp : Tosa_ElemWiseUnaryOp<"reciprocal"> {		def Tosa_ReciprocalOp : Tosa_ElementwiseOp<"reciprocal",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise reciprocal op";		let summary = "Elementwise reciprocal op";

let description = [{		let description = [{
Elementwise reciprocal operation. For integer operation, a TABLE should be		Elementwise reciprocal operation. For integer operation, a TABLE should be
used with the appropriate ranges.		used with the appropriate ranges.
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1		Tosa_Tensor:$input1
);		);

let results = (outs		let results = (outs
Tosa_Tensor:$output		Tosa_Tensor:$output
);		);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: rsqrt		// Operator: rsqrt
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_RsqrtOp : Tosa_ElemWiseUnaryOp<"rsqrt"> {		def Tosa_RsqrtOp : Tosa_ElementwiseOp<"rsqrt",
		[SameOperandsAndResultElementType]> {
let summary = "Elementwise 1/sqrt op";		let summary = "Elementwise 1/sqrt op";

let description = [{		let description = [{
Elementwise reciprocal square root operation. For integer operation, a TABLE		Elementwise reciprocal square root operation. For integer operation, a TABLE
should be used with the appropriate ranges.		should be used with the appropriate ranges.
}];		}];

let arguments = (ins		let arguments = (ins
Show All 9 Lines
// TOSA Spec Section 2.6		// TOSA Spec Section 2.6
// Operator Class: Elementwise unary/binary/ternary operators.		// Operator Class: Elementwise unary/binary/ternary operators.
// Operator Subclass: Elementwise ternary ops.		// Operator Subclass: Elementwise ternary ops.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: select		// Operator: select
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_SelectOp : Tosa_Op<"select", [		def Tosa_SelectOp : Tosa_ElementwiseOp<"select"> {
DeclareOpInterfaceMethods<InferShapedTypeOpInterface,
["inferReturnTypeComponents"]>, Pure]> {
let summary = "Elementwise select operator";		let summary = "Elementwise select operator";

let description = [{		let description = [{
Elementwise select of the output based on a condition.		Elementwise select of the output based on a condition.
}];		}];

let arguments = (ins		let arguments = (ins
I1Tensor:$pred,		I1Tensor:$pred,
Show All 11 Lines
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TOSA Spec Section 2.7		// TOSA Spec Section 2.7
// Operator Class: Logical Operations.		// Operator Class: Logical Operations.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: equal		// Operator: equal
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_EqualOp : Tosa_Op<"equal", [InferTensorType, ResultsBroadcastableShape,		def Tosa_EqualOp : Tosa_ElementwiseOp<"equal", [
Commutative, Pure, SameOperandsElementType]> {		InferTensorType,
		Commutative,
		SameOperandsElementType]> {
let summary = "Returns the truth value of (x == y) element-wise.";		let summary = "Returns the truth value of (x == y) element-wise.";

let description = [{		let description = [{
Elementwise comparison operation		Elementwise comparison operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Show All 11 Lines	def Tosa_EqualOp : Tosa_ElementwiseOp<"equal", [
}];		}];

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: greater		// Operator: greater
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_GreaterOp : Tosa_Op<"greater", [		def Tosa_GreaterOp : Tosa_ElementwiseOp<"greater", [SameOperandsElementType]> {
DeclareOpInterfaceMethods<InferShapedTypeOpInterface,
["inferReturnTypeComponents"]>,
ResultsBroadcastableShape, Pure, SameOperandsElementType]> {
let summary = "Returns the truth value of (x > y) element-wise.";		let summary = "Returns the truth value of (x > y) element-wise.";

let description = [{		let description = [{
Elementwise greater than comparison operation		Elementwise greater than comparison operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
Tosa_Tensor:$input2		Tosa_Tensor:$input2
);		);

let results = (outs		let results = (outs
I1Tensor:$output		I1Tensor:$output
);		);

let hasFolder = 1;		let hasFolder = 1;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Operator: greater_equal		// Operator: greater_equal
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
def Tosa_GreaterEqualOp : Tosa_Op<"greater_equal", [		def Tosa_GreaterEqualOp : Tosa_ElementwiseOp<"greater_equal",
DeclareOpInterfaceMethods<InferShapedTypeOpInterface,		[SameOperandsElementType]> {
["inferReturnTypeComponents"]>,
ResultsBroadcastableShape, Pure, SameOperandsElementType]> {
let summary = "Returns the truth value of (x >= y) element-wise.";		let summary = "Returns the truth value of (x >= y) element-wise.";

let description = [{		let description = [{
Elementwise comparison operation		Elementwise comparison operation
}];		}];

let arguments = (ins		let arguments = (ins
Tosa_Tensor:$input1,		Tosa_Tensor:$input1,
▲ Show 20 Lines • Show All 746 Lines • Show Last 20 Lines

mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp

Show All 18 Lines
#include "mlir/Dialect/Tensor/IR/Tensor.h"		#include "mlir/Dialect/Tensor/IR/Tensor.h"
#include "mlir/Dialect/Tensor/Utils/Utils.h"		#include "mlir/Dialect/Tensor/Utils/Utils.h"
#include "mlir/Dialect/Tosa/IR/TosaOps.h"		#include "mlir/Dialect/Tosa/IR/TosaOps.h"
#include "mlir/Dialect/Tosa/Utils/ConversionUtils.h"		#include "mlir/Dialect/Tosa/Utils/ConversionUtils.h"
#include "mlir/Dialect/Utils/ReshapeOpsUtils.h"		#include "mlir/Dialect/Utils/ReshapeOpsUtils.h"
#include "mlir/Dialect/Utils/StaticValueUtils.h"		#include "mlir/Dialect/Utils/StaticValueUtils.h"
#include "mlir/IR/ImplicitLocOpBuilder.h"		#include "mlir/IR/ImplicitLocOpBuilder.h"
#include "mlir/IR/Matchers.h"		#include "mlir/IR/Matchers.h"
		#include "mlir/IR/OpDefinition.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "mlir/Transforms/DialectConversion.h"		#include "mlir/Transforms/DialectConversion.h"
#include "mlir/Transforms/GreedyPatternRewriteDriver.h"		#include "mlir/Transforms/GreedyPatternRewriteDriver.h"
		#include "llvm/ADT/STLExtras.h"
		#include "llvm/ADT/Sequence.h"

#include <numeric>		#include <numeric>

using namespace mlir;		using namespace mlir;
using namespace mlir::tosa;		using namespace mlir::tosa;

template <typename T>		template <typename T>
static arith::ConstantOp		static arith::ConstantOp
▲ Show 20 Lines • Show All 474 Lines • ▼ Show 20 Lines	if (isa<tosa::CastOp>(op)) {
}		}
}		}

(void)rewriter.notifyMatchFailure(		(void)rewriter.notifyMatchFailure(
op, "unhandled op for linalg body calculation for elementwise op");		op, "unhandled op for linalg body calculation for elementwise op");
return nullptr;		return nullptr;
}		}

static LogicalResult		static Value expandRank(PatternRewriter &rewriter, Location loc, Value tensor,
elementwiseMatchAndRewriteHelper(Operation *operation,		int64_t rank) {
PatternRewriter &rewriter) {		// No need to expand if we are already at the desired rank
auto loc = operation->getLoc();		auto shapedType = dyn_cast<ShapedType>(tensor.getType());
		assert(shapedType && shapedType.hasRank() && "expected a ranked shaped type");
		tatwaichongUnsubmitted Done Reply Inline Actions I guess to make more systematic use of `notifyMatchFailure` would be better. Similar to the spots in this patch. tatwaichong: I guess to make more systematic use of `notifyMatchFailure` would be better. Similar to the…
		rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions This is an intentional use of `assert` as opposed to `notifyMatchFailure()`. While the latter indicates the failure of a pass to process a plausible input, the former hints at a failure to guarantee logical invariants. Function `expandRank()` can never be invoked with a value for argument `rank` less than the current rank of argument `tensor`. rafaelubalmw: This is an intentional use of `assert` as opposed to `notifyMatchFailure()`. While the latter…
assert(operation->getNumResults() == 1 &&		int64_t numExtraDims = rank - shapedType.getRank();
"All TOSA elementwise ops should only return a single result.");		assert(numExtraDims >= 0 && "cannot expand tensor to a lower rank");
		if (!numExtraDims)
auto result = operation->getResult(0);		return tensor;
auto resultTy = dyn_cast<RankedTensorType>(result.getType());
		// Compute reassociation indices
if (!resultTy)		SmallVector<SmallVector<int64_t, 2>> reassociationIndices(
return rewriter.notifyMatchFailure(		shapedType.getRank());
operation, "All results must be a ranked tensor type");		int64_t index = 0;
		for (index = 0; index <= numExtraDims; index++)
unsigned rank = resultTy.getRank();		reassociationIndices[0].push_back(index);
		for (size_t position = 1; position < reassociationIndices.size(); position++)
// Construct the indexing maps needed for linalg.generic ops.		reassociationIndices[position].push_back(index++);
SmallVector<Type> bodyArgTypes;
		// Compute result type
for (Value in : operation->getOperands())		SmallVector<int64_t> resultShape;
bodyArgTypes.emplace_back(getElementTypeOrSelf(in.getType()));		for (index = 0; index < numExtraDims; index++)
		resultShape.push_back(1);
SmallVector<Type> opResultTypes;		for (auto size : shapedType.getShape())
SmallVector<Value> emptyTensors;		resultShape.push_back(size);
		auto resultType =
SmallVector<Value> dynDims;		RankedTensorType::get(resultShape, shapedType.getElementType());
dynDims.resize(rank);
		// Emit 'tensor.expand_shape' op
for (auto arg : operation->getOperands()) {		return rewriter.create<tensor::ExpandShapeOp>(loc, resultType, tensor,
auto operandTy = cast<ShapedType>(arg.getType());		reassociationIndices);
for (int i = 0; i < operandTy.getRank(); i++) {		}
if (operandTy.isDynamicDim(i) && !dynDims[i])
dynDims[i] = rewriter.create<tensor::DimOp>(loc, arg, i);		static SmallVector<Value> expandInputRanks(PatternRewriter &rewriter,
}		Location loc, Operation *operation) {
		tatwaichongUnsubmitted Done Reply Inline Actions As mentioned by Tai Ly, would the merged `EqualizeRanks` can help here? tatwaichong: As mentioned by Tai Ly, would the merged `EqualizeRanks` can help here?
		rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions These are some key differences between the merged (1) `ConversionUtils.h:EqualizeRanks()` and (2) `TosaToLinalg.cpp:equalizeRank()`. (1) focuses on equalizing the ranks of two input values to the highest rank, while (2) matches all input ranks to the op result. This is intended to work for ops with any number of input arguments. To avoid confusion with naming, I have renamed (2) as `expandInputRanks()`. (1) is aimed for a TOSA-to-TOSA conversion pass, where rank expansion is carried out by a `tosa.reshape` op. While we could use recursive pattern application with (1), (2) gets us out of the TOSA dialect directly by emitting a `tensor.expand_shape` op instead. This op also takes additional dimension reassociation information, which is necessary when dealing with dynamic dimensions. rafaelubalmw: These are some key differences between the merged (1) `ConversionUtils.h:EqualizeRanks()` and…
		auto rank =
		operation->getResultTypes().front().cast<RankedTensorType>().getRank();
		return llvm::map_to_vector(operation->getOperands(), [&](Value operand) {
		return expandRank(rewriter, loc, operand, rank);
		});
}		}

SmallVector<Value> filteredDims = condenseValues(dynDims);		using IndexPool = DenseMap<int64_t, Value>;

emptyTensors.push_back(
rewriter.create<tensor::EmptyOp>(loc, resultTy, filteredDims));
opResultTypes.push_back(result.getType());

auto bodyResultTypes = llvm::to_vector<4>(llvm::map_range(
emptyTensors, [](Value v) { return getElementTypeOrSelf(v); }));

SmallVector<Value, 2> operands;
SmallVector<AffineMap, 2> indexingMaps;
indexingMaps.reserve(operation->getNumOperands() + bodyResultTypes.size());

// Input indexing maps may be broadcasted.
for (Value operand : operation->getOperands()) {
ShapedType type = cast<ShapedType>(operand.getType());

if (type.getShape() == resultTy.getShape()) {		// Emit an 'arith.constant' op for the given index if it has not been created
operands.push_back(operand);		// yet, or return an existing constant. This will prevent an excessive creation
indexingMaps.push_back(rewriter.getMultiDimIdentityMap(rank));		// of redundant constants, easing readability of emitted code for unit tests.
continue;		static Value createIndex(PatternRewriter &rewriter, Location loc,
}		IndexPool &indexPool, int64_t index) {
		auto [it, inserted] = indexPool.try_emplace(index);
		if (inserted)
		it->second =
		rewriter.create<arith::ConstantOp>(loc, rewriter.getIndexAttr(index));
		return it->second;
		}

		static Value getTensorDim(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, Value tensor, int64_t index) {
		auto indexValue = createIndex(rewriter, loc, indexPool, index);
		return rewriter.create<tensor::DimOp>(loc, tensor, indexValue).getResult();
		}

		static OpFoldResult getOrFoldTensorDim(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, Value tensor,
		int64_t index) {
		auto shapedType = dyn_cast<ShapedType>(tensor.getType());
		assert(shapedType && shapedType.hasRank() && "expected a ranked shaped type");
		assert(index >= 0 && index < shapedType.getRank() && "index out of bounds");
		jpienaarUnsubmitted Not Done Reply Inline Actions Is this verified or checked before here? jpienaar: Is this verified or checked before here?
		if (shapedType.isDynamicDim(index))
		return getTensorDim(rewriter, loc, indexPool, tensor, index);
		return rewriter.getIndexAttr(shapedType.getDimSize(index));
		}

		static bool operandsAndResultsRanked(Operation *operation) {
		auto isRanked = [](Value value) {
		return isa<RankedTensorType>(value.getType());
		};
		return llvm::all_of(operation->getOperands(), isRanked) &&
		llvm::all_of(operation->getResults(), isRanked);
		}

		// Compute the runtime dimension size for dimension 'dim' of the output by
		// inspecting input 'operands', all of which are expected to have the same rank.
		// This function returns a pair {targetSize, masterOperand}.
		//
		// The runtime size of the output dimension is returned either as a statically
		// computed attribute or as a runtime SSA value.
		//
		// If the target size was inferred directly from one dominating operand, that
		// operand is returned in 'masterOperand'. If the target size is inferred from
		// multiple operands, 'masterOperand' is set to nullptr.
		static std::pair<OpFoldResult, Value>
		computeTargetSize(PatternRewriter &rewriter, Location loc, IndexPool &indexPool,
		ValueRange operands, int64_t dim) {
		// If any input operand contains a static size greater than 1 for this
		// dimension, that is the target size. An occurrence of an additional static
		// dimension greater than 1 with a different value is undefined behavior.
		for (auto operand : operands) {
		auto size = operand.getType().cast<RankedTensorType>().getDimSize(dim);
		if (!ShapedType::isDynamic(size) && size > 1)
		return {rewriter.getIndexAttr(size), operand};
		}

		// Filter operands with dynamic dimension
		auto operandsWithDynamicDim =
		llvm::to_vector(llvm::make_filter_range(operands, [&](Value operand) {
		return operand.getType().cast<RankedTensorType>().isDynamicDim(dim);
		}));

		// If no operand has a dynamic dimension, it means all sizes were 1
		if (operandsWithDynamicDim.empty())
		return {rewriter.getIndexAttr(1), operands.front()};

		// Emit code that computes the runtime size for this dimension. If there is
		// only one operand with a dynamic dimension, it is considered the master
		// operand that determines the runtime size of the output dimension.
		auto targetSize =
		getTensorDim(rewriter, loc, indexPool, operandsWithDynamicDim[0], dim);
		if (operandsWithDynamicDim.size() == 1)
		return {targetSize, operandsWithDynamicDim[0]};

		// Calculate maximum size among all dynamic dimensions
		for (size_t i = 1; i < operandsWithDynamicDim.size(); i++) {
		jpienaarUnsubmitted Not Done Reply Inline Actions Lets convert this to range based (I don't think index is used outside that). Also https://llvm.org/docs/CodingStandards.html#don-t-evaluate-end-every-time-through-a-loop jpienaar: Lets convert this to range based (I don't think index is used outside that). Also https://llvm.
		auto nextSize =
		getTensorDim(rewriter, loc, indexPool, operandsWithDynamicDim[i], dim);
		targetSize = rewriter.create<arith::MaxUIOp>(loc, targetSize, nextSize);
		}
		return {targetSize, nullptr};
		}

		// Compute the runtime output size for all dimensions. This function returns
		// a pair {targetShape, masterOperands}.
		static std::pair<SmallVector<OpFoldResult>, SmallVector<Value>>
		computeTargetShape(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, ValueRange operands) {
		assert(!operands.empty());
		auto rank = operands.front().getType().cast<RankedTensorType>().getRank();
		SmallVector<OpFoldResult> targetShape;
		SmallVector<Value> masterOperands;
		for (auto dim : llvm::seq<int64_t>(0, rank)) {
		auto [targetSize, masterOperand] =
		computeTargetSize(rewriter, loc, indexPool, operands, dim);
		targetShape.push_back(targetSize);
		masterOperands.push_back(masterOperand);
		}
		return {targetShape, masterOperands};
		}

		static Value broadcastDynamicDimension(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, Value operand,
		int64_t dim, OpFoldResult targetSize,
		Value masterOperand) {
		// Nothing to do if this is a static dimension
		auto rankedTensorType = operand.getType().cast<RankedTensorType>();
		if (!rankedTensorType.isDynamicDim(dim))
		return operand;

		// If the target size for this dimension was directly inferred by only taking
		// this operand into account, there is no need to broadcast. This is an
		// optimization that will prevent redundant control flow, and constitutes the
		// main motivation for tracking "master operands".
		if (operand == masterOperand)
		return operand;

		// Affine maps for 'linalg.generic' op
		auto rank = rankedTensorType.getRank();
		SmallVector<AffineExpr> affineExprs;
		for (auto index : llvm::seq<int64_t>(0, rank)) {
		auto affineExpr = index == dim ? rewriter.getAffineConstantExpr(0)
		: rewriter.getAffineDimExpr(index);
		affineExprs.push_back(affineExpr);
		}
		auto broadcastAffineMap =
		AffineMap::get(rank, 0, affineExprs, rewriter.getContext());
		auto identityAffineMap = rewriter.getMultiDimIdentityMap(rank);
		SmallVector<AffineMap> affineMaps = {broadcastAffineMap, identityAffineMap};

		// Check if broadcast is necessary
		auto one = createIndex(rewriter, loc, indexPool, 1);
		auto runtimeSize = getTensorDim(rewriter, loc, indexPool, operand, dim);
		auto broadcastNecessary = rewriter.create<arith::CmpIOp>(
		loc, arith::CmpIPredicate::eq, runtimeSize, one);

		// Emit 'then' region of 'scf.if'
		auto emitThenRegion = [&](OpBuilder &opBuilder, Location loc) {
		// Emit 'tensor.empty' op
		SmallVector<OpFoldResult> outputTensorShape;
		for (auto index : llvm::seq<int64_t>(0, rank)) {
		auto size = index == dim ? targetSize
		: getOrFoldTensorDim(rewriter, loc, indexPool,
		operand, index);
		outputTensorShape.push_back(size);
		}
		Value outputTensor = opBuilder.create<tensor::EmptyOp>(
		loc, outputTensorShape, rankedTensorType.getElementType());

		// Emit 'linalg.generic' op
		auto resultTensor =
		opBuilder
		.create<linalg::GenericOp>(
		loc, outputTensor.getType(), operand, outputTensor, affineMaps,
		getNParallelLoopsAttrs(rank),
		[&](OpBuilder &opBuilder, Location loc, ValueRange blockArgs) {
		// Emit 'linalg.yield' op
		opBuilder.create<linalg::YieldOp>(loc, blockArgs.front());
		})
		.getResult(0);

SmallVector<int64_t, 5> newShape;		// Cast to original operand type if necessary
SmallVector<AffineExpr, 4> affineExprs;		auto castResultTensor = rewriter.createOrFold<tensor::CastOp>(
newShape.reserve(type.getRank());		loc, operand.getType(), resultTensor);
for (const auto &it : llvm::enumerate(type.getShape())) {
if (it.value() == resultTy.getDimSize(it.index())) {		// Emit 'scf.yield' op
newShape.push_back(it.value());		opBuilder.create<scf::YieldOp>(loc, castResultTensor);
affineExprs.push_back(		};
mlir::getAffineDimExpr(it.index(), rewriter.getContext()));
}		// Emit 'else' region of 'scf.if'
		auto emitElseRegion = [&](OpBuilder &opBuilder, Location loc) {
		opBuilder.create<scf::YieldOp>(loc, operand);
		};

		// Emit 'scf.if' op
		auto ifOp = rewriter.create<scf::IfOp>(loc, broadcastNecessary,
		emitThenRegion, emitElseRegion);
		return ifOp.getResult(0);
		}

		static Value broadcastDynamicDimensions(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, Value operand,
		ArrayRef<OpFoldResult> targetShape,
		ArrayRef<Value> masterOperands) {
		size_t rank = operand.getType().cast<RankedTensorType>().getRank();
		assert(targetShape.size() == rank);
		assert(masterOperands.size() == rank);
		for (auto index : llvm::seq<int64_t>(0, rank))
		operand =
		broadcastDynamicDimension(rewriter, loc, indexPool, operand, index,
		targetShape[index], masterOperands[index]);
		return operand;
		}

		static SmallVector<Value>
		broadcastDynamicDimensions(PatternRewriter &rewriter, Location loc,
		IndexPool &indexPool, ValueRange operands,
		ArrayRef<OpFoldResult> targetShape,
		ArrayRef<Value> masterOperands) {
		// No need to broadcast for unary operations
		if (operands.size() == 1)
		return operands;

		// Broadcast dynamic dimensions operand by operand
		return llvm::map_to_vector(operands, [&](Value operand) {
		return broadcastDynamicDimensions(rewriter, loc, indexPool, operand,
		targetShape, masterOperands);
		});
}		}

if (newShape.size() != rank) {		static LogicalResult
operand = rewriter.create<tosa::ReshapeOp>(		emitElementwiseComputation(PatternRewriter &rewriter, Location loc,
loc, RankedTensorType::get(newShape, type.getElementType()), operand,		Operation *operation, ValueRange operands,
rewriter.getDenseI64ArrayAttr(newShape));		ArrayRef<OpFoldResult> targetShape) {
}		// Generate output tensor
		auto resultType =
		operation->getResultTypes().front().cast<RankedTensorType>();
		Value outputTensor = rewriter.create<tensor::EmptyOp>(
		loc, targetShape, resultType.getElementType());

operands.push_back(operand);		// Create affine maps. Input affine maps broadcast static dimensions of size
indexingMaps.push_back(AffineMap::get(		// 1. The output affine map is an identity map.
/dimCount=/rank, /symbolCount=/0, affineExprs,		//
rewriter.getContext()));		auto rank = resultType.getRank();
		auto affineMaps = llvm::map_to_vector(operands, [&](Value operand) {
		auto shape = cast<ShapedType>(operand.getType()).getShape();
		SmallVector<AffineExpr> affineExprs;
		for (auto it : llvm::enumerate(shape)) {
		auto affineExpr = it.value() == 1 ? rewriter.getAffineConstantExpr(0)
		: rewriter.getAffineDimExpr(it.index());
		affineExprs.push_back(affineExpr);
}		}
		return AffineMap::get(rank, 0, affineExprs, rewriter.getContext());
		});
		affineMaps.push_back(rewriter.getMultiDimIdentityMap(rank));

indexingMaps.append(operation->getNumResults(),		// Emit 'linalg.generic' op
rewriter.getMultiDimIdentityMap(rank));		bool encounteredError = false;

bool didEncounterError = false;
auto linalgOp = rewriter.create<linalg::GenericOp>(		auto linalgOp = rewriter.create<linalg::GenericOp>(
loc, opResultTypes, operands, emptyTensors, indexingMaps,		loc, outputTensor.getType(), operands, outputTensor, affineMaps,
getNParallelLoopsAttrs(rank),		getNParallelLoopsAttrs(rank),
[&](OpBuilder &nestedBuilder, Location nestedLoc, ValueRange blockArgs) {		[&](OpBuilder &opBuilder, Location loc, ValueRange blockArgs) {
Value opResult = createLinalgBodyCalculationForElementwiseOp(		Value opResult = createLinalgBodyCalculationForElementwiseOp(
operation, blockArgs.take_front(operation->getNumOperands()),		operation, blockArgs.take_front(operation->getNumOperands()),
bodyResultTypes, rewriter);		{resultType.getElementType()}, rewriter);
if (!opResult) {		if (!opResult) {
didEncounterError = true;		encounteredError = true;
return;		return;
}		}
nestedBuilder.create<linalg::YieldOp>(loc, opResult);		opBuilder.create<linalg::YieldOp>(loc, opResult);
});		});
		if (encounteredError)
if (didEncounterError)
return rewriter.notifyMatchFailure(		return rewriter.notifyMatchFailure(
operation, "unable to create linalg.generic body for elementwise op");		operation, "unable to create linalg.generic body for elementwise op");

rewriter.replaceOp(operation, linalgOp->getResults());		// Cast 'linalg.generic' result into original result type if needed
		auto castResult = rewriter.createOrFold<tensor::CastOp>(
		loc, resultType, linalgOp->getResult(0));
		rewriter.replaceOp(operation, castResult);
return success();		return success();
}		}

		static LogicalResult
		elementwiseMatchAndRewriteHelper(Operation *operation,
		PatternRewriter &rewriter) {

		// Collect op properties
		assert(operation->getNumResults() == 1 && "elementwise op expects 1 result");
		assert(operation->getNumOperands() >= 1 &&
		"elementwise op expects at least 1 operand");
		if (!operandsAndResultsRanked(operation))
		return rewriter.notifyMatchFailure(operation,
		"Unranked tensors not supported");
		jpienaarUnsubmitted Not Done Reply Inline Actions nit: unranked to be consistent with rest. jpienaar: nit: unranked to be consistent with rest.

		// Lower operation
		IndexPool indexPool;
		auto loc = operation->getLoc();
		auto expandedOperands = expandInputRanks(rewriter, loc, operation);
		auto [targetShape, masterOperands] =
		computeTargetShape(rewriter, loc, indexPool, expandedOperands);
		auto broadcastOperands = broadcastDynamicDimensions(
		rewriter, loc, indexPool, expandedOperands, targetShape, masterOperands);
		return emitElementwiseComputation(rewriter, loc, operation, broadcastOperands,
		targetShape);
		}

// Returns the constant initial value for a given reduction operation. The		// Returns the constant initial value for a given reduction operation. The
// attribute type varies depending on the element type required.		// attribute type varies depending on the element type required.
static TypedAttr createInitialValueForReduceOp(Operation *op, Type elementTy,		static TypedAttr createInitialValueForReduceOp(Operation *op, Type elementTy,
PatternRewriter &rewriter) {		PatternRewriter &rewriter) {
if (isa<tosa::ReduceSumOp>(op) && isa<FloatType>(elementTy))		if (isa<tosa::ReduceSumOp>(op) && isa<FloatType>(elementTy))
return rewriter.getFloatAttr(elementTy, 0.0);		return rewriter.getFloatAttr(elementTy, 0.0);

if (isa<tosa::ReduceSumOp>(op) && isa<IntegerType>(elementTy))		if (isa<tosa::ReduceSumOp>(op) && isa<IntegerType>(elementTy))
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines
static LogicalResult reduceMatchAndRewriteHelper(Operation *op, uint64_t axis,		static LogicalResult reduceMatchAndRewriteHelper(Operation *op, uint64_t axis,
PatternRewriter &rewriter) {		PatternRewriter &rewriter) {
auto loc = op->getLoc();		auto loc = op->getLoc();
auto inputTy = cast<ShapedType>(op->getOperand(0).getType());		auto inputTy = cast<ShapedType>(op->getOperand(0).getType());
auto resultTy = cast<ShapedType>(op->getResult(0).getType());		auto resultTy = cast<ShapedType>(op->getResult(0).getType());
auto elementTy = resultTy.getElementType();		auto elementTy = resultTy.getElementType();
Value input = op->getOperand(0);		Value input = op->getOperand(0);

llvm::SmallVector<int64_t> reduceShape;		SmallVector<int64_t> reduceShape;
SmallVector<Value> dynDims;		SmallVector<Value> dynDims;
for (unsigned i = 0; i < inputTy.getRank(); i++) {		for (unsigned i = 0; i < inputTy.getRank(); i++) {
if (axis != i) {		if (axis != i) {
reduceShape.push_back(inputTy.getDimSize(i));		reduceShape.push_back(inputTy.getDimSize(i));
if (inputTy.isDynamicDim(i))		if (inputTy.isDynamicDim(i))
dynDims.push_back(rewriter.create<tensor::DimOp>(loc, input, i));		dynDims.push_back(rewriter.create<tensor::DimOp>(loc, input, i));
}		}
}		}
▲ Show 20 Lines • Show All 1,494 Lines • Show Last 20 Lines

mlir/lib/Dialect/Traits.cpp

	Show First 20 Lines • Show All 189 Lines • ▼ Show 20 Lines
	static std::tuple<bool, bool> hasTensorOrVectorType(iterator_range types) {			static std::tuple<bool, bool> hasTensorOrVectorType(iterator_range types) {
	return std::make_tuple(			return std::make_tuple(
	llvm::any_of(types, [](Type t) { return isa<TensorType>(t); }),			llvm::any_of(types, [](Type t) { return isa<TensorType>(t); }),
	llvm::any_of(types, [](Type t) { return isa<VectorType>(t); }));			llvm::any_of(types, [](Type t) { return isa<VectorType>(t); }));
	}			}

	static bool isCompatibleInferredReturnShape(ArrayRef<int64_t> inferred,			static bool isCompatibleInferredReturnShape(ArrayRef<int64_t> inferred,
	ArrayRef<int64_t> existing) {			ArrayRef<int64_t> existing) {
	auto isCompatible = [](int64_t dim1, int64_t dim2) {			auto isCompatible = [](int64_t inferredDim, int64_t existingDim) {
	// If the inferred and existing dim is the same, or one of them is unknown			// The following criterion is used to determine the validity of an existing
	// then it is compatible, else if the inferred dim is 1 then it is also			// dimension:
	// compatible. But if the existing dim is 1 and the inferred is greater than			//
	// 1 then flag.			// inferredDim existingDim Behavior
	return dim1 == dim2 \|\| ShapedType::isDynamic(dim1) \|\|			// ----------- ----------- --------
	ShapedType::isDynamic(dim2) \|\| dim1 == 1;			// dynamic dynamic OK
				// dynamic static Error
				jpienaarUnsubmitted Done Reply Inline Actions Why? jpienaar: Why?
				rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions See section "Modification 2: Forbidding implicit dynamic-to-static dimension cast in result dimensions" in the RFC: https://discourse.llvm.org/t/rfc-tosa-to-linalg-lowering-of-element-wise-ops/71559 rafaelubalmw: See section "Modification 2: Forbidding implicit dynamic-to-static dimension cast in result…
				jpienaarUnsubmitted Done Reply Inline Actions I know I've run into this TF side (and it sounds like current integrate rotation is running into it), I mostly err on not failing to verify unless wrong. TF in particular I know one can explicitly set shapes in GraphDef (in some cases) and so you end up with weirdness post import of static shape in to identity op and then dynamic shape out or vice versa and then needing to have shape inference pass fix it up, so it is invalid post import until a cleanup. jpienaar: I know I've run into this TF side (and it sounds like current integrate rotation is running…
				eric-k256Unsubmitted Not Done Reply Inline Actions I wasn't aware of this, that's good to know. I don't suppose there is a test suite that I can run to verify the TF side that would have caught this? I run the check-mlir tests within the repo right now when about to merge but no TF side tests. eric-k256: I wasn't aware of this, that's good to know. I don't suppose there is a test suite that I can…
				rafaelubalmwAuthorUnsubmitted Done Reply Inline Actions Does this mean we need to update the behavior for this case? If this restriction becomes problematic, a reasonable alternative would be allowing for implicit dynamic-to-static cast with undefined behavior if the resulting runtime dimension size does not match the given static size. We would continue to forbid implicit result broadcasting. Let me know if I should go ahead with this change. Also, given that this patch already landed, should I be creating a new Phabricator review to address the latest comments? rafaelubalmw: Does this mean we need to update the behavior for this case? If this restriction becomes…
				jpienaarUnsubmitted Not Done Reply Inline Actions I mean we could have patched into TF repo and ran ... but that's an unreasonable bar for external contributor to test all downstream projects. In most of these cases it could just be bad tests too (but there is a batch of them). Yes to new patch. I like the alternative here, I think this is inline with https://mlir.llvm.org/getting_started/DeveloperGuide/#ir-verifier and would mean that materializing runtime asserts that abort would be an allowed lowering. jpienaar: I mean we could have patched into TF repo and ran ... but that's an unreasonable bar for…
				// static dynamic OK
				// static static OK if equal
				return ShapedType::isDynamic(existingDim) \|\| inferredDim == existingDim;
	};			};
	if (inferred.size() != existing.size())			if (inferred.size() != existing.size())
	return false;			return false;
	for (auto p : llvm::zip(inferred, existing))			for (auto [inferredDim, existingDim] : llvm::zip(inferred, existing))
	if (!isCompatible(std::get<0>(p), std::get<1>(p)))			if (!isCompatible(inferredDim, existingDim))
	return false;			return false;
	return true;			return true;
	}			}

	static std::string getShapeString(ArrayRef<int64_t> shape) {			static std::string getShapeString(ArrayRef<int64_t> shape) {
	// TODO: should replace with printing shape more uniformly across here and			// TODO: should replace with printing shape more uniformly across here and
	// when in type.			// when in type.
	std::string ret;			std::string ret;
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir

// RUN: mlir-opt --split-input-file -pass-pipeline="builtin.module(func.func(tosa-to-linalg))" %s -verify-diagnostics -o -\| FileCheck %s		// RUN: mlir-opt --split-input-file -pass-pipeline="builtin.module(func.func(tosa-to-linalg))" %s -verify-diagnostics -o -\| FileCheck %s

// CHECK: #[[$MAP0:.*]] = affine_map<() -> ()>		// CHECK: #[[$MAP0:.*]] = affine_map<() -> ()>

// CHECK-LABEL: @test_abs		// CHECK-LABEL: @test_abs_scalar
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		// CHECK-SAME: ([[ARG0:%[0-9a-zA-Z_]*]]
func.func @test_abs(%arg0: tensor<f32>) -> tensor<f32> {		func.func @test_abs_scalar(%arg0: tensor<f32>) -> tensor<f32> {
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<f32>		// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<f32>
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]]], iterator_types = []} ins(%[[ARG0]] : tensor<f32>) outs([[INIT]] : tensor<f32>) {		// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]]], iterator_types = []} ins([[ARG0]] : tensor<f32>) outs([[INIT]] : tensor<f32>) {
// CHECK: ^bb0(%[[ARG1:.]]: f32, %[[ARG2:.]]: f32):		// CHECK: ^bb0([[ARG1:%.]]: f32, [[ARG2:%.]]: f32):
// CHECK: [[ELEMENT:%.+]] = math.absf %[[ARG1]]		// CHECK: [[ELEMENT:%.*]] = math.absf [[ARG1]] : f32
// CHECK: linalg.yield [[ELEMENT]] : f32		// CHECK: linalg.yield [[ELEMENT]] : f32
// CHECK: } -> tensor<f32>		// CHECK: } -> tensor<f32>

%0 = "tosa.abs"(%arg0) : (tensor<f32>) -> tensor<f32>		%0 = "tosa.abs"(%arg0) : (tensor<f32>) -> tensor<f32>

// CHECK: return [[GENERIC]]		// CHECK: return [[GENERIC]] : tensor<f32>
return %0 : tensor<f32>		return %0 : tensor<f32>
}		}

// -----		// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0) -> (d0)>		// CHECK: #[[$MAP0:.*]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_abs_1d_cast_result
		// CHECK-SAME: ([[ARG0:%[0-9a-zA-Z_]*]]
		func.func @test_abs_1d_cast_result(%arg0: tensor<5xf32>) -> tensor<?xf32> {
		// CHECK: [[EMPTY:%.+]] = tensor.empty() : tensor<5xf32>
		// CHECK: [[RESULT:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]]], iterator_types = ["parallel"]} ins([[ARG0]] : tensor<5xf32>) outs([[EMPTY]] : tensor<5xf32>) {
		// CHECK: ^bb0([[IN0:%.+]]: f32, [[OUT0:%.+]]: f32):
		// CHECK: [[ABS:%.+]] = math.absf [[IN0]] : f32
		// CHECK: linalg.yield [[ABS]] : f32
		// CHECK: } -> tensor<5xf32>
		%0 = "tosa.abs"(%arg0) : (tensor<5xf32>) -> tensor<?xf32>

// CHECK-LABEL: @test_abs		// CHECK: [[CAST_RESULT:%.+]] = tensor.cast [[RESULT]] : tensor<5xf32> to tensor<?xf32>
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		// CHECK: return [[CAST_RESULT]] : tensor<?xf32>
func.func @test_abs(%arg0: tensor<2xf32>) -> tensor<2xf32> {		return %0 : tensor<?xf32>
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<2xf32>		}
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]]], iterator_types = ["parallel"]} ins(%[[ARG0]] : tensor<2xf32>) outs([[INIT]] : tensor<2xf32>) {
// CHECK: ^bb0(%[[ARG1:.]]: f32, %[[ARG2:.]]: f32):
// CHECK: [[ELEMENT:%.+]] = math.absf %[[ARG1]]
// CHECK: linalg.yield [[ELEMENT]] : f32
// CHECK: } -> tensor<2xf32>
%0 = "tosa.abs"(%arg0) : (tensor<2xf32>) -> tensor<2xf32>

// CHECK: return [[GENERIC]]		// -----
return %0 : tensor<2xf32>
		// CHECK: #[[$MAP0:.*]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_abs_1d_dynamic
		// CHECK-SAME: ([[ARG0:%[0-9a-zA-Z_]*]]
		func.func @test_abs_1d_dynamic(%arg0: tensor<?xf32>) -> tensor<?xf32> {

		// CHECK: [[ZERO:%.+]] = arith.constant 0 : index
		// CHECK: [[DIM:%.+]] = tensor.dim [[ARG0]], [[ZERO]] : tensor<?xf32>
		// CHECK: [[EMPTY:%.+]] = tensor.empty([[DIM]]) : tensor<?xf32>
		// CHECK: [[RESULT:%.+]] = linalg.generic {indexing_maps = [#map, #map], iterator_types = ["parallel"]} ins(%arg0 : tensor<?xf32>) outs([[EMPTY]] : tensor<?xf32>) {
		// CHECK: ^bb0([[IN0:%.+]]: f32, [[OUT0:%.+]]: f32):
		// CHECK: [[ABSF:%.+]] = math.absf [[IN0]] : f32
		// CHECK: linalg.yield [[ABSF]] : f32
		// CHECK: } -> tensor<?xf32>
		%0 = "tosa.abs"(%arg0) : (tensor<?xf32>) -> tensor<?xf32>

		// CHECK: return [[RESULT]] : tensor<?xf32>
		return %0 : tensor<?xf32>
}		}

// -----		// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0, d1) -> (d0, d1)>		// CHECK: #[[$MAP0:.*]] = affine_map<() -> ()>
		// CHECK-LABEL: @test_add_0d
		// CHECK-SAME: [[ARG0:%[0-9a-zA-Z_]*]]:
		// CHECK-SAME: [[ARG1:%[0-9a-zA-Z_]*]]:
		func.func @test_add_0d(%arg0: tensor<f32>, %arg1: tensor<f32>) -> tensor<f32> {

		// CHECK: [[EMPTY:%.+]] = tensor.empty() : tensor<f32>
		// CHECK: [[RESULT:%.+]] = linalg.generic {indexing_maps = [#map, #map, #map], iterator_types = []} ins([[ARG0]], [[ARG1]] : tensor<f32>, tensor<f32>) outs([[EMPTY]] : tensor<f32>) {
		// CHECK: ^bb0([[IN0:%.+]]: f32, [[IN1:%.+]]: f32, [[OUT0:%.+]]: f32):
		// CHECK: [[ADDF:%.+]] = arith.addf [[IN0]], [[IN1]] : f32
		// CHECK: linalg.yield [[ADDF]] : f32
		// CHECK: } -> tensor<f32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<f32>, tensor<f32>) -> tensor<f32>

// CHECK-LABEL: @test_abs		// CHECK: return [[RESULT]] : tensor<f32>
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		return %0 : tensor<f32>
func.func @test_abs(%arg0: tensor<2x3xf32>) -> tensor<2x3xf32> {		}
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<2x3xf32>
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0]] : tensor<2x3xf32>) outs([[INIT]] : tensor<2x3xf32>) {
// CHECK: ^bb0(%[[ARG1:.]]: f32, %[[ARG2:.]]: f32):
// CHECK: [[ELEMENT:%.+]] = math.absf %[[ARG1]]
// CHECK: linalg.yield [[ELEMENT]] : f32
// CHECK: } -> tensor<2x3xf32>
%0 = "tosa.abs"(%arg0) : (tensor<2x3xf32>) -> tensor<2x3xf32>

// CHECK: return [[GENERIC]]		// -----
return %0 : tensor<2x3xf32>
		// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (0)>
		// CHECK: #[[$MAP1:.+]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_add_1d_all_dynamic
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_1d_all_dynamic(%arg0: tensor<?xf32>, %arg1: tensor<?xf32>) -> tensor<?xf32> {

		// CHECK: %[[CONST0:.*]] = arith.constant 0 : index
		// CHECK: %[[ARG0_DIM0:.*]] = tensor.dim %[[ARG0]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[ARG1_DIM0:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[ARG0_MAX_DIM:.*]] = arith.maxui %[[ARG0_DIM0]], %[[ARG1_DIM0]] : index
		// CHECK: %[[CONST1:.*]] = arith.constant 1 : index
		// CHECK: %[[VAL_0:.*]] = tensor.dim %[[ARG0]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[VAL_1:.*]] = arith.cmpi eq, %[[VAL_0]], %[[CONST1]] : index
		// CHECK: %[[ARG0_DIM0_BROADCAST:.*]] = scf.if %[[VAL_1]] -> (tensor<?xf32>) {
		// CHECK: %[[VAL_2:.*]] = tensor.empty(%[[ARG0_MAX_DIM]]) : tensor<?xf32>
		// CHECK: %[[VAL_3:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0]] : tensor<?xf32>) outs(%[[VAL_2]] : tensor<?xf32>) {
		// CHECK: ^bb0(%[[VAL_4:.]]: f32, %[[VAL_5:.]]: f32):
		// CHECK: linalg.yield %[[VAL_4]] : f32
		// CHECK: } -> tensor<?xf32>
		// CHECK: scf.yield %[[VAL_3]] : tensor<?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG0]] : tensor<?xf32>
		// CHECK: }
		// CHECK: %[[VAL_6:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[VAL_7:.*]] = arith.cmpi eq, %[[VAL_6]], %[[CONST1]] : index
		// CHECK: %[[ARG0_DIM1_BROADCAST:.*]] = scf.if %[[VAL_7]] -> (tensor<?xf32>) {
		// CHECK: %[[VAL_8:.*]] = tensor.empty(%[[ARG0_MAX_DIM]]) : tensor<?xf32>
		// CHECK: %[[VAL_9:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG1]] : tensor<?xf32>) outs(%[[VAL_8]] : tensor<?xf32>) {
		// CHECK: ^bb0(%[[VAL_10:.]]: f32, %[[VAL_11:.]]: f32):
		// CHECK: linalg.yield %[[VAL_10]] : f32
		// CHECK: } -> tensor<?xf32>
		// CHECK: scf.yield %[[VAL_9]] : tensor<?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG1]] : tensor<?xf32>
		// CHECK: }
		// CHECK: %[[VAL_12:.*]] = tensor.empty(%[[ARG0_MAX_DIM]]) : tensor<?xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP1]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0_DIM0_BROADCAST]], %[[ARG0_DIM1_BROADCAST]] : tensor<?xf32>, tensor<?xf32>) outs(%[[VAL_12]] : tensor<?xf32>) {
		// CHECK: ^bb0(%[[VAL_13:.]]: f32, %[[VAL_14:.]]: f32, %[[VAL_15:.*]]: f32):
		// CHECK: %[[VAL_16:.*]] = arith.addf %[[VAL_13]], %[[VAL_14]] : f32
		// CHECK: linalg.yield %[[VAL_16]] : f32
		// CHECK: } -> tensor<?xf32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<?xf32>, tensor<?xf32>) -> tensor<?xf32>

		// CHECK: return %[[RESULT]] : tensor<?xf32>
		return %0 : tensor<?xf32>
}		}

// -----		// -----

// CHECK-LABEL: @test_abs		// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (0)>
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		// CHECK: #[[$MAP1:.+]] = affine_map<(d0) -> (d0)>
func.func @test_abs(%arg0: tensor<?xf32>) -> tensor<?xf32> {		// CHECK-LABEL: @test_add_1d_broadcast_dynamic_to_static
// CHECK: %[[C0:.+]] = arith.constant 0		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
// CHECK: %[[DIM:.+]] = tensor.dim %[[ARG0]], %[[C0]]		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
// CHECK: %[[INIT:.+]] = tensor.empty(%[[DIM]])		func.func @test_add_1d_broadcast_dynamic_to_static(%arg0: tensor<5xf32>, %arg1: tensor<?xf32>) -> tensor<5xf32> {
// CHECK: linalg.generic
// CHECK: math.absf		// CHECK: %[[CONST1:.*]] = arith.constant 1 : index
%0 = "tosa.abs"(%arg0) : (tensor<?xf32>) -> tensor<?xf32>		// CHECK: %[[CONST0:.*]] = arith.constant 0 : index
		// CHECK: %[[ARG1_DIM0:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[VAL_0:.*]] = arith.cmpi eq, %[[ARG1_DIM0]], %[[CONST1]] : index
		// CHECK: %[[ARG1_DIM0_BROADCAST:.*]] = scf.if %[[VAL_0]] -> (tensor<?xf32>) {
		// CHECK: %[[VAL_1:.*]] = tensor.empty() : tensor<5xf32>
		// CHECK: %[[VAL_2:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG1]] : tensor<?xf32>) outs(%[[VAL_1]] : tensor<5xf32>) {
		// CHECK: ^bb0(%[[VAL_3:.]]: f32, %[[VAL_4:.]]: f32):
		// CHECK: linalg.yield %[[VAL_3]] : f32
		// CHECK: } -> tensor<5xf32>
		// CHECK: %[[VAL_5:.*]] = tensor.cast %[[VAL_2]] : tensor<5xf32> to tensor<?xf32>
		// CHECK: scf.yield %[[VAL_5]] : tensor<?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG1]] : tensor<?xf32>
		// CHECK: }
		// CHECK: %[[VAL_6:.*]] = tensor.empty() : tensor<5xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP1]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0]], %[[ARG1_DIM0_BROADCAST]] : tensor<5xf32>, tensor<?xf32>) outs(%[[VAL_6]] : tensor<5xf32>) {
		// CHECK: ^bb0(%[[VAL_7:.]]: f32, %[[VAL_8:.]]: f32, %[[VAL_9:.*]]: f32):
		// CHECK: %[[VAL_10:.*]] = arith.addf %[[VAL_7]], %[[VAL_8]] : f32
		// CHECK: linalg.yield %[[VAL_10]] : f32
		// CHECK: } -> tensor<5xf32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<5xf32>, tensor<?xf32>) -> tensor<5xf32>

		// CHECK: return %[[RESULT]] : tensor<5xf32>
		return %0 : tensor<5xf32>
		}

		// -----

		// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (0)>
		// CHECK: #[[$MAP1:.+]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_add_1d_broadcast_static_to_dynamic
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_1d_broadcast_static_to_dynamic(%arg0: tensor<1xf32>, %arg1: tensor<?xf32>) -> tensor<?xf32> {

		// CHECK: %[[CONST0:.*]] = arith.constant 0 : index
		// CHECK: %[[ARG1_DIM0:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?xf32>
		// CHECK: %[[VAL_0:.*]] = tensor.empty(%[[ARG1_DIM0]]) : tensor<?xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0]], %[[ARG1]] : tensor<1xf32>, tensor<?xf32>) outs(%[[VAL_0]] : tensor<?xf32>) {
		// CHECK: ^bb0(%[[VAL_1:.]]: f32, %[[VAL_2:.]]: f32, %[[VAL_3:.*]]: f32):
		// CHECK: %[[VAL_4:.*]] = arith.addf %[[VAL_1]], %[[VAL_2]] : f32
		// CHECK: linalg.yield %[[VAL_4]] : f32
		// CHECK: } -> tensor<?xf32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<1xf32>, tensor<?xf32>) -> tensor<?xf32>

		// CHECK: return %[[RESULT]] : tensor<?xf32>
return %0 : tensor<?xf32>		return %0 : tensor<?xf32>
}		}

// -----		// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0, d1) -> (d0, d1)>		// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (0)>
		// CHECK: #[[$MAP1:.+]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_add_1d_broadcast_static_to_static
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_1d_broadcast_static_to_static(%arg0: tensor<1xf32>, %arg1: tensor<3xf32>) -> tensor<3xf32> {

// CHECK-LABEL: @test_abs_dyn		// CHECK: %[[VAL_0:.*]] = tensor.empty() : tensor<3xf32>
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel"]} ins(%[[ARG0]], %[[ARG1]] : tensor<1xf32>, tensor<3xf32>) outs(%[[VAL_0]] : tensor<3xf32>) {
func.func @test_abs_dyn(%arg0: tensor<2x?xf32>) -> tensor<2x?xf32> {		// CHECK: ^bb0(%[[VAL_1:.]]: f32, %[[VAL_2:.]]: f32, %[[VAL_3:.*]]: f32):
// CHECK: %[[C1:.+]] = arith.constant 1		// CHECK: %[[VAL_4:.*]] = arith.addf %[[VAL_1]], %[[VAL_2]] : f32
// CHECK: %[[DIM:.+]] = tensor.dim %[[ARG0]], %[[C1]]		// CHECK: linalg.yield %[[VAL_4]] : f32
// CHECK: %[[INIT:.+]] = tensor.empty(%[[DIM]])		// CHECK: } -> tensor<3xf32>
// CHECK: linalg.generic		%0 = "tosa.add"(%arg0, %arg1) : (tensor<1xf32>, tensor<3xf32>) -> tensor<3xf32>
// CHECK: math.absf
%0 = "tosa.abs"(%arg0) : (tensor<2x?xf32>) -> tensor<2x?xf32>		// CHECK: return %[[RESULT]] : tensor<3xf32>
return %0 : tensor<2x?xf32>		return %0 : tensor<3xf32>
}		}

// -----		// -----

#SparseVector = #sparse_tensor.encoding<{ lvlTypes = [ "compressed" ] }>		// CHECK: #[[$MAP0:.+]] = affine_map<(d0) -> (d0)>
		// CHECK-LABEL: @test_add_1d_matching_static
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_1d_matching_static(%arg0: tensor<3xf32>, %arg1: tensor<3xf32>) -> tensor<3xf32> {

		// CHECK: %[[VAL_0:.*]] = tensor.empty() : tensor<3xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP0]], #[[$MAP0]]], iterator_types = ["parallel"]} ins(%[[ARG0]], %[[ARG1]] : tensor<3xf32>, tensor<3xf32>) outs(%[[VAL_0]] : tensor<3xf32>) {
		// CHECK: ^bb0(%[[VAL_1:.]]: f32, %[[VAL_2:.]]: f32, %[[VAL_3:.*]]: f32):
		// CHECK: %[[VAL_4:.*]] = arith.addf %[[VAL_1]], %[[VAL_2]] : f32
		// CHECK: linalg.yield %[[VAL_4]] : f32
		// CHECK: } -> tensor<3xf32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<3xf32>, tensor<3xf32>) -> tensor<3xf32>

// CHECK-LABEL: @test_encoding_passthrough		// CHECK: return %[[RESULT]] : tensor<3xf32>
func.func @test_encoding_passthrough(%arg0: tensor<2xi8, #SparseVector>) -> tensor<2xi8, #SparseVector> {		return %0 : tensor<3xf32>
// CHECK: linalg.generic
// CHECK: sparse_tensor
%0 = "tosa.abs"(%arg0) : (tensor<2xi8, #SparseVector>) -> tensor<2xi8, #SparseVector>
return %0 : tensor<2xi8, #SparseVector>
}		}

// -----		// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0) -> ()>		// CHECK: #[[$MAP0:.+]] = affine_map<(d0, d1) -> (0, d1)>
// CHECK: #[[$MAP1:.*]] = affine_map<(d0) -> (d0)>		// CHECK: #[[$MAP1:.+]] = affine_map<(d0, d1) -> (d0, d1)>
		// CHECK: #[[$MAP2:.+]] = affine_map<(d0, d1) -> (d0, 0)>
		// CHECK-LABEL: @test_add_2d_all_dynamic
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_2d_all_dynamic(%arg0: tensor<?x?xf32>, %arg1: tensor<?x?xf32>) -> tensor<?x?xf32> {

		// CHECK: %[[CONST0:.*]] = arith.constant 0 : index
		// CHECK: %[[ARG0_DIM0:.*]] = tensor.dim %[[ARG0]], %[[CONST0]] : tensor<?x?xf32>
		// CHECK: %[[ARG1_DIM0:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?x?xf32>
		// CHECK: %[[MAX_DIM0:.*]] = arith.maxui %[[ARG0_DIM0]], %[[ARG1_DIM0]] : index
		// CHECK: %[[CONST1:.*]] = arith.constant 1 : index
		// CHECK: %[[ARG0_DIM1:.*]] = tensor.dim %[[ARG0]], %[[CONST1]] : tensor<?x?xf32>
		// CHECK: %[[ARG1_DIM1:.*]] = tensor.dim %[[ARG1]], %[[CONST1]] : tensor<?x?xf32>
		// CHECK: %[[MAX_DIM1:.*]] = arith.maxui %[[ARG0_DIM1]], %[[ARG1_DIM1]] : index

		// CHECK: %[[VAL_0:.*]] = tensor.dim %[[ARG0]], %[[CONST0]] : tensor<?x?xf32>
		// CHECK: %[[VAL_1:.*]] = arith.cmpi eq, %[[VAL_0]], %[[CONST1]] : index
		// CHECK: %[[ARG0_DIM0_BROADCAST:.*]] = scf.if %[[VAL_1]] -> (tensor<?x?xf32>) {
		// CHECK: %[[VAL_2:.*]] = tensor.dim %[[ARG0]], %[[CONST1]] : tensor<?x?xf32>
		// CHECK: %[[VAL_3:.*]] = tensor.empty(%[[MAX_DIM0]], %[[VAL_2]]) : tensor<?x?xf32>
		// CHECK: %[[VAL_4:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0]] : tensor<?x?xf32>) outs(%[[VAL_3]] : tensor<?x?xf32>) {
		// CHECK: ^bb0(%[[VAL_5:.]]: f32, %[[VAL_6:.]]: f32):
		// CHECK: linalg.yield %[[VAL_5]] : f32
		// CHECK: } -> tensor<?x?xf32>
		// CHECK: scf.yield %[[VAL_4]] : tensor<?x?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG0]] : tensor<?x?xf32>
		// CHECK: }

// CHECK-LABEL: @test_broadcast		// CHECK: %[[VAL_7:.*]] = tensor.dim %[[ARG0_DIM0_BROADCAST]], %[[CONST1]] : tensor<?x?xf32>
// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]: tensor<1xf32		// CHECK: %[[VAL_8:.*]] = arith.cmpi eq, %[[VAL_7]], %[[CONST1]] : index
// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]: tensor<2xf32>		// CHECK: %[[ARG0_DIM1_BROADCAST:.*]] = scf.if %[[VAL_8]] -> (tensor<?x?xf32>) {
func.func @test_broadcast(%arg0: tensor<1xf32>, %arg1: tensor<2xf32>) -> tensor<2xf32> {		// CHECK: %[[VAL_9:.*]] = tensor.dim %[[ARG0_DIM0_BROADCAST]], %[[CONST0]] : tensor<?x?xf32>
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<2xf32>		// CHECK: %[[VAL_10:.*]] = tensor.empty(%[[VAL_9]], %[[MAX_DIM1]]) : tensor<?x?xf32>
// CHECK: [[RESHAPE:%.+]] = "tosa.reshape"(%[[ARG0]])		// CHECK: %[[VAL_11:.*]] = linalg.generic {indexing_maps = [#[[$MAP2]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0_DIM0_BROADCAST]] : tensor<?x?xf32>) outs(%[[VAL_10]] : tensor<?x?xf32>) {
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel"]} ins([[RESHAPE]], %[[ARG1]] : tensor<f32>, tensor<2xf32>) outs([[INIT]] : tensor<2xf32>) {		// CHECK: ^bb0(%[[VAL_12:.]]: f32, %[[VAL_13:.]]: f32):
// CHECK: ^bb0(%[[ARG2:.]]: f32, %[[ARG3:.]]: f32, %[[ARG4:.*]]: f32):		// CHECK: linalg.yield %[[VAL_12]] : f32
// CHECK: [[ELEMENT:%.+]] = arith.addf %[[ARG2]], %[[ARG3]] : f32		// CHECK: } -> tensor<?x?xf32>
// CHECK: linalg.yield [[ELEMENT]] : f32		// CHECK: scf.yield %[[VAL_11]] : tensor<?x?xf32>
// CHECK: } -> tensor<2xf32>		// CHECK: } else {
%0 = "tosa.add"(%arg0, %arg1) : (tensor<1xf32>, tensor<2xf32>) -> tensor<2xf32>		// CHECK: scf.yield %[[ARG0_DIM0_BROADCAST]] : tensor<?x?xf32>
return %0 : tensor<2xf32>		// CHECK: }

		// CHECK: %[[VAL_14:.*]] = tensor.dim %[[ARG1]], %[[CONST0]] : tensor<?x?xf32>
		// CHECK: %[[VAL_15:.*]] = arith.cmpi eq, %[[VAL_14]], %[[CONST1]] : index
		// CHECK: %[[ARG1_DIM0_BROADCAST:.*]] = scf.if %[[VAL_15]] -> (tensor<?x?xf32>) {
		// CHECK: %[[VAL_16:.*]] = tensor.dim %[[ARG1]], %[[CONST1]] : tensor<?x?xf32>
		// CHECK: %[[VAL_17:.*]] = tensor.empty(%[[MAX_DIM0]], %[[VAL_16]]) : tensor<?x?xf32>
		// CHECK: %[[VAL_18:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG1]] : tensor<?x?xf32>) outs(%[[VAL_17]] : tensor<?x?xf32>) {
		// CHECK: ^bb0(%[[VAL_19:.]]: f32, %[[VAL_20:.]]: f32):
		// CHECK: linalg.yield %[[VAL_19]] : f32
		// CHECK: } -> tensor<?x?xf32>
		// CHECK: scf.yield %[[VAL_18]] : tensor<?x?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG1]] : tensor<?x?xf32>
		// CHECK: }

		// CHECK: %[[VAL_21:.*]] = tensor.dim %[[ARG1_DIM0_BROADCAST]], %[[CONST1]] : tensor<?x?xf32>
		// CHECK: %[[VAL_22:.*]] = arith.cmpi eq, %[[VAL_21]], %[[CONST1]] : index
		// CHECK: %[[ARG1_DIM1_BROADCAST:.*]] = scf.if %[[VAL_22]] -> (tensor<?x?xf32>) {
		// CHECK: %[[VAL_23:.*]] = tensor.dim %[[ARG1_DIM0_BROADCAST]], %[[CONST0]] : tensor<?x?xf32>
		// CHECK: %[[VAL_24:.*]] = tensor.empty(%[[VAL_23]], %[[MAX_DIM1]]) : tensor<?x?xf32>
		// CHECK: %[[VAL_25:.*]] = linalg.generic {indexing_maps = [#[[$MAP2]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG1_DIM0_BROADCAST]] : tensor<?x?xf32>) outs(%[[VAL_24]] : tensor<?x?xf32>) {
		// CHECK: ^bb0(%[[VAL_26:.]]: f32, %[[VAL_27:.]]: f32):
		// CHECK: linalg.yield %[[VAL_26]] : f32
		// CHECK: } -> tensor<?x?xf32>
		// CHECK: scf.yield %[[VAL_25]] : tensor<?x?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG1_DIM0_BROADCAST]] : tensor<?x?xf32>
		// CHECK: }

		// CHECK: %[[VAL_28:.*]] = tensor.empty(%[[MAX_DIM0]], %[[MAX_DIM1]]) : tensor<?x?xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP1]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0_DIM1_BROADCAST]], %[[ARG1_DIM1_BROADCAST]] : tensor<?x?xf32>, tensor<?x?xf32>) outs(%[[VAL_28]] : tensor<?x?xf32>) {
		// CHECK: ^bb0(%[[VAL_29:.]]: f32, %[[VAL_30:.]]: f32, %[[VAL_31:.*]]: f32):
		// CHECK: %[[VAL_32:.*]] = arith.addf %[[VAL_29]], %[[VAL_30]] : f32
		// CHECK: linalg.yield %[[VAL_32]] : f32
		// CHECK: } -> tensor<?x?xf32>
		%0 = "tosa.add"(%arg0, %arg1) : (tensor<?x?xf32>, tensor<?x?xf32>) -> tensor<?x?xf32>

		// CHECK: return %[[RESULT]] : tensor<?x?xf32>
		return %0 : tensor<?x?xf32>
}		}

// -----		// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0) -> (d0)>		// CHECK: #[[$MAP0:.+]] = affine_map<(d0, d1, d2) -> (0, d1, d2)>
// CHECK: #[[$MAP1:.*]] = affine_map<(d0) -> ()>		// CHECK: #[[$MAP1:.+]] = affine_map<(d0, d1, d2) -> (d0, d1, d2)>
		// CHECK-LABEL: @test_add_2d_different_ranks
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		func.func @test_add_2d_different_ranks(%arg0: tensor<3x4xf32>, %arg1: tensor<2x3x4xf32>) -> tensor<2x3x4xf32> {

// CHECK-LABEL: @test_broadcast_swapped_args		// CHECK: %[[ARG0_EXPANDED:.*]] = tensor.expand_shape %[[ARG0]] {{\[\[}}0, 1], [2]] : tensor<3x4xf32> into tensor<1x3x4xf32>
// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]: tensor<2xf32		// CHECK: %[[VAL_0:.*]] = tensor.empty() : tensor<2x3x4xf32>
// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]: tensor<1xf32>		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel", "parallel", "parallel"]} ins(%[[ARG0_EXPANDED]], %[[ARG1]] : tensor<1x3x4xf32>, tensor<2x3x4xf32>) outs(%[[VAL_0]] : tensor<2x3x4xf32>) {
func.func @test_broadcast_swapped_args(%arg0: tensor<2xf32>, %arg1: tensor<1xf32>) -> tensor<2xf32> {		// CHECK: ^bb0(%[[VAL_1:.]]: f32, %[[VAL_2:.]]: f32, %[[VAL_3:.*]]: f32):
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<2xf32>		// CHECK: %[[VAL_4:.*]] = arith.addf %[[VAL_1]], %[[VAL_2]] : f32
// CHECK: [[RESHAPE:%.+]] = "tosa.reshape"(%[[ARG1]])		// CHECK: linalg.yield %[[VAL_4]] : f32
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]], #[[$MAP0]]], iterator_types = ["parallel"]} ins(%[[ARG0]], [[RESHAPE]] : tensor<2xf32>, tensor<f32>) outs([[INIT]] : tensor<2xf32>) {		// CHECK: } -> tensor<2x3x4xf32>
// CHECK: ^bb0(%[[ARG2:.]]: f32, %[[ARG3:.]]: f32, %[[ARG4:.*]]: f32):		%0 = "tosa.add"(%arg0, %arg1) : (tensor<3x4xf32>, tensor<2x3x4xf32>) -> tensor<2x3x4xf32>
// CHECK: [[ELEMENT:%.+]] = arith.addf %[[ARG2]], %[[ARG3]] : f32
// CHECK: linalg.yield [[ELEMENT]] : f32		// CHECK: return %[[RESULT]] : tensor<2x3x4xf32>
// CHECK: } -> tensor<2xf32>		return %0 : tensor<2x3x4xf32>
%0 = "tosa.add"(%arg0, %arg1) : (tensor<2xf32>, tensor<1xf32>) -> tensor<2xf32>
return %0 : tensor<2xf32>
}		}

// -----		// -----

// CHECK-DAG: #[[$MAP0:.*]] = affine_map<(d0, d1) -> (d0, d1)>		// CHECK: #[[$MAP0:.+]] = affine_map<(d0, d1) -> (d0, 0)>
// CHECK-DAG: #[[$MAP1:.*]] = affine_map<(d0, d1) -> (d1)>		// CHECK: #[[$MAP1:.+]] = affine_map<(d0, d1) -> (d0, d1)>
// CHECK-DAG: #[[$MAP2:.*]] = affine_map<(d0, d1) -> (d0)>		// CHECK-LABEL: @test_select_2d_one_dynamic
		// CHECK-SAME: %[[ARG0:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]:
		// CHECK-SAME: %[[ARG2:[0-9a-zA-Z_]*]]:
		func.func @test_select_2d_one_dynamic(%arg0: tensor<2x?xi1>, %arg1: tensor<2x?xf32>, %arg2: tensor<2x?xf32>) -> tensor<2x?xf32> {

// CHECK-LABEL: @test_multibroadcast		// CHECK: %[[CONST1:.*]] = arith.constant 1 : index
// CHECK-SAME: (%[[ARG0:[0-9a-zA-Z_]*]]		// CHECK: %[[ARG0_DIM1:.*]] = tensor.dim %[[ARG0]], %[[CONST1]] : tensor<2x?xi1>
// CHECK-SAME: %[[ARG1:[0-9a-zA-Z_]*]]		// CHECK: %[[ARG1_DIM1:.*]] = tensor.dim %[[ARG1]], %[[CONST1]] : tensor<2x?xf32>
func.func @test_multibroadcast(%arg0: tensor<1x3xf32>, %arg1: tensor<2x1xf32>) -> tensor<2x3xf32> {		// CHECK: %[[VAL_0:.*]] = arith.maxui %[[ARG0_DIM1]], %[[ARG1_DIM1]] : index
// CHECK: [[INIT:%.+]] = tensor.empty() : tensor<2x3xf32>		// CHECK: %[[ARG2_DIM1:.*]] = tensor.dim %[[ARG2]], %[[CONST1]] : tensor<2x?xf32>
// CHECK: [[RESHAPE1:%.+]] = "tosa.reshape"(%[[ARG0]]) <{new_shape = array<i64: 3>}		// CHECK: %[[MAX_DIM1:.*]] = arith.maxui %[[VAL_0]], %[[ARG2_DIM1]] : index
// CHECK: [[RESHAPE2:%.+]] = "tosa.reshape"(%[[ARG1]]) <{new_shape = array<i64: 2>}
// CHECK: [[GENERIC:%.+]] = linalg.generic {indexing_maps = [#[[$MAP1]], #[[$MAP2]], #[[$MAP0]]], iterator_types = ["parallel", "parallel"]} ins([[RESHAPE1]], [[RESHAPE2]] : tensor<3xf32>, tensor<2xf32>) outs([[INIT]] : tensor<2x3xf32>) {		// CHECK: %[[VAL_1:.*]] = tensor.dim %[[ARG0]], %[[CONST1]] : tensor<2x?xi1>
// CHECK: ^bb0(%[[ARG2:.]]: f32, %[[ARG3:.]]: f32, %[[ARG4:.*]]: f32):		// CHECK: %[[VAL_2:.*]] = arith.cmpi eq, %[[VAL_1]], %[[CONST1]] : index
// CHECK: [[ELEMENT:%.+]] = arith.addf %[[ARG2]], %[[ARG3]] : f32		// CHECK: %[[ARG0_BROADCAST:.*]] = scf.if %[[VAL_2]] -> (tensor<2x?xi1>) {
// CHECK: linalg.yield [[ELEMENT]] : f32		// CHECK: %[[VAL_3:.*]] = tensor.empty(%[[MAX_DIM1]]) : tensor<2x?xi1>
// CHECK: } -> tensor<2x3xf32>		// CHECK: %[[VAL_4:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0]] : tensor<2x?xi1>) outs(%[[VAL_3]] : tensor<2x?xi1>) {
%0 = "tosa.add"(%arg0, %arg1) : (tensor<1x3xf32>, tensor<2x1xf32>) -> tensor<2x3xf32>		// CHECK: ^bb0(%[[VAL_5:.]]: i1, %[[VAL_6:.]]: i1):
return %0 : tensor<2x3xf32>		// CHECK: linalg.yield %[[VAL_5]] : i1
		// CHECK: } -> tensor<2x?xi1>
		// CHECK: scf.yield %[[VAL_4]] : tensor<2x?xi1>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG0]] : tensor<2x?xi1>
		// CHECK: }

		// CHECK: %[[VAL_7:.*]] = tensor.dim %[[ARG1]], %[[CONST1]] : tensor<2x?xf32>
		// CHECK: %[[VAL_8:.*]] = arith.cmpi eq, %[[VAL_7]], %[[CONST1]] : index
		// CHECK: %[[ARG1_BROADCAST:.*]] = scf.if %[[VAL_8]] -> (tensor<2x?xf32>) {
		// CHECK: %[[VAL_9:.*]] = tensor.empty(%[[MAX_DIM1]]) : tensor<2x?xf32>
		// CHECK: %[[VAL_10:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG1]] : tensor<2x?xf32>) outs(%[[VAL_9]] : tensor<2x?xf32>) {
		// CHECK: ^bb0(%[[VAL_11:.]]: f32, %[[VAL_12:.]]: f32):
		// CHECK: linalg.yield %[[VAL_11]] : f32
		// CHECK: } -> tensor<2x?xf32>
		// CHECK: scf.yield %[[VAL_10]] : tensor<2x?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG1]] : tensor<2x?xf32>
		// CHECK: }

		// CHECK: %[[VAL_13:.*]] = tensor.dim %[[ARG2]], %[[CONST1]] : tensor<2x?xf32>
		// CHECK: %[[VAL_14:.*]] = arith.cmpi eq, %[[VAL_13]], %[[CONST1]] : index
		// CHECK: %[[ARG2_BROADCAST:.*]] = scf.if %[[VAL_14]] -> (tensor<2x?xf32>) {
		// CHECK: %[[VAL_15:.*]] = tensor.empty(%[[MAX_DIM1]]) : tensor<2x?xf32>
		// CHECK: %[[VAL_16:.*]] = linalg.generic {indexing_maps = [#[[$MAP0]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG2]] : tensor<2x?xf32>) outs(%[[VAL_15]] : tensor<2x?xf32>) {
		// CHECK: ^bb0(%[[VAL_17:.]]: f32, %[[VAL_18:.]]: f32):
		// CHECK: linalg.yield %[[VAL_17]] : f32
		// CHECK: } -> tensor<2x?xf32>
		// CHECK: scf.yield %[[VAL_16]] : tensor<2x?xf32>
		// CHECK: } else {
		// CHECK: scf.yield %[[ARG2]] : tensor<2x?xf32>
		// CHECK: }

		// CHECK: %[[VAL_19:.*]] = tensor.empty(%[[MAX_DIM1]]) : tensor<2x?xf32>
		// CHECK: %[[RESULT:.*]] = linalg.generic {indexing_maps = [#[[$MAP1]], #[[$MAP1]], #[[$MAP1]], #[[$MAP1]]], iterator_types = ["parallel", "parallel"]} ins(%[[ARG0_BROADCAST]], %[[ARG1_BROADCAST]], %[[ARG2_BROADCAST]] : tensor<2x?xi1>, tensor<2x?xf32>, tensor<2x?xf32>) outs(%[[VAL_19]] : tensor<2x?xf32>) {
		// CHECK: ^bb0(%[[VAL_20:.]]: i1, %[[VAL_21:.]]: f32, %[[VAL_22:.]]: f32, %[[VAL_23:.]]: f32):
		// CHECK: %[[VAL_24:.*]] = arith.select %[[VAL_20]], %[[VAL_21]], %[[VAL_22]] : f32
		// CHECK: linalg.yield %[[VAL_24]] : f32
		// CHECK: } -> tensor<2x?xf32>
		%0 = "tosa.select"(%arg0, %arg1, %arg2) : (tensor<2x?xi1>, tensor<2x?xf32>, tensor<2x?xf32>) -> tensor<2x?xf32>

		// CHECK: return %[[RESULT]] : tensor<2x?xf32>
		return %0 : tensor<2x?xf32>
}		}

// -----		// -----

// CHECK-LABEL: @test_simple_f32		// CHECK-LABEL: @test_simple_f32
func.func @test_simple_f32(%arg0: tensor<1xf32>) -> () {		func.func @test_simple_f32(%arg0: tensor<1xf32>) -> () {
// CHECK: linalg.generic		// CHECK: linalg.generic
// CHECK: tanh		// CHECK: tanh
▲ Show 20 Lines • Show All 1,238 Lines • ▼ Show 20 Lines	func.func @table8_dyn_table(%arg0: tensor<6xi8>, %arg1: tensor<?xi8>) -> () {
// CHECK: %[[EXTRACT:.+]] = tensor.extract %[[ARG1]][%[[ADD]]]		// CHECK: %[[EXTRACT:.+]] = tensor.extract %[[ARG1]][%[[ADD]]]
// CHECK: linalg.yield %[[EXTRACT]]		// CHECK: linalg.yield %[[EXTRACT]]
%0 = "tosa.table"(%arg0, %arg1) : (tensor<6xi8>, tensor<?xi8>) -> (tensor<6xi8>)		%0 = "tosa.table"(%arg0, %arg1) : (tensor<6xi8>, tensor<?xi8>) -> (tensor<6xi8>)
return		return
}		}

// -----		// -----

// Regression test for using the wrong rank.

// CHECK-DAG: affine_map<(d0, d1, d2, d3) -> (d0, d2, d3)>
// CHECK-DAG: affine_map<(d0, d1, d2, d3) -> (d0, d1, d2, d3)>
// CHECK-DAG: affine_map<(d0, d1, d2, d3) -> ()>
// CHECK-LABEL: @select_fp32
func.func @select_fp32(%arg0: tensor<1x1x5x5xi1>, %arg1: tensor<1x12x5x5xf32>, %arg2: tensor<f32>) -> tensor<1x12x5x5xf32> {
// CHECK: linalg.generic
%0 = "tosa.select"(%arg0, %arg1, %arg2) : (tensor<1x1x5x5xi1>, tensor<1x12x5x5xf32>, tensor<f32>) -> tensor<1x12x5x5xf32>
return %0 : tensor<1x12x5x5xf32>
}

// -----

// CHECK: #[[$MAP0:.*]] = affine_map<(d0, d1, d2, d3, d4) -> (d0, d3, d4)>		// CHECK: #[[$MAP0:.*]] = affine_map<(d0, d1, d2, d3, d4) -> (d0, d3, d4)>
// CHECK: #[[$MAP1:.*]] = affine_map<(d0, d1, d2, d3, d4) -> (d0, d1, d2)>		// CHECK: #[[$MAP1:.*]] = affine_map<(d0, d1, d2, d3, d4) -> (d0, d1, d2)>

// CHECK-LABEL: @test_static_rfft2d		// CHECK-LABEL: @test_static_rfft2d
// CHECK-SAME: (%[[ARG_0:[0-9a-zA-Z_]*]]:		// CHECK-SAME: (%[[ARG_0:[0-9a-zA-Z_]*]]:
func.func @test_static_rfft2d(%arg0: tensor<5x5x8xf32>) -> (tensor<5x5x5xf32>, tensor<5x5x5xf32>) {		func.func @test_static_rfft2d(%arg0: tensor<5x5x8xf32>) -> (tensor<5x5x5xf32>, tensor<5x5x5xf32>) {
// CHECK: %[[CST_1:.*]] = arith.constant 1 : index		// CHECK: %[[CST_1:.*]] = arith.constant 1 : index
// CHECK: %[[CST_2:.*]] = arith.constant 2 : index		// CHECK: %[[CST_2:.*]] = arith.constant 2 : index
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

mlir/test/Dialect/traits.mlir

	Show First 20 Lines • Show All 105 Lines • ▼ Show 20 Lines
	func.func @broadcast_tensor_tensor_tensor(tensor<4x3x2xi32>, tensor<?xi32>) -> tensor<4x3x2xi32> {			func.func @broadcast_tensor_tensor_tensor(tensor<4x3x2xi32>, tensor<?xi32>) -> tensor<4x3x2xi32> {
	^bb0(%arg0: tensor<4x3x2xi32>, %arg1: tensor<?xi32>):			^bb0(%arg0: tensor<4x3x2xi32>, %arg1: tensor<?xi32>):
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<4x3x2xi32>, tensor<?xi32>) -> tensor<4x3x2xi32>			%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<4x3x2xi32>, tensor<?xi32>) -> tensor<4x3x2xi32>
	return %0 : tensor<4x3x2xi32>			return %0 : tensor<4x3x2xi32>
	}			}

	// -----			// -----

	func.func @broadcast_tensor_tensor_tensor(%arg0: tensor<?x6x1xi32>, %arg1: tensor<*xi32>) -> tensor<?x6x6xi32> {			// Error for inferred dynamic dimension but existing static dimensions
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<?x6x1xi32>, tensor<*xi32>) -> tensor<?x6x6xi32>			func.func @broadcast_tensor_tensor_tensor(%arg0: tensor<?xi32>, %arg1: tensor<?xi32>) -> tensor<2xi32> {
	return %0 : tensor<?x6x6xi32>			// expected-error @+1 {{op result type '2' not broadcast compatible with broadcasted operands's shapes '?'}}
				%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<?xi32>, tensor<?xi32>) -> tensor<2xi32>
				return %0 : tensor<2xi32>
				}

				// -----

				func.func @broadcast_tensor_tensor_tensor(%arg0: tensor<?x6x1xi32>, %arg1: tensor<*xi32>) -> tensor<?x6x?xi32> {
				%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<?x6x1xi32>, tensor<*xi32>) -> tensor<?x6x?xi32>
				return %0 : tensor<?x6x?xi32>
	}			}

	// -----			// -----

	// Unranked operands but ranked result			// Unranked operands but ranked result
	func.func @broadcast_tensor_tensor_tensor(tensor<xi32>, tensor<xi32>) -> tensor<2xi32> {			func.func @broadcast_tensor_tensor_tensor(tensor<xi32>, tensor<xi32>) -> tensor<2xi32> {
	^bb0(%arg0: tensor<xi32>, %arg1: tensor<xi32>):			^bb0(%arg0: tensor<xi32>, %arg1: tensor<xi32>):
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<xi32>, tensor<xi32>) -> tensor<2xi32>			%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<xi32>, tensor<xi32>) -> tensor<2xi32>
	Show All 15 Lines
	^bb0(%arg0: tensor<3x2xi32>, %arg1: tensor<*xi32>):			^bb0(%arg0: tensor<3x2xi32>, %arg1: tensor<*xi32>):
	// expected-error @+1 {{op result type '2' not broadcast compatible with broadcasted operands's shapes '3x2'}}			// expected-error @+1 {{op result type '2' not broadcast compatible with broadcasted operands's shapes '3x2'}}
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<3x2xi32>, tensor<*xi32>) -> tensor<2xi32>			%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<3x2xi32>, tensor<*xi32>) -> tensor<2xi32>
	return %0 : tensor<2xi32>			return %0 : tensor<2xi32>
	}			}

	// -----			// -----

	func.func @broadcast_tensor_tensor_tensor(tensor<?x1x6x1xi32>, tensor<7x1x5xi32>) -> tensor<8x7x6x5xi32> {			// Correct use of broadcast semantics for input dimensions
	^bb0(%arg0: tensor<?x1x6x1xi32>, %arg1: tensor<7x1x5xi32>):			func.func @broadcast_tensor_tensor_tensor(%arg0: tensor<?x1x6x1xi32>, %arg1: tensor<7x1x5xi32>) -> tensor<?x7x6x5xi32> {
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<?x1x6x1xi32>, tensor<7x1x5xi32>) -> tensor<8x7x6x5xi32>			%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<?x1x6x1xi32>, tensor<7x1x5xi32>) -> tensor<?x7x6x5xi32>
	return %0 : tensor<8x7x6x5xi32>			return %0 : tensor<?x7x6x5xi32>
				}

				// -----

				// Incorrect attempt to use broadcast semantics for result
				func.func @broadcast_tensor_tensor_tensor(%arg0: tensor<1xi32>, %arg1: tensor<1xi32>) -> tensor<5xi32> {
				// expected-error @+1 {{op result type '5' not broadcast compatible with broadcasted operands's shapes '1'}}
				%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<1xi32>, tensor<1xi32>) -> tensor<5xi32>
				return %0 : tensor<5xi32>
	}			}

	// -----			// -----

	func.func @broadcastDifferentResultType(tensor<4xi32>, tensor<4xi32>) -> tensor<4xi1> {			func.func @broadcastDifferentResultType(tensor<4xi32>, tensor<4xi32>) -> tensor<4xi1> {
	^bb0(%arg0: tensor<4xi32>, %arg1: tensor<4xi32>):			^bb0(%arg0: tensor<4xi32>, %arg1: tensor<4xi32>):
	%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<4xi32>, tensor<4xi32>) -> tensor<4xi1>			%0 = "test.broadcastable"(%arg0, %arg1) : (tensor<4xi32>, tensor<4xi32>) -> tensor<4xi1>
	return %0 : tensor<4xi1>			return %0 : tensor<4xi1>
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

TOSA-to-Linalg lowering for element-wise opsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 543089

mlir/docs/Traits.md

mlir/docs/Traits/Broadcastable.md

mlir/docs/Traits/_index.md

mlir/include/mlir/Dialect/Tosa/IR/TosaOpBase.td

mlir/include/mlir/Dialect/Tosa/IR/TosaOps.td

mlir/lib/Conversion/TosaToLinalg/TosaToLinalg.cpp

mlir/lib/Dialect/Traits.cpp

mlir/test/Conversion/TosaToLinalg/tosa-to-linalg.mlir

mlir/test/Dialect/traits.mlir

TOSA-to-Linalg lowering for element-wise ops
ClosedPublic