This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/IR/
-
mlir/
-
IR/
1/1
AffineExpr.h
-
lib/IR/
-
IR/
-
AffineExpr.cpp
-
test/
-
CMakeLists.txt
-
lit.cfg.py
-
mlir-linalg-ods-gen/
15/15
test-linalg-ods-gen.tc
-
tools/
-
CMakeLists.txt
-
mlir-linalg-ods-gen/
-
CMakeLists.txt
83/83
mlir-linalg-ods-gen.cpp

Differential D77067

[mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor Comprehensions-like specification.
ClosedPublic

Authored by nicolasvasilache on Mar 30 2020, 8:56 AM.

Download Raw Diff

Details

Reviewers

rriddle
silvas
stellaraccident
ftynse
mehdi_amini
aartbik
asaadaldien
antiagainst

Commits

rG882ba4847437: [mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor…

Summary

This revision adds a tool that generates the ODS and C++ implementation for "named" Linalg ops according to the [RFC discussion](https://llvm.discourse.group/t/rfc-declarative-named-ops-in-the-linalg-dialect/745).

While the mechanisms and language aspects are by no means set in stone, this revision allows connecting the pieces end-to-end from a mathematical-like specification.

Some implementation details and short-term decisions taken for the purpose of bootstrapping and that are not set in stone include:
1. using a "[Tensor Comprehension](https://arxiv.org/abs/1802.04730)-inspired" syntax
2. implicit and eager discovery of dims and symbols when parsing
3. using EDSC ops to specify the computation (e.g. std_addf, std_mul_f, ...)

A followup revision will connect this tool to tablegen mechanisms and allow the emission of named Linalg ops that automatically lower to various loop forms and run end to end.

For the following "Tensor Comprehension-inspired" string:
```

def batch_matmul(A: f32(Batch, M, K), B: f32(K, N)) -> (C: f32(Batch, M, N)) {
  C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));
}

```

With -gen-ods-decl=1, this emits (modulo formatting):
```
  def batch_matmulOp : LinalgNamedStructured_Op<"batch_matmul", [
    NInputs<2>,
    NOutputs<1>,
    NamedStructuredOpTraits]> {
      let arguments = (ins Variadic<LinalgOperand>:$views);
      let results = (outs Variadic<AnyRankedTensor>:$output_tensors);
      let extraClassDeclaration = [{
        llvm::Optional<SmallVector<StringRef, 8>> referenceIterators();
        llvm::Optional<SmallVector<AffineMap, 8>> referenceIndexingMaps();
        void regionBuilder(ArrayRef<BlockArgument> args);
      }];
      let hasFolder = 1;
  }
```

With -gen-ods-impl, this emits (modulo formatting):
```
  llvm::Optional<SmallVector<StringRef, 8>> batch_matmul::referenceIterators() {
      return SmallVector<StringRef, 8>{ getParallelIteratorTypeName(),
                                        getParallelIteratorTypeName(),
                                        getParallelIteratorTypeName(),
                                        getReductionIteratorTypeName() };
  }
  llvm::Optional<SmallVector<AffineMap, 8>> batch_matmul::referenceIndexingMaps()
  {
    MLIRContext *context = getContext();
    AffineExpr d0, d1, d2, d3;
    bindDims(context, d0, d1, d2, d3);
    return SmallVector<AffineMap, 8>{
        AffineMap::get(4, 0, {d0, d1, d3}),
        AffineMap::get(4, 0, {d3, d2}),
        AffineMap::get(4, 0, {d0, d1, d2}) };
  }
  void batch_matmul::regionBuilder(ArrayRef<BlockArgument> args) {
    using namespace edsc;
    using namespace intrinsics;
    ValueHandle _0(args[0]), _1(args[1]), _2(args[2]);

    ValueHandle _4 = std_mulf(_0, _1);
    ValueHandle _5 = std_addf(_2, _4);
    (linalg_yield(ValueRange{ _5 }));
  }
```

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	230 ms	LLVM.Other::Unknown Unit Message ("")

Event Timeline

nicolasvasilache created this revision.Mar 30 2020, 8:56 AM

Herald added a reviewer: rriddle. · View Herald TranscriptMar 30 2020, 8:56 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, Joonsoo, liufengdb and 11 others. · View Herald Transcript

Harbormaster failed remote builds in B50966: Diff 253607!Mar 30 2020, 9:44 AM

nicolasvasilache added reviewers: silvas, stellaraccident, ftynse.Mar 30 2020, 3:34 PM

Herald added a subscriber: grosul1. · View Herald TranscriptMar 30 2020, 3:34 PM

nicolasvasilache added a reviewer: mehdi_amini.Mar 30 2020, 3:34 PM

Do you intend for this to be "approaching production quality code" and reviewed as such or still proof-of-concept level?

silvas added inline comments.Mar 30 2020, 7:32 PM

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
24	This actually looks like it could be reasonably parsed with a custom `linalg_named_op_gen.def` op? Or is there a dependency issue with using MLIR for this as this needs to happen during build time?

Format

@silvas I'd hope closer to "approaching production quality", it is missing comments though which will make it easier to read.
Note that almost everything above l. 1000 in mlir-linalg-ods-gen.cpp is borrowed from other places.
For some reason Token, Lexer and core Parser are kept hidden within MLIR and I would very much like to expose them and avoid the copy-pasta (@rriddle what's your take on this?).

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
24	This needs to generate ODS which in turn defines new ops. I think what you are referring to would be the `linalg.generic` form which has verbosity issues as well as does not let you refer to `isa/cast/dyn_cast<MatmulOp>()`or let you matchAndRewrite easily.

Harbormaster failed remote builds in B51081: Diff 253772!Mar 30 2020, 9:18 PM

Refactorings, cleanups and reformat.

@silvas refactored so that things are better layereed.
Code above the following code block at line ~800 ish is taken from other places in MLIR and should be refactored out once lexer/parser is exposed.

//===----------------------------------------------------------------------===//
// TC parsing.
//===----------------------------------------------------------------------===//

Harbormaster failed remote builds in B51183: Diff 253979!Mar 31 2020, 2:19 PM

Add a test line to pipe the generated ODS through mlir-tblgen.

Harbormaster failed remote builds in B51317: Diff 254251!Apr 1 2020, 11:31 AM

mehdi_amini added inline comments.Apr 2 2020, 9:56 AM

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
2	Missing license header?

nicolasvasilache added a reviewer: aartbik.Apr 3 2020, 9:36 AM

nicolasvasilache added a reviewer: asaadaldien.Apr 3 2020, 10:25 AM

nicolasvasilache added a reviewer: antiagainst.Apr 3 2020, 12:11 PM

First round of comments.

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td
698 ↗	(On Diff #254251)	Is there another diff that includes this?
mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
2	add test that exercises multiple comprehensions in the body
6	layering-wise would prefer to not test this here. If needed, we can add a separate test elsewhere that does this .td -> .inc file check. Strictly speaking what ends up in the .inc file is not really the concern of this component, only the contents of the .td file.
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
975	some spaces needed around first tensor-def-list
978	I don't see affine-expr or tensor-typedef mentioned locally in this comment. move this to the appropriate comment?
1371	should this be a diagnostic?
1380	How can we emit ODS before we finish processing the whole `tc-def` production?
1411	maybe rename to "parseAndEmitTCDef"? Also probably rename processOneComprehension to parseAndEmitOneComprehension to be consistent with that.
1429	typo in the "expected" string.
1439–1440	Can you make this comment a bit easier to understand. What is an "eagerly discovered symbol" and how does this "normalize" it?
1445–1446	Instead of the ternary, use static AffineMap get(unsigned dimCount, unsigned symbolCount, ArrayRef<AffineExpr> results, MLIRContext *context);
1455	comma separated comprehensions seems to contradict the grammar?
1656	auto here obscures things IMO
1662	auto here obscures things IMO

This revision now requires changes to proceed.Apr 3 2020, 5:40 PM

nicolasvasilache marked 20 inline comments as done.Apr 4 2020, 11:40 AM

nicolasvasilache added inline comments.

mlir/include/mlir/Dialect/Linalg/IR/LinalgStructuredOps.td
698 ↗	(On Diff #254251)	This is eagerly included to allow the test to pipe through mlir-tblgen and verify the ODS is well-formed (as was suggested by @ftynse, see other comment). The companion revision https://reviews.llvm.org/D76456 does the plumbing assuming a parser exists and shows how to make this run end-to-end. In the current form this is non-functional and only exists for the purpose of verifying well-formedness and avoiding a giant diff when things can be (reasonably well) separated. If you have strong objections against this interim state, I would rather drop the piping through tablegen rather than merge revisions (but @ftynse may have his own objections to this).
mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
2	Despite the parser accepting it, there is no support for that atm and some eperiment + design is required here. Emitting an error for now.
6	This was suggested by @ftynse to show the ODS is valid and how it connects to tblgen by mirroring this test: https://github.com/llvm/llvm-project/blob/master/mlir/test/mlir-tblgen/llvm-intrinsics.td#L11. I am fine either way, would just like consensus on this before reverting back to the previous state. Please reopen if you feel strongly about this. @ftynse any strong opinion?
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
1380	Thanks!
1455	you're right, thanks!

Address review comments + refactor ComprehensionParserState.

nicolasvasilache retitled this revision from [mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor Comprehensions-like specification. to [mlir][Linalg] Add a linalg.tensor_reshape to operate on tensors.Apr 4 2020, 11:49 AM

nicolasvasilache edited the summary of this revision. (Show Details)

nicolasvasilache retitled this revision from [mlir][Linalg] Add a linalg.tensor_reshape to operate on tensors to [mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor Comprehensions-like specification..

nicolasvasilache edited the summary of this revision. (Show Details)

Harbormaster failed remote builds in B51788: Diff 255067!Apr 4 2020, 1:18 PM

It would be great to share some parts with the main parser, for example affine expression parsing. I think we can pretty much have parseAffineExpr(StringRef) declared in a private header and use it here, possibly with some semantic post-checks on the expression not involving, e.g., SSA values.

mlir/include/mlir/IR/AffineExpr.h
222	Nit: AffineExpr is a value-type, can't we just pass it by-value ?
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
86	This makes it look like it's a token `start`... How about `FIRST_KEYWORD = kw_def`, `LAST_KEYWORD=kw_select`?
135	Copy-pasta comment, this is not "operation assembly"
288	Missed `select` keyword. We could have some macro magic to make sure modifying the list of tokens also handles them in the lexer.
294	The code in getUInt64IntegerValue seems to support hex integers, but this clearly does not.
371	Nit: llvm::function_ref if you don't store the argument
545	Nit: the operand in `consumeToken` here and below is redundant, the case-expression just above ensures that the token is of the right kind.
840	This should be trivial to implement without resorting to virtual functions, dispatching on `kind` and using static_cast.
849	Nit: tensor-id is not defined
855	Nit: `= default` would also work
859	If you use LLVM-style type system, you would normally want to avoid virtual functions...
873	Nit: given that `PreOrder` is a boolean template parameter, I am not sure what "perfoms `PreOrder` traversal` means when the parameter is false. Post-order? In-order? Compilation error?
893	Would MutableArrayRef work instead of hardcoding SmallVector with a given size?
910	Do you care about the order of reduction dimensions?
928	Why SetVector? In TC, we wouldn't care about the order of reduction dimensions.
940–941	And what if discovery mode != symbols ?
950	Nit: something went wrong with formatting here: `\|` ran away to the right. I personally prefer something like foo ::= token token continuation line of the same rule \| another rule
974	Nit: this comment repeats the comment on `struct TensorExpr`. I am worried about it getting out of sync if the syntax evolves. My recommendation would be to only keep the syntax in a single comment (preferably, the implementation of this method), and just refer to that from the other comments.
981	Nit: why pass by-pointer rather than by-reference?

ftynse added inline comments.Apr 6 2020, 4:25 AM

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
396	"proper token" is unclear as error message
927	Do you actually need pointers? Can't we just store `Expression`s as is, eventually with appropriate move semantics to avoid extra copies?
989	Why does a parsing function accept an _output_ stream?
994	Plz document what does it print
1050	Do you actually need this? I only see `DenseMap<TensorExpr *>`, which should be using a generic pointer-based map implementation.
1053	How about `DenseMapInfo<StringRef>::getTombstoneKey()` instead?
1066	Same as above, AffineMap also has a DenseMapInfo if I'm not mistaken.
1083	Nit: document how the visitation behaves if the callback mutates the visited object
1126	Nit: would emplace_back work?
1135	Have you considered storing tensors in an llvm::StringMap indexed by name instead of doing linear lookups every time?
1147	Naming nit: `isa` is widely used for downcasting, this is just a lookup; prefer `is`.
1172	Would emplace_back work?
1192	Could you just have a default message `expected %tokenname%` instead of having a similar string everywhere
1196	It looks like it would parse just about any id. "expected a type id" sounds a bit misleading because "type id" is not a production rule, and there's no additional check on the id somehow being a type.
1201	Nit: add a description in the assertion. Also, are we sure this can never happen?
1280	Ultra-nit: we tend to use single quotes rather than backticks in error messages
1282	Nit: `/allowEmptyList=/true`
1314	`/allowEmptyList=/true`
1322	This may crash if you have less LHS declarations than RHS definitions.
1327	`/allowEmptyList=/true`
1332	`dimCount` and `symbolCount` make the comment look outdated, is it?
1352	Did you check that indexings were different?
1365	Nit: I'd use early return here
1373	[Not for this commit]: I would rather have the parser accept the correct syntax, and have a separate check that implements "semantic" rules.
1414	`/allowEmptyList=/true`
1425	`/allowEmptyList=/true`
1430	typo: "symbolicc"
1490	Why is the result optional?
1560	Nit: could we use more meaningful names than `ss2`?
1626	C++14 supports `auto` for lambda arguments
1645	Alternatively, you could use `ss.str()` instead of `valueHandleStr` below. Also, consider better names than ss, ss2, ss3. One `ss` is acceptable in a short function, but here it's really tricky to keep in mind which stream is associated with which string.

ftynse requested changes to this revision.Apr 6 2020, 4:25 AM

This revision now requires changes to proceed.Apr 6 2020, 4:25 AM

ftynse added inline comments.Apr 6 2020, 5:28 AM

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	I'm not sure I understand what is the concern here? The `ODS` check verifies the content of the produced .td file, _not_ the result of feeding that .td file to `mlir-tblgen -gen-op-defs`, which is indeed a separate concern. The `IMPL` check verifies the implementations of methods that are declared in the `.td` file and there is simply no other place where we can verify them. The staging here is: 1a. mlir-linalg-ods-gen -gen-ods-decl %this_file% > ods.td 1b. mlir-linalg-ods-gen -gen-impl %this_file% > impl.cc 2a. mlir-tblgen -gen-op-decl ods.td > ods.h 2b. mlir-tblgen -gen-op-decl ods.td > ods.cc include impl.cc and ods.cc into the implementation file; and ods.h into the header file. @nicolasvasilache the test you referenced also has `RUN` lines making sure `mlir-tblgen` can consume what the first stage produces. Consider adding them here as well. This could help detect cases of ODS syntax change (the simple syntactic test passes, but not the piping check). That's why there is only a trivial check to make sure FileCheck eats something.

nicolasvasilache marked 65 inline comments as done.Apr 6 2020, 8:07 PM

nicolasvasilache added inline comments.

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	I'm not sure I understand what is the concern here? ... Consider adding them here as well. That's precisely what the concern was IIUC, piping through mlir-tblgen (see previous snapshot that I updated improperly https://reviews.llvm.org/D77067?id=254251). Restored that part of the test.
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
288	Rather than continue duplicating here, MLIR should expose the lexer and parser and everyone's life will be better.
294	Yes, I am trimming liberally until MLIR exposes its lexer and parser at which point all this can disappear.
859	why ? https://llvm.org/docs/HowToSetUpLLVMStyleRTTI.html#basic-setup shows it's perfectly fine to use abstract base classes and LLVM RTTI.
910	you should otherwise your computation is non-deterministic
927	It's this or uniqu'ing, underlying storage, placement new etc etc. I went for the simple solution. When we have strong data that we need to scale much more we can revisit.
928	reductions loops don't commute in FP land
1135	I need 2 extra maps and really don't anticipate a single named op to ever to a point where this would matter. Of course if proven otherwise I'm happy to reconsider.
1192	I'm reluctant to invest more in duplicating something that should be exposed by core in a later NFC revision.
1373	Agreed, there are a few other things for follwups too, thanks!
1430	it's a faster `symbolcc`
1490	this is what the ODS currently is because of manual "named ops", will be cleaned later.

Address review comments.

meta-point: @ftynse let's not review the core parser code at the top of the file, as Nicolas says that they are just copypasta from the .mlir parser and won't be in the final patch.

Otherwise, thanks @ftynse for helping with the review :)

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	Ah, okay. Sorry for the confusion! When I saw C++ code I was assuming it was emitted by mlir-tblgen gen-op-def. But I see now that there is a mlir-linalg-tblgen -gen-impl that emits C++ as well. Sorry for the noise!!!

Thanks for your details reviews @silvas @ftynse !
Anything else ?

silvas added inline comments.Apr 6 2020, 8:27 PM

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	Actually, when rereading I see that we do indeed invoke `mlir-tblgen -gen-op-decls`. I specifically object to the `TBLGEN` check prefixes here. I consider it a bug to do that (although i see the precendent in llvm-intrinsics.td, but I would have raised the same objection there), since it violates the layering: somebody updating mlir-tblgen shouldn't be able to break this test. Consider the implications of what is being checked now in this test... // TBLGEN-LABEL: linalg::batchmatmulOp declarations ^ could be broken by a change in a comment in the generated file :x // TBLGEN: class batchmatmulOpOperandAdaptor { ^ could be broken by adding a common base class to the operand adaptor classes, or a change in naming convention for the adaptor classes // TBLGEN: class batchmatmulOp : public Op< ^ could be changed by a change in base classes or naming convention. Note that none of those changes I've indicated would actually break any actual use of this code. So this test is just artificially constraining the implementation of mlir-tblgen for no real value. And even if you strip it down, all you would really be testing is `def batchmatmulOp` results in a `class batchmatmulOp` in the output, which is already tested in many places, such as, say, https://github.com/llvm/llvm-project/blob/master/mlir/test/mlir-tblgen/op-decl.td We need to be courteous to the maintainers of other components and give them the flexibility to adjust the implementations of their components.

@nicolasvasilache any progress on reusing the MLIR parser? I consider that refactoring as blocking for submitting this patch. I don't want us to have a custom parser copypasted here that somebody has to clean up later without a strong reason.

Harbormaster failed remote builds in B52101: Diff 255573!Apr 6 2020, 9:16 PM

ftynse added inline comments.Apr 7 2020, 2:15 AM

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	I agree that this specific test is over-constraining for mlir-tblgen implementation. What I intended to test in intrinsicgen, and what I would like to see replicated here, is that the tablegen input produced by intrinsicgen, or my mlir-linalg-ods-gen, can be consumed by mlir-tblgen at all. Basically, we don't need to check for any output if we can find a way to check that mlir-tblgen exited with code 0 on the produced file. FileChecking the class name is just a workaround. If we don't do this check, we risk ending up in a situation where all of the existing tests pass (mlir-tblgen still generates expected C++ from ODS, and mlir-linalg-ods-gen still generates the strings expected by its test, just those strings are no longer valid ODS), but the pipeline fails. And given mlir-tblgen's tendency to assert or crash on improperly structured yet valid TableGen, it would be annoying to debug.

meta-point: @ftynse let's not review the core parser code at the top of the file, as Nicolas says that they are just copypasta from the .mlir parser and won't be in the final patch.

@silvas I wouldn't review it if it was actual copy-pasta. It is an incomplete and modified copy, which is therefore likely to have some weird behavior or be able to get into an irreversible state where the original code wouldn't get.

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
859	Because you pay the runtime overhead price for two abstractions serving essentially the same goal. Why?
910	I suppose you mean the IR you produce does not have a deterministic order of dimensions, which makes it hard to check. TC semantics says that all dimensions are interchangable, so their textual order should not matter. If it does, we should discuss the semantics and avoid branding this input as TC-like.
927	I don't think vectors of unique_ptr are simpler than vectors of values. This is an extra abstraction with associated cognitive overhead. This is also one extra dynamic allocation per element, as opposed to occasional allocations in the vector, and no strong reason to maintain the pointer as unique or to auto-deallocate (other than you forced the allocation in the first place). There is no actual uniquing of expression, neither is there underlying storage or placement new, you seem to be mistaking this with how types/attributes are handled in MLIR.
1135	Well, you currently have two extra vectors. I just don't see why prefer using a vector of pairs and implementing a search for _every one of them_ is better than using a dedicated container with accessor immediately available.

@silvasean I consider that refactoring as blocking for submitting this patch. I don't want us to have a custom parser copypasted here that somebody has to clean up later without a strong reason.
I have been following precedent here, see https://reviews.llvm.org/D73405 which also introduces its own tokenizer / lexer / parser.

As far as I understand it, MLIR has been pretty opinionated about not wanting to expose its tokenizer / lexer / parser: I tried to have them exposed in the past but objections have been along the line of "it's very easy code to write anyway".
I would strongly prefer we revisit that but IMO it would be unfortunate that work is blocked on this refactoring.

Does this help mitigate your position?

Does this help mitigate your position?

Yes. I take back my request to break it out. I buy Chris' statement "I don’t think that splitting this out and pretending it is reusable is a good idea - too much of it is specific to decisions in the MLIR syntax".

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	Ah, ok. Then you can just remove the `\| FileCheck`. The RUN line checks that the program has exit code 0, which won't be the case if mlir-tblgen runs into a syntax or processing error.

ftynse added inline comments.Apr 7 2020, 11:59 AM

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	Perfect, let's do this!

LGTM. Let's do this!

Herald added a subscriber: frgossen. · View Herald TranscriptApr 8 2020, 2:46 PM

nicolasvasilache marked 13 inline comments as done.Apr 9 2020, 12:15 PM

nicolasvasilache added inline comments.

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	Updated the test to get the minimal checkable thing: the class name.
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
859	Marking as done, this is part of the more global reply on downcasting, uptr etc.
910	Added a documentation section in `Linalg.md`.
927	unique_ptr + abstract base class: basically because I use downcasting. `vector<Expression>` will slice unless derived classes have sizeof == 0 (i.e. there is an underlying pointer payload). An option is to implement a similar arena + pImpl to what MLIR does for the "by-value" abstractions. I consider this to be unnecessarily complex for my use case (parser that runs at compiler compile time): `vector<unique_ptr<...>>` is a standard and simple way to solve the slicing and ownership issue, its performance drawback are not relevant at this time IMO.
1135	fair enough, done, thanks!

Address review comments.

Harbormaster failed remote builds in B52548: Diff 256354!Apr 9 2020, 12:21 PM

Addressed

Please fix the Windows build problem before landing. It looks like the pre-merge testing has such build now so you can use it for the initial check.

mlir/docs/Dialects/Linalg.md
473 ↗	(On Diff #256354)	Nit: angle bracket notation
mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc
6	This won't work on Windows. Consider adding a `-test-emit-additional-includes` flag to `mlir-linalg-ods-gen` and use it here instead of trying shell magic.
mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp
823	llvm_unreachable ?

Address last review comments.

Format

Doc

This revision was not accepted when it landed; it landed in state Needs Review.Apr 10 2020, 11:05 AM

Closed by commit rG882ba4847437: [mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor… (authored by nicolasvasilache). · Explain Why

This revision was automatically updated to reflect the committed changes.

Harbormaster failed remote builds in B52697: Diff 256612!Apr 10 2020, 11:15 AM

Harbormaster failed remote builds in B52699: Diff 256614!

Harbormaster failed remote builds in B52700: Diff 256615!Apr 10 2020, 11:21 AM

Revision Contents

Path

Size

mlir/

include/

mlir/

IR/

AffineExpr.h

2 lines

lib/

IR/

AffineExpr.cpp

2 lines

test/

CMakeLists.txt

1 line

lit.cfg.py

2 lines

mlir-linalg-ods-gen/

test-linalg-ods-gen.tc

71 lines

tools/

CMakeLists.txt

1 line

mlir-linalg-ods-gen/

CMakeLists.txt

10 lines

mlir-linalg-ods-gen.cpp

1691 lines

Diff 255067

mlir/include/mlir/IR/AffineExpr.h

	//===- AffineExpr.h - MLIR Affine Expr Class --------------------- C++ --===//			//===- AffineExpr.h - MLIR Affine Expr Class --------------------- C++ --===//
				Lint: Lint Inline Actions clang-format-diff not found in user's PATH; not linting file. Lint: Lint: clang-format-diff not found in user's PATH; not linting file.
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	▲ Show 20 Lines • Show All 205 Lines • ▼ Show 20 Lines
	/// products expression, 'localExprs' is expected to have the AffineExpr			/// products expression, 'localExprs' is expected to have the AffineExpr
	/// for it, and is substituted into. The ArrayRef 'eq' is expected to be in the			/// for it, and is substituted into. The ArrayRef 'eq' is expected to be in the
	/// format [dims, symbols, locals, constant term].			/// format [dims, symbols, locals, constant term].
	AffineExpr getAffineExprFromFlatForm(ArrayRef<int64_t> flatExprs,			AffineExpr getAffineExprFromFlatForm(ArrayRef<int64_t> flatExprs,
	unsigned numDims, unsigned numSymbols,			unsigned numDims, unsigned numSymbols,
	ArrayRef<AffineExpr> localExprs,			ArrayRef<AffineExpr> localExprs,
	MLIRContext *context);			MLIRContext *context);

	raw_ostream &operator<<(raw_ostream &os, AffineExpr &expr);			raw_ostream &operator<<(raw_ostream &os, const AffineExpr &expr);
				ftynseUnsubmitted Done Reply Inline Actions Nit: AffineExpr is a value-type, can't we just pass it by-value ? ftynse: Nit: AffineExpr is a value-type, can't we just pass it by-value ?

	template <typename U> bool AffineExpr::isa() const {			template <typename U> bool AffineExpr::isa() const {
	if (std::is_same<U, AffineBinaryOpExpr>::value)			if (std::is_same<U, AffineBinaryOpExpr>::value)
	return getKind() <= AffineExprKind::LAST_AFFINE_BINARY_OP;			return getKind() <= AffineExprKind::LAST_AFFINE_BINARY_OP;
	if (std::is_same<U, AffineDimExpr>::value)			if (std::is_same<U, AffineDimExpr>::value)
	return getKind() == AffineExprKind::DimId;			return getKind() == AffineExprKind::DimId;
	if (std::is_same<U, AffineSymbolExpr>::value)			if (std::is_same<U, AffineSymbolExpr>::value)
	return getKind() == AffineExprKind::SymbolId;			return getKind() == AffineExprKind::SymbolId;
	▲ Show 20 Lines • Show All 65 Lines • Show Last 20 Lines

mlir/lib/IR/AffineExpr.cpp

//===- AffineExpr.cpp - MLIR Affine Expr Classes --------------------------===//		//===- AffineExpr.cpp - MLIR Affine Expr Classes --------------------------===//
		Lint: Lint Inline Actions clang-format-diff not found in user's PATH; not linting file. Lint: Lint: clang-format-diff not found in user's PATH; not linting file.
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 599 Lines • ▼ Show 20 Lines	return uniquer.get<AffineBinaryOpExprStorage>(
/initFn=/{}, static_cast<unsigned>(AffineExprKind::Mod), *this, other);		/initFn=/{}, static_cast<unsigned>(AffineExprKind::Mod), *this, other);
}		}

AffineExpr AffineExpr::compose(AffineMap map) const {		AffineExpr AffineExpr::compose(AffineMap map) const {
SmallVector<AffineExpr, 8> dimReplacements(map.getResults().begin(),		SmallVector<AffineExpr, 8> dimReplacements(map.getResults().begin(),
map.getResults().end());		map.getResults().end());
return replaceDimsAndSymbols(dimReplacements, {});		return replaceDimsAndSymbols(dimReplacements, {});
}		}
raw_ostream &mlir::operator<<(raw_ostream &os, AffineExpr &expr) {		raw_ostream &mlir::operator<<(raw_ostream &os, const AffineExpr &expr) {
expr.print(os);		expr.print(os);
return os;		return os;
}		}

/// Constructs an affine expression from a flat ArrayRef. If there are local		/// Constructs an affine expression from a flat ArrayRef. If there are local
/// identifiers (neither dimensional nor symbolic) that appear in the sum of		/// identifiers (neither dimensional nor symbolic) that appear in the sum of
/// products expression, `localExprs` is expected to have the AffineExpr		/// products expression, `localExprs` is expected to have the AffineExpr
/// for it, and is substituted into. The ArrayRef `flatExprs` is expected to be		/// for it, and is substituted into. The ArrayRef `flatExprs` is expected to be
▲ Show 20 Lines • Show All 265 Lines • Show Last 20 Lines

mlir/test/CMakeLists.txt

Show All 29 Lines	configure_lit_site_cfg(
${CMAKE_CURRENT_SOURCE_DIR}/Unit/lit.cfg.py		${CMAKE_CURRENT_SOURCE_DIR}/Unit/lit.cfg.py
)		)

set(MLIR_TEST_DEPENDS		set(MLIR_TEST_DEPENDS
FileCheck count not		FileCheck count not
MLIRUnitTests		MLIRUnitTests
mlir-cpu-runner		mlir-cpu-runner
mlir-edsc-builder-api-test		mlir-edsc-builder-api-test
		mlir-linalg-ods-gen
mlir-opt		mlir-opt
mlir-sdbm-api-test		mlir-sdbm-api-test
mlir-tblgen		mlir-tblgen
mlir-translate		mlir-translate
cblas		cblas
cblas_interface		cblas_interface
mlir_runner_utils		mlir_runner_utils
mlir_c_runner_utils		mlir_c_runner_utils
Show All 35 Lines

mlir/test/lit.cfg.py

	Show All 15 Lines
	# Configuration file for the 'lit' test runner.			# Configuration file for the 'lit' test runner.

	# name: The name of this test suite.			# name: The name of this test suite.
	config.name = 'MLIR'			config.name = 'MLIR'

	config.test_format = lit.formats.ShTest(not llvm_config.use_lit_shell)			config.test_format = lit.formats.ShTest(not llvm_config.use_lit_shell)

	# suffixes: A list of file extensions to treat as test files.			# suffixes: A list of file extensions to treat as test files.
	config.suffixes = ['.td', '.mlir', '.toy', '.ll']			config.suffixes = ['.td', '.mlir', '.toy', '.ll', '.tc']

	# test_source_root: The root path where tests are located.			# test_source_root: The root path where tests are located.
	config.test_source_root = os.path.dirname(__file__)			config.test_source_root = os.path.dirname(__file__)

	# test_exec_root: The root path where tests should be run.			# test_exec_root: The root path where tests should be run.
	config.test_exec_root = os.path.join(config.mlir_obj_root, 'test')			config.test_exec_root = os.path.join(config.mlir_obj_root, 'test')

	config.substitutions.append(('%PATH%', config.environment['PATH']))			config.substitutions.append(('%PATH%', config.environment['PATH']))
	▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

This file was added.

				// RUN: mlir-linalg-ods-gen %s -gen-ods-decl=1 \| FileCheck %s --check-prefix=ODS
				// RUN: mlir-linalg-ods-gen %s -gen-impl=1 \| FileCheck %s --check-prefix=IMPL
				silvasUnsubmitted Done Reply Inline Actions add test that exercises multiple comprehensions in the body silvas: add test that exercises multiple comprehensions in the body
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Despite the parser accepting it, there is no support for that atm and some eperiment + design is required here. Emitting an error for now. nicolasvasilache: Despite the parser accepting it, there is no support for that atm and some eperiment + design…

				// ODS-LABEL: def matvecOp : LinalgNamedStructured_Op<"matvec", [
				// ODS-NEXT: NInputs<2>,
				// ODS-NEXT: NOutputs<1>,
				silvasUnsubmitted Done Reply Inline Actions layering-wise would prefer to not test this here. If needed, we can add a separate test elsewhere that does this .td -> .inc file check. Strictly speaking what ends up in the .inc file is not really the concern of this component, only the contents of the .td file. silvas: layering-wise would prefer to not test this here. If needed, we can add a separate test…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions This was suggested by @ftynse to show the ODS is valid and how it connects to tblgen by mirroring this test: https://github.com/llvm/llvm-project/blob/master/mlir/test/mlir-tblgen/llvm-intrinsics.td#L11. I am fine either way, would just like consensus on this before reverting back to the previous state. Please reopen if you feel strongly about this. @ftynse any strong opinion? nicolasvasilache: This was suggested by @ftynse to show the ODS is valid and how it connects to tblgen by…
				ftynseUnsubmitted Done Reply Inline Actions I'm not sure I understand what is the concern here? The `ODS` check verifies the content of the produced .td file, _not_ the result of feeding that .td file to `mlir-tblgen -gen-op-defs`, which is indeed a separate concern. The `IMPL` check verifies the implementations of methods that are declared in the `.td` file and there is simply no other place where we can verify them. The staging here is: 1a. mlir-linalg-ods-gen -gen-ods-decl %this_file% > ods.td 1b. mlir-linalg-ods-gen -gen-impl %this_file% > impl.cc 2a. mlir-tblgen -gen-op-decl ods.td > ods.h 2b. mlir-tblgen -gen-op-decl ods.td > ods.cc include impl.cc and ods.cc into the implementation file; and ods.h into the header file. @nicolasvasilache the test you referenced also has `RUN` lines making sure `mlir-tblgen` can consume what the first stage produces. Consider adding them here as well. This could help detect cases of ODS syntax change (the simple syntactic test passes, but not the piping check). That's why there is only a trivial check to make sure FileCheck eats something. ftynse: I'm not sure I understand what is the concern here? The `ODS` check verifies the content of the…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I'm not sure I understand what is the concern here? ... Consider adding them here as well. That's precisely what the concern was IIUC, piping through mlir-tblgen (see previous snapshot that I updated improperly https://reviews.llvm.org/D77067?id=254251). Restored that part of the test. nicolasvasilache: ``` I'm not sure I understand what is the concern here? ... Consider adding them here as well.
				silvasUnsubmitted Done Reply Inline Actions Ah, okay. Sorry for the confusion! When I saw C++ code I was assuming it was emitted by mlir-tblgen gen-op-def. But I see now that there is a mlir-linalg-tblgen -gen-impl that emits C++ as well. Sorry for the noise!!! silvas: Ah, okay. Sorry for the confusion! When I saw C++ code I was assuming it was emitted by mlir…
				silvasUnsubmitted Done Reply Inline Actions Actually, when rereading I see that we do indeed invoke `mlir-tblgen -gen-op-decls`. I specifically object to the `TBLGEN` check prefixes here. I consider it a bug to do that (although i see the precendent in llvm-intrinsics.td, but I would have raised the same objection there), since it violates the layering: somebody updating mlir-tblgen shouldn't be able to break this test. Consider the implications of what is being checked now in this test... // TBLGEN-LABEL: linalg::batchmatmulOp declarations ^ could be broken by a change in a comment in the generated file :x // TBLGEN: class batchmatmulOpOperandAdaptor { ^ could be broken by adding a common base class to the operand adaptor classes, or a change in naming convention for the adaptor classes // TBLGEN: class batchmatmulOp : public Op< ^ could be changed by a change in base classes or naming convention. Note that none of those changes I've indicated would actually break any actual use of this code. So this test is just artificially constraining the implementation of mlir-tblgen for no real value. And even if you strip it down, all you would really be testing is `def batchmatmulOp` results in a `class batchmatmulOp` in the output, which is already tested in many places, such as, say, https://github.com/llvm/llvm-project/blob/master/mlir/test/mlir-tblgen/op-decl.td We need to be courteous to the maintainers of other components and give them the flexibility to adjust the implementations of their components. silvas: Actually, when rereading I see that we do indeed invoke `mlir-tblgen -gen-op-decls`. I…
				ftynseUnsubmitted Done Reply Inline Actions I agree that this specific test is over-constraining for mlir-tblgen implementation. What I intended to test in intrinsicgen, and what I would like to see replicated here, is that the tablegen input produced by intrinsicgen, or my mlir-linalg-ods-gen, can be consumed by mlir-tblgen at all. Basically, we don't need to check for any output if we can find a way to check that mlir-tblgen exited with code 0 on the produced file. FileChecking the class name is just a workaround. If we don't do this check, we risk ending up in a situation where all of the existing tests pass (mlir-tblgen still generates expected C++ from ODS, and mlir-linalg-ods-gen still generates the strings expected by its test, just those strings are no longer valid ODS), but the pipeline fails. And given mlir-tblgen's tendency to assert or crash on improperly structured yet valid TableGen, it would be annoying to debug. ftynse: I agree that this specific test is over-constraining for mlir-tblgen implementation. What I…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Updated the test to get the minimal checkable thing: the class name. nicolasvasilache: Updated the test to get the minimal checkable thing: the class name.
				silvasUnsubmitted Done Reply Inline Actions Ah, ok. Then you can just remove the `\| FileCheck`. The RUN line checks that the program has exit code 0, which won't be the case if mlir-tblgen runs into a syntax or processing error. silvas: Ah, ok. Then you can just remove the `\| FileCheck`. The RUN line checks that the program has…
				ftynseUnsubmitted Done Reply Inline Actions Perfect, let's do this! ftynse: Perfect, let's do this!
				ftynseUnsubmitted Done Reply Inline Actions This won't work on Windows. Consider adding a `-test-emit-additional-includes` flag to `mlir-linalg-ods-gen` and use it here instead of trying shell magic. ftynse: This won't work on Windows. Consider adding a `-test-emit-additional-includes` flag to `mlir…
				// ODS-NEXT: NamedStructuredOpTraits]>
				//
				// IMPL-LABEL: matvec::referenceIterators() {
				// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
				//
				// IMPL: matvec::referenceIndexingMaps() {
				// IMPL: AffineMap::get(2, 0, {d0, d1}),
				// IMPL-NEXT: AffineMap::get(2, 0, {d1}),
				// IMPL-NEXT: AffineMap::get(2, 0, {d0}) };
				//
				// IMPL: matvec::regionBuilder(ArrayRef<BlockArgument> args) {
				// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
				// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
				// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
				// IMPL: (linalg_yield(ValueRange{ [[e]] }));
				//
				def matvec(A: f32(M, K), B: f32(K)) -> (C: f32(M)) {
				C(m) = std_addf<k>(std_mulf(A(m, k), B(k)));
				silvasUnsubmitted Done Reply Inline Actions This actually looks like it could be reasonably parsed with a custom `linalg_named_op_gen.def` op? Or is there a dependency issue with using MLIR for this as this needs to happen during build time? silvas: This actually looks like it could be reasonably parsed with a custom `linalg_named_op_gen.def`…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions This needs to generate ODS which in turn defines new ops. I think what you are referring to would be the `linalg.generic` form which has verbosity issues as well as does not let you refer to `isa/cast/dyn_cast<MatmulOp>()`or let you matchAndRewrite easily. nicolasvasilache: This needs to generate ODS which in turn defines new ops. I think what you are referring to…
				}

				// ODS-LABEL: def matmulOp : LinalgNamedStructured_Op<"matmul", [
				// ODS-NEXT: NInputs<2>,
				// ODS-NEXT: NOutputs<1>,
				// ODS-NEXT: NamedStructuredOpTraits]>
				//
				// IMPL-LABEL: matmul::referenceIterators() {
				// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
				//
				// IMPL: matmul::referenceIndexingMaps() {
				// IMPL: AffineMap::get(3, 0, {d0, d2}),
				// IMPL-NEXT: AffineMap::get(3, 0, {d2, d1}),
				// IMPL-NEXT: AffineMap::get(3, 0, {d0, d1}) };
				//
				// IMPL: matmul::regionBuilder(ArrayRef<BlockArgument> args) {
				// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
				// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
				// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
				// IMPL: (linalg_yield(ValueRange{ [[e]] }));
				//
				def matmul(A: f32(M, K), B: f32(K, N)) -> (C: f32(M, N)) {
				C(m, n) = std_addf<k>(std_mulf(A(m, k), B(k, n)));
				}

				// ODS-LABEL: def batch_matmulOp : LinalgNamedStructured_Op<"batch_matmul", [
				// ODS-NEXT: NInputs<2>,
				// ODS-NEXT: NOutputs<1>,
				// ODS-NEXT: NamedStructuredOpTraits]>
				//
				// IMPL-LABEL: batch_matmul::referenceIterators() {
				// IMPL-NEXT: { {{.}}Parallel{{.}}, {{.}}Parallel{{.}}, {{.}}Reduction{{.}} }
				//
				// IMPL: batch_matmul::referenceIndexingMaps() {
				// IMPL: AffineMap::get(4, 0, {d0, d1, d3}),
				// IMPL-NEXT: AffineMap::get(4, 0, {d3, d2}),
				// IMPL-NEXT: AffineMap::get(4, 0, {d0, d1, d2}) };
				//
				// IMPL: batch_matmul::regionBuilder(ArrayRef<BlockArgument> args) {
				// IMPL: ValueHandle [[a:.]](args[0]), [[b:.]](args[1]), [[c:.*]](args[2]);
				// IMPL: ValueHandle [[d:.*]] = std_mulf([[a]], [[b]]);
				// IMPL: ValueHandle [[e:.*]] = std_addf([[c]], [[d]]);
				// IMPL: (linalg_yield(ValueRange{ [[e]] }));
				//
				def batch_matmul(A: f32(Batch, M, K), B: f32(K, N)) -> (C: f32(Batch, M, N)) {
				C(b, m, n) = std_addf<k>(std_mulf(A(b, m, k), B(k, n)));
				}

mlir/tools/CMakeLists.txt

	add_subdirectory(mlir-cuda-runner)			add_subdirectory(mlir-cuda-runner)
	add_subdirectory(mlir-cpu-runner)			add_subdirectory(mlir-cpu-runner)
				add_subdirectory(mlir-linalg-ods-gen)
	add_subdirectory(mlir-opt)			add_subdirectory(mlir-opt)
	add_subdirectory(mlir-translate)			add_subdirectory(mlir-translate)
	add_subdirectory(mlir-vulkan-runner)			add_subdirectory(mlir-vulkan-runner)
	add_subdirectory(mlir-shlib)			add_subdirectory(mlir-shlib)

mlir/tools/mlir-linalg-ods-gen/CMakeLists.txt

This file was added.

				add_llvm_tool(mlir-linalg-ods-gen
				mlir-linalg-ods-gen.cpp
				)
				llvm_update_compile_flags(mlir-linalg-ods-gen)
				target_link_libraries(mlir-linalg-ods-gen PRIVATE
				MLIRParser
				MLIRSupport
				LLVMCore
				LLVMSupport
				)

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

This file was added.

				//===- mlir-linalg-ods-gen.cpp - Linalg ODS generation from math form -----===//
				Lint: Lint Inline Actions clang-format-diff not found in user's PATH; not linting file. Lint: Lint: clang-format-diff not found in user's PATH; not linting file.
				//
				mehdi_aminiUnsubmitted Done Reply Inline Actions Missing license header? mehdi_amini: Missing license header?
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains the implementation for the Tensor Comprehension-inspired
				// parser and ODS pretty-printer for specifying Linalg "named ops" from a
				// mathematical form.
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/IR/AffineExpr.h"
				#include "mlir/IR/AffineMap.h"
				#include "mlir/IR/MLIRContext.h"
				#include "mlir/IR/OpImplementation.h"
				#include "mlir/Support/FileUtilities.h"
				#include "mlir/Support/LLVM.h"
				#include "mlir/Support/LogicalResult.h"
				#include "mlir/Support/STLExtras.h"
				#include "llvm/ADT/SetVector.h"
				#include "llvm/Support/Casting.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/FormatVariadic.h"
				#include "llvm/Support/ToolOutputFile.h"

				#define DEBUG_TYPE "linalg-ods-gen"

				static llvm::cl::OptionCategory IntrinsicGenCat("Linalg ODS Gen");

				// Commandline options
				static llvm::cl::opt<std::string>
				inputFilename(llvm::cl::Positional, llvm::cl::desc("<input file>"),
				llvm::cl::init("-"), llvm::cl::value_desc("filename"));

				static llvm::cl::opt<std::string>
				outputFilename("o", llvm::cl::desc("Output filename"),
				llvm::cl::value_desc("filename"), llvm::cl::init("-"));

				static llvm::cl::opt<bool>
				genODSDecl("gen-ods-decl", llvm::cl::desc("Emit the ODS ops declarations."),
				llvm::cl::cat(IntrinsicGenCat));

				static llvm::cl::opt<bool>
				genODSImpl("gen-impl", llvm::cl::desc("Emit the ops implementations"),
				llvm::cl::init(false), llvm::cl::cat(IntrinsicGenCat));

				using llvm::SetVector;
				using llvm::SMLoc;
				using llvm::StringRef;
				using llvm::Twine;

				using namespace mlir;

				//===----------------------------------------------------------------------===//
				// Lexer
				//===----------------------------------------------------------------------===//

				namespace {
				/// This class represents a specific token in the input format.
				class Token {
				public:
				enum Kind {
				// Markers.
				eof,
				error,

				// Tokens with no info.
				colon,
				comma,
				equal,
				gt,
				l_brace,
				l_paren,
				lt,
				minus,
				plus,
				r_brace,
				r_paren,
				semicolon,
				star,

				// Keywords.
				keyword_start,
				ftynseUnsubmitted Done Reply Inline Actions This makes it look like it's a token `start`... How about `FIRST_KEYWORD = kw_def`, `LAST_KEYWORD=kw_select`? ftynse: This makes it look like it's a token `start`... How about `FIRST_KEYWORD = kw_def`…
				kw_def,
				kw_floordiv,
				kw_ceildiv,
				kw_mod,
				kw_select,
				keyword_end,

				// String valued tokens.
				id,
				integer,
				};

				Token(Kind kind, StringRef spelling) : kind(kind), spelling(spelling) {}

				/// Return the bytes that make up this token.
				StringRef getSpelling() const { return spelling; }

				/// Return the kind of this token.
				Kind getKind() const { return kind; }

				/// Return a location for this token.
				llvm::SMLoc getLoc() const {
				return llvm::SMLoc::getFromPointer(spelling.data());
				}

				/// Return if this token is a keyword.
				bool isKeyword() const { return kind > keyword_start && kind < keyword_end; }
				bool is(Kind k) const { return kind == k; }
				bool isNot(Kind k) const { return kind != k; }

				Optional<uint64_t> getUInt64IntegerValue() const {
				bool isHex = spelling.size() > 1 && spelling[1] == 'x';

				uint64_t result = 0;
				if (spelling.getAsInteger(isHex ? 0 : 10, result))
				return None;
				return result;
				}

				private:
				/// Discriminator that indicates the kind of token this is.
				Kind kind;

				/// A reference to the entire token contents; this is always a pointer into
				/// a memory buffer owned by the source manager.
				StringRef spelling;
				};

				/// This class implements a simple lexer for operation assembly format strings.
				ftynseUnsubmitted Done Reply Inline Actions Copy-pasta comment, this is not "operation assembly" ftynse: Copy-pasta comment, this is not "operation assembly"
				class Lexer {
				public:
				Lexer(llvm::SourceMgr &mgr);

				/// Lex the next token and return it.
				Token lexToken();

				/// Emit an error to the lexer with the given location and message.
				Token emitError(llvm::SMLoc loc, const Twine &msg);
				Token emitError(const char *loc, const Twine &msg);

				private:
				Token formToken(Token::Kind kind, const char *tokStart) {
				return Token(kind, StringRef(tokStart, curPtr - tokStart));
				}

				/// Return the next character in the stream.
				int getNextChar();

				/// Lex an identifier.
				Token lexIdentifier(const char *tokStart);

				// Lex an integer.
				Token lexInteger(const char *tokStart);

				// Skip a comment line, starting with a '//'.
				void skipComment();

				llvm::SourceMgr &srcMgr;
				StringRef curBuffer;
				const char *curPtr;
				};
				} // end anonymous namespace

				Lexer::Lexer(llvm::SourceMgr &mgr) : srcMgr(mgr) {
				curBuffer = srcMgr.getMemoryBuffer(mgr.getMainFileID())->getBuffer();
				curPtr = curBuffer.begin();
				}

				Token Lexer::emitError(llvm::SMLoc loc, const Twine &msg) {
				srcMgr.PrintMessage(loc, llvm::SourceMgr::DK_Error, msg);
				return formToken(Token::error, loc.getPointer());
				}
				Token Lexer::emitError(const char *loc, const Twine &msg) {
				return emitError(llvm::SMLoc::getFromPointer(loc), msg);
				}

				int Lexer::getNextChar() {
				char curChar = *curPtr++;
				switch (curChar) {
				default:
				return (unsigned char)curChar;
				case 0: {
				// A nul character in the stream is either the end of the current buffer
				// or a random nul in the file. Disambiguate that here.
				if (curPtr - 1 != curBuffer.end())
				return 0;

				// Otherwise, return end of file.
				--curPtr;
				return EOF;
				}
				case '\n':
				case '\r':
				// Handle the newline character by ignoring it and incrementing the line
				// count. However, be careful about 'dos style' files with \n\r in them.
				// Only treat a \n\r or \r\n as a single line.
				if ((curPtr == '\n' \|\| (curPtr == '\r')) && *curPtr != curChar)
				++curPtr;
				return '\n';
				}
				}

				Token Lexer::lexToken() {
				while (true) {
				const char *tokStart = curPtr;

				// This always consumes at least one character.
				int curChar = getNextChar();
				switch (curChar) {
				default:
				// Handle identifiers: [a-zA-Z_]
				if (isalpha(curChar) \|\| curChar == '_')
				return lexIdentifier(tokStart);

				// Handle integers: [0-9]
				if (isdigit(curChar))
				return lexInteger(tokStart);

				// Unknown character, emit an error.
				return emitError(tokStart, "unexpected character");

				case EOF:
				// Return EOF denoting the end of lexing.
				return formToken(Token::eof, tokStart);

				// Lex punctuation.
				case ':':
				return formToken(Token::colon, tokStart);
				case ',':
				return formToken(Token::comma, tokStart);
				case '=':
				return formToken(Token::equal, tokStart);
				case '{':
				return formToken(Token::l_brace, tokStart);
				case '(':
				return formToken(Token::l_paren, tokStart);
				case '}':
				return formToken(Token::r_brace, tokStart);
				case ')':
				return formToken(Token::r_paren, tokStart);
				case '<':
				return formToken(Token::lt, tokStart);
				case '>':
				return formToken(Token::gt, tokStart);
				case '+':
				return formToken(Token::plus, tokStart);
				case '-':
				return formToken(Token::minus, tokStart);
				case ';':
				return formToken(Token::semicolon, tokStart);
				case '*':
				return formToken(Token::star, tokStart);
				case '/':
				if (*curPtr == '/') {
				skipComment();
				continue;
				}
				// Unknown character, emit an error.
				return emitError(tokStart, "unexpected character: not a comment");

				// Ignore whitespace characters.
				case 0:
				case ' ':
				case '\t':
				case '\n':
				return lexToken();
				}
				}
				}

				Token Lexer::lexIdentifier(const char *tokStart) {
				// Match the rest of the identifier regex: [0-9a-zA-Z_\-]*
				while (isalnum(curPtr) \|\| curPtr == '_' \|\| *curPtr == '-')
				++curPtr;

				// Check to see if this identifier is a keyword.
				StringRef str(tokStart, curPtr - tokStart);
				Token::Kind kind = llvm::StringSwitch<Token::Kind>(str)
				.Case("def", Token::kw_def)
				.Case("floordiv", Token::kw_floordiv)
				.Case("ceildiv", Token::kw_ceildiv)
				.Case("mod", Token::kw_mod)
				ftynseUnsubmitted Done Reply Inline Actions Missed `select` keyword. We could have some macro magic to make sure modifying the list of tokens also handles them in the lexer. ftynse: Missed `select` keyword. We could have some macro magic to make sure modifying the list of…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Rather than continue duplicating here, MLIR should expose the lexer and parser and everyone's life will be better. nicolasvasilache: Rather than continue duplicating here, MLIR should expose the lexer and parser and everyone's…
				.Default(Token::id);

				return Token(kind, str);
				}

				Token Lexer::lexInteger(const char *tokStart) {
				ftynseUnsubmitted Done Reply Inline Actions The code in getUInt64IntegerValue seems to support hex integers, but this clearly does not. ftynse: The code in getUInt64IntegerValue seems to support hex integers, but this clearly does not.
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Yes, I am trimming liberally until MLIR exposes its lexer and parser at which point all this can disappear. nicolasvasilache: Yes, I am trimming liberally until MLIR exposes its lexer and parser at which point all this…
				// Match the rest of the identifier regex: [0-9a-zA-Z_\-]*
				while (isdigit(*curPtr))
				++curPtr;

				StringRef str(tokStart, curPtr - tokStart);
				return Token(Token::integer, str);
				}

				/// Skip a comment line, starting with a '//'.
				void Lexer::skipComment() {
				// Advance over the second '/' in a '//' comment.
				assert(*curPtr == '/');
				++curPtr;

				while (true) {
				switch (*curPtr++) {
				case '\n':
				case '\r':
				// Newline is end of comment.
				return;
				case 0:
				// If this is the end of the buffer, end the comment.
				if (curPtr - 1 == curBuffer.end()) {
				--curPtr;
				return;
				}
				LLVM_FALLTHROUGH;
				default:
				// Skip over other characters.
				break;
				}
				}
				}

				namespace {

				class Parser {
				public:
				Parser(llvm::SourceMgr &mgr, MLIRContext *ctx)
				: lexer(mgr), curToken(lexer.lexToken()), context(ctx) {}

				//===--------------------------------------------------------------------===//
				// Lexer Utilities
				//===--------------------------------------------------------------------===//

				/// Advance the current lexer onto the next token.
				void consumeToken() {
				assert(curToken.getKind() != Token::eof &&
				curToken.getKind() != Token::error &&
				"shouldn't advance past EOF or errors");
				curToken = lexer.lexToken();
				}
				void consumeToken(Token::Kind kind) {
				assert(curToken.getKind() == kind && "unexpected token");
				curToken = lexer.lexToken();
				}
				LogicalResult parseToken(Token::Kind kind, const Twine &msg) {
				if (curToken.getKind() != kind)
				return emitError(curToken.getLoc(), msg);
				consumeToken();
				return success();
				}
				LogicalResult emitError(llvm::SMLoc loc, const Twine &msg) {
				lexer.emitError(loc, msg);
				return failure();
				}
				LogicalResult emitError(const Twine &msg) {
				return emitError(curToken.getLoc(), msg);
				}
				bool consumeIf(Token::Kind kind) {
				if (curToken.isNot(kind))
				return false;
				consumeToken(kind);
				return true;
				}
				LogicalResult
				parseCommaSeparatedList(const std::function<ParseResult()> &parseElement) {
				ftynseUnsubmitted Done Reply Inline Actions Nit: llvm::function_ref if you don't store the argument ftynse: Nit: llvm::function_ref if you don't store the argument
				// Non-empty case starts with an element.
				if (parseElement())
				return failure();

				// Otherwise we have a list of comma separated elements.
				while (consumeIf(Token::comma)) {
				if (parseElement())
				return failure();
				}
				return success();
				}
				LogicalResult
				parseCommaSeparatedListUntil(Token::Kind rightToken,
				const std::function<ParseResult()> &parseElement,
				bool allowEmptyList) {
				// Handle the empty case.
				if (curToken.is(rightToken)) {
				if (!allowEmptyList)
				return emitError("expected list element");
				consumeToken(rightToken);
				return success();
				}

				if (failed(parseCommaSeparatedList(parseElement)) \|\|
				failed(parseToken(rightToken, "expected ',' or proper token")))
				ftynseUnsubmitted Done Reply Inline Actions "proper token" is unclear as error message ftynse: "proper token" is unclear as error message
				return failure();

				return success();
				}

				Lexer lexer;
				Token curToken;
				MLIRContext *context;
				};
				} // namespace

				//===----------------------------------------------------------------------===//
				// Affine parsing.
				//===----------------------------------------------------------------------===//

				namespace {

				/// Lower precedence ops (all at the same precedence level). LNoOp is false in
				/// the boolean sense.
				enum AffineLowPrecOp {
				/// Null value.
				LNoOp,
				Add,
				Sub
				};

				/// Higher precedence ops - all at the same precedence level. HNoOp is false
				/// in the boolean sense.
				enum AffineHighPrecOp {
				/// Null value.
				HNoOp,
				Mul,
				FloorDiv,
				CeilDiv,
				Mod
				};

				using AffineDimList = SmallVector<std::pair<StringRef, AffineExpr>, 4>;
				using AffineSymbolList = SmallVector<std::pair<StringRef, AffineExpr>, 4>;

				/// This is a specialized parser for affine expressions.
				class AffineParser {
				public:
				explicit AffineParser(Parser &p,
				std::function<AffineExpr(StringRef)> bareIdParsingHook,
				AffineDimList &dimList, AffineSymbolList &symbolList)
				: parser(p), bareIdFallback(bareIdParsingHook), dims(dimList),
				symbols(symbolList) {}

				/// Parse a comma-separated list of affine exprs.
				SmallVector<AffineExpr, 4>
				parseAffineExprs(Token::Kind lDelim = Token::l_paren,
				Token::Kind rDelim = Token::r_paren);

				/// Parse a single affine expr.`.
				AffineExpr parseAffineExpr();

				private:
				// Binary affine op parsing.
				AffineLowPrecOp consumeIfLowPrecOp();
				AffineHighPrecOp consumeIfHighPrecOp();

				// AffineExpr parsing.
				AffineExpr parseParentheticalExpr();
				AffineExpr parseNegateExpression(AffineExpr lhs);
				AffineExpr parseIntegerExpr();
				AffineExpr parseBareIdExpr();

				AffineExpr getAffineBinaryOpExpr(AffineHighPrecOp op, AffineExpr lhs,
				AffineExpr rhs, SMLoc opLoc);
				AffineExpr getAffineBinaryOpExpr(AffineLowPrecOp op, AffineExpr lhs,
				AffineExpr rhs);
				AffineExpr parseAffineOperandExpr(AffineExpr lhs);
				AffineExpr parseAffineLowPrecOpExpr(AffineExpr llhs, AffineLowPrecOp llhsOp);
				AffineExpr parseAffineHighPrecOpExpr(AffineExpr llhs, AffineHighPrecOp llhsOp,
				SMLoc llhsOpLoc);

				Parser &parser;
				std::function<AffineExpr(StringRef)> bareIdFallback;
				AffineDimList &dims;
				AffineSymbolList &symbols;
				};
				} // end anonymous namespace

				/// Create an affine binary high precedence op expression (mul's, div's, mod).
				/// opLoc is the location of the op token to be used to report errors
				/// for non-conforming expressions.
				AffineExpr AffineParser::getAffineBinaryOpExpr(AffineHighPrecOp op,
				AffineExpr lhs, AffineExpr rhs,
				SMLoc opLoc) {
				switch (op) {
				case Mul:
				if (!lhs.isSymbolicOrConstant() && !rhs.isSymbolicOrConstant()) {
				parser.emitError(opLoc,
				"non-affine expression: at least one of the multiply "
				"operands has to be either a constant or symbolic");
				return nullptr;
				}
				return lhs * rhs;
				case FloorDiv:
				if (!rhs.isSymbolicOrConstant()) {
				parser.emitError(opLoc,
				"non-affine expression: right operand of floordiv "
				"has to be either a constant or symbolic");
				return nullptr;
				}
				return lhs.floorDiv(rhs);
				case CeilDiv:
				if (!rhs.isSymbolicOrConstant()) {
				parser.emitError(opLoc, "non-affine expression: right operand of ceildiv "
				"has to be either a constant or symbolic");
				return nullptr;
				}
				return lhs.ceilDiv(rhs);
				case Mod:
				if (!rhs.isSymbolicOrConstant()) {
				parser.emitError(opLoc, "non-affine expression: right operand of mod "
				"has to be either a constant or symbolic");
				return nullptr;
				}
				return lhs % rhs;
				case HNoOp:
				llvm_unreachable("can't create affine expression for null high prec op");
				return nullptr;
				}
				llvm_unreachable("Unknown AffineHighPrecOp");
				}

				/// Create an affine binary low precedence op expression (add, sub).
				AffineExpr AffineParser::getAffineBinaryOpExpr(AffineLowPrecOp op,
				AffineExpr lhs, AffineExpr rhs) {
				switch (op) {
				case AffineLowPrecOp::Add:
				return lhs + rhs;
				case AffineLowPrecOp::Sub:
				return lhs - rhs;
				case AffineLowPrecOp::LNoOp:
				llvm_unreachable("can't create affine expression for null low prec op");
				return nullptr;
				}
				llvm_unreachable("Unknown AffineLowPrecOp");
				}

				/// Consume this token if it is a lower precedence affine op (there are only
				/// two precedence levels).
				AffineLowPrecOp AffineParser::consumeIfLowPrecOp() {
				switch (parser.curToken.getKind()) {
				case Token::plus:
				parser.consumeToken(Token::plus);
				ftynseUnsubmitted Done Reply Inline Actions Nit: the operand in `consumeToken` here and below is redundant, the case-expression just above ensures that the token is of the right kind. ftynse: Nit: the operand in `consumeToken` here and below is redundant, the case-expression just above…
				return AffineLowPrecOp::Add;
				case Token::minus:
				parser.consumeToken(Token::minus);
				return AffineLowPrecOp::Sub;
				default:
				return AffineLowPrecOp::LNoOp;
				}
				}

				/// Consume this token if it is a higher precedence affine op (there are only
				/// two precedence levels)
				AffineHighPrecOp AffineParser::consumeIfHighPrecOp() {
				switch (parser.curToken.getKind()) {
				case Token::star:
				parser.consumeToken(Token::star);
				return Mul;
				case Token::kw_floordiv:
				parser.consumeToken(Token::kw_floordiv);
				return FloorDiv;
				case Token::kw_ceildiv:
				parser.consumeToken(Token::kw_ceildiv);
				return CeilDiv;
				case Token::kw_mod:
				parser.consumeToken(Token::kw_mod);
				return Mod;
				default:
				return HNoOp;
				}
				}

				/// Parse a high precedence op expression list: mul, div, and mod are high
				/// precedence binary ops, i.e., parse a
				/// expr_1 op_1 expr_2 op_2 ... expr_n
				/// where op_1, op_2 are all a AffineHighPrecOp (mul, div, mod).
				/// All affine binary ops are left associative.
				/// Given llhs, returns (llhs llhsOp lhs) op rhs, or (lhs op rhs) if llhs is
				/// null. If no rhs can be found, returns (llhs llhsOp lhs) or lhs if llhs is
				/// null. llhsOpLoc is the location of the llhsOp token that will be used to
				/// report an error for non-conforming expressions.
				AffineExpr AffineParser::parseAffineHighPrecOpExpr(AffineExpr llhs,
				AffineHighPrecOp llhsOp,
				SMLoc llhsOpLoc) {
				AffineExpr lhs = parseAffineOperandExpr(llhs);
				if (!lhs)
				return nullptr;

				// Found an LHS. Parse the remaining expression.
				auto opLoc = parser.curToken.getLoc();
				if (AffineHighPrecOp op = consumeIfHighPrecOp()) {
				if (llhs) {
				AffineExpr expr = getAffineBinaryOpExpr(llhsOp, llhs, lhs, opLoc);
				if (!expr)
				return nullptr;
				return parseAffineHighPrecOpExpr(expr, op, opLoc);
				}
				// No LLHS, get RHS
				return parseAffineHighPrecOpExpr(lhs, op, opLoc);
				}

				// This is the last operand in this expression.
				if (llhs)
				return getAffineBinaryOpExpr(llhsOp, llhs, lhs, llhsOpLoc);

				// No llhs, 'lhs' itself is the expression.
				return lhs;
				}

				/// Parse an affine expression inside parentheses.
				///
				/// affine-expr ::= `(` affine-expr `)`
				AffineExpr AffineParser::parseParentheticalExpr() {
				if (failed(parser.parseToken(Token::l_paren, "expected '('")))
				return nullptr;
				if (parser.curToken.is(Token::r_paren))
				return (parser.emitError("no expression inside parentheses"), nullptr);

				auto expr = parseAffineExpr();
				if (!expr)
				return nullptr;
				if (failed(parser.parseToken(Token::r_paren, "expected ')'")))
				return nullptr;

				return expr;
				}

				/// Parse the negation expression.
				///
				/// affine-expr ::= `-` affine-expr
				AffineExpr AffineParser::parseNegateExpression(AffineExpr lhs) {
				if (failed(parser.parseToken(Token::minus, "expected '-'")))
				return nullptr;

				AffineExpr operand = parseAffineOperandExpr(lhs);
				// Since negation has the highest precedence of all ops (including high
				// precedence ops) but lower than parentheses, we are only going to use
				// parseAffineOperandExpr instead of parseAffineExpr here.
				if (!operand)
				// Extra error message although parseAffineOperandExpr would have
				// complained. Leads to a better diagnostic.
				return (parser.emitError("missing operand of negation"), nullptr);
				return (-1) * operand;
				}

				/// Parse a bare id that may appear in an affine expression.
				///
				/// affine-expr ::= bare-id
				AffineExpr AffineParser::parseBareIdExpr() {
				if (parser.curToken.isNot(Token::id))
				return (parser.emitError("expected id"), nullptr);

				StringRef sRef = parser.curToken.getSpelling();
				for (auto &list : {dims, symbols}) {
				for (auto entry : list) {
				if (entry.first == sRef) {
				parser.consumeToken(Token::id);
				return entry.second;
				}
				}
				}

				// Not found, check fallback path.
				AffineExpr expr = bareIdFallback(sRef);
				if (expr) {
				parser.consumeToken(Token::id);
				return expr;
				}

				return (parser.emitError("use of undeclared id"), nullptr);
				}

				/// Parse a positive integral constant appearing in an affine expression.
				///
				/// affine-expr ::= integer-literal
				AffineExpr AffineParser::parseIntegerExpr() {
				auto val = parser.curToken.getUInt64IntegerValue();
				if (!val.hasValue() \|\| (int64_t)val.getValue() < 0)
				return (parser.emitError("constant too large for index"), nullptr);

				parser.consumeToken(Token::integer);
				return getAffineConstantExpr((int64_t)val.getValue(), parser.context);
				}

				/// Parses an expression that can be a valid operand of an affine expression.
				/// lhs: if non-null, lhs is an affine expression that is the lhs of a binary
				/// operator, the rhs of which is being parsed. This is used to determine
				/// whether an error should be emitted for a missing right operand.
				// Eg: for an expression without parentheses (like i + j + k + l), each
				// of the four identifiers is an operand. For i + jk + l, jk is not an
				// operand expression, it's an op expression and will be parsed via
				// parseAffineHighPrecOpExpression(). However, for i + (jk) + -l, (jk) and
				// -l are valid operands that will be parsed by this function.
				AffineExpr AffineParser::parseAffineOperandExpr(AffineExpr lhs) {
				switch (parser.curToken.getKind()) {
				case Token::id:
				return parseBareIdExpr();
				case Token::integer:
				return parseIntegerExpr();
				case Token::l_paren:
				return parseParentheticalExpr();
				case Token::minus:
				return parseNegateExpression(lhs);
				case Token::kw_ceildiv:
				case Token::kw_floordiv:
				case Token::kw_mod:
				case Token::plus:
				case Token::star:
				if (lhs)
				parser.emitError("missing right operand of binary operator");
				else
				parser.emitError("missing left operand of binary operator");
				return nullptr;
				default:
				if (lhs)
				parser.emitError("missing right operand of binary operator");
				else
				parser.emitError("expected affine expression");
				return nullptr;
				}
				}

				/// Parse affine expressions that are bare-id's, integer constants,
				/// parenthetical affine expressions, and affine op expressions that are a
				/// composition of those.
				///
				/// All binary op's associate from left to right.
				///
				/// {add, sub} have lower precedence than {mul, div, and mod}.
				///
				/// Add, sub'are themselves at the same precedence level. Mul, floordiv,
				/// ceildiv, and mod are at the same higher precedence level. Negation has
				/// higher precedence than any binary op.
				///
				/// llhs: the affine expression appearing on the left of the one being parsed.
				/// This function will return ((llhs llhsOp lhs) op rhs) if llhs is non null,
				/// and lhs op rhs otherwise; if there is no rhs, llhs llhsOp lhs is returned
				/// if llhs is non-null; otherwise lhs is returned. This is to deal with left
				/// associativity.
				///
				/// Eg: when the expression is e1 + e2*e3 + e4, with e1 as llhs, this function
				/// will return the affine expr equivalent of (e1 + (e2*e3)) + e4, where
				/// (e2*e3) will be parsed using parseAffineHighPrecOpExpr().
				AffineExpr AffineParser::parseAffineLowPrecOpExpr(AffineExpr llhs,
				AffineLowPrecOp llhsOp) {
				AffineExpr lhs;
				if (!(lhs = parseAffineOperandExpr(llhs)))
				return nullptr;

				// Found an LHS. Deal with the ops.
				if (AffineLowPrecOp lOp = consumeIfLowPrecOp()) {
				if (llhs) {
				AffineExpr sum = getAffineBinaryOpExpr(llhsOp, llhs, lhs);
				return parseAffineLowPrecOpExpr(sum, lOp);
				}
				// No LLHS, get RHS and form the expression.
				return parseAffineLowPrecOpExpr(lhs, lOp);
				}
				auto opLoc = parser.curToken.getLoc();
				if (AffineHighPrecOp hOp = consumeIfHighPrecOp()) {
				// We have a higher precedence op here. Get the rhs operand for the llhs
				// through parseAffineHighPrecOpExpr.
				AffineExpr highRes = parseAffineHighPrecOpExpr(lhs, hOp, opLoc);
				if (!highRes)
				return nullptr;

				// If llhs is null, the product forms the first operand of the yet to be
				// found expression. If non-null, the op to associate with llhs is llhsOp.
				AffineExpr expr =
				llhs ? getAffineBinaryOpExpr(llhsOp, llhs, highRes) : highRes;

				// Recurse for subsequent low prec op's after the affine high prec op
				// expression.
				if (AffineLowPrecOp nextOp = consumeIfLowPrecOp())
				return parseAffineLowPrecOpExpr(expr, nextOp);
				return expr;
				}
				// Last operand in the expression list.
				if (llhs)
				return getAffineBinaryOpExpr(llhsOp, llhs, lhs);
				// No llhs, 'lhs' itself is the expression.
				return lhs;
				}

				/// Parse an affine expression.
				/// affine-expr ::= `(` affine-expr `)`
				/// \| `-` affine-expr
				/// \| affine-expr `+` affine-expr
				/// \| affine-expr `-` affine-expr
				/// \| affine-expr `*` affine-expr
				/// \| affine-expr `floordiv` affine-expr
				/// \| affine-expr `ceildiv` affine-expr
				/// \| affine-expr `mod` affine-expr
				/// \| bare-id
				/// \| integer-literal
				///
				/// Additional conditions are checked depending on the production. For eg.,
				/// one of the operands for `*` has to be either constant/symbolic; the second
				/// operand for floordiv, ceildiv, and mod has to be a positive integer.
				AffineExpr AffineParser::parseAffineExpr() {
				return parseAffineLowPrecOpExpr(nullptr, AffineLowPrecOp::LNoOp);
				}

				SmallVector<AffineExpr, 4> AffineParser::parseAffineExprs(Token::Kind lDelim,
				Token::Kind rDelim) {
				parser.parseToken(lDelim, "expected lDelim at start of affine expr list");

				SmallVector<AffineExpr, 4> exprs;
				auto parseElt = [&]() -> LogicalResult {
				auto elt = parseAffineExpr();
				exprs.push_back(elt);
				return elt ? success() : failure();
				};

				if (failed(parser.parseCommaSeparatedListUntil(rDelim, parseElt, true)))
				assert(false);

				return exprs;
				}

				ftynseUnsubmitted Done Reply Inline Actions llvm_unreachable ? ftynse: llvm_unreachable ?
				//===----------------------------------------------------------------------===//
				// TC parsing.
				//===----------------------------------------------------------------------===//

				namespace {

				/// Base class for expressions involved in TC parsing.
				struct Expression {
				enum class Kind {
				Uninitialized = 0,
				TensorExpr = 1,
				TensorUse = 2,
				};

				explicit Expression(Kind k = Kind::Uninitialized) : kind(k) {}
				virtual ~Expression() {}
				virtual bool equals(const Expression &e) const = 0;
				ftynseUnsubmitted Done Reply Inline Actions This should be trivial to implement without resorting to virtual functions, dispatching on `kind` and using static_cast. ftynse: This should be trivial to implement without resorting to virtual functions, dispatching on…
				operator bool() const { return kind != Kind::Uninitialized; }

				Kind kind;
				};

				/// Encodes a tensor use of the form:
				///
				/// affine-expr-list ::= affine-expr (`,` affine-expr)*
				/// tensor-use ::= tensor-id `(` `)` \|
				ftynseUnsubmitted Done Reply Inline Actions Nit: tensor-id is not defined ftynse: Nit: tensor-id is not defined
				/// tensor-id `(` affine-expr-list `)`
				///
				/// The affine-expr-list is stored as an AffineMap.
				struct TensorUse : public Expression {
				TensorUse() : TensorUse("", AffineMap()) {}
				TensorUse(const TensorUse &use) : TensorUse(use.tensorId, use.indexingMap) {}
				ftynseUnsubmitted Done Reply Inline Actions Nit: `= default` would also work ftynse: Nit: `= default` would also work
				TensorUse(StringRef name, AffineMap map)
				: Expression(Kind::TensorUse), tensorId(name), indexingMap(map) {}

				static bool classof(const Expression *e) {
				ftynseUnsubmitted Done Reply Inline Actions If you use LLVM-style type system, you would normally want to avoid virtual functions... ftynse: If you use LLVM-style type system, you would normally want to avoid virtual functions...
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions why ? https://llvm.org/docs/HowToSetUpLLVMStyleRTTI.html#basic-setup shows it's perfectly fine to use abstract base classes and LLVM RTTI. nicolasvasilache: why ? https://llvm.org/docs/HowToSetUpLLVMStyleRTTI.html#basic-setup shows it's perfectly fine…
				ftynseUnsubmitted Done Reply Inline Actions Because you pay the runtime overhead price for two abstractions serving essentially the same goal. Why? ftynse: Because you pay the runtime overhead price for two abstractions serving essentially the same…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Marking as done, this is part of the more global reply on downcasting, uptr etc. nicolasvasilache: Marking as done, this is part of the more global reply on downcasting, uptr etc.
				return e->kind == Kind::TensorUse;
				}

				bool operator==(const TensorUse &other) const {
				return tensorId == other.tensorId && indexingMap == other.indexingMap;
				}

				bool equals(const Expression &e) const override {
				if (e.kind != Expression::Kind::TensorUse)
				return false;
				return *this == static_cast<const TensorUse &>(e);
				}

				/// Visitation function. Performs `PreOrder` traversal and applies `callback`
				ftynseUnsubmitted Done Reply Inline Actions Nit: given that `PreOrder` is a boolean template parameter, I am not sure what "perfoms `PreOrder` traversal` means when the parameter is false. Post-order? In-order? Compilation error? ftynse: Nit: given that `PreOrder` is a boolean template parameter, I am not sure what "perfoms…
				/// on each node.
				template <typename Lambda, bool PreOrder> void visit(Lambda callback);
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - template <typename Lambda, bool PreOrder> void visit(Lambda callback); + template <typename Lambda, bool PreOrder> + void visit(Lambda callback); Lint: Pre-merge checks: clang-format: please reformat the code ``` - template <typename Lambda, bool PreOrder> void…

				StringRef tensorId;
				AffineMap indexingMap;
				};

				/// Encodes a tensor expression of the form:
				///
				/// op-spec ::= id `<` reduction-dims-list `>` \|
				/// id
				/// op-arg ::= tensor-expr \|
				/// tensor-use
				/// op-arg-list ::= op-arg (`,` op-arg)*
				/// tensor-expr ::= op-spec `(` op-arg-list `)`
				///
				/// Underlying op-arg are stored by unique_ptr to base class.
				struct TensorExpr : public Expression {
				TensorExpr(StringRef name,
				SmallVector<std::unique_ptr<Expression>, 4> &&exprs,
				ftynseUnsubmitted Done Reply Inline Actions Would MutableArrayRef work instead of hardcoding SmallVector with a given size? ftynse: Would MutableArrayRef work instead of hardcoding SmallVector with a given size?
				ArrayRef<unsigned> reductionDims)
				: Expression(Kind::TensorExpr), opId(name), expressions(std::move(exprs)),
				reductionDimensions(reductionDims.begin(), reductionDims.end()) {}

				static bool classof(const Expression *e) {
				return e->kind == Kind::TensorExpr;
				}

				bool operator==(const TensorExpr &other) const {
				if (opId != other.opId)
				return false;
				if (expressions.size() != other.expressions.size())
				return false;
				for (unsigned i = 0, e = expressions.size(); i < e; ++i)
				if (!expressions[i]->equals(*other.expressions[i]))
				return false;
				for (unsigned i = 0, e = reductionDimensions.size(); i < e; ++i)
				ftynseUnsubmitted Done Reply Inline Actions Do you care about the order of reduction dimensions? ftynse: Do you care about the order of reduction dimensions?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions you should otherwise your computation is non-deterministic nicolasvasilache: you should otherwise your computation is non-deterministic
				ftynseUnsubmitted Done Reply Inline Actions I suppose you mean the IR you produce does not have a deterministic order of dimensions, which makes it hard to check. TC semantics says that all dimensions are interchangable, so their textual order should not matter. If it does, we should discuss the semantics and avoid branding this input as TC-like. ftynse: I suppose you mean the IR you produce does not have a deterministic order of dimensions, which…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Added a documentation section in `Linalg.md`. nicolasvasilache: Added a documentation section in `Linalg.md`.
				if (reductionDimensions[i] != other.reductionDimensions[i])
				return false;
				return true;
				}

				bool equals(const Expression &e) const override {
				if (e.kind != Expression::Kind::TensorExpr)
				return false;
				return *this == static_cast<const TensorExpr &>(e);
				}

				/// Visitation function. Performs `PreOrder` traversal and applies `callback`
				/// on each node.
				template <typename Lambda, bool PreOrder> void visit(Lambda callback);
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - template <typename Lambda, bool PreOrder> void visit(Lambda callback); + template <typename Lambda, bool PreOrder> + void visit(Lambda callback); Lint: Pre-merge checks: clang-format: please reformat the code ``` - template <typename Lambda, bool PreOrder> void…

				StringRef opId;
				SmallVector<std::unique_ptr<Expression>, 4> expressions;
				ftynseUnsubmitted Done Reply Inline Actions Do you actually need pointers? Can't we just store `Expression`s as is, eventually with appropriate move semantics to avoid extra copies? ftynse: Do you actually need pointers? Can't we just store `Expression`s as is, eventually with…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions It's this or uniqu'ing, underlying storage, placement new etc etc. I went for the simple solution. When we have strong data that we need to scale much more we can revisit. nicolasvasilache: It's this or uniqu'ing, underlying storage, placement new etc etc. I went for the simple…
				ftynseUnsubmitted Done Reply Inline Actions I don't think vectors of unique_ptr are simpler than vectors of values. This is an extra abstraction with associated cognitive overhead. This is also one extra dynamic allocation per element, as opposed to occasional allocations in the vector, and no strong reason to maintain the pointer as unique or to auto-deallocate (other than you forced the allocation in the first place). There is no actual uniquing of expression, neither is there underlying storage or placement new, you seem to be mistaking this with how types/attributes are handled in MLIR. ftynse: I don't think vectors of unique_ptr are simpler than vectors of values. This is an extra…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions unique_ptr + abstract base class: basically because I use downcasting. `vector<Expression>` will slice unless derived classes have sizeof == 0 (i.e. there is an underlying pointer payload). An option is to implement a similar arena + pImpl to what MLIR does for the "by-value" abstractions. I consider this to be unnecessarily complex for my use case (parser that runs at compiler compile time): `vector<unique_ptr<...>>` is a standard and simple way to solve the slicing and ownership issue, its performance drawback are not relevant at this time IMO. nicolasvasilache: unique_ptr + abstract base class: basically because I use downcasting. `vector<Expression>`…
				SetVector<unsigned> reductionDimensions;
				ftynseUnsubmitted Done Reply Inline Actions Why SetVector? In TC, we wouldn't care about the order of reduction dimensions. ftynse: Why SetVector? In TC, we wouldn't care about the order of reduction dimensions.
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions reductions loops don't commute in FP land nicolasvasilache: reductions loops don't commute in FP land
				};

				/// This is a specialized parser for a TCDef.
				/// This maintains the dims it finds in an eager fashion.
				class TCParser {
				enum class EagerDiscoveryMode { None = 0, Symbols, Dimensions };

				public:
				explicit TCParser(Parser &p);

				/// Uses the AffineParser to parse the affine exprs used in a tensor
				/// definition. All identifiers are interpreted as symbols, new symbols are
				/// added eagerly.
				ftynseUnsubmitted Done Reply Inline Actions And what if discovery mode != symbols ? ftynse: And what if discovery mode != symbols ?
				SmallVector<AffineExpr, 4>
				parseAffineExprs(EagerDiscoveryMode discoveryMode, AffineDimList &dims,
				Token::Kind lDelim = Token::l_paren,
				Token::Kind rDelim = Token::r_paren);

				/// Parse the information for a tensor def of the form:
				///
				/// affine-expr-list ::= affine-expr (`,` affine-expr )*
				/// tensor-typedef ::= type `(` `)` \|
				ftynseUnsubmitted Done Reply Inline Actions Nit: something went wrong with formatting here: `\|` ran away to the right. I personally prefer something like foo ::= token token continuation line of the same rule \| another rule ftynse: Nit: something went wrong with formatting here: `\|` ran away to the right. I personally prefer…
				/// type `(` affine-expr-list `)`
				/// tensor-def ::= tensor-id `:` tensor-typedef
				///
				/// All the affine-expr in a `tensor-typedef` must be dimensionless (i.e.
				/// contain only expressions involving symbols and constants), but can
				/// otherwise contain arbitrary affine expressions.
				LogicalResult parseTensorDef(bool isOutput);

				/// Parses a tensor use of the form:
				///
				/// affine-expr-list ::= affine-expr (`,` affine-expr)*
				/// tensor-use ::= tensor-id `(` `)` \|
				/// tensor-id `(` affine-expr-list `)`
				struct ComprehensionParsingState {
				AffineDimList dims;
				SmallVector<std::unique_ptr<Expression>, 4> expressions;
				llvm::DenseMap<TensorUse, unsigned> orderedTensorArgs;
				};
				LogicalResult parseTensorUse(TensorUse &result,
				ComprehensionParsingState &state);

				/// Parses a tensor expression of the form:
				///
				/// op-spec ::= id `<` reduction-dims-list `>` \|
				ftynseUnsubmitted Done Reply Inline Actions Nit: this comment repeats the comment on `struct TensorExpr`. I am worried about it getting out of sync if the syntax evolves. My recommendation would be to only keep the syntax in a single comment (preferably, the implementation of this method), and just refer to that from the other comments. ftynse: Nit: this comment repeats the comment on `struct TensorExpr`. I am worried about it getting out…
				/// id
				silvasUnsubmitted Done Reply Inline Actions some spaces needed around first tensor-def-list silvas: some spaces needed around first tensor-def-list
				/// op-arg ::= tensor-expr \|
				/// tensor-use
				/// op-arg-list ::= op-arg (`,` op-arg)*
				silvasUnsubmitted Done Reply Inline Actions I don't see affine-expr or tensor-typedef mentioned locally in this comment. move this to the appropriate comment? silvas: I don't see affine-expr or tensor-typedef mentioned locally in this comment. move this to the…
				/// tensor-expr ::= op-spec `(` op-arg-list `)`
				LogicalResult parseExpression(TensorUse currentDefinition,
				std::unique_ptr<Expression> *result,
				ftynseUnsubmitted Done Reply Inline Actions Nit: why pass by-pointer rather than by-reference? ftynse: Nit: why pass by-pointer rather than by-reference?
				ComprehensionParsingState &state);

				/// Parse a single comprehension.
				///
				/// tensor-def-list ::= tensor-def (`,` tensor-def)*
				/// tensor-expr-list ::= tensor-expr (`,` tensor-expr)*
				/// comprehension ::= tensor-def-list `=` tensor-expr-list `;`
				LogicalResult parseOneComprehension(llvm::raw_ostream &os,
				ftynseUnsubmitted Done Reply Inline Actions Why does a parsing function accept an _output_ stream? ftynse: Why does a parsing function accept an _output_ stream?
				StringRef cppOpName,
				StringRef linalgOpName,
				ComprehensionParsingState &state);

				/// Parse and print the information for a TC def.
				ftynseUnsubmitted Done Reply Inline Actions Plz document what does it print ftynse: Plz document what does it print
				///
				/// tensor-def-list ::= tensor-def (`,` tensor-def )*
				///
				/// comprehension-list ::= comprehension comprehension*
				///
				/// tc-def ::= `def` tc-id `(` tensor-def-list `)`
				/// `->` `(` tensor-def-list `)` `{` comprehension-list `}`
				LogicalResult parseAndEmitTCDef(llvm::raw_ostream &os);

				/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
				void printODS(llvm::raw_ostream &os, StringRef cppOpName,
				StringRef linalgOpName);

				/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.
				void printReferenceIterators(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state);

				/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.
				void printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state);

				/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.
				void printRegionBuilder(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state);

				private:
				//===--------------------------------------------------------------------===//
				// Internal bookkeeping of tensors.
				//===--------------------------------------------------------------------===//
				struct RegisteredTensor {
				StringRef name;
				StringRef type;
				AffineMap shape;
				bool isOutput;
				AffineMap indexingMap;
				};
				void registerTensor(StringRef tensorId, StringRef tensorType, AffineMap map,
				bool isOutput);
				RegisteredTensor &getRegisteredTensor(StringRef id);
				unsigned getRegisteredTensorIndex(StringRef id);
				bool isaRegisteredTensor(StringRef id);

				//===--------------------------------------------------------------------===//
				// Per-TC def state.
				//===--------------------------------------------------------------------===//
				/// Symbols are per TC def.
				AffineSymbolList symbols;
				/// Tensors are per TC def.
				SmallVector<RegisteredTensor, 4> tensors;

				Parser &parser;
				};
				} // namespace

				namespace llvm {
				template <> struct DenseMapInfo<TensorExpr> {
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -template <> struct DenseMapInfo<TensorExpr> { +template <> +struct DenseMapInfo<TensorExpr> { Lint: Pre-merge checks: clang-format: please reformat the code ``` -template <> struct DenseMapInfo<TensorExpr> {…
				ftynseUnsubmitted Done Reply Inline Actions Do you actually need this? I only see `DenseMap<TensorExpr >`, which should be using a generic pointer-based map implementation. ftynse:* Do you actually need this? I only see `DenseMap<TensorExpr *>`, which should be using a generic…
				static TensorExpr getEmptyKey() { return TensorExpr("", {}, {}); }
				static TensorExpr getTombstoneKey() {
				return TensorExpr("__tombstone__", {}, {});
				ftynseUnsubmitted Done Reply Inline Actions How about `DenseMapInfo<StringRef>::getTombstoneKey()` instead? ftynse: How about `DenseMapInfo<StringRef>::getTombstoneKey()` instead?
				}
				static unsigned getHashValue(const TensorExpr &val) {
				return ::llvm::hash_value(val.opId); // don't care about collisions.
				}
				static bool isEqual(const TensorExpr &LHS, const TensorExpr &RHS) {
				return LHS == RHS;
				}
				};

				template <> struct DenseMapInfo<TensorUse> {
				Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -template <> struct DenseMapInfo<TensorUse> { +template <> +struct DenseMapInfo<TensorUse> { Lint: Pre-merge checks: clang-format: please reformat the code ``` -template <> struct DenseMapInfo<TensorUse> {…
				static TensorUse getEmptyKey() { return TensorUse("", AffineMap()); }
				static TensorUse getTombstoneKey() {
				return TensorUse("__tombstone__", AffineMap());
				ftynseUnsubmitted Done Reply Inline Actions Same as above, AffineMap also has a DenseMapInfo if I'm not mistaken. ftynse: Same as above, AffineMap also has a DenseMapInfo if I'm not mistaken.
				}
				static unsigned getHashValue(const TensorUse &val) {
				return ::llvm::hash_value(val.tensorId); // don't care about collisions.
				}
				static bool isEqual(const TensorUse &LHS, const TensorUse &RHS) {
				return LHS == RHS;
				}
				};

				} // namespace llvm

				//===----------------------------------------------------------------------===//
				// Visitation functions.
				//===----------------------------------------------------------------------===//

				template <typename Lambda, bool PreOrder>
				void visit(Expression &expr, Lambda callback) {
				ftynseUnsubmitted Done Reply Inline Actions Nit: document how the visitation behaves if the callback mutates the visited object ftynse: Nit: document how the visitation behaves if the callback mutates the visited object
				switch (expr.kind) {
				default:
				llvm_unreachable("Unexpected kind");
				case Expression::Kind::TensorExpr:
				static_cast<TensorExpr &>(expr).visit<Lambda, PreOrder>(callback);
				break;
				case Expression::Kind::TensorUse:
				static_cast<TensorUse &>(expr).visit<Lambda, PreOrder>(callback);
				break;
				}
				}

				template <typename Lambda>
				void visitPreorder(Expression &expr, Lambda callback) {
				visit<Lambda, false>(expr, callback);
				}

				template <typename Lambda>
				void visitPostorder(Expression &expr, Lambda callback) {
				visit<Lambda, true>(expr, callback);
				}

				template <typename Lambda, bool PreOrder>
				void TensorExpr::visit(Lambda callback) {
				if (!PreOrder)
				callback(*this);
				for (auto &e : expressions)
				::visit<Lambda, PreOrder>(*e, callback);
				if (PreOrder)
				callback(*this);
				}

				template <typename Lambda, bool PreOrder>
				void TensorUse::visit(Lambda callback) {
				callback(*this);
				}

				//===----------------------------------------------------------------------===//
				// Internal bookkeeping of tensors.
				//===----------------------------------------------------------------------===//
				void TCParser::registerTensor(StringRef tensorId, StringRef tensorType,
				AffineMap map, bool isOutput) {
				tensors.push_back(
				ftynseUnsubmitted Done Reply Inline Actions Nit: would emplace_back work? ftynse: Nit: would emplace_back work?
				RegisteredTensor{tensorId, tensorType, map, isOutput, AffineMap()});
				LLVM_DEBUG(llvm::dbgs() << "Recorded: " << tensorId << " "
				<< "with typeString: " << tensorType << " "
				<< "and shape: " << map << "\n");
				}

				TCParser::RegisteredTensor &TCParser::getRegisteredTensor(StringRef id) {
				for (unsigned i = 0, e = tensors.size(); i < e; ++i)
				if (tensors[i].name == id)
				ftynseUnsubmitted Done Reply Inline Actions Have you considered storing tensors in an llvm::StringMap indexed by name instead of doing linear lookups every time? ftynse: Have you considered storing tensors in an llvm::StringMap indexed by name instead of doing…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I need 2 extra maps and really don't anticipate a single named op to ever to a point where this would matter. Of course if proven otherwise I'm happy to reconsider. nicolasvasilache: I need 2 extra maps and really don't anticipate a single named op to ever to a point where this…
				ftynseUnsubmitted Done Reply Inline Actions Well, you currently have two extra vectors. I just don't see why prefer using a vector of pairs and implementing a search for _every one of them_ is better than using a dedicated container with accessor immediately available. ftynse: Well, you currently have two extra vectors. I just don't see why prefer using a vector of pairs…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions fair enough, done, thanks! nicolasvasilache: fair enough, done, thanks!
				return tensors[i];
				llvm_unreachable("Unregistered tensor");
				}

				unsigned TCParser::getRegisteredTensorIndex(StringRef id) {
				for (unsigned i = 0, e = tensors.size(); i < e; ++i)
				if (tensors[i].name == id)
				return i;
				llvm_unreachable("Unregistered tensor");
				}

				bool TCParser::isaRegisteredTensor(StringRef id) {
				ftynseUnsubmitted Done Reply Inline Actions Naming nit: `isa` is widely used for downcasting, this is just a lookup; prefer `is`. ftynse: Naming nit: `isa` is widely used for downcasting, this is just a lookup; prefer `is`.
				for (unsigned i = 0, e = tensors.size(); i < e; ++i)
				if (tensors[i].name == id)
				return true;
				return false;
				}

				//===----------------------------------------------------------------------===//
				// TC parsing functions.
				//===----------------------------------------------------------------------===//
				TCParser::TCParser(Parser &p) : parser(p) {}

				/// Uses the AffineParser to parse the affine exprs used in a tensor
				/// definition. All identifiers are interpreted as symbols, new symbols are
				/// added eagerly.
				SmallVector<AffineExpr, 4>
				TCParser::parseAffineExprs(EagerDiscoveryMode discoveryMode,
				AffineDimList &dims, Token::Kind lDelim,
				Token::Kind rDelim) {
				AffineParser affineParser(
				parser,
				[&](StringRef sRef) {
				AffineExpr expr;
				if (discoveryMode == EagerDiscoveryMode::Symbols) {
				expr = getAffineSymbolExpr(symbols.size(), parser.context);
				symbols.push_back(std::make_pair(sRef, expr));
				ftynseUnsubmitted Done Reply Inline Actions Would emplace_back work? ftynse: Would emplace_back work?
				} else if (discoveryMode == EagerDiscoveryMode::Dimensions) {
				expr = getAffineDimExpr(dims.size(), parser.context);
				dims.push_back(std::make_pair(sRef, expr));
				}
				return expr;
				},
				dims, symbols);
				return affineParser.parseAffineExprs(lDelim, rDelim);
				}

				/// Parse the information for a tensor def of the form:
				///
				/// affine-expr-list ::= affine-expr (`,` affine-expr )*
				/// tensor-typedef ::= type `(` `)` \|
				/// type `(` affine-expr-list `)`
				/// tensor-def ::= tensor-id `:` tensor-typedef
				LogicalResult TCParser::parseTensorDef(bool isOutput) {
				StringRef tensorId = parser.curToken.getSpelling();
				if (failed(parser.parseToken(Token::id, "expected a name id")) \|\|
				failed(parser.parseToken(Token::colon, "expected colon")))
				ftynseUnsubmitted Done Reply Inline Actions Could you just have a default message `expected %tokenname%` instead of having a similar string everywhere ftynse: Could you just have a default message `expected %tokenname%` instead of having a similar string…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions I'm reluctant to invest more in duplicating something that should be exposed by core in a later NFC revision. nicolasvasilache: I'm reluctant to invest more in duplicating something that should be exposed by core in a later…
				return failure();

				StringRef tensorType = parser.curToken.getSpelling();
				if (failed(parser.parseToken(Token::id, "expected a type id")))
				ftynseUnsubmitted Done Reply Inline Actions It looks like it would parse just about any id. "expected a type id" sounds a bit misleading because "type id" is not a production rule, and there's no additional check on the id somehow being a type. ftynse: It looks like it would parse just about any id. "expected a type id" sounds a bit misleading…
				return failure();

				AffineDimList emptyDims;
				auto exprs = parseAffineExprs(EagerDiscoveryMode::Symbols, emptyDims);
				assert(emptyDims.empty());
				ftynseUnsubmitted Done Reply Inline Actions Nit: add a description in the assertion. Also, are we sure this can never happen? ftynse: Nit: add a description in the assertion. Also, are we sure this can never happen?
				AffineMap map =
				AffineMap::get(/dimCount=/0, symbols.size(), exprs, parser.context);

				registerTensor(tensorId, tensorType, map, isOutput);

				return success();
				}

				/// Parses a tensor use of the form:
				///
				/// affine-expr-list ::= affine-expr (`,` affine-expr)*
				/// tensor-use ::= tensor-id `(` `)` \|
				/// tensor-id `(` affine-expr-list `)`
				LogicalResult TCParser::parseTensorUse(TensorUse &result,
				ComprehensionParsingState &state) {
				StringRef tensorId = parser.curToken.getSpelling();
				if (failed(parser.parseToken(Token::id, "expected an id")))
				return failure();

				auto exprs = parseAffineExprs(EagerDiscoveryMode::Dimensions, state.dims);
				AffineMap map =
				AffineMap::get(state.dims.size(), symbols.size(), exprs, parser.context);
				LLVM_DEBUG(llvm::dbgs() << "Use of tensor: " << tensorId << " map: " << map
				<< "\n");

				result = TensorUse(tensorId, map);
				return success();
				}

				/// Parses a tensor expression of the form:
				///
				/// op-spec ::= id `<` reduction-dims-list `>` \|
				/// id
				/// op-arg ::= tensor-expr \|
				/// tensor-use
				/// op-arg-list ::= op-arg (`,` op-arg)*
				/// tensor-expr ::= op-spec `(` op-arg-list `)`
				LogicalResult TCParser::parseExpression(TensorUse currentDefinition,
				std::unique_ptr<Expression> *result,
				ComprehensionParsingState &state) {
				StringRef opOrTensor = parser.curToken.getSpelling();
				if (isaRegisteredTensor(opOrTensor)) {
				TensorUse use;
				auto res = parseTensorUse(use, state);
				if (failed(res))
				return res;
				*result = std::make_unique<TensorUse>(use);
				return success();
				}

				if (failed(parser.parseToken(Token::id, "expected an operation")))
				return failure();

				// This is an op.
				SmallVector<unsigned, 4> reductionDims;
				SmallVector<std::unique_ptr<Expression>, 4> expressions;

				// Check if it has a reduction set, discover dimensions eagerly.
				if (parser.curToken.is(Token::lt)) {
				auto iters = parseAffineExprs(EagerDiscoveryMode::Dimensions, state.dims,
				Token::lt, Token::gt);
				for (auto iter : iters)
				reductionDims.push_back(iter.cast<AffineDimExpr>().getPosition());
				}

				// If this op is a reduction, it's first argument is the `currentDefinition`
				// tensor use.
				if (!reductionDims.empty())
				expressions.push_back(std::make_unique<TensorUse>(currentDefinition));
				LLVM_DEBUG(llvm::dbgs() << "op: " << opOrTensor << "\n");

				auto parseExpr = [&]() -> LogicalResult {
				std::unique_ptr<Expression> e;
				if (failed(parseExpression(currentDefinition, &e, state)))
				return failure();
				expressions.push_back(std::move(e));
				return success();
				};
				if (failed(parser.parseToken(Token::l_paren, "expected `(`")) \|\|
				ftynseUnsubmitted Done Reply Inline Actions Ultra-nit: we tend to use single quotes rather than backticks in error messages ftynse: Ultra-nit: we tend to use single quotes rather than backticks in error messages
				failed(
				parser.parseCommaSeparatedListUntil(Token::r_paren, parseExpr, true)))
				ftynseUnsubmitted Done Reply Inline Actions Nit: `/allowEmptyList=/true` ftynse: Nit: `/allowEmptyList=/true`
				return failure();

				*result = std::make_unique<TensorExpr>(opOrTensor, std::move(expressions),
				reductionDims);

				return success();
				}

				//===----------------------------------------------------------------------===//
				// Parse and Emit functions.
				//===----------------------------------------------------------------------===//

				/// Parse and print the information for a single comprehension.
				///
				/// tensor-def-list ::= tensor-def (`,` tensor-def)*
				/// tensor-expr-list ::= tensor-expr (`,` tensor-expr)*
				/// comprehension ::= tensor-def-list `=` tensor-expr-list `;`
				LogicalResult
				TCParser::parseOneComprehension(llvm::raw_ostream &os, StringRef cppOpName,
				StringRef linalgOpName,
				ComprehensionParsingState &state) {
				// 1. Parse LHS of `=`, these become the definitions that appear as the output
				// tensors or read/write buffers.
				SmallVector<TensorUse, 4> definitions;
				auto parseUse = [&]() -> LogicalResult {
				TensorUse use;
				if (failed(parseTensorUse(use, state)))
				return failure();
				definitions.push_back(use);
				return success();
				};
				if (failed(parser.parseCommaSeparatedListUntil(Token::equal, parseUse, true)))
				ftynseUnsubmitted Done Reply Inline Actions `/allowEmptyList=/true` ftynse: `/allowEmptyList=/true`
				return failure();

				// 2. Parse RHS of `=`, this becomes the expressions from which we emit
				// computations.
				unsigned idx = 0;
				auto parseExpr = [&]() -> LogicalResult {
				std::unique_ptr<Expression> expr;
				if (failed(parseExpression(definitions[idx++], &expr, state)))
				ftynseUnsubmitted Done Reply Inline Actions This may crash if you have less LHS declarations than RHS definitions. ftynse: This may crash if you have less LHS declarations than RHS definitions.
				return failure();
				state.expressions.push_back(std::move(expr));
				return success();
				};
				if (failed(parser.parseCommaSeparatedListUntil(Token::semicolon, parseExpr,
				ftynseUnsubmitted Done Reply Inline Actions `/allowEmptyList=/true` ftynse: `/allowEmptyList=/true`
				true)))
				return failure();

				// 3. Postprocess.
				// 3.a. Normalize all maps to the proper dimCount and symbolCount.
				ftynseUnsubmitted Done Reply Inline Actions `dimCount` and `symbolCount` make the comment look outdated, is it? ftynse: `dimCount` and `symbolCount` make the comment look outdated, is it?
				SmallVector<TensorUse, 4> allUses;
				allUses.reserve(tensors.size());
				for (auto &def : definitions)
				allUses.push_back(def);
				for (auto &pExpr : state.expressions)
				visitPostorder(*pExpr, [&](Expression &e) {
				if (auto *use = dyn_cast<TensorUse>(&e))
				allUses.push_back(*use);
				});
				for (auto &use : allUses)
				use.indexingMap =
				AffineMap::get(state.dims.size(), symbols.size(),
				use.indexingMap.getResults(), parser.context);

				// 3.b. Traverse definitions
				llvm::DenseSet<StringRef> seenDefs;
				for (auto &def : definitions) {
				auto &tensor = getRegisteredTensor(def.tensorId);
				if (seenDefs.count(def.tensorId) > 0) {
				parser.emitError("Unexpected multi-write with different indexings to a "
				ftynseUnsubmitted Done Reply Inline Actions Did you check that indexings were different? ftynse: Did you check that indexings were different?
				"single tensor");
				}
				seenDefs.insert(def.tensorId);
				tensor.indexingMap = def.indexingMap;
				state.orderedTensorArgs[def] = getRegisteredTensorIndex(def.tensorId);
				}

				bool failed = false;
				for (auto &pExpr : state.expressions)
				visitPostorder(*pExpr, [&](Expression &e) {
				if (failed)
				return;
				if (auto *pUse = dyn_cast<TensorUse>(&e)) {
				ftynseUnsubmitted Done Reply Inline Actions Nit: I'd use early return here ftynse: Nit: I'd use early return here
				auto &use = *pUse;
				auto &tensor = getRegisteredTensor(use.tensorId);
				LLVM_DEBUG(llvm::dbgs()
				<< "\nuse: " << use.tensorId << " map: " << use.indexingMap);
				if (tensor.indexingMap && state.orderedTensorArgs.count(use) == 0) {
				LLVM_DEBUG(llvm::dbgs() << "\nexisting: " << tensor.indexingMap);
				silvasUnsubmitted Done Reply Inline Actions should this be a diagnostic? silvas: should this be a diagnostic?
				parser.emitError(
				"Unexpected multi-read of a tensor with different accesses");
				ftynseUnsubmitted Done Reply Inline Actions [Not for this commit]: I would rather have the parser accept the correct syntax, and have a separate check that implements "semantic" rules. ftynse: [Not for this commit]: I would rather have the parser accept the correct syntax, and have a…
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Agreed, there are a few other things for follwups too, thanks! nicolasvasilache: Agreed, there are a few other things for follwups too, thanks!
				failed = true;
				return;
				}
				seenDefs.insert(use.tensorId);
				tensor.indexingMap = use.indexingMap;
				state.orderedTensorArgs[use] = getRegisteredTensorIndex(use.tensorId);
				}
				silvasUnsubmitted Done Reply Inline Actions How can we emit ODS before we finish processing the whole `tc-def` production? silvas: How can we emit ODS before we finish processing the whole `tc-def` production?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions Thanks! nicolasvasilache: Thanks!
				});
				if (failed)
				return failure();

				return success();
				}

				/// Parse and print the information for a TC def.
				///
				/// tensor-def-list ::= tensor-def (`,` tensor-def )*
				///
				/// comprehension-list ::= comprehension comprehension*
				///
				/// tc-def ::= `def` tc-id `(`tensor-def-list`)` `->` `(` tensor-def-list`)`
				/// `{` comprehension-list `}`
				///
				/// All the affine-expr in a `tensor-typedef` must be dimensionless (i.e.
				/// contain only expressions involving symbols and constants), but can
				/// otherwise contain arbitrary affine expressions.
				LogicalResult TCParser::parseAndEmitTCDef(llvm::raw_ostream &os) {
				if (failed(parser.parseToken(Token::kw_def, "expected 'def' to define a TC")))
				return failure();

				StringRef tcName = parser.curToken.getSpelling();
				LLVM_DEBUG(llvm::dbgs() << "\n\nStart parsing tc: " << tcName << "\n");
				if (failed(parser.parseToken(Token::id, "expected id")) \|\|
				failed(parser.parseToken(Token::l_paren, "expected '('")))
				return failure();

				auto parseInputDef = [&]() -> LogicalResult {
				return parseTensorDef(/isOutput=/false);
				silvasUnsubmitted Done Reply Inline Actions maybe rename to "parseAndEmitTCDef"? Also probably rename processOneComprehension to parseAndEmitOneComprehension to be consistent with that. silvas: maybe rename to "parseAndEmitTCDef"? Also probably rename processOneComprehension to…
				};
				if (failed(parser.parseCommaSeparatedListUntil(Token::r_paren, parseInputDef,
				false)))
				ftynseUnsubmitted Done Reply Inline Actions `/allowEmptyList=/true` ftynse: `/allowEmptyList=/true`
				return failure();

				if (failed(parser.parseToken(Token::minus, "expected '-'")) \|\|
				failed(parser.parseToken(Token::gt, "expected '>'")) \|\|
				failed(parser.parseToken(Token::l_paren, "expected '('")))
				return failure();
				auto parseOutputDef = [&]() -> LogicalResult {
				return parseTensorDef(/isOutput=/true);
				};
				if (failed(parser.parseCommaSeparatedListUntil(Token::r_paren, parseOutputDef,
				false)))
				ftynseUnsubmitted Done Reply Inline Actions `/allowEmptyList=/true` ftynse: `/allowEmptyList=/true`
				return failure();

				// Since we don't declare symbols separately, we discover them eagerly: each
				// newly encountered id in a tensor shape expression is treated as a new
				silvasUnsubmitted Done Reply Inline Actions typo in the "expected" string. silvas: typo in the "expected" string.
				// symbolicc. At this point, all tensors have been parsed and all the symbols
				ftynseUnsubmitted Done Reply Inline Actions typo: "symbolicc" ftynse: typo: "symbolicc"
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions it's a faster `symbolcc` nicolasvasilache: it's a faster `symbolcc`
				// that could be discovered eagerly are now known. Resize all AffineMaps to
				// normalize the number of eagerly discovered symbols.
				for (auto &tensor : tensors) {
				auto &map = tensor.shape;
				map = AffineMap::get(/dimCount=/0, symbols.size(), map.getResults(),
				parser.context);
				}

				if (failed(parser.parseToken(Token::l_brace, "expected '{'")))
				return failure();
				silvasUnsubmitted Done Reply Inline Actions Can you make this comment a bit easier to understand. What is an "eagerly discovered symbol" and how does this "normalize" it? silvas: Can you make this comment a bit easier to understand. What is an "eagerly discovered symbol"…

				SmallVector<ComprehensionParsingState, 4> perComprehensionStates;
				while (parser.curToken.isNot(Token::r_brace)) {
				perComprehensionStates.push_back(ComprehensionParsingState());
				if (failed(parseOneComprehension(os, tcName, tcName,
				perComprehensionStates.back())))
				silvasUnsubmitted Done Reply Inline Actions Instead of the ternary, use static AffineMap get(unsigned dimCount, unsigned symbolCount, ArrayRef<AffineExpr> results, MLIRContext context); silvas:* Instead of the ternary, use ``` static AffineMap get(unsigned dimCount, unsigned symbolCount…
				return failure();
				};
				parser.parseToken(Token::r_brace, "expected '}'");

				// Print.
				auto nComprehensions = perComprehensionStates.size();
				if (nComprehensions != 1) {
				parser.emitError("only 1 comprehension supported for now, got: " +
				llvm::Twine(nComprehensions));
				silvasUnsubmitted Done Reply Inline Actions comma separated comprehensions seems to contradict the grammar? silvas: comma separated comprehensions seems to contradict the grammar?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions you're right, thanks! nicolasvasilache: you're right, thanks!
				return failure();
				}
				if (genODSDecl) {
				printODS(os, tcName, tcName);
				os << "\n";
				}
				if (genODSImpl) {
				auto &state = perComprehensionStates.back();
				std::string extraMethods;
				llvm::raw_string_ostream ss(extraMethods);
				printReferenceIterators(ss, tcName, state);
				printReferenceIndexingMaps(ss, tcName, state);
				printRegionBuilder(ss, tcName, state);
				ss.flush();
				os << extraMethods << "\n";
				}

				return success();
				}

				//===----------------------------------------------------------------------===//
				// Printing functions
				//===----------------------------------------------------------------------===//

				/// Print the ODS class that defines a new `cppOpName` for a `linalgOpName`.
				void TCParser::printODS(llvm::raw_ostream &os, StringRef cppOpName,
				StringRef linalgOpName) {
				const char *header = R"FMT( def {0}Op : LinalgNamedStructured_Op<"{1}", [
				NInputs<{2}>,
				NOutputs<{3}>,
				NamedStructuredOpTraits]> {
				let arguments = (ins Variadic<LinalgOperand>:$views);
				let results = (outs Variadic<AnyRankedTensor>:$output_tensors);
				let extraClassDeclaration = [{{
				llvm::Optional<SmallVector<StringRef, 8>> referenceIterators();
				ftynseUnsubmitted Done Reply Inline Actions Why is the result optional? ftynse: Why is the result optional?
				nicolasvasilacheAuthorUnsubmitted Done Reply Inline Actions this is what the ODS currently is because of manual "named ops", will be cleaned later. nicolasvasilache: this is what the ODS currently is because of manual "named ops", will be cleaned later.
				llvm::Optional<SmallVector<AffineMap, 8>> referenceIndexingMaps();
				void regionBuilder(ArrayRef<BlockArgument> args);
				}];
				let hasFolder = 1;
				})FMT";

				unsigned nInputs = 0, nOutputs = 0;
				for (auto &t : tensors) {
				if (t.isOutput)
				nOutputs++;
				else
				nInputs++;
				}

				os << llvm::formatv(header, cppOpName, linalgOpName, nInputs, nOutputs);
				}

				/// Print the C++ StructuredOpsInterface impl of `referenceIterators`.
				void TCParser::printReferenceIterators(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state) {
				const char *referenceReferenceIteratorsFmt =
				R"FMT(
				llvm::Optional<SmallVector<StringRef, 8>> {0}::referenceIterators() {
				return SmallVector<StringRef, 8>{{ {1} };
				})FMT";

				std::string iteratorsStr;
				llvm::raw_string_ostream ss(iteratorsStr);
				unsigned pos = 0;
				interleaveComma(state.dims, ss, [&](std::pair<StringRef, AffineExpr> p) {
				bool reduction = false;
				for (auto &expr : state.expressions) {
				visitPostorder(*expr, [&](Expression &e) {
				if (auto *pTensorExpr = dyn_cast<TensorExpr>(&e)) {
				if (pTensorExpr->reductionDimensions.count(pos) > 0)
				reduction = true;
				}
				});
				if (reduction)
				break;
				}
				ss << (reduction ? "getReductionIteratorTypeName()"
				: "getParallelIteratorTypeName()");
				pos++;
				});
				ss.flush();

				os << llvm::formatv(referenceReferenceIteratorsFmt, opId, iteratorsStr);
				}

				/// Print the C++ StructuredOpsInterface impl of `referenceIndexingMaps`.
				void TCParser::printReferenceIndexingMaps(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state) {
				const char *referenceIndexingMapsFmt =
				R"FMT(
				llvm::Optional<SmallVector<AffineMap, 8>> {0}::referenceIndexingMaps() {
				MLIRContext *context = getContext();
				AffineExpr {1};
				bindDims(context, {1});
				return SmallVector<AffineMap, 8>{{ {2} };
				})FMT";

				std::string dimsStr;
				llvm::raw_string_ostream ss(dimsStr);
				interleaveComma(state.dims, ss,
				[&](std::pair<StringRef, AffineExpr> p) { ss << p.second; });
				ss.flush();

				std::string mapsStr;
				llvm::raw_string_ostream ss2(mapsStr);
				ftynseUnsubmitted Done Reply Inline Actions Nit: could we use more meaningful names than `ss2`? ftynse: Nit: could we use more meaningful names than `ss2`?
				SmallVector<TensorUse, 4> orderedUses(state.orderedTensorArgs.size());
				for (auto it : state.orderedTensorArgs)
				orderedUses[it.second] = it.first;
				interleaveComma(orderedUses, ss2, [&](TensorUse u) {
				assert(u.indexingMap);
				const char *mapFmt = "\n\tAffineMap::get({0}, 0, {1})";
				if (u.indexingMap.isEmpty()) {
				ss2 << llvm::formatv(mapFmt, state.dims.size(), "context");
				return;
				}

				std::string exprsStr;
				llvm::raw_string_ostream ss3(exprsStr);
				ss3 << "{";
				interleaveComma(u.indexingMap.getResults(), ss3);
				ss3 << "}";
				ss3.flush();

				ss2 << llvm::formatv(mapFmt, state.dims.size(), exprsStr);
				});
				ss2.flush();

				os << llvm::formatv(referenceIndexingMapsFmt, opId, dimsStr, mapsStr);
				}

				/// Print the C++ StructuredOpsInterface impl of `regionBuilder`.
				void TCParser::printRegionBuilder(llvm::raw_ostream &os, StringRef opId,
				ComprehensionParsingState &state) {
				unsigned count = state.orderedTensorArgs.size();
				llvm::DenseMap<TensorExpr *, unsigned> subExprsMap;
				std::function<void(llvm::raw_ostream & os, Expression &)> printExpr;
				printExpr = [&](llvm::raw_ostream &os, Expression &e) -> void {
				if (auto *pUse = dyn_cast<TensorUse>(&e)) {
				os << "_" << state.orderedTensorArgs[*pUse];
				return;
				}
				auto *pTensorExpr = cast<TensorExpr>(&e);
				if (subExprsMap.count(pTensorExpr) > 0) {
				os << "_" << subExprsMap[pTensorExpr];
				} else {
				std::string subExprs;
				llvm::raw_string_ostream ss(subExprs);
				interleaveComma(
				pTensorExpr->expressions, ss,
				[&](const std::unique_ptr<Expression> &e) { printExpr(ss, *e); });
				ss.flush();
				const char *tensorExprFmt = "\n ValueHandle _{0} = {1}({2});";
				os << llvm::formatv(tensorExprFmt, ++count, pTensorExpr->opId, subExprs);
				subExprsMap[pTensorExpr] = count;
				}
				};

				const char *regionBuilderFmt = R"FMT(
				void {0}::regionBuilder(ArrayRef<BlockArgument> args) {
				using namespace edsc;
				using namespace intrinsics;
				ValueHandle {1};
				{2}
				(linalg_yield(ValueRange{ {3} }));
				})FMT";

				unsigned idx = 0;
				std::string valueHandleStr;
				llvm::raw_string_ostream ss(valueHandleStr);
				interleaveComma(state.orderedTensorArgs, ss,
				[&](decltype(state.orderedTensorArgs.begin())::value_type) {
				ftynseUnsubmitted Done Reply Inline Actions C++14 supports `auto` for lambda arguments ftynse: C++14 supports `auto` for lambda arguments
				ss << "_" << idx << "(args[" << idx << "])";
				idx++;
				});

				std::string expressionsStr;
				llvm::raw_string_ostream ss2(expressionsStr);
				for (auto &expr : state.expressions)
				visitPostorder(*expr, [&](Expression &e) {
				if (e.kind == Expression::Kind::TensorExpr)
				printExpr(ss2, e);
				});

				std::string yieldStr;
				llvm::raw_string_ostream ss3(yieldStr);
				interleaveComma(
				state.expressions, ss3,
				[&](const std::unique_ptr<Expression> &e) { printExpr(ss3, *e); });

				ss.flush();
				ftynseUnsubmitted Done Reply Inline Actions Alternatively, you could use `ss.str()` instead of `valueHandleStr` below. Also, consider better names than ss, ss2, ss3. One `ss` is acceptable in a short function, but here it's really tricky to keep in mind which stream is associated with which string. ftynse: Alternatively, you could use `ss.str()` instead of `valueHandleStr` below. Also, consider…
				ss2.flush();
				ss3.flush();

				os << llvm::formatv(regionBuilderFmt, opId, valueHandleStr, expressionsStr,
				yieldStr);
				}

				/// Iterate over each Tensor Comprehension def.
				LogicalResult parseAndEmitAllTensorComprehensions(llvm::raw_ostream &os,
				Parser &parser) {
				while (parser.curToken.getKind() != Token::eof) {
				silvasUnsubmitted Done Reply Inline Actions auto here obscures things IMO silvas: auto here obscures things IMO
				TCParser tcParser(parser);
				if (failed(tcParser.parseAndEmitTCDef(os)))
				return failure();
				}
				return success();
				}
				silvasUnsubmitted Done Reply Inline Actions auto here obscures things IMO silvas: auto here obscures things IMO

				int main(int argc, char **argv) {
				llvm::cl::ParseCommandLineOptions(argc, argv, "Linalg ODS Gen");

				// Set up the input file.
				std::string errorMessage;
				std::unique_ptr<llvm::MemoryBuffer> file =
				mlir::openInputFile(inputFilename, &errorMessage);
				if (!file) {
				llvm::errs() << errorMessage << "\n";
				return 1;
				}

				std::unique_ptr<llvm::ToolOutputFile> output =
				openOutputFile(outputFilename, &errorMessage);
				if (!output) {
				llvm::errs() << errorMessage << "\n";
				exit(1);
				}

				MLIRContext context;
				llvm::SourceMgr mgr;
				mgr.AddNewSourceBuffer(std::move(file), llvm::SMLoc());
				Parser parser(mgr, &context);
				parseAndEmitAllTensorComprehensions(output->os(), parser);
				output->keep();

				return 0;
				}

This is an archive of the discontinued LLVM Phabricator instance.

[mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor Comprehensions-like specification.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 255067

mlir/include/mlir/IR/AffineExpr.h

mlir/lib/IR/AffineExpr.cpp

mlir/test/CMakeLists.txt

mlir/test/lit.cfg.py

mlir/test/mlir-linalg-ods-gen/test-linalg-ods-gen.tc

mlir/tools/CMakeLists.txt

mlir/tools/mlir-linalg-ods-gen/CMakeLists.txt

mlir/tools/mlir-linalg-ods-gen/mlir-linalg-ods-gen.cpp

[mlir][Linalg] Create a tool to generate named Linalg ops from a Tensor Comprehensions-like specification.
ClosedPublic