This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/
-
mlir/
-
Bytecode/
-
BytecodeImplementation.h
7/7
BytecodeWriter.h
-
IR/
14/15
AsmState.h
10/10
BuiltinDialectBytecode.h
-
lib/
-
Bytecode/
-
Reader/
4/4
BytecodeReader.cpp
-
Writer/
1/1
BytecodeWriter.cpp
-
IRNumbering.cpp
-
IR/
-
BuiltinDialect.cpp
-
BuiltinDialectBytecode.h
4/5
BuiltinDialectBytecode.cpp
-
test/
-
Bytecode/
-
bytecode_callback.mlir
-
bytecode_callback_full_override.mlir
-
bytecode_callback_with_custom_attribute.mlir
-
bytecode_callback_with_custom_type.mlir
-
invalid/
-
invalid_attr_type_section.mlir
-
lib/
-
Dialect/Test/
-
Test/
-
TestDialect.h
-
TestDialect.cpp
-
TestOps.td
-
TestTypeDefs.td
-
IR/
-
CMakeLists.txt
10/10
TestBytecodeCallbacks.cpp
-
tools/mlir-opt/
-
mlir-opt/
-
mlir-opt.cpp

Differential D153383

Expose callbacks for encoding of types/attributes
ClosedPublic

Authored by mfrancio on Jun 20 2023, 3:33 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini
jpienaar
rriddle
nicolasvasilache

Commits

rGbff6a4292f80: Expose callbacks for encoding of types/attributes
rGb299ec16661f: Expose callbacks for encoding of types/attributes

Summary

[mlir] Expose a mechanism to provide a callback for encoding types and attributes in MLIR bytecode.

Two callbacks are exposed, respectively, to the BytecodeWriterConfig and to the ParserConfig. At bytecode parsing/printing, clients have the ability to specify a callback to be used to optionally read/write the encoding. On failure, fallback path will execute the default parsers and printers for the dialect.

Testing shows how to leverage this functionality to support back-deployment and backward-compatibility usecases when roundtripping to bytecode a client dialect with type/attributes dependencies on upstream.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mfrancio created this revision.Jun 20 2023, 3:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 3:33 PM

Herald added subscribers: bviyer, Moerafaat, zero9178 and 19 others. · View Herald Transcript

mfrancio requested review of this revision.Jun 20 2023, 3:33 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 20 2023, 3:33 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

I'm not sure I understand the rationale here. This looks like a very awkward side band, and quite invasive. I don't see why we'd want to open up the bytecode for arbitrary encodings, this should be driven solely by the dialect itself.

This revision now requires changes to proceed.Jun 20 2023, 3:37 PM

mehdi_amini added inline comments.Jun 20 2023, 3:43 PM

mlir/include/mlir/Bytecode/BytecodeWriter.h

Seems like not natural to me as an API to anchor this to the dialect, that seems a bit arbitrary to me.

When we talked about it I was thinking something more flat and simpler:

void addOverrideCallback(std::function<bool(Type)> callback) {
}
Smallvector<std::function<bool(Type)>> typeOverrideCallbacks;

And the writer would do:

// First try to process the given type with the provided override
for (auto &callback : typeOverrideCallbacks)
  if (callback(type)) return

// continue with normal emission

Harbormaster completed remote builds in B240109: Diff 533067.Jun 20 2023, 3:56 PM

mehdi_amini added inline comments.Jun 20 2023, 3:58 PM

mlir/include/mlir/Bytecode/BytecodeWriter.h
63	I guess what I wrote is not enough: there is more than the exact encoding, there may be some remapping to another dialect as well.

In D153383#4436145, @rriddle wrote:

I'm not sure I understand the rationale here. This looks like a very awkward side band, and quite invasive. I don't see why we'd want to open up the bytecode for arbitrary encodings, this should be driven solely by the dialect itself.

Thanks for your feedback. I completely agree in the principle: a dialect should be the only driver of the encoding. However, this comes with a limitation - if a versioned client dialect wants to control its own encoding, there is no way to do it currently without re-defining and reimplementing all the types and attributes. The patch tries to address this problem by exposing a callback, which offers clients the chance to decouple the encoding of types and attributes that are defined as part of the upstream dialects from their upstream encoding, so that a versioned client dialect that uses unversioned upstream types/attributes can maintain forward/backward compatibility independently from the upstream development.

mlir/include/mlir/Bytecode/BytecodeWriter.h
63	I found it as a compelling and explicit way to override Type and Attributes printer/parser of any dialect with a specific encoding. But agreed, it is arbitrary. I can remove this anchor and simplify it a little bit.

Remove ability to specify the dialect associated with each callback.

mfrancio added inline comments.Jun 20 2023, 8:35 PM

mlir/include/mlir/Bytecode/BytecodeWriter.h
63	I think what implemented should be enough to be able to override the encoding of a type/attribute if its definition jumps from one dialect to another - this is true as long as the encoding is always owned by the callback, before and after the jump.

Harbormaster completed remote builds in B240143: Diff 533112.Jun 20 2023, 8:59 PM

Adds a test exercising roundtrip to bytecode with a custom encoding of IntegerType.

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptJul 10 2023, 5:00 PM

mfrancio added inline comments.Jul 10 2023, 5:07 PM

mlir/test/lib/IR/TestBytecodeCallbacks.cpp
91	@mehdi_amini this gives an idea of what we would do on parsing - we would get the dialect version to parse from the version map. Right now though there is no system to specify a version on writing, and this is the closest I could go :). We could post-fix this once the proper API exists if you are ok in leaving the TODO, or I can propose an implementation and finalize the work.

mehdi_amini added inline comments.Jul 10 2023, 8:02 PM

mlir/test/lib/IR/TestBytecodeCallbacks.cpp
52	Can you write `Type entryValue` and drop the `if constexpr` in the body?
66	I'd be interested to see an example where you actually take a `!test.i32` as input and write it as a builtin IntegerType (and show that we can parse it as such), and vice-versa.
91	LG

Harbormaster completed remote builds in B244317: Diff 538873.Jul 10 2023, 8:14 PM

Adds bytecode roundtrip tests with custom integer types.

Harbormaster completed remote builds in B244607: Diff 539292.Jul 11 2023, 2:16 PM

mfrancio marked 3 inline comments as done.Jul 11 2023, 2:22 PM

mfrancio added inline comments.

mlir/test/lib/IR/TestBytecodeCallbacks.cpp
52	No, the callback needs to compile for both Type entryValue and Attribute entryValue. We could do this with two separate callbacks if you feel it's a bit cumbersome to force the use of auto in the callback signature.
66	Test added. Note that it is not possible to parse !test.i32 as a native builtin integer type (hence, without adding a specific callback for it, or without a custom parser that falls back to the builtin integer type parser, which I did not implement for the sake of this test) because the the "owner" of such encoding is still the test dialect.

clang-format commit.

Harbormaster completed remote builds in B244608: Diff 539294.Jul 11 2023, 2:25 PM

mfrancio updated this revision to Diff 539301.Jul 11 2023, 2:52 PM

mehdi_amini added inline comments.Jul 11 2023, 3:21 PM

mlir/test/lib/IR/TestBytecodeCallbacks.cpp
52	What about taking a union of type/attr instead of templating this? Right now this adds some cognitive complexity that does not seems necessarily justified to me. Alternatively we could indeed split it in two callback registration, that can be fine as well.
66	I understand we need a callback, but can't you implement the callback in this file to show we can parse !test.i32 as a native builtin integer type?

Harbormaster completed remote builds in B244613: Diff 539301.Jul 11 2023, 7:27 PM

split callbacks between type and attributes
expose builtin bytecode dialect interface
parse explicitly with bytecode dialect interface when testing interoperability with bytecode types/attributes within a callback

mfrancio marked an inline comment as done.Jul 11 2023, 10:22 PM

mfrancio added inline comments.

mlir/test/lib/IR/TestBytecodeCallbacks.cpp
52	Switched to two callbacks - it is a bit more code, but the function signature is cleaner and more explicit.
66	I did export the integer type parser code within the callback - but effectively was not very clear. Right now I exported the bytecode dialect interface and used this explicitly to write/read, which helps showcasing the feature.

Harbormaster completed remote builds in B244669: Diff 539379.Jul 12 2023, 12:53 AM

mehdi_amini added inline comments.Jul 12 2023, 1:14 AM

mlir/include/mlir/IR/AsmState.h
83	I would think we should distinguish between "no handling" and "error during parsing" here, that is the API should be able to fail the parsing.

Handle failure in read callback API
Add test showcasing the feature

mfrancio marked an inline comment as done.Jul 12 2023, 10:55 AM

mfrancio added inline comments.

mlir/include/mlir/IR/AsmState.h
83	Thanks for pointing this out, it is indeed a very useful feature. I changed the API and added a test for it.

burmako added a subscriber: burmako.Jul 12 2023, 11:10 AM

burmako added inline comments.

mlir/lib/Bytecode/Reader/BytecodeReader.cpp
1248	How do you envision the contract between customized serialization and deserialization? E.g. how does a consumer of bytecode payload know that a specific payload was generated via serializer callbacks and how do they know where to obtain the corresponding deserializer callbacks?

mfrancio marked an inline comment as done.Jul 12 2023, 12:48 PM

mfrancio added inline comments.

mlir/lib/Bytecode/Reader/BytecodeReader.cpp
1248	Since the callbacks are driven by the client, it's up to the client to decide. Assuming you are serializing a module that contains a versioned dialect, I would envision handling such scenario through the dialect version. I don't think upstream dialects should use those callbacks in any way.

Harbormaster completed remote builds in B244850: Diff 539639.Jul 12 2023, 2:05 PM

mfrancio mentioned this in D155340: Add support for versioning properties in MLIR bytecode.Jul 15 2023, 12:29 PM

Simplifies signature of the read callback by leveraging the dialect reader to retrieve context and dialect versions.

Harbormaster completed remote builds in B245639: Diff 540749.Jul 15 2023, 5:14 PM

(forgot to add feedback earlier)

mlir/include/mlir/Bytecode/BytecodeWriter.h
57	Nit: let's keep these called writer rather than printer (could be emitter too as that matches some of the comments).
mlir/include/mlir/IR/AsmState.h
27	Let's keep these sorted
67	I don't think the _ naming convention is used anywhere else here, of you want to signal internal it could be in private section.
73	emitting -> parsing (or reading or ingesting)
562	So these are just flat arrays rather than grouped by (say) type it handles?
mlir/include/mlir/IR/BuiltinDialectBytecode.h
22	I think we should just make this now builtin::detail
24	I was semi in between exposing the interface vs just helper methods generated and then the add method below (the add method doesn't add much given full interface here). But I was thinking of different composition then.
47	Don't know Mehdi s opinion, but I'd probably put this just in builtin namespace along with dialect. This is rather "public" level API for me.

mehdi_amini added inline comments.Jul 15 2023, 7:09 PM

mlir/include/mlir/IR/AsmState.h
561	Nit: can you make the return type explicit (ArrayRef<...>) here? I don't think the `auto` return is widely used in the codebase, and it goes a bit against the "use auto only when the type is obvious from the context" practice.
mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	I agree: in general "detail" namespace aren't meant to contain things to be directly used by clients. But also stepping back: I'm not convinced this entire file should be publicly exposed. We should be able to have a single entry point in MLIR for writing a type or an attribute.
mlir/lib/IR/BuiltinDialectBytecode.cpp
91	I believe that in general we prefer `using namespace` in implementation files, and fully qualify function definition?

Makes BuiltinDialectBytecodeInterface private
Exposes write/read bytecode functions for types and attributes using the builtin encoding
Addresses few nits and comments

mfrancio marked 11 inline comments as done.Jul 16 2023, 10:00 PM

mfrancio added inline comments.

mlir/include/mlir/IR/AsmState.h
562	Well, the idea is that you would encode a function that handles the abstract type and not the concrete, then handling the concrete types using a type switch within the callback itself as needed. I feel passing a callback per concrete type would be more cumbersome to use?
mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	I avoided exposing the interface while adding an entry point to read/write types and attributes using the builtin encoding. I used the builtin namespace even though I found out it wasn't used for the builtin dialect - it seems that this dialect lives directly at the mlir level. However, I have the impression that read/write bytecode functions would be best under the builtin namespace to be explicit about the underlying encoding.

Harbormaster completed remote builds in B245720: Diff 540855.Jul 16 2023, 10:19 PM

mehdi_amini added inline comments.Jul 16 2023, 10:21 PM

mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	I really meant that we should be able to call « writeAttribute » without knowing the dialect to target: I don’t quite get what is special about the built in dialect here?

mfrancio marked an inline comment as done.Jul 17 2023, 8:17 AM

mfrancio added inline comments.

mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	I got what you mean and yes, write functions for attribute and type could be made dialect agnostic - what made me stop is that it's now clear how that would extend to read functions, since the encoding does not contain dialect info, and there is nothing that prevents different dialects to use the same encoding for different things. I found this incongruence a bit confusing, to a point that it seemed suggesting that it would not fit in the current design? For context, see for example Builtin and Quantization (those functions are generated through tablegen) - both dialect encode type 1 and there is no way to disambiguate in a "dialect agnostic" fashion. Builtin: static Type readType(MLIRContext* context, DialectBytecodeReader &reader) { uint64_t kind; if (failed(reader.readVarInt(kind))) return Type(); switch (kind) { case 0: return readIntegerType(context, reader); case 1: return readIndexType(context, reader); Quantization: static Type readType(MLIRContext* context, DialectBytecodeReader &reader) { uint64_t kind; if (failed(reader.readVarInt(kind))) return Type(); switch (kind) { case 1: return readAnyQuantizedType(context, reader); case 2: return readAnyQuantizedTypeWithExpressedType(context, reader); Furthermore, it is reasonable to expect to have more of such conflicts as the MLIR bytecode gains popularity.

mehdi_amini added inline comments.Jul 17 2023, 11:52 AM

mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	OK I get it: it is similar to what we do for print/parse textual assembly. The convention there is that every type/attribute class expose a print/parse method, can we align on the same model for bytecode? We wouldn't need to emit the discriminant for which attribute it is when we know which one to emit! (similar to textual ASM)

Removes readAttribute() and readType() APIs that take a version as argument since the dialect version is available through the dialect reader.

mfrancio added inline comments.Jul 18 2023, 1:55 PM

mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	A non-breaking exension of the ASM approach to bytecode does not seem straightforward because of our current bytecode serialization approach. To summarize the textual ASM approach: builtin types/attributes have known tokens that can be used to detect if a type or an attribute is owned by builtin. If it is not, textual ASM uses a special token (!), which triggers emission of an extended type. At parsing, the extended type token is processed first and this triggers the use of the specific dialect parser. Now porting this approach into bytecode would mean changing how types or attributes are encoded. Types and attributes are grouped together when emitted to bytecode. At parsing, we parse first the dialect owning the group, which will determine the encoding (this is done per-group and not per-attribute, as opposed to the ASM case). We could implement an API that works similarly to textual ASM, but unless i am missing some details, integrating it into the existing bytecode format would require some additional work if we want to maintain backwards compatibility, which is probably beyond the scope of the current patch. We could attempt doing this in the future and I would be happy to take the work.

mehdi_amini added inline comments.Jul 18 2023, 2:49 PM

mlir/include/mlir/IR/BuiltinDialectBytecode.h
47	I agree with you, the encoding difference makes it harder here. Seems reasonable.
mlir/lib/Bytecode/Reader/BytecodeReader.cpp
315	Formatting only?
mlir/lib/Bytecode/Writer/BytecodeWriter.cpp
828–834	Can you extract the above in a lambda "emitAttrOrTypeImpl(..)": you could do a lot of early return which would simplify the control flow and reduce the indentation. The `hasCustomEncoding` boolean should disappear basically.

Harbormaster completed remote builds in B246342: Diff 541718.Jul 18 2023, 8:03 PM

Extend callbacks to allow remapping from one dialect group to another when writing/reading bytecode.

mfrancio marked an inline comment as done.Jul 19 2023, 2:12 PM

Refactors emitAttrOrType lambda to simplify control flow.

mfrancio marked an inline comment as done.Jul 19 2023, 3:01 PM

mfrancio marked 7 inline comments as done.Jul 19 2023, 3:38 PM

mfrancio updated this revision to Diff 542263.Jul 19 2023, 5:43 PM

improve few comments

Harbormaster completed remote builds in B248346: Diff 544481.Jul 26 2023, 12:59 PM

rriddle added inline comments.Jul 26 2023, 1:54 PM

mlir/include/mlir/Bytecode/BytecodeWriter.h
56	Can you ArrayRef here and below instead?
mlir/include/mlir/IR/AsmState.h
41–44	It feels quite weird to have something bytecode related not in Bytecode/. Why does this need to be here instead of hooked into the `BytecodeWriterConfig`?
mlir/lib/IR/BuiltinDialectBytecode.cpp
115–130	Why do we need to expose these at all? As opposed to casting the builtin dialect to DialectBytecodeInterface, and going through the normal path?

mfrancio marked an inline comment as done.Jul 26 2023, 2:35 PM

mfrancio added inline comments.

mlir/include/mlir/Bytecode/BytecodeWriter.h
56	Absolutely, I'll push a revision.
mlir/include/mlir/IR/AsmState.h
41–44	I agree and I tried to search for a better location. The issue is that we currently have a unified entry point for the parser (`ParserConfig`) which works for both text and bytecode. Unless we want to refactor this, at least the reader side of this class need to stay in this header. Since the same logic is used for `AsmResourcePrinter` (defined here, but used in bytecode writer config), I thought this could work?
mlir/lib/IR/BuiltinDialectBytecode.cpp
115–130	That would require exposing the interface for builtin, which I had originally done in a previous version of the patch :). However, after few rounds of revisions the consensus was that it could have been better to leave the interface as an internal implementation detail and expose single hooks. In previous revisions, I was also asked to try to expose top level entry point for writing type/attributes in a dialect agnostic fashion, similarly to what the textual parser does, but we realized that it was not going to fit in the current design. Hence, exposing those hooks was the closest I could go to that idea, which preserves the original intent of @jpienaar (having the bytecode dialect interface as an internal implementation detail and not exposed). Those functions here are only needed for the tests, but it is not unreasonable to expect clients to use them. Let me know if you are strongly against leaving as is, and I can revise as necessary.

rriddle added inline comments.Jul 26 2023, 4:01 PM

mlir/include/mlir/IR/AsmState.h
41–44	Hmrmrm, can we split out all of the bytecode config into a BytecodeReaderConfig and have that in the parser config? Would be good to keep all of the bytecode pieces isolated.
mlir/lib/IR/BuiltinDialectBytecode.cpp
115–130	DialectBytecodeInterface is an already exposed api, you can just do `cast<DialectBytecodeInterface>(type.getDialect())` to get the virtual instance. I don't see which part of that requires exposing anything from the builtin dialect?

mfrancio marked 2 inline comments as done.Jul 26 2023, 4:48 PM

mfrancio added inline comments.

mlir/include/mlir/IR/AsmState.h
41–44	Sure, it seems like a good change. I'll do that and update the patch. Thanks for the feedback.
mlir/lib/IR/BuiltinDialectBytecode.cpp
115–130	I apologize for the confusion - it seems casting directly is not allowed, but I was able to retrieve the virtual instance through getRegisteredInterface<BytecodeDialectInterface>()! For some reason I had assumed at first that type id of the interface was going to be different. Thanks for pointing it out.

Move bytecode related code into Bytecode/. with appropriate header
Make naming convention uniform (parse<->read, print<->write)
Rebase

mfrancio marked an inline comment as done.Jul 26 2023, 9:51 PM

mfrancio added inline comments.

mlir/include/mlir/IR/AsmState.h
41–44	I created a new header specific for the bytecode reader config to avoid a circular dependency (AsmState <-> BytecodeReader).

Harbormaster completed remote builds in B248431: Diff 544596.Jul 26 2023, 10:19 PM

Nice

mlir/include/mlir/IR/AsmState.h
560	Could we have just an accessor for the BytecodeReaderConfig? Would help really limit the bytecode parts of this API.
mlir/lib/Bytecode/Reader/BytecodeReader.cpp
315	Was the formatting off here before?

This revision is now accepted and ready to land.Jul 26 2023, 10:30 PM

Expose the bytecode reader config directly to the parser config.

Harbormaster completed remote builds in B248602: Diff 544817.Jul 27 2023, 9:27 AM

mfrancio marked 2 inline comments as done.Jul 27 2023, 9:28 AM

mfrancio added inline comments.

mlir/include/mlir/IR/AsmState.h
560	done! Thanks for the review.

Rebase

Harbormaster completed remote builds in B248623: Diff 544832.Jul 27 2023, 12:57 PM

Closed by commit rGb299ec16661f: Expose callbacks for encoding of types/attributes (authored by mehdi_amini). · Explain WhyJul 28 2023, 10:44 AM

This revision was automatically updated to reflect the committed changes.

mehdi_amini added a commit: rGb299ec16661f: Expose callbacks for encoding of types/attributes.

mehdi_amini added a commit: rGbff6a4292f80: Expose callbacks for encoding of types/attributes.Jul 28 2023, 4:46 PM

mehdi_amini added a reverting change: rGb86a13211fcd: Revert "Expose callbacks for encoding of types/attributes".

Revision Contents

Path

Size

mlir/

include/

mlir/

Bytecode/

BytecodeImplementation.h

63 lines

BytecodeWriter.h

37 lines

IR/

AsmState.h

129 lines

	include/	mlir/	IR/
		lib/	IR/

BuiltinDialectBytecode.h

18 lines

lib/

Bytecode/

Reader/

BytecodeReader.cpp

173 lines

Writer/

BytecodeWriter.cpp

91 lines

IRNumbering.cpp

40 lines

IR/

BuiltinDialect.cpp

5 lines

BuiltinDialectBytecode.h

BuiltinDialectBytecode.cpp

29 lines

test/

Bytecode/

bytecode_callback.mlir

14 lines

bytecode_callback_full_override.mlir

18 lines

bytecode_callback_with_custom_attribute.mlir

14 lines

bytecode_callback_with_custom_type.mlir

18 lines

invalid/

invalid_attr_type_section.mlir

4 lines

lib/

Dialect/

Test/

16 lines

40 lines

11 lines

4 lines

IR/

CMakeLists.txt

1 line

TestBytecodeCallbacks.cpp

348 lines

tools/

mlir-opt/

mlir-opt.cpp

2 lines

Diff 542263

mlir/include/mlir/Bytecode/BytecodeImplementation.h

Show All 17 Lines
#include "mlir/IR/Diagnostics.h"		#include "mlir/IR/Diagnostics.h"
#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
#include "mlir/IR/DialectInterface.h"		#include "mlir/IR/DialectInterface.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/Support/LogicalResult.h"		#include "mlir/Support/LogicalResult.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"

namespace mlir {		namespace mlir {
		//===--------------------------------------------------------------------===//
		// Dialect Version Interface.
		//===--------------------------------------------------------------------===//

		/// This class is used to represent the version of a dialect, for the purpose
		/// of polymorphic destruction.
		class DialectVersion {
		public:
		virtual ~DialectVersion() = default;
		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// DialectBytecodeReader		// DialectBytecodeReader
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// This class defines a virtual interface for reading a bytecode stream,		/// This class defines a virtual interface for reading a bytecode stream,
/// providing hooks into the bytecode reader. As such, this class should only be		/// providing hooks into the bytecode reader. As such, this class should only be
/// derived and defined by the main bytecode reader, users (i.e. dialects)		/// derived and defined by the main bytecode reader, users (i.e. dialects)
/// should generally only interact with this class via the		/// should generally only interact with this class via the
/// BytecodeDialectInterface below.		/// BytecodeDialectInterface below.
class DialectBytecodeReader {		class DialectBytecodeReader {
public:		public:
virtual ~DialectBytecodeReader() = default;		virtual ~DialectBytecodeReader() = default;

/// Emit an error to the reader.		/// Emit an error to the reader.
virtual InFlightDiagnostic emitError(const Twine &msg = {}) = 0;		virtual InFlightDiagnostic emitError(const Twine &msg = {}) const = 0;

		/// Retrieve the dialect version by name if available.
		virtual FailureOr<const DialectVersion *>
		getDialectVersion(StringRef dialectName) const = 0;

		/// Retrieve the context associated to the reader.
		virtual MLIRContext *getContext() const = 0;

/// Read out a list of elements, invoking the provided callback for each		/// Read out a list of elements, invoking the provided callback for each
/// element. The callback function may be in any of the following forms:		/// element. The callback function may be in any of the following forms:
/// * LogicalResult(T &)		/// * LogicalResult(T &)
/// * FailureOr<T>()		/// * FailureOr<T>()
template <typename T, typename CallbackFn>		template <typename T, typename CallbackFn>
LogicalResult readList(SmallVectorImpl<T> &result, CallbackFn &&callback) {		LogicalResult readList(SmallVectorImpl<T> &result, CallbackFn &&callback) {
uint64_t size;		uint64_t size;
▲ Show 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	public:

/// Write a bool to the output stream.		/// Write a bool to the output stream.
virtual void writeOwnedBool(bool value) = 0;		virtual void writeOwnedBool(bool value) = 0;

/// Return the bytecode version being emitted for.		/// Return the bytecode version being emitted for.
virtual int64_t getBytecodeVersion() const = 0;		virtual int64_t getBytecodeVersion() const = 0;
};		};

//===--------------------------------------------------------------------===//
// Dialect Version Interface.
//===--------------------------------------------------------------------===//

/// This class is used to represent the version of a dialect, for the purpose
/// of polymorphic destruction.
class DialectVersion {
public:
virtual ~DialectVersion() = default;
};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// BytecodeDialectInterface		// BytecodeDialectInterface
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class BytecodeDialectInterface		class BytecodeDialectInterface
: public DialectInterface::Base<BytecodeDialectInterface> {		: public DialectInterface::Base<BytecodeDialectInterface> {
public:		public:
using Base::Base;		using Base::Base;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Reading		// Reading
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Read an attribute belonging to this dialect from the given reader. This		/// Read an attribute belonging to this dialect from the given reader. This
/// method should return null in the case of failure.		/// method should return null in the case of failure. Optionally, the dialect
		/// version can be accessed through the reader.
virtual Attribute readAttribute(DialectBytecodeReader &reader) const {		virtual Attribute readAttribute(DialectBytecodeReader &reader) const {
reader.emitError() << "dialect " << getDialect()->getNamespace()		reader.emitError() << "dialect " << getDialect()->getNamespace()
<< " does not support reading attributes from bytecode";		<< " does not support reading attributes from bytecode";
return Attribute();		return Attribute();
}		}

/// Read a versioned attribute encoding belonging to this dialect from the
/// given reader. This method should return null in the case of failure, and
/// falls back to the non-versioned reader in case the dialect implements
/// versioning but it does not support versioned custom encodings for the
/// attributes.
virtual Attribute readAttribute(DialectBytecodeReader &reader,
const DialectVersion &version) const {
reader.emitError()
<< "dialect " << getDialect()->getNamespace()
<< " does not support reading versioned attributes from bytecode";
return Attribute();
}

/// Read a type belonging to this dialect from the given reader. This method		/// Read a type belonging to this dialect from the given reader. This method
/// should return null in the case of failure.		/// should return null in the case of failure. Optionally, the dialect version
		/// can be accessed thorugh the reader.
virtual Type readType(DialectBytecodeReader &reader) const {		virtual Type readType(DialectBytecodeReader &reader) const {
reader.emitError() << "dialect " << getDialect()->getNamespace()		reader.emitError() << "dialect " << getDialect()->getNamespace()
<< " does not support reading types from bytecode";		<< " does not support reading types from bytecode";
return Type();		return Type();
}		}

/// Read a versioned type encoding belonging to this dialect from the given
/// reader. This method should return null in the case of failure, and
/// falls back to the non-versioned reader in case the dialect implements
/// versioning but it does not support versioned custom encodings for the
/// types.
virtual Type readType(DialectBytecodeReader &reader,
const DialectVersion &version) const {
reader.emitError()
<< "dialect " << getDialect()->getNamespace()
<< " does not support reading versioned types from bytecode";
return Type();
}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Writing		// Writing
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Write the given attribute, which belongs to this dialect, to the given		/// Write the given attribute, which belongs to this dialect, to the given
/// writer. This method may return failure to indicate that the given		/// writer. This method may return failure to indicate that the given
/// attribute could not be encoded, in which case the textual format will be		/// attribute could not be encoded, in which case the textual format will be
/// used to encode this attribute instead.		/// used to encode this attribute instead.
▲ Show 20 Lines • Show All 70 Lines • Show Last 20 Lines

mlir/include/mlir/Bytecode/BytecodeWriter.h

Show First 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	public:
/// the desired version. The bytecode writer entry point will return failure		/// the desired version. The bytecode writer entry point will return failure
/// if it cannot emit the desired version.		/// if it cannot emit the desired version.
void setDesiredBytecodeVersion(int64_t bytecodeVersion);		void setDesiredBytecodeVersion(int64_t bytecodeVersion);

/// Get the set desired bytecode version to emit.		/// Get the set desired bytecode version to emit.
int64_t getDesiredBytecodeVersion() const;		int64_t getDesiredBytecodeVersion() const;

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
		// Types and Attributes encoding
		//===--------------------------------------------------------------------===//

		/// Retrieve the callbacks.
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Attribute>>> &
		rriddleUnsubmitted Done Reply Inline Actions Can you ArrayRef here and below instead? rriddle: Can you ArrayRef here and below instead?
		mfrancioAuthorUnsubmitted Done Reply Inline Actions Absolutely, I'll push a revision. mfrancio: Absolutely, I'll push a revision.
		getAttributeWriterCallbacks() const;
		jpienaarUnsubmitted Done Reply Inline Actions Nit: let's keep these called writer rather than printer (could be emitter too as that matches some of the comments). jpienaar: Nit: let's keep these called writer rather than printer (could be emitter too as that matches…
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Type>>> &
		getTypeWriterCallbacks() const;

		/// Attach a custom bytecode printer callback to the configuration for the
		/// emission of custom type/attributes encodings.
		void attachAttributeCallback(
		mehdi_aminiUnsubmitted Done Reply Inline Actions Seems like not natural to me as an API to anchor this to the dialect, that seems a bit arbitrary to me. When we talked about it I was thinking something more flat and simpler: void addOverrideCallback(std::function<bool(Type)> callback) { } Smallvector<std::function<bool(Type)>> typeOverrideCallbacks; And the writer would do: // First try to process the given type with the provided override for (auto &callback : typeOverrideCallbacks) if (callback(type)) return // continue with normal emission mehdi_amini: Seems like not natural to me as an API to anchor this to the dialect, that seems a bit…
		mehdi_aminiUnsubmitted Done Reply Inline Actions I guess what I wrote is not enough: there is more than the exact encoding, there may be some remapping to another dialect as well. mehdi_amini: I guess what I wrote is not enough: there is more than the exact encoding, there may be some…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions I found it as a compelling and explicit way to override Type and Attributes printer/parser of any dialect with a specific encoding. But agreed, it is arbitrary. I can remove this anchor and simplify it a little bit. mfrancio: I found it as a compelling and explicit way to override Type and Attributes printer/parser of…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions I think what implemented should be enough to be able to override the encoding of a type/attribute if its definition jumps from one dialect to another - this is true as long as the encoding is always owned by the callback, before and after the jump. mfrancio: I think what implemented should be enough to be able to override the encoding of a…
		std::unique_ptr<AsmAttrTypeBytecodeWriter<Attribute>> callback);
		void
		attachTypeCallback(std::unique_ptr<AsmAttrTypeBytecodeWriter<Type>> callback);

		/// Attach a custom bytecode printer callback to the configuration for the
		/// emission of custom type/attributes encodings.
		template <typename CallableT>
		std::enable_if_t<std::is_convertible_v<
		CallableT,
		std::function<LogicalResult(Attribute, std::optional<StringRef> &,
		DialectBytecodeWriter &)>>>
		attachAttributeCallback(CallableT &&emitFn) {
		attachAttributeCallback(AsmAttrTypeBytecodeWriter<Attribute>::fromCallable(
		std::forward<CallableT>(emitFn)));
		}
		template <typename CallableT>
		std::enable_if_t<std::is_convertible_v<
		CallableT, std::function<LogicalResult(Type, std::optional<StringRef> &,
		DialectBytecodeWriter &)>>>
		attachTypeCallback(CallableT &&emitFn) {
		attachTypeCallback(AsmAttrTypeBytecodeWriter<Type>::fromCallable(
		std::forward<CallableT>(emitFn)));
		}

		//===--------------------------------------------------------------------===//
// Resources		// Resources
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Attach the given resource printer to the writer configuration.		/// Attach the given resource printer to the writer configuration.
void attachResourcePrinter(std::unique_ptr<AsmResourcePrinter> printer);		void attachResourcePrinter(std::unique_ptr<AsmResourcePrinter> printer);

/// Attach an resource printer, in the form of a callable, to the		/// Attach an resource printer, in the form of a callable, to the
/// configuration.		/// configuration.
Show All 33 Lines

mlir/include/mlir/IR/AsmState.h

Show All 17 Lines
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"
#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"

#include <memory>		#include <memory>
#include <variant>		#include <variant>

namespace mlir {		namespace mlir {
class AsmResourcePrinter;
class AsmDialectResourceHandle;		class AsmDialectResourceHandle;
		class AsmResourcePrinter;
		jpienaarUnsubmitted Done Reply Inline Actions Let's keep these sorted jpienaar: Let's keep these sorted
		class DialectBytecodeReader;
		class DialectBytecodeWriter;
		class DialectVersion;
class Operation;		class Operation;

namespace detail {		namespace detail {
class AsmStateImpl;		class AsmStateImpl;
} // namespace detail		} // namespace detail

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// AsmAttrTypeBytecode Parser/Printer
		//===----------------------------------------------------------------------===//

		/// A class to interact with the attributes and types printer when emitting MLIR
		/// bytecode.
		template <class T>
		class AsmAttrTypeBytecodeWriter {
		rriddleUnsubmitted Not Done Reply Inline Actions It feels quite weird to have something bytecode related not in Bytecode/. Why does this need to be here instead of hooked into the `BytecodeWriterConfig`? rriddle: It feels quite weird to have something bytecode related not in Bytecode/. Why does this need to…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions I agree and I tried to search for a better location. The issue is that we currently have a unified entry point for the parser (`ParserConfig`) which works for both text and bytecode. Unless we want to refactor this, at least the reader side of this class need to stay in this header. Since the same logic is used for `AsmResourcePrinter` (defined here, but used in bytecode writer config), I thought this could work? mfrancio: I agree and I tried to search for a better location. The issue is that we currently have a…
		rriddleUnsubmitted Done Reply Inline Actions Hmrmrm, can we split out all of the bytecode config into a BytecodeReaderConfig and have that in the parser config? Would be good to keep all of the bytecode pieces isolated. rriddle: Hmrmrm, can we split out all of the bytecode config into a BytecodeReaderConfig and have that…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions Sure, it seems like a good change. I'll do that and update the patch. Thanks for the feedback. mfrancio: Sure, it seems like a good change. I'll do that and update the patch. Thanks for the feedback.
		mfrancioAuthorUnsubmitted Done Reply Inline Actions I created a new header specific for the bytecode reader config to avoid a circular dependency (AsmState <-> BytecodeReader). mfrancio: I created a new header specific for the bytecode reader config to avoid a circular dependency…
		public:
		AsmAttrTypeBytecodeWriter() = default;
		virtual ~AsmAttrTypeBytecodeWriter() = default;

		virtual LogicalResult write(T entry, std::optional<StringRef> &name,
		DialectBytecodeWriter &writer) = 0;

		LogicalResult write(T entry, DialectBytecodeWriter &writer) {
		std::optional<StringRef> dummy;
		return write(entry, dummy, writer);
		}

		/// Return an Attribute/Type printer implemented via the given callable, whose
		/// form should match that of the `write` function above.
		template <typename CallableT,
		std::enable_if_t<std::is_convertible_v<
		CallableT, std::function<LogicalResult(
		T, std::optional<StringRef> &,
		DialectBytecodeWriter &)>>,
		bool> = true>
		static std::unique_ptr<AsmAttrTypeBytecodeWriter<T>>
		fromCallable(CallableT &&writeFn) {
		struct Processor : public AsmAttrTypeBytecodeWriter<T> {
		jpienaarUnsubmitted Done Reply Inline Actions I don't think the _ naming convention is used anywhere else here, of you want to signal internal it could be in private section. jpienaar: I don't think the _ naming convention is used anywhere else here, of you want to signal…
		Processor(CallableT &&writeFn)
		: AsmAttrTypeBytecodeWriter(), writeFn(std::move(writeFn)) {}
		LogicalResult write(T entry, std::optional<StringRef> &name,
		DialectBytecodeWriter &writer) override {
		return writeFn(entry, name, writer);
		}
		jpienaarUnsubmitted Done Reply Inline Actions emitting -> parsing (or reading or ingesting) jpienaar: emitting -> parsing (or reading or ingesting)

		std::decay_t<CallableT> writeFn;
		};
		return std::make_unique<Processor>(std::forward<CallableT>(writeFn));
		}
		};

		/// A class to interact with the attributes and types parser when parsing MLIR
		/// bytecode.
		template <class T>
		mehdi_aminiUnsubmitted Done Reply Inline Actions I would think we should distinguish between "no handling" and "error during parsing" here, that is the API should be able to fail the parsing. mehdi_amini: I would think we should distinguish between "no handling" and "error during parsing" here…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions Thanks for pointing this out, it is indeed a very useful feature. I changed the API and added a test for it. mfrancio: Thanks for pointing this out, it is indeed a very useful feature. I changed the API and added a…
		class AsmAttrTypeBytecodeParser {
		public:
		AsmAttrTypeBytecodeParser() = default;
		virtual ~AsmAttrTypeBytecodeParser() = default;

		virtual LogicalResult parse(DialectBytecodeReader &reader,
		StringRef dialectName, T &entry) = 0;

		/// Return an Attribute/Type printer implemented via the given callable, whose
		/// form should match that of the `parse` function above.
		template <typename CallableT,
		std::enable_if_t<
		std::is_convertible_v<
		CallableT, std::function<LogicalResult(
		DialectBytecodeReader &, StringRef, T &)>>,
		bool> = true>
		static std::unique_ptr<AsmAttrTypeBytecodeParser<T>>
		fromCallable(CallableT &&parseFn) {
		struct Processor : public AsmAttrTypeBytecodeParser<T> {
		Processor(CallableT &&parseFn)
		: AsmAttrTypeBytecodeParser(), parseFn(std::move(parseFn)) {}
		LogicalResult parse(DialectBytecodeReader &reader, StringRef dialectName,
		T &entry) override {
		return parseFn(reader, dialectName, entry);
		}

		std::decay_t<CallableT> parseFn;
		};
		return std::make_unique<Processor>(std::forward<CallableT>(parseFn));
		}
		};

		//===----------------------------------------------------------------------===//
// Resources		// Resources
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// The following classes enable support for parsing and printing resources		/// The following classes enable support for parsing and printing resources
/// within MLIR assembly formats. Resources are a mechanism by which dialects,		/// within MLIR assembly formats. Resources are a mechanism by which dialects,
/// and external clients, may attach additional information when parsing or		/// and external clients, may attach additional information when parsing or
/// printing IR without that information being encoded in the IR itself.		/// printing IR without that information being encoded in the IR itself.
/// Resources are not uniqued within the MLIR context, are not attached directly		/// Resources are not uniqued within the MLIR context, are not attached directly
▲ Show 20 Lines • Show All 427 Lines • ▼ Show 20 Lines	public:
}		}

/// Return the MLIRContext to be used when parsing.		/// Return the MLIRContext to be used when parsing.
MLIRContext *getContext() const { return context; }		MLIRContext *getContext() const { return context; }

/// Returns if the parser should verify the IR after parsing.		/// Returns if the parser should verify the IR after parsing.
bool shouldVerifyAfterParse() const { return verifyAfterParse; }		bool shouldVerifyAfterParse() const { return verifyAfterParse; }

		/// Returns the callbacks available to the parser.
		rriddleUnsubmitted Done Reply Inline Actions Could we have just an accessor for the BytecodeReaderConfig? Would help really limit the bytecode parts of this API. rriddle: Could we have just an accessor for the BytecodeReaderConfig? Would help really limit the…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions done! Thanks for the review. mfrancio: done! Thanks for the review.
		ArrayRef<std::unique_ptr<AsmAttrTypeBytecodeParser<Attribute>>>
		mehdi_aminiUnsubmitted Done Reply Inline Actions Nit: can you make the return type explicit (ArrayRef<...>) here? I don't think the `auto` return is widely used in the codebase, and it goes a bit against the "use auto only when the type is obvious from the context" practice. mehdi_amini: Nit: can you make the return type explicit (ArrayRef<...>) here? I don't think the `auto`…
		getAttributeBytecodeCallbacks() const {
		jpienaarUnsubmitted Done Reply Inline Actions So these are just flat arrays rather than grouped by (say) type it handles? jpienaar: So these are just flat arrays rather than grouped by (say) type it handles?
		mfrancioAuthorUnsubmitted Done Reply Inline Actions Well, the idea is that you would encode a function that handles the abstract type and not the concrete, then handling the concrete types using a type switch within the callback itself as needed. I feel passing a callback per concrete type would be more cumbersome to use? mfrancio: Well, the idea is that you would encode a function that handles the abstract type and not the…
		return attributeBytecodeParsers;
		}
		ArrayRef<std::unique_ptr<AsmAttrTypeBytecodeParser<Type>>>
		getTypeBytecodeCallbacks() const {
		return typeBytecodeParsers;
		}

		/// Attach a custom bytecode parser callback to the configuration for parsing
		/// of custom type/attributes encodings.
		void attachAttributeBytecodeCallback(
		std::unique_ptr<AsmAttrTypeBytecodeParser<Attribute>> parser) {
		attributeBytecodeParsers.emplace_back(std::move(parser));
		}
		void attachTypeBytecodeCallback(
		std::unique_ptr<AsmAttrTypeBytecodeParser<Type>> parser) {
		typeBytecodeParsers.emplace_back(std::move(parser));
		}

		/// Attach a custom bytecode parser callback to the configuration for parsing
		/// of custom type/attributes encodings.
		template <typename CallableT>
		std::enable_if_t<std::is_convertible_v<
		CallableT, std::function<LogicalResult(DialectBytecodeReader &, StringRef,
		Attribute &)>>>
		attachAttributeBytecodeCallback(CallableT &&parserFn) {
		attachAttributeBytecodeCallback(
		AsmAttrTypeBytecodeParser<Attribute>::fromCallable(
		std::forward<CallableT>(parserFn)));
		}
		template <typename CallableT>
		std::enable_if_t<std::is_convertible_v<
		CallableT,
		std::function<LogicalResult(DialectBytecodeReader &, StringRef, Type &)>>>
		attachTypeBytecodeCallback(CallableT &&parserFn) {
		attachTypeBytecodeCallback(AsmAttrTypeBytecodeParser<Type>::fromCallable(
		std::forward<CallableT>(parserFn)));
		}

/// Return the resource parser registered to the given name, or nullptr if no		/// Return the resource parser registered to the given name, or nullptr if no
/// parser with `name` is registered.		/// parser with `name` is registered.
AsmResourceParser *getResourceParser(StringRef name) const {		AsmResourceParser *getResourceParser(StringRef name) const {
auto it = resourceParsers.find(name);		auto it = resourceParsers.find(name);
if (it != resourceParsers.end())		if (it != resourceParsers.end())
return it->second.get();		return it->second.get();
if (fallbackResourceMap)		if (fallbackResourceMap)
return &fallbackResourceMap->getParserFor(name);		return &fallbackResourceMap->getParserFor(name);
Show All 18 Lines	attachResourceParser(AsmResourceParser::fromCallable(
name, std::forward<CallableT>(parserFn)));		name, std::forward<CallableT>(parserFn)));
}		}

private:		private:
MLIRContext *context;		MLIRContext *context;
bool verifyAfterParse;		bool verifyAfterParse;
DenseMap<StringRef, std::unique_ptr<AsmResourceParser>> resourceParsers;		DenseMap<StringRef, std::unique_ptr<AsmResourceParser>> resourceParsers;
FallbackAsmResourceMap *fallbackResourceMap;		FallbackAsmResourceMap *fallbackResourceMap;
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeParser<Attribute>>>
		attributeBytecodeParsers;
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeParser<Type>>>
		typeBytecodeParsers;
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AsmState		// AsmState
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// This class provides management for the lifetime of the state used when		/// This class provides management for the lifetime of the state used when
/// printing the IR. It allows for alleviating the cost of recomputing the		/// printing the IR. It allows for alleviating the cost of recomputing the
▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

mlir/include/mlir/IR/BuiltinDialectBytecode.h

This file was moved from mlir/lib/IR/BuiltinDialectBytecode.h.

	//===- BuiltinDialectBytecode.h - MLIR Bytecode Implementation --- C++ --===//			//===- BuiltinDialectBytecode.h - MLIR Bytecode Implementation --- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This header defines hooks into the builtin dialect bytecode implementation.			// This header defines hooks into the builtin dialect bytecode implementation.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LIB_MLIR_IR_BUILTINDIALECTBYTECODE_H			#ifndef MLIR_IR_BUILTINDIALECTBYTECODE_H
	#define LIB_MLIR_IR_BUILTINDIALECTBYTECODE_H			#define MLIR_IR_BUILTINDIALECTBYTECODE_H

				#include "mlir/Bytecode/BytecodeImplementation.h"

	namespace mlir {			namespace mlir {
	class BuiltinDialect;			class BuiltinDialect;

	namespace builtin_dialect_detail {			namespace builtin {
				/// Utility read/write functions for types and attributes based on the builtin
				jpienaarUnsubmitted Done Reply Inline Actions I think we should just make this now builtin::detail jpienaar: I think we should just make this now builtin::detail
				/// bytecode encoding.
				Attribute readAttribute(DialectBytecodeReader &reader);
				jpienaarUnsubmitted Done Reply Inline Actions I was semi in between exposing the interface vs just helper methods generated and then the add method below (the add method doesn't add much given full interface here). But I was thinking of different composition then. jpienaar: I was semi in between exposing the interface vs just helper methods generated and then the add…
				LogicalResult writeAttribute(Attribute attribute,
				DialectBytecodeWriter &writer);
				Type readType(DialectBytecodeReader &reader);
				LogicalResult writeType(Type type, DialectBytecodeWriter &writer);

	/// Add the interfaces necessary for encoding the builtin dialect components in			/// Add the interfaces necessary for encoding the builtin dialect components in
	/// bytecode.			/// bytecode.
	void addBytecodeInterface(BuiltinDialect *dialect);			void addBytecodeInterface(BuiltinDialect *dialect);
	} // namespace builtin_dialect_detail			} // namespace builtin
	} // namespace mlir			} // namespace mlir

	#endif // LIB_MLIR_IR_BUILTINDIALECTBYTECODE_H			#endif // LIB_MLIR_IR_BUILTINDIALECTBYTECODE_H
				jpienaarUnsubmitted Done Reply Inline Actions Don't know Mehdi s opinion, but I'd probably put this just in builtin namespace along with dialect. This is rather "public" level API for me. jpienaar: Don't know Mehdi s opinion, but I'd probably put this just in builtin namespace along with…
				mehdi_aminiUnsubmitted Done Reply Inline Actions I agree: in general "detail" namespace aren't meant to contain things to be directly used by clients. But also stepping back: I'm not convinced this entire file should be publicly exposed. We should be able to have a single entry point in MLIR for writing a type or an attribute. mehdi_amini: I agree: in general "detail" namespace aren't meant to contain things to be directly used by…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions I avoided exposing the interface while adding an entry point to read/write types and attributes using the builtin encoding. I used the builtin namespace even though I found out it wasn't used for the builtin dialect - it seems that this dialect lives directly at the mlir level. However, I have the impression that read/write bytecode functions would be best under the builtin namespace to be explicit about the underlying encoding. mfrancio: I avoided exposing the interface while adding an entry point to read/write types and attributes…
				mehdi_aminiUnsubmitted Done Reply Inline Actions I really meant that we should be able to call « writeAttribute » without knowing the dialect to target: I don’t quite get what is special about the built in dialect here? mehdi_amini: I really meant that we should be able to call « writeAttribute » without knowing the dialect to…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions I got what you mean and yes, write functions for attribute and type could be made dialect agnostic - what made me stop is that it's now clear how that would extend to read functions, since the encoding does not contain dialect info, and there is nothing that prevents different dialects to use the same encoding for different things. I found this incongruence a bit confusing, to a point that it seemed suggesting that it would not fit in the current design? For context, see for example Builtin and Quantization (those functions are generated through tablegen) - both dialect encode type 1 and there is no way to disambiguate in a "dialect agnostic" fashion. Builtin: static Type readType(MLIRContext* context, DialectBytecodeReader &reader) { uint64_t kind; if (failed(reader.readVarInt(kind))) return Type(); switch (kind) { case 0: return readIntegerType(context, reader); case 1: return readIndexType(context, reader); Quantization: static Type readType(MLIRContext* context, DialectBytecodeReader &reader) { uint64_t kind; if (failed(reader.readVarInt(kind))) return Type(); switch (kind) { case 1: return readAnyQuantizedType(context, reader); case 2: return readAnyQuantizedTypeWithExpressedType(context, reader); Furthermore, it is reasonable to expect to have more of such conflicts as the MLIR bytecode gains popularity. mfrancio: I got what you mean and yes, write functions for attribute and type could be made dialect…
				mehdi_aminiUnsubmitted Done Reply Inline Actions OK I get it: it is similar to what we do for print/parse textual assembly. The convention there is that every type/attribute class expose a print/parse method, can we align on the same model for bytecode? We wouldn't need to emit the discriminant for which attribute it is when we know which one to emit! (similar to textual ASM) mehdi_amini: OK I get it: it is similar to what we do for print/parse textual assembly. The convention…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions A non-breaking exension of the ASM approach to bytecode does not seem straightforward because of our current bytecode serialization approach. To summarize the textual ASM approach: builtin types/attributes have known tokens that can be used to detect if a type or an attribute is owned by builtin. If it is not, textual ASM uses a special token (!), which triggers emission of an extended type. At parsing, the extended type token is processed first and this triggers the use of the specific dialect parser. Now porting this approach into bytecode would mean changing how types or attributes are encoded. Types and attributes are grouped together when emitted to bytecode. At parsing, we parse first the dialect owning the group, which will determine the encoding (this is done per-group and not per-attribute, as opposed to the ASM case). We could implement an API that works similarly to textual ASM, but unless i am missing some details, integrating it into the existing bytecode format would require some additional work if we want to maintain backwards compatibility, which is probably beyond the scope of the current patch. We could attempt doing this in the future and I would be happy to take the work. mfrancio: A non-breaking exension of the ASM approach to bytecode does not seem straightforward because…
				mehdi_aminiUnsubmitted Done Reply Inline Actions I agree with you, the encoding difference makes it harder here. Seems reasonable. mehdi_amini: I agree with you, the encoding difference makes it harder here. Seems reasonable.

mlir/lib/Bytecode/Reader/BytecodeReader.cpp

Show First 20 Lines • Show All 306 Lines • ▼ Show 20 Lines	LLVM_ATTRIBUTE_NOINLINE LogicalResult parseMultiByteVarInt(uint64_t &result) {
// implementation).		// implementation).
uint32_t numBytes = llvm::countr_zero<uint32_t>(result);		uint32_t numBytes = llvm::countr_zero<uint32_t>(result);
assert(numBytes > 0 && numBytes <= 7 &&		assert(numBytes > 0 && numBytes <= 7 &&
"unexpected number of trailing zeros in varint encoding");		"unexpected number of trailing zeros in varint encoding");

// Parse in the remaining bytes of the value.		// Parse in the remaining bytes of the value.
llvm::support::ulittle64_t resultLE(result);		llvm::support::ulittle64_t resultLE(result);
if (failed(parseBytes(numBytes, reinterpret_cast<uint8_t *>(&resultLE) + 1)))		if (failed(parseBytes(numBytes, reinterpret_cast<uint8_t *>(&resultLE) + 1)))
return failure();		return failure();
		mehdi_aminiUnsubmitted Done Reply Inline Actions Formatting only? mehdi_amini: Formatting only?
		rriddleUnsubmitted Done Reply Inline Actions Was the formatting off here before? rriddle: Was the formatting off here before?

// Shift out the low-order bits that were used to mark how the value was		// Shift out the low-order bits that were used to mark how the value was
// encoded.		// encoded.
result = resultLE >> (numBytes + 1);		result = resultLE >> (numBytes + 1);
return success();		return success();
}		}

/// The current data iterator, and an iterator to the end of the buffer.		/// The current data iterator, and an iterator to the end of the buffer.
▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines
class DialectReader;		class DialectReader;

/// This struct represents a dialect entry within the bytecode.		/// This struct represents a dialect entry within the bytecode.
struct BytecodeDialect {		struct BytecodeDialect {
/// Load the dialect into the provided context if it hasn't been loaded yet.		/// Load the dialect into the provided context if it hasn't been loaded yet.
/// Returns failure if the dialect couldn't be loaded and the provided		/// Returns failure if the dialect couldn't be loaded and the provided
/// context does not allow unregistered dialects. The provided reader is used		/// context does not allow unregistered dialects. The provided reader is used
/// for error emission if necessary.		/// for error emission if necessary.
LogicalResult load(DialectReader &reader, MLIRContext *ctx);		LogicalResult load(const DialectReader &reader, MLIRContext *ctx);

/// Return the loaded dialect, or nullptr if the dialect is unknown. This can		/// Return the loaded dialect, or nullptr if the dialect is unknown. This can
/// only be called after `load`.		/// only be called after `load`.
Dialect *getLoadedDialect() const {		Dialect *getLoadedDialect() const {
assert(dialect &&		assert(dialect &&
"expected `load` to be invoked before `getLoadedDialect`");		"expected `load` to be invoked before `getLoadedDialect`");
return *dialect;		return *dialect;
}		}
Show All 37 Lines	struct BytecodeOperationName {
/// Whether this operation was registered when the bytecode was produced.		/// Whether this operation was registered when the bytecode was produced.
/// This flag is populated when bytecode version >=kNativePropertiesEncoding.		/// This flag is populated when bytecode version >=kNativePropertiesEncoding.
std::optional<bool> wasRegistered;		std::optional<bool> wasRegistered;
};		};
} // namespace		} // namespace

/// Parse a single dialect group encoded in the byte stream.		/// Parse a single dialect group encoded in the byte stream.
static LogicalResult parseDialectGrouping(		static LogicalResult parseDialectGrouping(
EncodingReader &reader, MutableArrayRef<BytecodeDialect> dialects,		EncodingReader &reader,
		MutableArrayRef<std::unique_ptr<BytecodeDialect>> dialects,
function_ref<LogicalResult(BytecodeDialect *)> entryCallback) {		function_ref<LogicalResult(BytecodeDialect *)> entryCallback) {
// Parse the dialect and the number of entries in the group.		// Parse the dialect and the number of entries in the group.
BytecodeDialect *dialect;		std::unique_ptr<BytecodeDialect> *dialect;
if (failed(parseEntry(reader, dialects, dialect, "dialect")))		if (failed(parseEntry(reader, dialects, dialect, "dialect")))
return failure();		return failure();
uint64_t numEntries;		uint64_t numEntries;
if (failed(reader.parseVarInt(numEntries)))		if (failed(reader.parseVarInt(numEntries)))
return failure();		return failure();

for (uint64_t i = 0; i < numEntries; ++i)		for (uint64_t i = 0; i < numEntries; ++i)
if (failed(entryCallback(dialect)))		if (failed(entryCallback(dialect->get())))
return failure();		return failure();
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ResourceSectionReader		// ResourceSectionReader
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

namespace {		namespace {
/// This class is used to read the resource section from the bytecode.		/// This class is used to read the resource section from the bytecode.
class ResourceSectionReader {		class ResourceSectionReader {
public:		public:
/// Initialize the resource section reader with the given section data.		/// Initialize the resource section reader with the given section data.
LogicalResult		LogicalResult
initialize(Location fileLoc, const ParserConfig &config,		initialize(Location fileLoc, const ParserConfig &config,
MutableArrayRef<BytecodeDialect> dialects,		MutableArrayRef<std::unique_ptr<BytecodeDialect>> dialects,
StringSectionReader &stringReader, ArrayRef<uint8_t> sectionData,		StringSectionReader &stringReader, ArrayRef<uint8_t> sectionData,
ArrayRef<uint8_t> offsetSectionData, DialectReader &dialectReader,		ArrayRef<uint8_t> offsetSectionData, DialectReader &dialectReader,
const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef);		const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef);

/// Parse a dialect resource handle from the resource section.		/// Parse a dialect resource handle from the resource section.
LogicalResult parseResourceHandle(EncodingReader &reader,		LogicalResult parseResourceHandle(EncodingReader &reader,
AsmDialectResourceHandle &result) {		AsmDialectResourceHandle &result) {
return parseEntry(reader, dialectResources, result, "resource handle");		return parseEntry(reader, dialectResources, result, "resource handle");
▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	if (!entryReader.empty()) {
"unexpected trailing bytes in resource entry '", key, "'");		"unexpected trailing bytes in resource entry '", key, "'");
}		}
}		}
return success();		return success();
}		}

LogicalResult ResourceSectionReader::initialize(		LogicalResult ResourceSectionReader::initialize(
Location fileLoc, const ParserConfig &config,		Location fileLoc, const ParserConfig &config,
MutableArrayRef<BytecodeDialect> dialects,		MutableArrayRef<std::unique_ptr<BytecodeDialect>> dialects,
StringSectionReader &stringReader, ArrayRef<uint8_t> sectionData,		StringSectionReader &stringReader, ArrayRef<uint8_t> sectionData,
ArrayRef<uint8_t> offsetSectionData, DialectReader &dialectReader,		ArrayRef<uint8_t> offsetSectionData, DialectReader &dialectReader,
const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef) {		const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef) {
EncodingReader resourceReader(sectionData, fileLoc);		EncodingReader resourceReader(sectionData, fileLoc);
EncodingReader offsetReader(offsetSectionData, fileLoc);		EncodingReader offsetReader(offsetSectionData, fileLoc);

// Read the number of external resource providers.		// Read the number of external resource providers.
uint64_t numExternalResourceGroups;		uint64_t numExternalResourceGroups;
Show All 32 Lines	for (uint64_t i = 0; i < numExternalResourceGroups; ++i) {

if (failed(parseGroup(handler)))		if (failed(parseGroup(handler)))
return failure();		return failure();
}		}

// Read the dialect resources from the bytecode.		// Read the dialect resources from the bytecode.
MLIRContext *ctx = fileLoc->getContext();		MLIRContext *ctx = fileLoc->getContext();
while (!offsetReader.empty()) {		while (!offsetReader.empty()) {
BytecodeDialect *dialect;		std::unique_ptr<BytecodeDialect> *dialect;
if (failed(parseEntry(offsetReader, dialects, dialect, "dialect")) \|\|		if (failed(parseEntry(offsetReader, dialects, dialect, "dialect")) \|\|
failed(dialect->load(dialectReader, ctx)))		failed((*dialect)->load(dialectReader, ctx)))
return failure();		return failure();
Dialect *loadedDialect = dialect->getLoadedDialect();		Dialect loadedDialect = (dialect)->getLoadedDialect();
if (!loadedDialect) {		if (!loadedDialect) {
return resourceReader.emitError()		return resourceReader.emitError()
<< "dialect '" << dialect->name << "' is unknown";		<< "dialect '" << (*dialect)->name << "' is unknown";
}		}
const auto *handler = dyn_cast<OpAsmDialectInterface>(loadedDialect);		const auto *handler = dyn_cast<OpAsmDialectInterface>(loadedDialect);
if (!handler) {		if (!handler) {
return resourceReader.emitError()		return resourceReader.emitError()
<< "unexpected resources for dialect '" << dialect->name << "'";		<< "unexpected resources for dialect '" << (*dialect)->name << "'";
}		}

// Ensure that each resource is declared before being processed.		// Ensure that each resource is declared before being processed.
auto processResourceKeyFn = [&](StringRef key) -> LogicalResult {		auto processResourceKeyFn = [&](StringRef key) -> LogicalResult {
FailureOr<AsmDialectResourceHandle> handle =		FailureOr<AsmDialectResourceHandle> handle =
handler->declareResource(key);		handler->declareResource(key);
if (failed(handle)) {		if (failed(handle)) {
return resourceReader.emitError()		return resourceReader.emitError()
<< "unknown 'resource' key '" << key << "' for dialect '"		<< "unknown 'resource' key '" << key << "' for dialect '"
<< dialect->name << "'";		<< (*dialect)->name << "'";
}		}
dialectResourceHandleRenamingMap[key] = handler->getResourceKey(*handle);		dialectResourceHandleRenamingMap[key] = handler->getResourceKey(*handle);
dialectResources.push_back(*handle);		dialectResources.push_back(*handle);
return success();		return success();
};		};

// Parse the resources for this dialect. We allow empty resources because we		// Parse the resources for this dialect. We allow empty resources because we
// just treat these as declarations.		// just treat these as declarations.
Show All 26 Lines	struct Entry {
/// The raw data of this entry in the bytecode.		/// The raw data of this entry in the bytecode.
ArrayRef<uint8_t> data;		ArrayRef<uint8_t> data;
};		};
using AttrEntry = Entry<Attribute>;		using AttrEntry = Entry<Attribute>;
using TypeEntry = Entry<Type>;		using TypeEntry = Entry<Type>;

public:		public:
AttrTypeReader(StringSectionReader &stringReader,		AttrTypeReader(StringSectionReader &stringReader,
ResourceSectionReader &resourceReader, Location fileLoc)		ResourceSectionReader &resourceReader,
		const llvm::StringMap<BytecodeDialect *> &dialectsMap,
		Location fileLoc, const ParserConfig &config)
: stringReader(stringReader), resourceReader(resourceReader),		: stringReader(stringReader), resourceReader(resourceReader),
fileLoc(fileLoc) {}		dialectsMap(dialectsMap), fileLoc(fileLoc), parserConfig(config) {}

/// Initialize the attribute and type information within the reader.		/// Initialize the attribute and type information within the reader.
LogicalResult initialize(MutableArrayRef<BytecodeDialect> dialects,		LogicalResult
		initialize(MutableArrayRef<std::unique_ptr<BytecodeDialect>> dialects,
ArrayRef<uint8_t> sectionData,		ArrayRef<uint8_t> sectionData,
ArrayRef<uint8_t> offsetSectionData);		ArrayRef<uint8_t> offsetSectionData);

/// Resolve the attribute or type at the given index. Returns nullptr on		/// Resolve the attribute or type at the given index. Returns nullptr on
/// failure.		/// failure.
Attribute resolveAttribute(size_t index) {		Attribute resolveAttribute(size_t index) {
return resolveEntry(attributes, index, "Attribute");		return resolveEntry(attributes, index, "Attribute");
}		}
Type resolveType(size_t index) { return resolveEntry(types, index, "Type"); }		Type resolveType(size_t index) { return resolveEntry(types, index, "Type"); }

▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	private:
/// The string section reader used to resolve string references when parsing		/// The string section reader used to resolve string references when parsing
/// custom encoded attribute/type entries.		/// custom encoded attribute/type entries.
StringSectionReader &stringReader;		StringSectionReader &stringReader;

/// The resource section reader used to resolve resource references when		/// The resource section reader used to resolve resource references when
/// parsing custom encoded attribute/type entries.		/// parsing custom encoded attribute/type entries.
ResourceSectionReader &resourceReader;		ResourceSectionReader &resourceReader;

		/// The map of the loaded dialects used to retrieve dialect information, such
		/// as the dialect version.
		const llvm::StringMap<BytecodeDialect *> &dialectsMap;

/// The set of attribute and type entries.		/// The set of attribute and type entries.
SmallVector<AttrEntry> attributes;		SmallVector<AttrEntry> attributes;
SmallVector<TypeEntry> types;		SmallVector<TypeEntry> types;

/// A location used for error emission.		/// A location used for error emission.
Location fileLoc;		Location fileLoc;

		/// Reference to the parser configuration.
		const ParserConfig &parserConfig;
};		};

class DialectReader : public DialectBytecodeReader {		class DialectReader : public DialectBytecodeReader {
public:		public:
DialectReader(AttrTypeReader &attrTypeReader,		DialectReader(AttrTypeReader &attrTypeReader,
StringSectionReader &stringReader,		StringSectionReader &stringReader,
ResourceSectionReader &resourceReader, EncodingReader &reader)		ResourceSectionReader &resourceReader,
		const llvm::StringMap<BytecodeDialect *> &dialectsMap,
		EncodingReader &reader)
: attrTypeReader(attrTypeReader), stringReader(stringReader),		: attrTypeReader(attrTypeReader), stringReader(stringReader),
resourceReader(resourceReader), reader(reader) {}		resourceReader(resourceReader), dialectsMap(dialectsMap),
		reader(reader) {}

InFlightDiagnostic emitError(const Twine &msg) override {		InFlightDiagnostic emitError(const Twine &msg) const override {
return reader.emitError(msg);		return reader.emitError(msg);
}		}

DialectReader withEncodingReader(EncodingReader &encReader) {		FailureOr<const DialectVersion *>
		getDialectVersion(StringRef dialectName) const override {
		// First check if the dialect is available in the map.
		auto dialectEntry = dialectsMap.find(dialectName);
		if (dialectEntry == dialectsMap.end())
		return failure();
		// If the dialect was found, try to load it. This will trigger reading the
		// bytecode version from the version buffer if it wasn't already processed.
		// Return failure if either of those two actions could not be completed.
		if (failed(dialectEntry->getValue()->load(*this, getLoc().getContext())) \|\|
		dialectEntry->getValue()->loadedVersion.get() == nullptr)
		return failure();
		return dialectEntry->getValue()->loadedVersion.get();
		}

		MLIRContext *getContext() const override { return getLoc().getContext(); }

		DialectReader withEncodingReader(EncodingReader &encReader) const {
return DialectReader(attrTypeReader, stringReader, resourceReader,		return DialectReader(attrTypeReader, stringReader, resourceReader,
encReader);		dialectsMap, encReader);
}		}

Location getLoc() const { return reader.getLoc(); }		Location getLoc() const { return reader.getLoc(); }

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// IR		// IR
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	public:
LogicalResult readBool(bool &result) override {		LogicalResult readBool(bool &result) override {
return reader.parseByte(result);		return reader.parseByte(result);
}		}

private:		private:
AttrTypeReader &attrTypeReader;		AttrTypeReader &attrTypeReader;
StringSectionReader &stringReader;		StringSectionReader &stringReader;
ResourceSectionReader &resourceReader;		ResourceSectionReader &resourceReader;
		const llvm::StringMap<BytecodeDialect *> &dialectsMap;
EncodingReader &reader;		EncodingReader &reader;
};		};

/// Wraps the properties section and handles reading properties out of it.		/// Wraps the properties section and handles reading properties out of it.
class PropertiesSectionReader {		class PropertiesSectionReader {
public:		public:
/// Initialize the properties section reader with the given section data.		/// Initialize the properties section reader with the given section data.
LogicalResult initialize(Location fileLoc, ArrayRef<uint8_t> sectionData) {		LogicalResult initialize(Location fileLoc, ArrayRef<uint8_t> sectionData) {
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	private:
/// The properties buffer referenced within the bytecode file.		/// The properties buffer referenced within the bytecode file.
ArrayRef<uint8_t> propertiesBuffers;		ArrayRef<uint8_t> propertiesBuffers;

/// Table of offset in the buffer above.		/// Table of offset in the buffer above.
SmallVector<int64_t> offsetTable;		SmallVector<int64_t> offsetTable;
};		};
} // namespace		} // namespace

LogicalResult		LogicalResult AttrTypeReader::initialize(
AttrTypeReader::initialize(MutableArrayRef<BytecodeDialect> dialects,		MutableArrayRef<std::unique_ptr<BytecodeDialect>> dialects,
ArrayRef<uint8_t> sectionData,		ArrayRef<uint8_t> sectionData, ArrayRef<uint8_t> offsetSectionData) {
ArrayRef<uint8_t> offsetSectionData) {
EncodingReader offsetReader(offsetSectionData, fileLoc);		EncodingReader offsetReader(offsetSectionData, fileLoc);

// Parse the number of attribute and type entries.		// Parse the number of attribute and type entries.
uint64_t numAttributes, numTypes;		uint64_t numAttributes, numTypes;
if (failed(offsetReader.parseVarInt(numAttributes)) \|\|		if (failed(offsetReader.parseVarInt(numAttributes)) \|\|
failed(offsetReader.parseVarInt(numTypes)))		failed(offsetReader.parseVarInt(numTypes)))
return failure();		return failure();
attributes.resize(numAttributes);		attributes.resize(numAttributes);
Show All 35 Lines	LogicalResult AttrTypeReader::initialize(
if (failed(parseEntries(attributes)) \|\| failed(parseEntries(types)))		if (failed(parseEntries(attributes)) \|\| failed(parseEntries(types)))
return failure();		return failure();

// Ensure that we read everything from the section.		// Ensure that we read everything from the section.
if (!offsetReader.empty()) {		if (!offsetReader.empty()) {
return offsetReader.emitError(		return offsetReader.emitError(
"unexpected trailing data in the Attribute/Type offset section");		"unexpected trailing data in the Attribute/Type offset section");
}		}

return success();		return success();
}		}

template <typename T>		template <typename T>
T AttrTypeReader::resolveEntry(SmallVectorImpl<Entry<T>> &entries, size_t index,		T AttrTypeReader::resolveEntry(SmallVectorImpl<Entry<T>> &entries, size_t index,
StringRef entryType) {		StringRef entryType) {
if (index >= entries.size()) {		if (index >= entries.size()) {
emitError(fileLoc) << "invalid " << entryType << " index: " << index;		emitError(fileLoc) << "invalid " << entryType << " index: " << index;
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	LogicalResult AttrTypeReader::parseAsmEntry(T &result, EncodingReader &reader,
}		}
return success();		return success();
}		}

template <typename T>		template <typename T>
LogicalResult AttrTypeReader::parseCustomEntry(Entry<T> &entry,		LogicalResult AttrTypeReader::parseCustomEntry(Entry<T> &entry,
EncodingReader &reader,		EncodingReader &reader,
StringRef entryType) {		StringRef entryType) {
DialectReader dialectReader(*this, stringReader, resourceReader, reader);		DialectReader dialectReader(*this, stringReader, resourceReader, dialectsMap,
		reader);
if (failed(entry.dialect->load(dialectReader, fileLoc.getContext())))		if (failed(entry.dialect->load(dialectReader, fileLoc.getContext())))
return failure();		return failure();

		if constexpr (std::is_same_v<T, Type>) {
		// Try parsing with callbacks first if available.
		burmakoUnsubmitted Done Reply Inline Actions How do you envision the contract between customized serialization and deserialization? E.g. how does a consumer of bytecode payload know that a specific payload was generated via serializer callbacks and how do they know where to obtain the corresponding deserializer callbacks? burmako: How do you envision the contract between customized serialization and deserialization? E.g. how…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions Since the callbacks are driven by the client, it's up to the client to decide. Assuming you are serializing a module that contains a versioned dialect, I would envision handling such scenario through the dialect version. I don't think upstream dialects should use those callbacks in any way. mfrancio: Since the callbacks are driven by the client, it's up to the client to decide. Assuming you are…
		for (const auto &callback : parserConfig.getTypeBytecodeCallbacks()) {
		if (failed(
		callback->parse(dialectReader, entry.dialect->name, entry.entry)))
		return failure();
		// Early return if parsing was successful.
		if (!!entry.entry)
		return success();

		// Reset the reader if we failed to parse, so we can fall through the
		// other parsing functions.
		reader = EncodingReader(entry.data, reader.getLoc());
		}
		} else {
		// Try parsing with callbacks first if available.
		for (const auto &callback : parserConfig.getAttributeBytecodeCallbacks()) {
		if (failed(
		callback->parse(dialectReader, entry.dialect->name, entry.entry)))
		return failure();
		// Early return if parsing was successful.
		if (!!entry.entry)
		return success();

		// Reset the reader if we failed to parse, so we can fall through the
		// other parsing functions.
		reader = EncodingReader(entry.data, reader.getLoc());
		}
		}

// Ensure that the dialect implements the bytecode interface.		// Ensure that the dialect implements the bytecode interface.
if (!entry.dialect->interface) {		if (!entry.dialect->interface) {
return reader.emitError("dialect '", entry.dialect->name,		return reader.emitError("dialect '", entry.dialect->name,
"' does not implement the bytecode interface");		"' does not implement the bytecode interface");
}		}

// Ask the dialect to parse the entry. If the dialect is versioned, parse
// using the versioned encoding readers.
if (entry.dialect->loadedVersion.get()) {
if constexpr (std::is_same_v<T, Type>)
entry.entry = entry.dialect->interface->readType(
dialectReader, *entry.dialect->loadedVersion);
else
entry.entry = entry.dialect->interface->readAttribute(
dialectReader, *entry.dialect->loadedVersion);

} else {
if constexpr (std::is_same_v<T, Type>)		if constexpr (std::is_same_v<T, Type>)
entry.entry = entry.dialect->interface->readType(dialectReader);		entry.entry = entry.dialect->interface->readType(dialectReader);
else		else
entry.entry = entry.dialect->interface->readAttribute(dialectReader);		entry.entry = entry.dialect->interface->readAttribute(dialectReader);
}
return success(!!entry.entry);		return success(!!entry.entry);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Bytecode Reader		// Bytecode Reader
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// This class is used to read a bytecode buffer and translate it into MLIR.		/// This class is used to read a bytecode buffer and translate it into MLIR.
class mlir::BytecodeReader::Impl {		class mlir::BytecodeReader::Impl {
struct RegionReadState;		struct RegionReadState;
using LazyLoadableOpsInfo =		using LazyLoadableOpsInfo =
std::list<std::pair<Operation *, RegionReadState>>;		std::list<std::pair<Operation *, RegionReadState>>;
using LazyLoadableOpsMap =		using LazyLoadableOpsMap =
DenseMap<Operation *, LazyLoadableOpsInfo::iterator>;		DenseMap<Operation *, LazyLoadableOpsInfo::iterator>;

public:		public:
Impl(Location fileLoc, const ParserConfig &config, bool lazyLoading,		Impl(Location fileLoc, const ParserConfig &config, bool lazyLoading,
llvm::MemoryBufferRef buffer,		llvm::MemoryBufferRef buffer,
const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef)		const std::shared_ptr<llvm::SourceMgr> &bufferOwnerRef)
: config(config), fileLoc(fileLoc), lazyLoading(lazyLoading),		: config(config), fileLoc(fileLoc), lazyLoading(lazyLoading),
attrTypeReader(stringReader, resourceReader, fileLoc),		attrTypeReader(stringReader, resourceReader, dialectsMap, fileLoc,
		config),
// Use the builtin unrealized conversion cast operation to represent		// Use the builtin unrealized conversion cast operation to represent
// forward references to values that aren't yet defined.		// forward references to values that aren't yet defined.
forwardRefOpState(UnknownLoc::get(config.getContext()),		forwardRefOpState(UnknownLoc::get(config.getContext()),
"builtin.unrealized_conversion_cast", ValueRange(),		"builtin.unrealized_conversion_cast", ValueRange(),
NoneType::get(config.getContext())),		NoneType::get(config.getContext())),
buffer(buffer), bufferOwnerRef(bufferOwnerRef) {}		buffer(buffer), bufferOwnerRef(bufferOwnerRef) {}

/// Read the bytecode defined within `buffer` into the given block.		/// Read the bytecode defined within `buffer` into the given block.
▲ Show 20 Lines • Show All 249 Lines • ▼ Show 20 Lines	private:

/// The version of the bytecode being read.		/// The version of the bytecode being read.
uint64_t version = 0;		uint64_t version = 0;

/// The producer of the bytecode being read.		/// The producer of the bytecode being read.
StringRef producer;		StringRef producer;

/// The table of IR units referenced within the bytecode file.		/// The table of IR units referenced within the bytecode file.
SmallVector<BytecodeDialect> dialects;		SmallVector<std::unique_ptr<BytecodeDialect>> dialects;
		llvm::StringMap<BytecodeDialect *> dialectsMap;
SmallVector<BytecodeOperationName> opNames;		SmallVector<BytecodeOperationName> opNames;

/// The reader used to process resources within the bytecode.		/// The reader used to process resources within the bytecode.
ResourceSectionReader resourceReader;		ResourceSectionReader resourceReader;

/// Worklist of values with custom use-list orders to process before the end		/// Worklist of values with custom use-list orders to process before the end
/// of the parsing.		/// of the parsing.
DenseMap<void *, UseListOrderStorage> valueToUseListMap;		DenseMap<void *, UseListOrderStorage> valueToUseListMap;
▲ Show 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	LogicalResult BytecodeReader::Impl::parseVersion(EncodingReader &reader) {
if (version < bytecode::kLazyLoading)		if (version < bytecode::kLazyLoading)
lazyLoading = false;		lazyLoading = false;
return success();		return success();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Dialect Section		// Dialect Section

LogicalResult BytecodeDialect::load(DialectReader &reader, MLIRContext *ctx) {		LogicalResult BytecodeDialect::load(const DialectReader &reader,
		MLIRContext *ctx) {
if (dialect)		if (dialect)
return success();		return success();
Dialect *loadedDialect = ctx->getOrLoadDialect(name);		Dialect *loadedDialect = ctx->getOrLoadDialect(name);
if (!loadedDialect && !ctx->allowsUnregisteredDialects()) {		if (!loadedDialect && !ctx->allowsUnregisteredDialects()) {
return reader.emitError("dialect '")		return reader.emitError("dialect '")
<< name		<< name
<< "' is unknown. If this is intended, please call "		<< "' is unknown. If this is intended, please call "
"allowUnregisteredDialects() on the MLIRContext, or use "		"allowUnregisteredDialects() on the MLIRContext, or use "
Show All 27 Lines	BytecodeReader::Impl::parseDialectSection(ArrayRef<uint8_t> sectionData) {
// Parse the number of dialects in the section.		// Parse the number of dialects in the section.
uint64_t numDialects;		uint64_t numDialects;
if (failed(sectionReader.parseVarInt(numDialects)))		if (failed(sectionReader.parseVarInt(numDialects)))
return failure();		return failure();
dialects.resize(numDialects);		dialects.resize(numDialects);

// Parse each of the dialects.		// Parse each of the dialects.
for (uint64_t i = 0; i < numDialects; ++i) {		for (uint64_t i = 0; i < numDialects; ++i) {
		dialects[i] = std::make_unique<BytecodeDialect>();
/// Before version kDialectVersioning, there wasn't any versioning available		/// Before version kDialectVersioning, there wasn't any versioning available
/// for dialects, and the entryIdx represent the string itself.		/// for dialects, and the entryIdx represent the string itself.
if (version < bytecode::kDialectVersioning) {		if (version < bytecode::kDialectVersioning) {
if (failed(stringReader.parseString(sectionReader, dialects[i].name)))		if (failed(stringReader.parseString(sectionReader, dialects[i]->name)))
return failure();		return failure();
continue;		continue;
}		}

// Parse ID representing dialect and version.		// Parse ID representing dialect and version.
uint64_t dialectNameIdx;		uint64_t dialectNameIdx;
bool versionAvailable;		bool versionAvailable;
if (failed(sectionReader.parseVarIntWithFlag(dialectNameIdx,		if (failed(sectionReader.parseVarIntWithFlag(dialectNameIdx,
versionAvailable)))		versionAvailable)))
return failure();		return failure();
if (failed(stringReader.parseStringAtIndex(sectionReader, dialectNameIdx,		if (failed(stringReader.parseStringAtIndex(sectionReader, dialectNameIdx,
dialects[i].name)))		dialects[i]->name)))
return failure();		return failure();
if (versionAvailable) {		if (versionAvailable) {
bytecode::Section::ID sectionID;		bytecode::Section::ID sectionID;
if (failed(		if (failed(sectionReader.parseSection(sectionID,
sectionReader.parseSection(sectionID, dialects[i].versionBuffer)))		dialects[i]->versionBuffer)))
return failure();		return failure();
if (sectionID != bytecode::Section::kDialectVersions) {		if (sectionID != bytecode::Section::kDialectVersions) {
emitError(fileLoc, "expected dialect version section");		emitError(fileLoc, "expected dialect version section");
return failure();		return failure();
}		}
}		}
		dialectsMap[dialects[i]->name] = dialects[i].get();
}		}

// Parse the operation names, which are grouped by dialect.		// Parse the operation names, which are grouped by dialect.
auto parseOpName = [&](BytecodeDialect *dialect) {		auto parseOpName = [&](BytecodeDialect *dialect) {
StringRef opName;		StringRef opName;
std::optional<bool> wasRegistered;		std::optional<bool> wasRegistered;
// Prior to version kNativePropertiesEncoding, the information about wheter		// Prior to version kNativePropertiesEncoding, the information about wheter
// an op was registered or not wasn't encoded.		// an op was registered or not wasn't encoded.
Show All 31 Lines	BytecodeReader::Impl::parseOpName(EncodingReader &reader,
if (failed(parseEntry(reader, opNames, opName, "operation name")))		if (failed(parseEntry(reader, opNames, opName, "operation name")))
return failure();		return failure();
wasRegistered = opName->wasRegistered;		wasRegistered = opName->wasRegistered;
// Check to see if this operation name has already been resolved. If we		// Check to see if this operation name has already been resolved. If we
// haven't, load the dialect and build the operation name.		// haven't, load the dialect and build the operation name.
if (!opName->opName) {		if (!opName->opName) {
// Load the dialect and its version.		// Load the dialect and its version.
DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,		DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,
reader);		dialectsMap, reader);
if (failed(opName->dialect->load(dialectReader, getContext())))		if (failed(opName->dialect->load(dialectReader, getContext())))
return failure();		return failure();
// If the opName is empty, this is because we use to accept names such as		// If the opName is empty, this is because we use to accept names such as
// `foo` without any `.` separator. We shouldn't tolerate this in textual		// `foo` without any `.` separator. We shouldn't tolerate this in textual
// format anymore but for now we'll be backward compatible. This can only		// format anymore but for now we'll be backward compatible. This can only
// happen with unregistered dialects.		// happen with unregistered dialects.
if (opName->name.empty()) {		if (opName->name.empty()) {
if (opName->dialect->getLoadedDialect())		if (opName->dialect->getLoadedDialect())
Show All 26 Lines	LogicalResult BytecodeReader::Impl::parseResourceSection(
}		}

// If the resource sections are absent, there is nothing to do.		// If the resource sections are absent, there is nothing to do.
if (!resourceData)		if (!resourceData)
return success();		return success();

// Initialize the resource reader with the resource sections.		// Initialize the resource reader with the resource sections.
DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,		DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,
reader);		dialectsMap, reader);
return resourceReader.initialize(fileLoc, config, dialects, stringReader,		return resourceReader.initialize(fileLoc, config, dialects, stringReader,
resourceData, resourceOffsetData,		resourceData, resourceOffsetData,
dialectReader, bufferOwnerRef);		dialectReader, bufferOwnerRef);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// UseListOrder Helpers		// UseListOrder Helpers

▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines	BytecodeReader::Impl::parseIRSection(ArrayRef<uint8_t> sectionData,
}		}

// Sort use-lists according to what specified in bytecode.		// Sort use-lists according to what specified in bytecode.
if (failed(processUseLists(*moduleOp)))		if (failed(processUseLists(*moduleOp)))
return reader.emitError(		return reader.emitError(
"parsed use-list orders were invalid and could not be applied");		"parsed use-list orders were invalid and could not be applied");

// Resolve dialect version.		// Resolve dialect version.
for (const BytecodeDialect &byteCodeDialect : dialects) {		for (const std::unique_ptr<BytecodeDialect> &byteCodeDialect : dialects) {
// Parsing is complete, give an opportunity to each dialect to visit the		// Parsing is complete, give an opportunity to each dialect to visit the
// IR and perform upgrades.		// IR and perform upgrades.
if (!byteCodeDialect.loadedVersion)		if (!byteCodeDialect->loadedVersion)
continue;		continue;
if (byteCodeDialect.interface &&		if (byteCodeDialect->interface &&
failed(byteCodeDialect.interface->upgradeFromVersion(		failed(byteCodeDialect->interface->upgradeFromVersion(
moduleOp, byteCodeDialect.loadedVersion)))		moduleOp, byteCodeDialect->loadedVersion)))
return failure();		return failure();
}		}

// Verify that the parsed operations are valid.		// Verify that the parsed operations are valid.
if (config.shouldVerifyAfterParse() && failed(verify(*moduleOp)))		if (config.shouldVerifyAfterParse() && failed(verify(*moduleOp)))
return failure();		return failure();

// Splice the parsed operations over to the provided top-level block.		// Splice the parsed operations over to the provided top-level block.
▲ Show 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	if (!wasRegistered)
"Unexpected missing `wasRegistered` opname flag at "		"Unexpected missing `wasRegistered` opname flag at "
"bytecode version ")		"bytecode version ")
<< version << " with properties.";		<< version << " with properties.";
// When an operation is emitted without being registered, the properties are		// When an operation is emitted without being registered, the properties are
// stored as an attribute. Otherwise the op must implement the bytecode		// stored as an attribute. Otherwise the op must implement the bytecode
// interface and control the serialization.		// interface and control the serialization.
if (wasRegistered) {		if (wasRegistered) {
DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,		DialectReader dialectReader(attrTypeReader, stringReader, resourceReader,
reader);		dialectsMap, reader);
if (failed(		if (failed(
propertiesReader.read(fileLoc, dialectReader, &*opName, opState)))		propertiesReader.read(fileLoc, dialectReader, &*opName, opState)))
return failure();		return failure();
} else {		} else {
// If the operation wasn't registered when it was emitted, the properties		// If the operation wasn't registered when it was emitted, the properties
// was serialized as an attribute.		// was serialized as an attribute.
if (failed(parseAttribute(reader, opState.propertiesAttr)))		if (failed(parseAttribute(reader, opState.propertiesAttr)))
return failure();		return failure();
▲ Show 20 Lines • Show All 329 Lines • Show Last 20 Lines

mlir/lib/Bytecode/Writer/BytecodeWriter.cpp

Show All 12 Lines
#include "mlir/Bytecode/Encoding.h"		#include "mlir/Bytecode/Encoding.h"
#include "mlir/IR/Attributes.h"		#include "mlir/IR/Attributes.h"
#include "mlir/IR/Diagnostics.h"		#include "mlir/IR/Diagnostics.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/Support/LogicalResult.h"		#include "mlir/Support/LogicalResult.h"
#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/CachedHashString.h"		#include "llvm/ADT/CachedHashString.h"
#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/ADT/SmallString.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/Support/raw_ostream.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
#include <cstddef>		#include "llvm/Support/raw_ostream.h"
#include <cstdint>
#include <cstring>
#include <optional>		#include <optional>
#include <sys/types.h>

#define DEBUG_TYPE "mlir-bytecode-writer"		#define DEBUG_TYPE "mlir-bytecode-writer"

using namespace mlir;		using namespace mlir;
using namespace mlir::bytecode::detail;		using namespace mlir::bytecode::detail;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// BytecodeWriterConfig		// BytecodeWriterConfig
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

struct BytecodeWriterConfig::Impl {		struct BytecodeWriterConfig::Impl {
Impl(StringRef producer) : producer(producer) {}		Impl(StringRef producer) : producer(producer) {}

/// Version to use when writing.		/// Version to use when writing.
/// Note: This only differs from kVersion if a specific version is set.		/// Note: This only differs from kVersion if a specific version is set.
int64_t bytecodeVersion = bytecode::kVersion;		int64_t bytecodeVersion = bytecode::kVersion;

/// The producer of the bytecode.		/// The producer of the bytecode.
StringRef producer;		StringRef producer;

		/// Printer callbacks used to emit custom type and attribute encodings.
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Attribute>>>
		attributeWriterCallbacks;
		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Type>>>
		typeWriterCallbacks;

/// A collection of non-dialect resource printers.		/// A collection of non-dialect resource printers.
SmallVector<std::unique_ptr<AsmResourcePrinter>> externalResourcePrinters;		SmallVector<std::unique_ptr<AsmResourcePrinter>> externalResourcePrinters;
};		};

BytecodeWriterConfig::BytecodeWriterConfig(StringRef producer)		BytecodeWriterConfig::BytecodeWriterConfig(StringRef producer)
: impl(std::make_unique<Impl>(producer)) {}		: impl(std::make_unique<Impl>(producer)) {}
BytecodeWriterConfig::BytecodeWriterConfig(FallbackAsmResourceMap &map,		BytecodeWriterConfig::BytecodeWriterConfig(FallbackAsmResourceMap &map,
StringRef producer)		StringRef producer)
: BytecodeWriterConfig(producer) {		: BytecodeWriterConfig(producer) {
attachFallbackResourcePrinter(map);		attachFallbackResourcePrinter(map);
}		}
BytecodeWriterConfig::~BytecodeWriterConfig() = default;		BytecodeWriterConfig::~BytecodeWriterConfig() = default;

		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Attribute>>> &
		BytecodeWriterConfig::getAttributeWriterCallbacks() const {
		return impl->attributeWriterCallbacks;
		}

		llvm::SmallVector<std::unique_ptr<AsmAttrTypeBytecodeWriter<Type>>> &
		BytecodeWriterConfig::getTypeWriterCallbacks() const {
		return impl->typeWriterCallbacks;
		}

		void BytecodeWriterConfig::attachAttributeCallback(
		std::unique_ptr<AsmAttrTypeBytecodeWriter<Attribute>> callback) {
		impl->attributeWriterCallbacks.emplace_back(std::move(callback));
		}

		void BytecodeWriterConfig::attachTypeCallback(
		std::unique_ptr<AsmAttrTypeBytecodeWriter<Type>> callback) {
		impl->typeWriterCallbacks.emplace_back(std::move(callback));
		}

void BytecodeWriterConfig::attachResourcePrinter(		void BytecodeWriterConfig::attachResourcePrinter(
std::unique_ptr<AsmResourcePrinter> printer) {		std::unique_ptr<AsmResourcePrinter> printer) {
impl->externalResourcePrinters.emplace_back(std::move(printer));		impl->externalResourcePrinters.emplace_back(std::move(printer));
}		}

void BytecodeWriterConfig::setDesiredBytecodeVersion(int64_t bytecodeVersion) {		void BytecodeWriterConfig::setDesiredBytecodeVersion(int64_t bytecodeVersion) {
impl->bytecodeVersion = bytecodeVersion;		impl->bytecodeVersion = bytecodeVersion;
}		}
▲ Show 20 Lines • Show All 691 Lines • ▼ Show 20 Lines	void BytecodeWriter::writeAttrTypeSection(EncodingEmitter &emitter) {
offsetEmitter.emitVarInt(llvm::size(numberingState.getAttributes()));		offsetEmitter.emitVarInt(llvm::size(numberingState.getAttributes()));
offsetEmitter.emitVarInt(llvm::size(numberingState.getTypes()));		offsetEmitter.emitVarInt(llvm::size(numberingState.getTypes()));

// A functor used to emit an attribute or type entry.		// A functor used to emit an attribute or type entry.
uint64_t prevOffset = 0;		uint64_t prevOffset = 0;
auto emitAttrOrType = [&](auto &entry) {		auto emitAttrOrType = [&](auto &entry) {
auto entryValue = entry.getValue();		auto entryValue = entry.getValue();

// First, try to emit this entry using the dialect bytecode interface.		auto emitAttrOrTypeRawImpl = [&]() -> void {
bool hasCustomEncoding = false;		RawEmitterOstream(attrTypeEmitter) << entryValue;
if (const BytecodeDialectInterface *interface = entry.dialect->interface) {		attrTypeEmitter.emitByte(0);
// The writer used when emitting using a custom bytecode encoding.		};
		auto emitAttrOrTypeImpl = [&]() -> bool {
		// TODO: We don't currently support custom encoded mutable types and
		// attributes.
		if (entryValue.template hasTrait<TypeTrait::IsMutable>() \|\|
		entryValue.template hasTrait<AttributeTrait::IsMutable>()) {
		emitAttrOrTypeRawImpl();
		return false;
		}

DialectWriter dialectWriter(config.bytecodeVersion, attrTypeEmitter,		DialectWriter dialectWriter(config.bytecodeVersion, attrTypeEmitter,
numberingState, stringSection);		numberingState, stringSection);

if constexpr (std::is_same_v<std::decay_t<decltype(entryValue)>, Type>) {		if constexpr (std::is_same_v<std::decay_t<decltype(entryValue)>, Type>) {
// TODO: We don't currently support custom encoded mutable types.		for (const auto &callback : config.typeWriterCallbacks) {
hasCustomEncoding =		if (succeeded(callback->write(entryValue, dialectWriter)))
!entryValue.template hasTrait<TypeTrait::IsMutable>() &&		return true;
succeeded(interface->writeType(entryValue, dialectWriter));		}
		if (const BytecodeDialectInterface *interface =
		entry.dialect->interface) {
		if (succeeded(interface->writeType(entryValue, dialectWriter)))
		return true;
		}
} else {		} else {
// TODO: We don't currently support custom encoded mutable attributes.		for (const auto &callback : config.attributeWriterCallbacks) {
hasCustomEncoding =		if (succeeded(callback->write(entryValue, dialectWriter)))
!entryValue.template hasTrait<AttributeTrait::IsMutable>() &&		return true;
succeeded(interface->writeAttribute(entryValue, dialectWriter));
}		}
		if (const BytecodeDialectInterface *interface =
		entry.dialect->interface) {
		if (succeeded(interface->writeAttribute(entryValue, dialectWriter)))
		return true;
}		}

// If the entry was not emitted using the dialect interface, emit it using
// the textual format.
if (!hasCustomEncoding) {
RawEmitterOstream(attrTypeEmitter) << entryValue;
attrTypeEmitter.emitByte(0);
}		}

		// If the entry was not emitted using a callback or a dialect interface,
		// emit it using the textual format.
		emitAttrOrTypeRawImpl();
		return false;
		};

		bool hasCustomEncoding = emitAttrOrTypeImpl();
		mehdi_aminiUnsubmitted Done Reply Inline Actions Can you extract the above in a lambda "emitAttrOrTypeImpl(..)": you could do a lot of early return which would simplify the control flow and reduce the indentation. The `hasCustomEncoding` boolean should disappear basically. mehdi_amini: Can you extract the above in a lambda "emitAttrOrTypeImpl(..)": you could do a lot of early…

// Record the offset of this entry.		// Record the offset of this entry.
uint64_t curOffset = attrTypeEmitter.size();		uint64_t curOffset = attrTypeEmitter.size();
offsetEmitter.emitVarIntWithFlag(curOffset - prevOffset, hasCustomEncoding);		offsetEmitter.emitVarIntWithFlag(curOffset - prevOffset, hasCustomEncoding);
prevOffset = curOffset;		prevOffset = curOffset;
};		};

// Emit the attribute and type entries for each dialect.		// Emit the attribute and type entries for each dialect.
writeDialectGrouping(offsetEmitter, numberingState.getAttributes(),		writeDialectGrouping(offsetEmitter, numberingState.getAttributes(),
▲ Show 20 Lines • Show All 414 Lines • Show Last 20 Lines

mlir/lib/Bytecode/Writer/IRNumbering.cpp

Show First 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	void IRNumberingState::number(Attribute attr) {
if (OpaqueAttr opaqueAttr = dyn_cast<OpaqueAttr>(attr)) {		if (OpaqueAttr opaqueAttr = dyn_cast<OpaqueAttr>(attr)) {
numbering->dialect = &numberDialect(opaqueAttr.getDialectNamespace());		numbering->dialect = &numberDialect(opaqueAttr.getDialectNamespace());
return;		return;
}		}
numbering->dialect = &numberDialect(&attr.getDialect());		numbering->dialect = &numberDialect(&attr.getDialect());

// If this attribute will be emitted using the bytecode format, perform a		// If this attribute will be emitted using the bytecode format, perform a
// dummy writing to number any nested components.		// dummy writing to number any nested components.
if (const auto *interface = numbering->dialect->interface) {
// TODO: We don't allow custom encodings for mutable attributes right now.		// TODO: We don't allow custom encodings for mutable attributes right now.
if (!attr.hasTrait<AttributeTrait::IsMutable>()) {		if (!attr.hasTrait<AttributeTrait::IsMutable>()) {
		// Try overriding emission with callbacks.
		for (const auto &callback : config.getAttributeWriterCallbacks()) {
		NumberingDialectWriter writer(*this);
		// The client has the ability to override the group name through the
		// callback.
		std::optional<StringRef> groupNameOverride;
		if (succeeded(callback->write(attr, groupNameOverride, writer))) {
		if (groupNameOverride.has_value())
		numbering->dialect = &numberDialect(*groupNameOverride);
		return;
		}
		}

		if (const auto *interface = numbering->dialect->interface) {
NumberingDialectWriter writer(*this);		NumberingDialectWriter writer(*this);
if (succeeded(interface->writeAttribute(attr, writer)))		if (succeeded(interface->writeAttribute(attr, writer)))
return;		return;
}		}
}		}
// If this attribute will be emitted using the fallback, number the nested		// If this attribute will be emitted using the fallback, number the nested
// dialect resources. We don't number everything (e.g. no nested		// dialect resources. We don't number everything (e.g. no nested
// attributes/types), because we don't want to encode things we won't decode		// attributes/types), because we don't want to encode things we won't decode
▲ Show 20 Lines • Show All 131 Lines • ▼ Show 20 Lines	void IRNumberingState::number(Type type) {
if (OpaqueType opaqueType = dyn_cast<OpaqueType>(type)) {		if (OpaqueType opaqueType = dyn_cast<OpaqueType>(type)) {
numbering->dialect = &numberDialect(opaqueType.getDialectNamespace());		numbering->dialect = &numberDialect(opaqueType.getDialectNamespace());
return;		return;
}		}
numbering->dialect = &numberDialect(&type.getDialect());		numbering->dialect = &numberDialect(&type.getDialect());

// If this type will be emitted using the bytecode format, perform a dummy		// If this type will be emitted using the bytecode format, perform a dummy
// writing to number any nested components.		// writing to number any nested components.
if (const auto *interface = numbering->dialect->interface) {
// TODO: We don't allow custom encodings for mutable types right now.		// TODO: We don't allow custom encodings for mutable types right now.
if (!type.hasTrait<TypeTrait::IsMutable>()) {		if (!type.hasTrait<TypeTrait::IsMutable>()) {
		// Try overriding emission with callbacks.
		for (const auto &callback : config.getTypeWriterCallbacks()) {
		NumberingDialectWriter writer(*this);
		// The client has the ability to override the group name through the
		// callback.
		std::optional<StringRef> groupNameOverride;
		if (succeeded(callback->write(type, groupNameOverride, writer))) {
		if (groupNameOverride.has_value())
		numbering->dialect = &numberDialect(*groupNameOverride);
		return;
		}
		}

		// If this attribute will be emitted using the bytecode format, perform a
		// dummy writing to number any nested components.
		if (const auto *interface = numbering->dialect->interface) {
NumberingDialectWriter writer(*this);		NumberingDialectWriter writer(*this);
if (succeeded(interface->writeType(type, writer)))		if (succeeded(interface->writeType(type, writer)))
return;		return;
}		}
}		}
// If this type will be emitted using the fallback, number the nested dialect		// If this type will be emitted using the fallback, number the nested dialect
// resources. We don't number everything (e.g. no nested attributes/types),		// resources. We don't number everything (e.g. no nested attributes/types),
// because we don't want to encode things we won't decode (the textual format		// because we don't want to encode things we won't decode (the textual format
▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

mlir/lib/IR/BuiltinDialect.cpp

//===- BuiltinDialect.cpp - MLIR Builtin Dialect --------------------------===//		//===- BuiltinDialect.cpp - MLIR Builtin Dialect --------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains the Builtin dialect that contains all of the attributes,		// This file contains the Builtin dialect that contains all of the attributes,
// operations, and types that are necessary for the validity of the IR.		// operations, and types that are necessary for the validity of the IR.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/IR/BuiltinDialect.h"		#include "mlir/IR/BuiltinDialect.h"
#include "BuiltinDialectBytecode.h"
#include "mlir/IR/Builders.h"		#include "mlir/IR/Builders.h"
		#include "mlir/IR/BuiltinDialectBytecode.h"
#include "mlir/IR/BuiltinOps.h"		#include "mlir/IR/BuiltinOps.h"
#include "mlir/IR/BuiltinTypes.h"		#include "mlir/IR/BuiltinTypes.h"
#include "mlir/IR/DialectResourceBlobManager.h"		#include "mlir/IR/DialectResourceBlobManager.h"
#include "mlir/IR/IRMapping.h"		#include "mlir/IR/IRMapping.h"
#include "mlir/IR/OpImplementation.h"		#include "mlir/IR/OpImplementation.h"
#include "mlir/IR/PatternMatch.h"		#include "mlir/IR/PatternMatch.h"
#include "mlir/IR/TypeRange.h"		#include "mlir/IR/TypeRange.h"

using namespace mlir;		using namespace mlir;
		using namespace builtin;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// TableGen'erated dialect		// TableGen'erated dialect
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/IR/BuiltinDialect.cpp.inc"		#include "mlir/IR/BuiltinDialect.cpp.inc"

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	void BuiltinDialect::initialize() {
registerLocationAttributes();		registerLocationAttributes();
addOperations<		addOperations<
#define GET_OP_LIST		#define GET_OP_LIST
#include "mlir/IR/BuiltinOps.cpp.inc"		#include "mlir/IR/BuiltinOps.cpp.inc"
>();		>();

auto &blobInterface = addInterface<BuiltinBlobManagerInterface>();		auto &blobInterface = addInterface<BuiltinBlobManagerInterface>();
addInterface<BuiltinOpAsmDialectInterface>(blobInterface);		addInterface<BuiltinOpAsmDialectInterface>(blobInterface);
builtin_dialect_detail::addBytecodeInterface(this);		addBytecodeInterface(this);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// ModuleOp		// ModuleOp
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void ModuleOp::build(OpBuilder &builder, OperationState &state,		void ModuleOp::build(OpBuilder &builder, OperationState &state,
std::optional<StringRef> name) {		std::optional<StringRef> name) {
▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

mlir/lib/IR/BuiltinDialectBytecode.h

This file was moved to mlir/include/mlir/IR/BuiltinDialectBytecode.h.

mlir/lib/IR/BuiltinDialectBytecode.cpp

//===- BuiltinDialectBytecode.cpp - Builtin Bytecode Implementation -------===//		//===- BuiltinDialectBytecode.cpp - Builtin Bytecode Implementation -------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "BuiltinDialectBytecode.h"		#include "mlir/IR/BuiltinDialectBytecode.h"
#include "AttributeDetail.h"		#include "AttributeDetail.h"
#include "mlir/Bytecode/BytecodeImplementation.h"		#include "mlir/Bytecode/BytecodeImplementation.h"
#include "mlir/IR/BuiltinAttributes.h"		#include "mlir/IR/BuiltinAttributes.h"
#include "mlir/IR/BuiltinDialect.h"		#include "mlir/IR/BuiltinDialect.h"
#include "mlir/IR/BuiltinTypes.h"		#include "mlir/IR/BuiltinTypes.h"
#include "mlir/IR/Diagnostics.h"		#include "mlir/IR/Diagnostics.h"
#include "mlir/IR/DialectResourceBlobManager.h"		#include "mlir/IR/DialectResourceBlobManager.h"
#include "llvm/ADT/TypeSwitch.h"		#include "llvm/ADT/TypeSwitch.h"

using namespace mlir;		using namespace mlir;

//===----------------------------------------------------------------------===//
// BuiltinDialectBytecodeInterface
//===----------------------------------------------------------------------===//

namespace {		namespace {

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Utility functions		// Utility functions

// TODO: Move these to separate file.		// TODO: Move these to separate file.

// Returns the bitwidth if known, else return 0.		// Returns the bitwidth if known, else return 0.
▲ Show 20 Lines • Show All 43 Lines • ▼ Show 20 Lines	if (isSplat)
return writer.writeOwnedString(attr.getRawStringData().front());		return writer.writeOwnedString(attr.getRawStringData().front());

for (StringRef str : attr.getRawStringData())		for (StringRef str : attr.getRawStringData())
writer.writeOwnedString(str);		writer.writeOwnedString(str);
}		}

#include "mlir/IR/BuiltinDialectBytecode.cpp.inc"		#include "mlir/IR/BuiltinDialectBytecode.cpp.inc"

		//===----------------------------------------------------------------------===//
		// BuiltinDialectBytecodeInterface
		//===----------------------------------------------------------------------===//

/// This class implements the bytecode interface for the builtin dialect.		/// This class implements the bytecode interface for the builtin dialect.
struct BuiltinDialectBytecodeInterface : public BytecodeDialectInterface {		struct BuiltinDialectBytecodeInterface : public BytecodeDialectInterface {
BuiltinDialectBytecodeInterface(Dialect *dialect)		BuiltinDialectBytecodeInterface(Dialect *dialect)
: BytecodeDialectInterface(dialect) {}		: BytecodeDialectInterface(dialect) {}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Attributes		// Attributes

		mehdi_aminiUnsubmitted Done Reply Inline Actions I believe that in general we prefer `using namespace` in implementation files, and fully qualify function definition? mehdi_amini: I believe that in general we prefer `using namespace` in implementation files, and fully…
Attribute readAttribute(DialectBytecodeReader &reader) const override {		Attribute readAttribute(DialectBytecodeReader &reader) const override {
return ::readAttribute(getContext(), reader);		return ::readAttribute(getContext(), reader);
}		}

LogicalResult writeAttribute(Attribute attr,		LogicalResult writeAttribute(Attribute attr,
DialectBytecodeWriter &writer) const override {		DialectBytecodeWriter &writer) const override {
return ::writeAttribute(attr, writer);		return ::writeAttribute(attr, writer);
}		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Types		// Types

Type readType(DialectBytecodeReader &reader) const override {		Type readType(DialectBytecodeReader &reader) const override {
return ::readType(getContext(), reader);		return ::readType(getContext(), reader);
}		}

LogicalResult writeType(Type type,		LogicalResult writeType(Type type,
DialectBytecodeWriter &writer) const override {		DialectBytecodeWriter &writer) const override {
return ::writeType(type, writer);		return ::writeType(type, writer);
}		}
};		};
} // namespace		} // namespace

void builtin_dialect_detail::addBytecodeInterface(BuiltinDialect *dialect) {		Attribute builtin::readAttribute(DialectBytecodeReader &reader) {
		return ::readAttribute(reader.getContext(), reader);
		}

		LogicalResult builtin::writeAttribute(Attribute attribute,
		DialectBytecodeWriter &writer) {
		return ::writeAttribute(attribute, writer);
		}

		Type builtin::readType(DialectBytecodeReader &reader) {
		return ::readType(reader.getContext(), reader);
		}

		LogicalResult builtin::writeType(Type type, DialectBytecodeWriter &writer) {
		return ::writeType(type, writer);
		}
		rriddleUnsubmitted Not Done Reply Inline Actions Why do we need to expose these at all? As opposed to casting the builtin dialect to DialectBytecodeInterface, and going through the normal path? rriddle: Why do we need to expose these at all? As opposed to casting the builtin dialect to…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions That would require exposing the interface for builtin, which I had originally done in a previous version of the patch :). However, after few rounds of revisions the consensus was that it could have been better to leave the interface as an internal implementation detail and expose single hooks. In previous revisions, I was also asked to try to expose top level entry point for writing type/attributes in a dialect agnostic fashion, similarly to what the textual parser does, but we realized that it was not going to fit in the current design. Hence, exposing those hooks was the closest I could go to that idea, which preserves the original intent of @jpienaar (having the bytecode dialect interface as an internal implementation detail and not exposed). Those functions here are only needed for the tests, but it is not unreasonable to expect clients to use them. Let me know if you are strongly against leaving as is, and I can revise as necessary. mfrancio: That would require exposing the interface for builtin, which I had originally done in a…
		rriddleUnsubmitted Done Reply Inline Actions DialectBytecodeInterface is an already exposed api, you can just do `cast<DialectBytecodeInterface>(type.getDialect())` to get the virtual instance. I don't see which part of that requires exposing anything from the builtin dialect? rriddle: DialectBytecodeInterface is an already exposed api, you can just do…
		mfrancioAuthorUnsubmitted Done Reply Inline Actions I apologize for the confusion - it seems casting directly is not allowed, but I was able to retrieve the virtual instance through getRegisteredInterface<BytecodeDialectInterface>()! For some reason I had assumed at first that type id of the interface was going to be different. Thanks for pointing it out. mfrancio: I apologize for the confusion - it seems casting directly is not allowed, but I was able to…

		void builtin::addBytecodeInterface(BuiltinDialect *dialect) {
dialect->addInterfaces<BuiltinDialectBytecodeInterface>();		dialect->addInterfaces<BuiltinDialectBytecodeInterface>();
}		}

mlir/test/Bytecode/bytecode_callback.mlir

This file was added.

				// RUN: mlir-opt %s --test-bytecode-callback="test-dialect-version=1.2" -verify-diagnostics \| FileCheck %s --check-prefix=VERSION_1_2
				// RUN: mlir-opt %s --test-bytecode-callback="test-dialect-version=2.0" -verify-diagnostics \| FileCheck %s --check-prefix=VERSION_2_0

				func.func @base_test(%arg0 : i32) -> f32 {
				%0 = "test.addi"(%arg0, %arg0) : (i32, i32) -> i32
				%1 = "test.cast"(%0) : (i32) -> f32
				return %1 : f32
				}

				// VERSION_1_2: Overriding IntegerType encoding...
				// VERSION_1_2: Overriding parsing of IntegerType encoding...

				// VERSION_2_0-NOT: Overriding IntegerType encoding...
				// VERSION_2_0-NOT: Overriding parsing of IntegerType encoding...

mlir/test/Bytecode/bytecode_callback_full_override.mlir

This file was added.

				// RUN: not mlir-opt %s -split-input-file --test-bytecode-callback="callback-test=5" 2>&1 \| FileCheck %s

				// CHECK-NOT: failed to read bytecode
				func.func @base_test(%arg0 : i32) -> f32 {
				%0 = "test.addi"(%arg0, %arg0) : (i32, i32) -> i32
				%1 = "test.cast"(%0) : (i32) -> f32
				return %1 : f32
				}

				// -----

				// CHECK-LABEL: error: unknown attribute code: 99
				// CHECK: failed to read bytecode
				func.func @base_test(%arg0 : !test.i32) -> f32 {
				%0 = "test.addi"(%arg0, %arg0) : (!test.i32, !test.i32) -> !test.i32
				%1 = "test.cast"(%0) : (!test.i32) -> f32
				return %1 : f32
				}

mlir/test/Bytecode/bytecode_callback_with_custom_attribute.mlir

This file was added.

				// RUN: mlir-opt %s -split-input-file --test-bytecode-callback="callback-test=3" \| FileCheck %s --check-prefix=TEST_3
				// RUN: mlir-opt %s -split-input-file --test-bytecode-callback="callback-test=4" \| FileCheck %s --check-prefix=TEST_4

				"test.versionedC"() <{attribute = #test.attr_params<42, 24>}> : () -> ()

				// TEST_3: Overriding TestAttrParamsAttr encoding...
				// TEST_3: "test.versionedC"() <{attribute = dense<[42, 24]> : tensor<2xi32>}> : () -> ()

				// -----

				"test.versionedC"() <{attribute = dense<[42, 24]> : tensor<2xi32>}> : () -> ()

				// TEST_4: Overriding parsing of TestAttrParamsAttr encoding...
				// TEST_4: "test.versionedC"() <{attribute = #test.attr_params<42, 24>}> : () -> ()

mlir/test/Bytecode/bytecode_callback_with_custom_type.mlir

This file was added.

				// RUN: mlir-opt %s -split-input-file --test-bytecode-callback="callback-test=1" \| FileCheck %s --check-prefix=TEST_1
				// RUN: mlir-opt %s -split-input-file --test-bytecode-callback="callback-test=2" \| FileCheck %s --check-prefix=TEST_2

				func.func @base_test(%arg0: !test.i32, %arg1: f32) {
				return
				}

				// TEST_1: Overriding TestI32Type encoding...
				// TEST_1: func.func @base_test([[ARG0:%.+]]: i32, [[ARG1:%.+]]: f32) {

				// -----

				func.func @base_test(%arg0: i32, %arg1: f32) {
				return
				}

				// TEST_2: Overriding parsing of TestI32Type encoding...
				// TEST_2: func.func @base_test([[ARG0:%.+]]: !test.i32, [[ARG1:%.+]]: f32) {

mlir/test/Bytecode/invalid/invalid_attr_type_section.mlir

	// This file contains various failure test cases related to the structure of			// This file contains various failure test cases related to the structure of
	// the attribute/type offset section.			// the attribute/type offset section.

	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//
	// Index			// Index
	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//

	// RUN: not mlir-opt %S/invalid-attr_type_section-index.mlirbc 2>&1 \| FileCheck %s --check-prefix=INDEX			// RUN: not mlir-opt %S/invalid-attr_type_section-index.mlirbc -allow-unregistered-dialect 2>&1 \| FileCheck %s --check-prefix=INDEX
	// INDEX: invalid Attribute index: 3			// INDEX: invalid Attribute index: 3

	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//
	// Trailing Data			// Trailing Data
	//===--------------------------------------------------------------------===//			//===--------------------------------------------------------------------===//

	// RUN: not mlir-opt %S/invalid-attr_type_section-trailing_data.mlirbc 2>&1 \| FileCheck %s --check-prefix=TRAILING_DATA			// RUN: not mlir-opt %S/invalid-attr_type_section-trailing_data.mlirbc -allow-unregistered-dialect 2>&1 \| FileCheck %s --check-prefix=TRAILING_DATA
	// TRAILING_DATA: trailing characters found after Attribute assembly format: trailing			// TRAILING_DATA: trailing characters found after Attribute assembly format: trailing

mlir/test/lib/Dialect/Test/TestDialect.h

	//===- TestDialect.h - MLIR Dialect for testing ------------------ C++ --===//			//===- TestDialect.h - MLIR Dialect for testing ------------------ C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file defines a fake 'test' dialect that can be used for testing things			// This file defines a fake 'test' dialect that can be used for testing things
	// that do not have a respective counterpart in the main source directories.			// that do not have a respective counterpart in the main source directories.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_TESTDIALECT_H			#ifndef MLIR_TESTDIALECT_H
	#define MLIR_TESTDIALECT_H			#define MLIR_TESTDIALECT_H

	#include "TestTypes.h"
	#include "TestAttributes.h"			#include "TestAttributes.h"
	#include "TestInterfaces.h"			#include "TestInterfaces.h"
				#include "TestTypes.h"
				#include "mlir/Bytecode/BytecodeImplementation.h"
	#include "mlir/Dialect/DLTI/DLTI.h"			#include "mlir/Dialect/DLTI/DLTI.h"
	#include "mlir/Dialect/DLTI/Traits.h"			#include "mlir/Dialect/DLTI/Traits.h"
	#include "mlir/Dialect/Func/IR/FuncOps.h"			#include "mlir/Dialect/Func/IR/FuncOps.h"
	#include "mlir/Dialect/Linalg/IR/Linalg.h"			#include "mlir/Dialect/Linalg/IR/Linalg.h"
	#include "mlir/Dialect/Traits.h"			#include "mlir/Dialect/Traits.h"
	#include "mlir/IR/AsmState.h"			#include "mlir/IR/AsmState.h"
	#include "mlir/IR/BuiltinOps.h"			#include "mlir/IR/BuiltinOps.h"
	#include "mlir/IR/BuiltinTypes.h"			#include "mlir/IR/BuiltinTypes.h"
	Show All 24 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// TestDialect			// TestDialect
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "TestOpInterfaces.h.inc"			#include "TestOpInterfaces.h.inc"
	#include "TestOpsDialect.h.inc"			#include "TestOpsDialect.h.inc"

	namespace test {			namespace test {

				//===----------------------------------------------------------------------===//
				// TestDialect version utilities
				//===----------------------------------------------------------------------===//

				struct TestDialectVersion : public mlir::DialectVersion {
				TestDialectVersion() = default;
				TestDialectVersion(uint32_t _major, uint32_t _minor)
				: major(_major), minor(_minor){};
				uint32_t major = 2;
				uint32_t minor = 0;
				};

	// Define some classes to exercises the Properties feature.			// Define some classes to exercises the Properties feature.

	struct PropertiesWithCustomPrint {			struct PropertiesWithCustomPrint {
	/// A shared_ptr to a const object is safe: it is equivalent to a value-based			/// A shared_ptr to a const object is safe: it is equivalent to a value-based
	/// member. Here the label will be deallocated when the last operation			/// member. Here the label will be deallocated when the last operation
	/// refering to it is destroyed. However there is no pool-allocation: this is			/// refering to it is destroyed. However there is no pool-allocation: this is
	/// offloaded to the client.			/// offloaded to the client.
	std::shared_ptr<const std::string> label;			std::shared_ptr<const std::string> label;
	▲ Show 20 Lines • Show All 47 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestDialect.cpp

	//===- TestDialect.cpp - MLIR Dialect for Testing -------------------------===//			//===- TestDialect.cpp - MLIR Dialect for Testing -------------------------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "TestDialect.h"			#include "TestDialect.h"
	#include "TestAttributes.h"			#include "TestAttributes.h"
	#include "TestInterfaces.h"			#include "TestInterfaces.h"
	#include "TestTypes.h"			#include "TestTypes.h"
	#include "mlir/Bytecode/BytecodeImplementation.h"
	#include "mlir/Dialect/Arith/IR/Arith.h"			#include "mlir/Dialect/Arith/IR/Arith.h"
	#include "mlir/Dialect/Func/IR/FuncOps.h"			#include "mlir/Dialect/Func/IR/FuncOps.h"
	#include "mlir/Dialect/Tensor/IR/Tensor.h"			#include "mlir/Dialect/Tensor/IR/Tensor.h"
	#include "mlir/IR/AsmState.h"			#include "mlir/IR/AsmState.h"
	#include "mlir/IR/BuiltinAttributes.h"			#include "mlir/IR/BuiltinAttributes.h"
	#include "mlir/IR/BuiltinOps.h"			#include "mlir/IR/BuiltinOps.h"
	#include "mlir/IR/Diagnostics.h"			#include "mlir/IR/Diagnostics.h"
	#include "mlir/IR/ExtensibleDialect.h"			#include "mlir/IR/ExtensibleDialect.h"
	▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
	static ParseResult customParseProperties(OpAsmParser &parser,			static ParseResult customParseProperties(OpAsmParser &parser,
	PropertiesWithCustomPrint &prop);			PropertiesWithCustomPrint &prop);

	void test::registerTestDialect(DialectRegistry &registry) {			void test::registerTestDialect(DialectRegistry &registry) {
	registry.insert<TestDialect>();			registry.insert<TestDialect>();
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// TestDialect version utilities
	//===----------------------------------------------------------------------===//

	struct TestDialectVersion : public DialectVersion {
	uint32_t major = 2;
	uint32_t minor = 0;
	};

	//===----------------------------------------------------------------------===//
	// TestDialect Interfaces			// TestDialect Interfaces
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	namespace {			namespace {

	/// Testing the correctness of some traits.			/// Testing the correctness of some traits.
	static_assert(			static_assert(
	llvm::is_detected<OpTrait::has_implicit_terminator_t,			llvm::is_detected<OpTrait::has_implicit_terminator_t,
	SingleBlockImplicitTerminatorOp>::value,			SingleBlockImplicitTerminatorOp>::value,
	"has_implicit_terminator_t does not match SingleBlockImplicitTerminatorOp");			"has_implicit_terminator_t does not match SingleBlockImplicitTerminatorOp");
	static_assert(OpTrait::hasSingleBlockImplicitTerminator<			static_assert(OpTrait::hasSingleBlockImplicitTerminator<
	SingleBlockImplicitTerminatorOp>::value,			SingleBlockImplicitTerminatorOp>::value,
	"hasSingleBlockImplicitTerminator does not match "			"hasSingleBlockImplicitTerminator does not match "
	"SingleBlockImplicitTerminatorOp");			"SingleBlockImplicitTerminatorOp");

	struct TestResourceBlobManagerInterface			struct TestResourceBlobManagerInterface
	: public ResourceBlobManagerDialectInterfaceBase<			: public ResourceBlobManagerDialectInterfaceBase<
	TestDialectResourceBlobHandle> {			TestDialectResourceBlobHandle> {
	using ResourceBlobManagerDialectInterfaceBase<			using ResourceBlobManagerDialectInterfaceBase<
	TestDialectResourceBlobHandle>::ResourceBlobManagerDialectInterfaceBase;			TestDialectResourceBlobHandle>::ResourceBlobManagerDialectInterfaceBase;
	};			};

	namespace {			namespace {
	enum test_encoding { k_attr_params = 0 };			enum test_encoding { k_attr_params = 0, k_test_i32 = 99 };
	}			}

	// Test support for interacting with the Bytecode reader/writer.			// Test support for interacting with the Bytecode reader/writer.
	struct TestBytecodeDialectInterface : public BytecodeDialectInterface {			struct TestBytecodeDialectInterface : public BytecodeDialectInterface {
	using BytecodeDialectInterface::BytecodeDialectInterface;			using BytecodeDialectInterface::BytecodeDialectInterface;
	TestBytecodeDialectInterface(Dialect *dialect)			TestBytecodeDialectInterface(Dialect *dialect)
	: BytecodeDialectInterface(dialect) {}			: BytecodeDialectInterface(dialect) {}

				LogicalResult writeType(Type type,
				DialectBytecodeWriter &writer) const final {
				if (auto concreteType = llvm::dyn_cast<TestI32Type>(type)) {
				writer.writeVarInt(test_encoding::k_test_i32);
				return success();
				}
				return failure();
				}

				Type readType(DialectBytecodeReader &reader) const final {
				uint64_t encoding;
				if (failed(reader.readVarInt(encoding)))
				return Type();
				if (encoding == test_encoding::k_test_i32)
				return TestI32Type::get(getContext());
				return Type();
				}

	LogicalResult writeAttribute(Attribute attr,			LogicalResult writeAttribute(Attribute attr,
	DialectBytecodeWriter &writer) const final {			DialectBytecodeWriter &writer) const final {
	if (auto concreteAttr = llvm::dyn_cast<TestAttrParamsAttr>(attr)) {			if (auto concreteAttr = llvm::dyn_cast<TestAttrParamsAttr>(attr)) {
	writer.writeVarInt(test_encoding::k_attr_params);			writer.writeVarInt(test_encoding::k_attr_params);
	writer.writeVarInt(concreteAttr.getV0());			writer.writeVarInt(concreteAttr.getV0());
	writer.writeVarInt(concreteAttr.getV1());			writer.writeVarInt(concreteAttr.getV1());
	return success();			return success();
	}			}
	return failure();			return failure();
	}			}

	Attribute readAttribute(DialectBytecodeReader &reader,			Attribute readAttribute(DialectBytecodeReader &reader) const final {
	const DialectVersion &version_) const final {			auto versionOr = reader.getDialectVersion("test");
	const auto &version = static_cast<const TestDialectVersion &>(version_);			// Assume current version if not available through the reader.
				const auto version =
				(succeeded(versionOr))
				? reinterpret_cast<const TestDialectVersion >(*versionOr)
				: TestDialectVersion();
	if (version.major < 2)			if (version.major < 2)
	return readAttrOldEncoding(reader);			return readAttrOldEncoding(reader);
	if (version.major == 2 && version.minor == 0)			if (version.major == 2 && version.minor == 0)
	return readAttrNewEncoding(reader);			return readAttrNewEncoding(reader);
	// Forbid reading future versions by returning nullptr.			// Forbid reading future versions by returning nullptr.
	return Attribute();			return Attribute();
	}			}

	▲ Show 20 Lines • Show All 1,756 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestOps.td

Show First 20 Lines • Show All 1,305 Lines • ▼ Show 20 Lines

def TestOpWithVariadicResultsAndFolder: TEST_Op<"op_with_variadic_results_and_folder"> {		def TestOpWithVariadicResultsAndFolder: TEST_Op<"op_with_variadic_results_and_folder"> {
let arguments = (ins Variadic<I32>);		let arguments = (ins Variadic<I32>);
let results = (outs Variadic<I32>);		let results = (outs Variadic<I32>);
let hasFolder = 1;		let hasFolder = 1;
}		}

def TestAddIOp : TEST_Op<"addi"> {		def TestAddIOp : TEST_Op<"addi"> {
let arguments = (ins I32:$op1, I32:$op2);		let arguments = (ins AnyTypeOf<[I32, TestI32]>:$op1,
let results = (outs I32);		AnyTypeOf<[I32, TestI32]>:$op2);
		let results = (outs AnyTypeOf<[I32, TestI32]>);
}		}

def TestCommutativeOp : TEST_Op<"op_commutative", [Commutative]> {		def TestCommutativeOp : TEST_Op<"op_commutative", [Commutative]> {
let arguments = (ins I32:$op1, I32:$op2, I32:$op3, I32:$op4);		let arguments = (ins I32:$op1, I32:$op2, I32:$op3, I32:$op4);
let results = (outs I32);		let results = (outs I32);
}		}

def TestLargeCommutativeOp : TEST_Op<"op_large_commutative", [Commutative]> {		def TestLargeCommutativeOp : TEST_Op<"op_large_commutative", [Commutative]> {
▲ Show 20 Lines • Show All 1,986 Lines • ▼ Show 20 Lines	def TestVersionedOpB : TEST_Op<"versionedB"> {
//		//
// We support loading old IR through a custom readAttribute method, see		// We support loading old IR through a custom readAttribute method, see
// `readAttribute()` in `TestBytecodeDialectInterface`		// `readAttribute()` in `TestBytecodeDialectInterface`
let arguments = (ins		let arguments = (ins
TestAttrParams:$attribute		TestAttrParams:$attribute
);		);
}		}

		def TestVersionedOpC : TEST_Op<"versionedC"> {
		let arguments = (ins AnyAttrOf<[TestAttrParams,
		I32ElementsAttr]>:$attribute
		);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Test Properties		// Test Properties
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//


// Op with a properties struct defined inline.		// Op with a properties struct defined inline.
def TestOpWithProperties : TEST_Op<"with_properties"> {		def TestOpWithProperties : TEST_Op<"with_properties"> {
let assemblyFormat = "prop-dict attr-dict";		let assemblyFormat = "prop-dict attr-dict";
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

mlir/test/lib/Dialect/Test/TestTypeDefs.td

	Show First 20 Lines • Show All 363 Lines • ▼ Show 20 Lines

	def TestTypeElseAnchorStruct : Test_Type<"TestTypeElseAnchorStruct"> {			def TestTypeElseAnchorStruct : Test_Type<"TestTypeElseAnchorStruct"> {
	let parameters = (ins OptionalParameter<"std::optional<int>">:$a,			let parameters = (ins OptionalParameter<"std::optional<int>">:$a,
	OptionalParameter<"std::optional<int>">:$b);			OptionalParameter<"std::optional<int>">:$b);
	let mnemonic = "else_anchor_struct";			let mnemonic = "else_anchor_struct";
	let assemblyFormat = "`<` (`?`) : (struct($a, $b)^)? `>`";			let assemblyFormat = "`<` (`?`) : (struct($a, $b)^)? `>`";
	}			}

				def TestI32 : Test_Type<"TestI32"> {
				let mnemonic = "i32";
				}

	#endif // TEST_TYPEDEFS			#endif // TEST_TYPEDEFS

mlir/test/lib/IR/CMakeLists.txt

	# Exclude tests from libMLIR.so			# Exclude tests from libMLIR.so
	add_mlir_library(MLIRTestIR			add_mlir_library(MLIRTestIR
				TestBytecodeCallbacks.cpp
	TestBuiltinAttributeInterfaces.cpp			TestBuiltinAttributeInterfaces.cpp
	TestBuiltinDistinctAttributes.cpp			TestBuiltinDistinctAttributes.cpp
	TestClone.cpp			TestClone.cpp
	TestDiagnostics.cpp			TestDiagnostics.cpp
	TestDominance.cpp			TestDominance.cpp
	TestFunc.cpp			TestFunc.cpp
	TestInterfaces.cpp			TestInterfaces.cpp
	TestMatchers.cpp			TestMatchers.cpp
	Show All 29 Lines

mlir/test/lib/IR/TestBytecodeCallbacks.cpp

This file was added.

				//===- TestBytecodeCallbacks.cpp - Pass to test bytecode callback hooks --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "TestDialect.h"
				#include "mlir/Bytecode/BytecodeReader.h"
				#include "mlir/Bytecode/BytecodeWriter.h"
				#include "mlir/IR/BuiltinDialectBytecode.h"
				#include "mlir/IR/BuiltinOps.h"
				#include "mlir/IR/OperationSupport.h"
				#include "mlir/Parser/Parser.h"
				#include "mlir/Pass/Pass.h"
				#include "llvm/Support/CommandLine.h"
				#include "llvm/Support/MemoryBufferRef.h"
				#include "llvm/Support/raw_ostream.h"
				#include <list>

				using namespace mlir;
				using namespace llvm;

				namespace {
				class TestDialectVersionParser : public cl::parser<test::TestDialectVersion> {
				public:
				TestDialectVersionParser(cl::Option &O)
				: cl::parser<test::TestDialectVersion>(O) {}

				bool parse(cl::Option &O, StringRef /argName/, StringRef arg,
				test::TestDialectVersion &v) {
				long long major, minor;
				if (getAsSignedInteger(arg.split(".").first, 10, major))
				return O.error("Invalid argument '" + arg);
				if (getAsSignedInteger(arg.split(".").second, 10, minor))
				return O.error("Invalid argument '" + arg);
				v = test::TestDialectVersion(major, minor);
				// Returns true on error.
				return false;
				}
				static void print(raw_ostream &os, const test::TestDialectVersion &v) {
				os << v.major << "." << v.minor;
				};
				};

				/// This is a test pass which uses callbacks to encode attributes and types in a
				/// custom fashion.
				struct TestBytecodeCallbackPass
				: public PassWrapper<TestBytecodeCallbackPass, OperationPass<ModuleOp>> {
				MLIR_DEFINE_EXPLICIT_INTERNAL_INLINE_TYPE_ID(TestBytecodeCallbackPass)

				mehdi_aminiUnsubmitted Done Reply Inline Actions Can you write `Type entryValue` and drop the `if constexpr` in the body? mehdi_amini: Can you write `Type entryValue` and drop the `if constexpr` in the body?
				mfrancioAuthorUnsubmitted Done Reply Inline Actions No, the callback needs to compile for both Type entryValue and Attribute entryValue. We could do this with two separate callbacks if you feel it's a bit cumbersome to force the use of auto in the callback signature. mfrancio: No, the callback needs to compile for both Type entryValue and Attribute entryValue. We could…
				mehdi_aminiUnsubmitted Done Reply Inline Actions What about taking a union of type/attr instead of templating this? Right now this adds some cognitive complexity that does not seems necessarily justified to me. Alternatively we could indeed split it in two callback registration, that can be fine as well. mehdi_amini: What about taking a union of type/attr instead of templating this? Right now this adds some…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions Switched to two callbacks - it is a bit more code, but the function signature is cleaner and more explicit. mfrancio: Switched to two callbacks - it is a bit more code, but the function signature is cleaner and…
				StringRef getArgument() const final { return "test-bytecode-callback"; }
				StringRef getDescription() const final {
				return "Test encoding of a dialect type/attributes with a custom callback";
				}
				void getDependentDialects(DialectRegistry &registry) const override {
				registry.insert<test::TestDialect>();
				}
				TestBytecodeCallbackPass() = default;
				TestBytecodeCallbackPass(const TestBytecodeCallbackPass &) {}

				void runOnOperation() override {
				switch (testKind) {
				case (0):
				return runTest0(getOperation());
				mehdi_aminiUnsubmitted Done Reply Inline Actions I'd be interested to see an example where you actually take a `!test.i32` as input and write it as a builtin IntegerType (and show that we can parse it as such), and vice-versa. mehdi_amini: I'd be interested to see an example where you actually take a `!test.i32` as input and write it…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions Test added. Note that it is not possible to parse !test.i32 as a native builtin integer type (hence, without adding a specific callback for it, or without a custom parser that falls back to the builtin integer type parser, which I did not implement for the sake of this test) because the the "owner" of such encoding is still the test dialect. mfrancio: Test added. Note that it is not possible to parse !test.i32 as a native builtin integer type…
				mehdi_aminiUnsubmitted Done Reply Inline Actions I understand we need a callback, but can't you implement the callback in this file to show we can parse !test.i32 as a native builtin integer type? mehdi_amini: I understand we need a callback, but can't you implement the callback in this file to show we…
				mfrancioAuthorUnsubmitted Done Reply Inline Actions I did export the integer type parser code within the callback - but effectively was not very clear. Right now I exported the bytecode dialect interface and used this explicitly to write/read, which helps showcasing the feature. mfrancio: I did export the integer type parser code within the callback - but effectively was not very…
				case (1):
				return runTest1(getOperation());
				case (2):
				return runTest2(getOperation());
				case (3):
				return runTest3(getOperation());
				case (4):
				return runTest4(getOperation());
				case (5):
				return runTest5(getOperation());
				default:
				llvm_unreachable("unhandled test kind for TestBytecodeCallbacks pass");
				}
				}

				mlir::Pass::Option<test::TestDialectVersion, TestDialectVersionParser>
				targetVersion{*this, "test-dialect-version",
				llvm::cl::desc(
				"Specifies the test dialect version to emit and parse"),
				cl::init(test::TestDialectVersion())};

				mlir::Pass::Option<int> testKind{
				*this, "callback-test",
				llvm::cl::desc("Specifies the test kind to execute"), cl::init(0)};

				mfrancioAuthorUnsubmitted Done Reply Inline Actions @mehdi_amini this gives an idea of what we would do on parsing - we would get the dialect version to parse from the version map. Right now though there is no system to specify a version on writing, and this is the closest I could go :). We could post-fix this once the proper API exists if you are ok in leaving the TODO, or I can propose an implementation and finalize the work. mfrancio: @mehdi_amini this gives an idea of what we would do on parsing - we would get the dialect…
				mehdi_aminiUnsubmitted Done Reply Inline Actions LG mehdi_amini: LG
				private:
				void doRoundtripWithConfigs(Operation *op,
				const BytecodeWriterConfig &writeConfig,
				const ParserConfig &parseConfig) {
				std::string bytecode;
				llvm::raw_string_ostream os(bytecode);
				if (failed(writeBytecodeToFile(op, os, writeConfig))) {
				op->emitError() << "failed to write bytecode\n";
				signalPassFailure();
				return;
				}
				auto newModuleOp = parseSourceString(StringRef(bytecode), parseConfig);
				if (!newModuleOp.get()) {
				op->emitError() << "failed to read bytecode\n";
				signalPassFailure();
				return;
				}
				// Print the module to the output stream, so that we can filecheck the
				// result.
				newModuleOp->print(llvm::outs());
				return;
				}

				// Test0: let's assume that versions older than 2.0 were relying on a special
				// integer attribute of the builtin dialect that is now deprecated. Assume
				// that its encoding was made by two varInts, the first was the ID (999) and
				// the second contained width and signedness info. We can emit it using a
				// callback emitting a custom encoding, and parse it back with a custom parser
				// reading the same encoding. Note that the ID 999 does not correspond to a
				// valid integer type in the current encodings of builtin types.
				void runTest0(Operation *op) {
				auto newCtx = std::make_shared<MLIRContext>();
				test::TestDialectVersion targetEmissionVersion = targetVersion;
				BytecodeWriterConfig writeConfig;
				writeConfig.attachTypeCallback(
				[&](Type entryValue, std::optional<StringRef> &name,
				DialectBytecodeWriter &writer) -> LogicalResult {
				// Do not override anything if version less than 2.0.
				if (targetEmissionVersion.major >= 2)
				return failure();

				// For version less than 2.0, override the encoding of IntegerType.
				if (auto type = llvm::dyn_cast<IntegerType>(entryValue)) {
				llvm::outs() << "Overriding IntegerType encoding...\n";
				name = StringLiteral("funky");
				writer.writeVarInt(/* IntegerType */ 999);
				writer.writeVarInt(type.getWidth() << 2 \| type.getSignedness());
				return success();
				}
				return failure();
				});
				newCtx->appendDialectRegistry(op->getContext()->getDialectRegistry());
				newCtx->allowUnregisteredDialects();
				ParserConfig parseConfig(newCtx.get(), /verifyAfterParse=/true);
				parseConfig.attachTypeBytecodeCallback([&](DialectBytecodeReader &reader,
				StringRef dialectName,
				Type &entry) -> LogicalResult {
				// Get test dialect version from the version map.
				auto versionOr = reader.getDialectVersion("test");
				assert(
				succeeded(versionOr) &&
				"expected reader to be able to access the version for test dialect");
				const auto *version =
				reinterpret_cast<const test::TestDialectVersion >(versionOr);

				// TODO: once back-deployment is formally supported,
				// `targetEmissionVersion` will be encoded in the bytecode file, and
				// exposed through the versionMap. Right now though this is not yet
				// supported. For the purpose of the test, just use
				// `targetEmissionVersion`.
				(void)version;
				if (targetEmissionVersion.major >= 2)
				return success();

				if (dialectName != StringLiteral("funky"))
				return success();

				uint64_t encoding;
				if (failed(reader.readVarInt(encoding)) \|\| encoding != 999)
				return success();
				llvm::outs() << "Overriding parsing of IntegerType encoding...\n";
				uint64_t _widthAndSignedness, width;
				IntegerType::SignednessSemantics signedness;
				if (succeeded(reader.readVarInt(_widthAndSignedness)) &&
				((width = _widthAndSignedness >> 2), true) &&
				((signedness = static_cast<IntegerType::SignednessSemantics>(
				_widthAndSignedness & 0x3)),
				true))
				entry = IntegerType::get(reader.getContext(), width, signedness);
				// Return nullopt to fall through the rest of the parsing code path.
				return success();
				});
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}

				// Test1: When writing bytecode, we override the encoding of TestI32Type with
				// the encoding of builtin IntegerType. At parsing, we use such encoding to
				// read the type and assemble the builtin IntegerType.
				void runTest1(Operation *op) {
				BytecodeWriterConfig writeConfig;
				writeConfig.attachTypeCallback(
				[&](Type entryValue, std::optional<StringRef> &name,
				DialectBytecodeWriter &writer) -> LogicalResult {
				// Emit TestIntegerType using the builtin dialect encoding.
				if (llvm::isa<test::TestI32Type>(entryValue)) {
				llvm::outs() << "Overriding TestI32Type encoding...\n";
				auto builtinI32Type =
				IntegerType::get(op->getContext(), 32,
				IntegerType::SignednessSemantics::Signless);
				name = StringLiteral("builtin");
				if (succeeded(builtin::writeType(builtinI32Type, writer)))
				return success();
				}
				return failure();
				});
				// We natively parse the attribute as a builtin, so no callback needed.
				ParserConfig parseConfig(op->getContext(), /verifyAfterParse=/true);
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}

				// Test2: When writing bytecode, we write standard builtin IntegerTypes. At
				// parsing, we use the encoding of IntegerType to intercept all i32. Then,
				// instead of creating i32s, we assemble TestI32Type and return it.
				void runTest2(Operation *op) {
				BytecodeWriterConfig writeConfig;
				ParserConfig parseConfig(op->getContext(), /verifyAfterParse=/true);
				parseConfig.attachTypeBytecodeCallback([&](DialectBytecodeReader &reader,
				StringRef dialectName,
				Type &entry) -> LogicalResult {
				if (dialectName != StringLiteral("builtin"))
				return success();
				Type builtinAttr = builtin::readType(reader);
				if (auto integerType = llvm::dyn_cast_or_null<IntegerType>(builtinAttr)) {
				if (integerType.getWidth() == 32 && integerType.isSignless()) {
				llvm::outs() << "Overriding parsing of TestI32Type encoding...\n";
				entry = test::TestI32Type::get(reader.getContext());
				}
				}
				return success();
				});
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}

				// Test3: When writing bytecode, we override the encoding of
				// TestAttrParamsAttr with the encoding of builtin DenseIntElementsAttr. At
				// parsing, we use such encoding to read the type and assemble the builtin
				// DenseIntElementsAttr.
				void runTest3(Operation *op) {
				auto i32Type = IntegerType::get(op->getContext(), 32,
				IntegerType::SignednessSemantics::Signless);
				BytecodeWriterConfig writeConfig;
				writeConfig.attachAttributeCallback(
				[&](Attribute entryValue, std::optional<StringRef> &name,
				DialectBytecodeWriter &writer) -> LogicalResult {
				// Emit TestIntegerType using the builtin dialect encoding.
				if (auto testParamAttrs =
				llvm::dyn_cast<test::TestAttrParamsAttr>(entryValue)) {
				llvm::outs() << "Overriding TestAttrParamsAttr encoding...\n";
				name = StringLiteral("builtin");
				auto denseAttr = DenseIntElementsAttr::get(
				RankedTensorType::get({2}, i32Type),
				{testParamAttrs.getV0(), testParamAttrs.getV1()});
				if (succeeded(builtin::writeAttribute(denseAttr, writer)))
				return success();
				}
				return failure();
				});
				// We natively parse the attribute as a builtin, so no callback needed.
				ParserConfig parseConfig(op->getContext(), /verifyAfterParse=/false);
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}

				// Test4: When writing bytecode, we write standard builtin
				// DenseIntElementsAttr. At parsing, we use the encoding of
				// DenseIntElementsAttr to intercept all ElementsAttr that have shaped type of
				// <2xi32>. Instead of assembling a DenseIntElementsAttr, we assemble
				// TestAttrParamsAttr and return it.
				void runTest4(Operation *op) {
				auto i32Type = IntegerType::get(op->getContext(), 32,
				IntegerType::SignednessSemantics::Signless);
				BytecodeWriterConfig writeConfig;
				ParserConfig parseConfig(op->getContext(), /verifyAfterParse=/false);
				parseConfig.attachAttributeBytecodeCallback(
				[&](DialectBytecodeReader &reader, StringRef dialectName,
				Attribute &entry) -> LogicalResult {
				// Override only the case where the return type of the builtin reader
				// is an i32 and fall through on all the other cases, since we want to
				// still use TestDialect normal codepath to parse the other types.
				Attribute builtinAttr = builtin::readAttribute(reader);
				if (auto denseAttr =
				llvm::dyn_cast_or_null<DenseIntElementsAttr>(builtinAttr)) {
				if (denseAttr.getType().getShape() == ArrayRef<int64_t>(2) &&
				denseAttr.getElementType() == i32Type) {
				llvm::outs()
				<< "Overriding parsing of TestAttrParamsAttr encoding...\n";
				int v0 = denseAttr.getValues<IntegerAttr>()[0].getInt();
				int v1 = denseAttr.getValues<IntegerAttr>()[1].getInt();
				entry =
				test::TestAttrParamsAttr::get(reader.getContext(), v0, v1);
				}
				}
				return success();
				});
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}

				// Test5: When writing bytecode, we want TestDialect to use nothing else than
				// the builtin types and attributes and take full control of the encoding,
				// returning failure if any type or attribute is not part of builtin.
				void runTest5(Operation *op) {
				BytecodeWriterConfig writeConfig;
				writeConfig.attachAttributeCallback(
				[&](Attribute attr, std::optional<StringRef> &name,
				DialectBytecodeWriter &writer) -> LogicalResult {
				return builtin::writeAttribute(attr, writer);
				});
				writeConfig.attachTypeCallback(
				[&](Type type, std::optional<StringRef> &name,
				DialectBytecodeWriter &writer) -> LogicalResult {
				return builtin::writeType(type, writer);
				});
				ParserConfig parseConfig(op->getContext(), /verifyAfterParse=/false);
				parseConfig.attachAttributeBytecodeCallback(
				[&](DialectBytecodeReader &reader, StringRef dialectName,
				Attribute &entry) -> LogicalResult {
				Attribute builtinAttr = builtin::readAttribute(reader);
				if (!builtinAttr)
				return failure();
				entry = builtinAttr;
				return success();
				});
				parseConfig.attachTypeBytecodeCallback([&](DialectBytecodeReader &reader,
				StringRef dialectName,
				Type &entry) -> LogicalResult {
				Type builtinType = builtin::readType(reader);
				if (!builtinType) {
				return failure();
				}
				entry = builtinType;
				return success();
				});
				doRoundtripWithConfigs(op, writeConfig, parseConfig);
				return;
				}
				};
				} // namespace

				namespace mlir {
				void registerTestBytecodeCallbackPasses() {
				PassRegistration<TestBytecodeCallbackPass>();
				}
				} // namespace mlir

mlir/tools/mlir-opt/mlir-opt.cpp

Show All 37 Lines
void registerLoopLikeInterfaceTestPasses();		void registerLoopLikeInterfaceTestPasses();
void registerShapeFunctionTestPasses();		void registerShapeFunctionTestPasses();
void registerSideEffectTestPasses();		void registerSideEffectTestPasses();
void registerSliceAnalysisTestPass();		void registerSliceAnalysisTestPass();
void registerSymbolTestPasses();		void registerSymbolTestPasses();
void registerRegionTestPasses();		void registerRegionTestPasses();
void registerTestAffineDataCopyPass();		void registerTestAffineDataCopyPass();
void registerTestAffineReifyValueBoundsPass();		void registerTestAffineReifyValueBoundsPass();
		void registerTestBytecodeCallbackPasses();
void registerTestDecomposeAffineOpPass();		void registerTestDecomposeAffineOpPass();
void registerTestAffineLoopUnswitchingPass();		void registerTestAffineLoopUnswitchingPass();
void registerTestAllReduceLoweringPass();		void registerTestAllReduceLoweringPass();
void registerTestFunc();		void registerTestFunc();
void registerTestGpuMemoryPromotionPass();		void registerTestGpuMemoryPromotionPass();
void registerTestLoopPermutationPass();		void registerTestLoopPermutationPass();
void registerTestMatchers();		void registerTestMatchers();
void registerTestOperationEqualPass();		void registerTestOperationEqualPass();
▲ Show 20 Lines • Show All 105 Lines • ▼ Show 20 Lines	void registerTestPasses() {
registerSliceAnalysisTestPass();		registerSliceAnalysisTestPass();
registerSymbolTestPasses();		registerSymbolTestPasses();
registerRegionTestPasses();		registerRegionTestPasses();
registerTestAffineDataCopyPass();		registerTestAffineDataCopyPass();
registerTestAffineReifyValueBoundsPass();		registerTestAffineReifyValueBoundsPass();
registerTestDecomposeAffineOpPass();		registerTestDecomposeAffineOpPass();
registerTestAffineLoopUnswitchingPass();		registerTestAffineLoopUnswitchingPass();
registerTestAllReduceLoweringPass();		registerTestAllReduceLoweringPass();
		registerTestBytecodeCallbackPasses();
registerTestFunc();		registerTestFunc();
registerTestGpuMemoryPromotionPass();		registerTestGpuMemoryPromotionPass();
registerTestLoopPermutationPass();		registerTestLoopPermutationPass();
registerTestMatchers();		registerTestMatchers();
registerTestOperationEqualPass();		registerTestOperationEqualPass();
registerTestPrintDefUsePass();		registerTestPrintDefUsePass();
registerTestPrintInvalidPass();		registerTestPrintInvalidPass();
registerTestPrintNestingPass();		registerTestPrintNestingPass();
▲ Show 20 Lines • Show All 104 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Expose callbacks for encoding of types/attributesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 542263

mlir/include/mlir/Bytecode/BytecodeImplementation.h

mlir/include/mlir/Bytecode/BytecodeWriter.h

mlir/include/mlir/IR/AsmState.h

mlir/include/mlir/IR/BuiltinDialectBytecode.h

mlir/lib/Bytecode/Reader/BytecodeReader.cpp

mlir/lib/Bytecode/Writer/BytecodeWriter.cpp

mlir/lib/Bytecode/Writer/IRNumbering.cpp

mlir/lib/IR/BuiltinDialect.cpp

mlir/lib/IR/BuiltinDialectBytecode.h

mlir/lib/IR/BuiltinDialectBytecode.cpp

mlir/test/Bytecode/bytecode_callback.mlir

mlir/test/Bytecode/bytecode_callback_full_override.mlir

mlir/test/Bytecode/bytecode_callback_with_custom_attribute.mlir

mlir/test/Bytecode/bytecode_callback_with_custom_type.mlir

mlir/test/Bytecode/invalid/invalid_attr_type_section.mlir

mlir/test/lib/Dialect/Test/TestDialect.h

mlir/test/lib/Dialect/Test/TestDialect.cpp

mlir/test/lib/Dialect/Test/TestOps.td

mlir/test/lib/Dialect/Test/TestTypeDefs.td

mlir/test/lib/IR/CMakeLists.txt

mlir/test/lib/IR/TestBytecodeCallbacks.cpp

mlir/tools/mlir-opt/mlir-opt.cpp

Expose callbacks for encoding of types/attributes
ClosedPublic