This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/
-
include/mlir/Tools/mlir-translate/
-
mlir/
-
Tools/
-
mlir-translate/
1
MlirTranslateMain.h
-
lib/Tools/mlir-translate/
-
Tools/
-
mlir-translate/
-
MlirTranslateMain.cpp

Differential D120970

[mlirTranslateMain] Add a customization callback.
ClosedPublic

Authored by lattner on Mar 3 2022, 9:11 PM.

Download Raw Diff

Details

Reviewers

rriddle
jpienaar

Commits

rGf18d6af7e972: [mlirTranslateMain] Add a customization callback.

Summary

mlir-translate and related tools currently have a fixed set
of flags that are built into Translation.cpp. This works for
simple cases, but some clients want to change the default
globally (e.g. default to allowing unregistered dialects
without a command line flag), or support dialect-independent
translations without having those translations register every
conceivable dialect they could be used with (breaking
modularity).

This approach could also be applied to mlirOptMain to reduce
the significant number of flags it has accumulated.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lattner created this revision.Mar 3 2022, 9:11 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 3 2022, 9:11 PM

Herald added subscribers: sdasgup3, wenzhicui, wrengr and 19 others. · View Herald Transcript

lattner requested review of this revision.Mar 3 2022, 9:11 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 3 2022, 9:11 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

lattner added a reviewer: rriddle.Mar 3 2022, 9:12 PM

Harbormaster completed remote builds in B152529: Diff 412924.Mar 3 2022, 9:22 PM

This approach could also be applied to mlirOptMain to reduce the significant number of flags it has accumulated.

There is value in explicit options: this provides a restricted interface instead of an open one with unbounded access to the context. We leveraged the fact that we have control over the context over time in mlirOptMain to change the behavior as we saw fit.

mlir/include/mlir/Translation.h
109 ↗	(On Diff #412924)

I think I'm fine with this in general, but we should also be wary about encouraging users to do the wrong thing. I would say that we should make certain context-related things parameters (e.g. the DialectRegistry) when we want to enforce/encourage them to do what is expected.

mlir/include/mlir/Translation.h
108 ↗	(On Diff #412924)	I would add a DialectRegistry parameter here. We really want users to be driving adding dialects through that, the customization callback I would view for non-registry related things.

This revision is now accepted and ready to land.Mar 4 2022, 1:14 PM

Ok, I agree the dialect registration is the most critical thing that is missing.

That said, I have a translate tool that always wants to accept unregistered dialects. It is super annoying that every testcase has to specify -allow-unregistered-dialect. We have a way to do this for mlir-opt, because it takes a ton of flags.... but is it really better to feed a ton of arguments into these tools? For reference MlirOptMain has grown to this beast:

LogicalResult MlirOptMain(llvm::raw_ostream &outputStream,
                          std::unique_ptr<llvm::MemoryBuffer> buffer,
                          const PassPipelineCLParser &passPipeline,
                          DialectRegistry &registry, bool splitInputFile,
                          bool verifyDiagnostics, bool verifyPasses,
                          bool allowUnregisteredDialects,
                          bool preloadDialectsInContext = false);

with several overloads, because this is unmanagable for most clients.

Coming back to this patch, if we change the callback to take a registry instead of an MLIRContext, then we'll have to go down this same path for mlirTranslateMain: adding a bool allowUnregisteredDialects parameter... and I assume other clients will want others over time. Is this actually a better design?

In my opinion, the goal of constraining the interface is not actually as important as having a clean interface and allowing people using MLIR to get stuff done. We cannot prevent all "abuse" after all. I don't think we can even define what "abuse" means.

WDYT?

In D120970#3362660, @lattner wrote:
Ok, I agree the dialect registration is the most critical thing that is missing.

That said, I have a translate tool that always wants to accept unregistered dialects. It is super annoying that every testcase has to specify -allow-unregistered-dialect. We have a way to do this for mlir-opt, because it takes a ton of flags.... but is it really better to feed a ton of arguments into these tools? For reference MlirOptMain has grown to this beast:
LogicalResult MlirOptMain(llvm::raw_ostream &outputStream,
                          std::unique_ptr<llvm::MemoryBuffer> buffer,
                          const PassPipelineCLParser &passPipeline,
                          DialectRegistry &registry, bool splitInputFile,
                          bool verifyDiagnostics, bool verifyPasses,
                          bool allowUnregisteredDialects,
                          bool preloadDialectsInContext = false);
with several overloads, because this is unmanagable for most clients.

Not really sure you would do differently here (other than a config struct?), most of those are specific to mlir-opt; i.e. not configuring the context (as a side note preloadDialectsInContext is also something we want to get rid of).

Coming back to this patch, if we change the callback to take a registry instead of an MLIRContext, then we'll have to go down this same path for mlirTranslateMain: adding a bool allowUnregisteredDialects parameter... and I assume other clients will want others over time. Is this actually a better design?

In my opinion, the goal of constraining the interface is not actually as important as having a clean interface and allowing people using MLIR to get stuff done. We cannot prevent all "abuse" after all. I don't think we can even define what "abuse" means.

WDYT?

I never suggested that the callback would take a DialectRegistry, MlirTranslateMain should always take a registry. The callback could still take an MLIRContext, but clients should be populating the DialectRegistry that gets passed to MlirTranslateMain, not manually adding dialects to the MLIRContext.

I agree with the sentiments here (enable getting things done + pushing folks to using supported APIs), and this seems reasonable. These for me are meant for creating exploration/testing tools, so shouldn't be as locked down as a dedicated tool (and dedicated end-to-end tools should probably not be using this either), so SGTM.

mlir/lib/Translation/Translation.cpp
161 ↗	(On Diff #412924)	This is one that hit me recently, having the init in here meant I couldn't reuse another helper (and is different than what we do for mlir-opt's main helper). Not relevant to this change beyond reminding me of it.

In D120970#3362660, @lattner wrote:

That said, I have a translate tool that always wants to accept unregistered dialects. It is super annoying that every testcase has to specify -allow-unregistered-dialect.

If this is the only use case for the callback, then can we just add a flag (or a config struct if you prefer)? Since this API hasn't changed "forever", in absence of other motivation it seems extreme to me to expose the entire context where we need a bool.

But more importantly I'm not convinced that this is the right API to change actually: it is unlikely that "a translate tool that always wants to accept unregistered dialects" and instead more likely that a specific translation wants this.
This is also why the DialectRegistry isn't provided through this API, but is part of the individual translation registrations.
For example see here: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Target/SPIRV/TranslateRegistration.cpp#L114-L116
how the Spirv translation registers the spirv dialect.
I would expect that the property of "allowing allow-unregistered-dialect" would be at the same level, and so conveys from this registration instead.

Below I'm just providing my counter point to your overall argument on a restricted vs open API, less relevant for this current patch:

We have a way to do this for mlir-opt, because it takes a ton of flags.... but is it really better to feed a ton of arguments into these tools?  For reference `MlirOptMain` has grown to this beast:

LogicalResult MlirOptMain(llvm::raw_ostream &outputStream,
                          std::unique_ptr<llvm::MemoryBuffer> buffer,
                          const PassPipelineCLParser &passPipeline,
                          DialectRegistry &registry, bool splitInputFile,
                          bool verifyDiagnostics, bool verifyPasses,
                          bool allowUnregisteredDialects,
                          bool preloadDialectsInContext = false);

Exactly two flags here are about the MLIRContext (the last two), and the last one is even deprecated! It's hardly "a ton of flags" IMO.

The other flags are unfortunate but also quite specific to the behavior of mlir-opt, I would be fine with a config struct here as well (on the model of OpPrintingFlags) to collect splitInputFile, verifyDiagnostics, verifyPasses, ...

In my opinion, the goal of constraining the interface is not actually as important as having a clean interface and allowing people using MLIR to get stuff done.

I disagree here: constraining the interface is important to me and has proven to be very valuable exactly in the case of mlir-opt in the past!
More importantly: I don't see how replacing the 2 flags (one deprecated) with a callback would make the mlirOptMain interface "cleaner".
Finally: it has to be elaborated about how the current mlirOptMain interface does not allow "to get stuff done".

We cannot prevent all "abuse" after all.

We can't prevent it all, but talking in the extreme isn't an argument IMO: we can always make API "easy to use, hard to misuse" without being absolute.
APIs conveys important information to users about the intent of what we support. I argue that this is even more critical in a project like LLVM with unstable APIs: because we want to be able to evolve them, having "closed" or "restricted" API is much more resilient to changes: it both support our job as maintainer (easier to think about invariants when refactoring a "restricted" API, in particular when you don't see all the clients) and as users (less likely to be broken or be told "you're relying on an unsupported implementation detail of the API").

I don't think we can even define what "abuse" means.

We design the system with a specific mental model, in general the way I'm using "abuse" is when people work around the invariants we're using to keep the system consistent in the design, but that we can't easily enforce (through assertions or others).

Update for review comments and merge to mainline.

This adopts function_ref, and adds a dialect registry to make
the interface more explicit.

This still LGTM. Making it easier for tool authors that have weird configs, while also encouraging the right thing (re using the registry instead of adding things directly to the context), seems fine. (and is also aligned with mlir-opt).

mlir/include/mlir/Tools/mlir-translate/MlirTranslateMain.h
39	We can likely drop the llvm:: here.

Harbormaster completed remote builds in B153947: Diff 414874.Mar 12 2022, 1:08 PM

remove unneeded llvm:: qualifier.

Thank you for the discussion and review!

This revision was landed with ongoing or failed builds.Mar 12 2022, 1:18 PM

Closed by commit rGf18d6af7e972: [mlirTranslateMain] Add a customization callback. (authored by lattner). · Explain Why

This revision was automatically updated to reflect the committed changes.

lattner added a commit: rGf18d6af7e972: [mlirTranslateMain] Add a customization callback..

Harbormaster completed remote builds in B153949: Diff 414876.Mar 12 2022, 1:35 PM

In D120970#3363161, @mehdi_amini wrote:

In D120970#3362660, @lattner wrote:

That said, I have a translate tool that always wants to accept unregistered dialects. It is super annoying that every testcase has to specify -allow-unregistered-dialect.

If this is the only use case for the callback, then can we just add a flag (or a config struct if you prefer)? Since this API hasn't changed "forever", in absence of other motivation it seems extreme to me to expose the entire context where we need a bool.

But more importantly I'm not convinced that this is the right API to change actually: it is unlikely that "a translate tool that always wants to accept unregistered dialects" and instead more likely that a specific translation wants this.
This is also why the DialectRegistry isn't provided through this API, but is part of the individual translation registrations.
For example see here: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Target/SPIRV/TranslateRegistration.cpp#L114-L116
how the Spirv translation registers the spirv dialect.
I would expect that the property of "allowing allow-unregistered-dialect" would be at the same level, and so conveys from this registration instead.

Surprised to see this landing: you haven't addressed my comment as far as I can tell?

Surprised to see this landing: you haven't addressed my comment as far as I can tell?

I'm sorry, I didn't mean to land while ignoring your comment! I thought I addressed this on the discourse thread but I can see that it isn't completely clear. I'll follow-up over there, I didn't mean to steam roll you Mehdi!

lattner added a reverting change: D121668: Revert "[mlirTranslateMain] Add a customization callback.".Mar 14 2022, 10:03 PM

lattner added a reverting change: rG2ef95efb414e: Revert "[mlirTranslateMain] Add a customization callback.".Mar 14 2022, 10:05 PM

Revision Contents

Path

Size

mlir/

include/

mlir/

Tools/

mlir-translate/

MlirTranslateMain.h

16 lines

lib/

Tools/

mlir-translate/

MlirTranslateMain.cpp

24 lines

Diff 414877

mlir/include/mlir/Tools/mlir-translate/MlirTranslateMain.h

	//===- MlirTranslateMain.h - MLIR Translation Driver main -------- C++ --===//			//===- MlirTranslateMain.h - MLIR Translation Driver main -------- C++ --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// Main entry function for mlir-translate for when built as standalone binary.			// Main entry function for mlir-translate for when built as standalone binary.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H			#ifndef MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H
	#define MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H			#define MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H

				#include "mlir/IR/Dialect.h"
	#include "mlir/Support/LogicalResult.h"			#include "mlir/Support/LogicalResult.h"
				#include "llvm/ADT/STLFunctionalExtras.h"
	#include "llvm/ADT/StringRef.h"			#include "llvm/ADT/StringRef.h"

	namespace mlir {			namespace mlir {

	/// Translate to/from an MLIR module from/to an external representation (e.g.			/// Translate to/from an MLIR module from/to an external representation (e.g.
	/// LLVM IR, SPIRV binary, ...). This is the entry point for the implementation			/// LLVM IR, SPIRV binary, ...). This is the entry point for the implementation
	/// of tools like `mlir-translate`. The translation to perform is parsed from			/// of tools like `mlir-translate`. The translation to perform is parsed from
	/// the command line. The `toolName` argument is used for the header displayed			/// the command line. The `toolName` argument is used for the header displayed
	/// by `--help`.			/// by `--help`.
	LogicalResult mlirTranslateMain(int argc, char **argv, StringRef toolName);			///
				/// Dialect translation typically registers the dialects produced or returned
				/// by the translation itself, but some translation testing tools may want
				/// additional dialects registered so the .mlir parser can read them. In this
				/// case, `extraDialects` may be specified with additional dialects to use.
				///
				/// The client may specify a "customization" function if they'd like, which
				/// is invoked when an MLIRContext is set up, allowing custom settings.
				LogicalResult
				mlirTranslateMain(int argc, char **argv, StringRef toolName,
				const DialectRegistry &extraDialects = DialectRegistry(),
				function_ref<void(MLIRContext &)> customization = {});
				rriddleUnsubmitted Not Done Reply Inline Actions We can likely drop the llvm:: here. rriddle: We can likely drop the llvm:: here.
	} // namespace mlir			} // namespace mlir

	#endif // MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H			#endif // MLIR_TOOLS_MLIRTRANSLATE_MLIRTRANSLATEMAIN_H

mlir/lib/Tools/mlir-translate/MlirTranslateMain.cpp

Show All 16 Lines
#include "mlir/Tools/mlir-translate/Translation.h"		#include "mlir/Tools/mlir-translate/Translation.h"
#include "llvm/Support/InitLLVM.h"		#include "llvm/Support/InitLLVM.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/ToolOutputFile.h"		#include "llvm/Support/ToolOutputFile.h"

using namespace mlir;		using namespace mlir;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Translation Parser		// mlir-translate tool driver
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

LogicalResult mlir::mlirTranslateMain(int argc, char **argv,		LogicalResult
llvm::StringRef toolName) {		mlir::mlirTranslateMain(int argc, char **argv, llvm::StringRef toolName,
		const DialectRegistry &extraDialects,
		llvm::function_ref<void(MLIRContext &)> customization) {

static llvm::cl::opt<std::string> inputFilename(		static llvm::cl::opt<std::string> inputFilename(
llvm::cl::Positional, llvm::cl::desc("<input file>"),		llvm::cl::Positional, llvm::cl::desc("<input file>"),
llvm::cl::init("-"));		llvm::cl::init("-"));

static llvm::cl::opt<std::string> outputFilename(		static llvm::cl::opt<std::string> outputFilename(
"o", llvm::cl::desc("Output filename"), llvm::cl::value_desc("filename"),		"o", llvm::cl::desc("Output filename"), llvm::cl::value_desc("filename"),
llvm::cl::init("-"));		llvm::cl::init("-"));
Show All 37 Lines	if (!output) {
llvm::errs() << errorMessage << "\n";		llvm::errs() << errorMessage << "\n";
return failure();		return failure();
}		}

// Processes the memory buffer with a new MLIRContext.		// Processes the memory buffer with a new MLIRContext.
auto processBuffer = [&](std::unique_ptr<llvm::MemoryBuffer> ownedBuffer,		auto processBuffer = [&](std::unique_ptr<llvm::MemoryBuffer> ownedBuffer,
raw_ostream &os) {		raw_ostream &os) {
MLIRContext context;		MLIRContext context;

		// If the client wanted to register additional dialects, go ahead and add
		// them to our context.
		context.appendDialectRegistry(extraDialects);

		// If a customization callback was provided, apply it to the MLIRContext.
		// This could add dialects to the registry or change context defaults.
		if (customization)
		customization(context);

		// If command line flags were used to customize the context, apply their
		// settings.
		if (allowUnregisteredDialects.getNumOccurrences())
context.allowUnregisteredDialects(allowUnregisteredDialects);		context.allowUnregisteredDialects(allowUnregisteredDialects);
context.printOpOnDiagnostic(!verifyDiagnostics);		context.printOpOnDiagnostic(!verifyDiagnostics);

llvm::SourceMgr sourceMgr;		llvm::SourceMgr sourceMgr;
sourceMgr.AddNewSourceBuffer(std::move(ownedBuffer), SMLoc());		sourceMgr.AddNewSourceBuffer(std::move(ownedBuffer), SMLoc());

if (!verifyDiagnostics) {		if (!verifyDiagnostics) {
SourceMgrDiagnosticHandler sourceMgrHandler(sourceMgr, &context);		SourceMgrDiagnosticHandler sourceMgrHandler(sourceMgr, &context);
return (*translationRequested)(sourceMgr, os, &context);		return (*translationRequested)(sourceMgr, os, &context);
}		}

Show All 19 Lines