Download Raw Diff

Details

Reviewers

ftynse
mehdi_amini
rriddle
nicolasvasilache

Commits

rG4433e52e69b1: [mlir] Fix circular dialect initialization

Summary

This change fixes a bug where a dialect is initialized multiple times. This triggers an assertion when the ops of the dialect are registered (error: operation named ... is already registered).

This bug can be triggered as follows:

Dialect A depends on dialect B (as per ADialect.td).

Somewhere there is an extension of dialect B that depends on dialect A (e.g., it defines external models create ops from dialect A). E.g.:

registry.addExtension(+[](MLIRContext *ctx, BDialect *dialect) {
  BDialectOp::attachInterface ...
  ctx->loadDialect<ADialect>();
});

When dialect A is loaded, its initialize function is called twice:

ADialect::ADialect()
   |     |
   |     v
   |   ADialect::initialize()
   v
getOrLoadDialect<BDialect>()
   |
   v
(load extension of BDialect)
   |
   v
ctx->loadDialect<ADialect>()  // user wrote this in the extension
   |
   v
getOrLoadDialect<ADialect>()  // the dialect is not "fully" loaded yet
   |
   v
ADialect::ADialect()
   |
   v
ADialect::initialize()

An example of a dialect extension that depends on other dialects is Dialect/Tensor/Transforms/BufferizableOpInterfaceImpl.cpp. That particular dialect extension does not trigger this bug. (It would trigger this bug if the SCF dialect would depend on the Tensor dialect.)

This change introduces a new dialect state: dialects that are currently being loaded. Same as dialects that were already fully loaded (and initialized), dialects that are in the process of being loaded are not loaded a second time.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

springerm created this revision.Oct 25 2022, 7:35 AM

Herald added a reviewer: rriddle. · View Herald TranscriptOct 25 2022, 7:35 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: zero9178, bzcheeseman, sdasgup3 and 18 others. · View Herald Transcript

springerm requested review of this revision.Oct 25 2022, 7:35 AM

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptOct 25 2022, 7:35 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Note: I am not an expert on dialect loading/initialization. I can't tell whether this is the right way to fix the bug or not.

Harbormaster completed remote builds in B194177: Diff 470487.Oct 25 2022, 8:56 AM

mehdi_amini added inline comments.Oct 25 2022, 10:01 AM

mlir/tools/mlir-tblgen/DialectGen.cpp
262	I think this require much more explanations: call out the recursive initialization through get-dependent. But more importantly I don't actually understand how `initialize()` gets called twice, where is the second call coming from? You mention in the description that there is a call to `initialize()` from `ctx->loadDialect<ADialect>()`, but how does this call `initialize()`? It may call another constructor, but we'd have the constructor called twice? Does not seem right, so I'm missing something :)

springerm added inline comments.Oct 26 2022, 12:41 AM

mlir/tools/mlir-tblgen/DialectGen.cpp
262	Yes, the same constructor is getting called twice. When `ADialect` is loaded for the first time, an instance of `ADialect` is created, so the constructor is called. But the `ADialect` instance is not yet added to the list. This happens after the constructor is finished executing: std::unique_ptr<Dialect> &dialect = impl.loadedDialects.insert({dialectNamespace, ctor()}).first->second; assert(dialect && "dialect ctor failed"); The constructor (`ADialect::ADialect`) then indirectly calls `ctx->loadDialect<ADialect>()`, which calls `getOrLoadDialect` again. But we are still in the process of constructing the other `ADialect` instance, so it has not been added to `loadedDialects` yet. And therefore we create a second instance. Note: `impl.loadedDialects.insert` is called twice. The second `insert` fails; i.e., it returns the already inserted `ADialect` instance, so one of the two `ADialect` objects is discarded.

springerm added inline comments.Oct 26 2022, 12:49 AM

mlir/tools/mlir-tblgen/DialectGen.cpp
262	so one of the two ADialect objects is discarded This makes me wonder, are dialect objects allowed to have state? In that case, this change is wrong (as it may discard state). However, it's still not ideal that `ADialect` is created twice....

a better fix: do not load/create the same dialect multiple times

I uploaded a better fix that avoid creating two versions of ADialect in the first place.

springerm edited the summary of this revision. (Show Details)Oct 26 2022, 1:44 AM

Harbormaster completed remote builds in B194359: Diff 470742.Oct 26 2022, 2:15 AM

On a technical level, the issue is with the fact that there's no entry in the dialect map when the second call is issued. Maybe we can proactively add a null pointer to that map before calling the constructor to work around this? It will result in the second getOrLoad call returning a null pointer. Does it happen only in extensions? If so, it may be okay as long as we document that extensions must not attempt to load dialects and use them, instead they should actually "extend" the dialect they want to use (since extensions may depend on multiple dialects).

On the conceptual level, loading dialects in extensions let us create cycles in the dialect dependency graph that are not visible in the dialect itself. I wonder if we can trigger a similar cycle without extensions, by having two mutually dependent dialects. If so, I'd go with whatever solution that addresses both problems.

mlir/lib/IR/MLIRContext.cpp
178

mehdi_amini added inline comments.Oct 26 2022, 10:35 AM

mlir/lib/IR/MLIRContext.cpp
180	I find this a bit annoying to have this separate data structure, what about Alex's suggestion of inserting a nullptr in the loadedDialects map?
mlir/tools/mlir-tblgen/DialectGen.cpp
262	Yes, the same constructor is getting called twice. Right let's avoid that :) This makes me wonder, are dialect objects allowed to have state? Yes.

address comments

Update: Using nullptr to indicate "dialect loading" and also keeping isDialectLoading for error handling (report_fatal_error).

Harbormaster completed remote builds in B194567: Diff 471037.Oct 27 2022, 12:54 AM

ftynse accepted this revision.Oct 27 2022, 1:44 AM

ftynse added inline comments.

mlir/lib/IR/MLIRContext.cpp
473	Nit: I'd expand this auto.
476	I'd rather do `== nullptr` to better match with the comment.

This revision is now accepted and ready to land.Oct 27 2022, 1:44 AM

springerm marked 3 inline comments as done.Oct 27 2022, 2:24 AM

address comments

This revision was landed with ongoing or failed builds.Oct 27 2022, 2:50 AM

Closed by commit rG4433e52e69b1: [mlir] Fix circular dialect initialization (authored by springerm). · Explain Why

This revision was automatically updated to reflect the committed changes.

springerm added a commit: rG4433e52e69b1: [mlir] Fix circular dialect initialization.

Harbormaster completed remote builds in B194597: Diff 471084.Oct 27 2022, 4:06 AM

rriddle added inline comments.Oct 27 2022, 10:22 AM

mlir/include/mlir/IR/MLIRContext.h
101–102	This doesn't look like something that should be publicly exposed.
mlir/lib/IR/MLIRContext.cpp
446–448	Can you avoid the double lookup here? It'd be nicer to just assign `std::unique_ptr<Dialect> &dialect = impl.loadedDialects[dialectNamespace];` first, and then call the constructor. The lookup should already default initialize it to null.

springerm mentioned this in D136923: [mlir] Do not expose MLIRContext::isDialectLoading.Oct 28 2022, 1:24 AM

springerm mentioned this in rG69b9e03572d7: [mlir] Do not expose MLIRContext::isDialectLoading.Oct 31 2022, 1:08 AM

Diff 470742

mlir/include/mlir/IR/MLIRContext.h

Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	public:
T *getOrLoadDialect() {		T *getOrLoadDialect() {
return static_cast<T *>(		return static_cast<T *>(
getOrLoadDialect(T::getDialectNamespace(), TypeID::get<T>(), [this]() {		getOrLoadDialect(T::getDialectNamespace(), TypeID::get<T>(), [this]() {
std::unique_ptr<T> dialect(new T(this));		std::unique_ptr<T> dialect(new T(this));
return dialect;		return dialect;
}));		}));
}		}

		/// Return true if the given dialect is currently loading.
		bool isDialectLoading(StringRef dialectNamespace);
		rriddleUnsubmitted Not Done Reply Inline Actions This doesn't look like something that should be publicly exposed. rriddle: This doesn't look like something that should be publicly exposed.

/// Load a dialect in the context.		/// Load a dialect in the context.
template <typename Dialect>		template <typename Dialect>
void loadDialect() {		void loadDialect() {
		// Do not load the dialect if it is currently loading. This can happen if a
		// dialect initializer triggers loading the same dialect recursively.
		if (!isDialectLoading(Dialect::getDialectNamespace()))
getOrLoadDialect<Dialect>();		getOrLoadDialect<Dialect>();
}		}

/// Load a list dialects in the context.		/// Load a list dialects in the context.
template <typename Dialect, typename OtherDialect, typename... MoreDialects>		template <typename Dialect, typename OtherDialect, typename... MoreDialects>
void loadDialect() {		void loadDialect() {
getOrLoadDialect<Dialect>();		loadDialect<Dialect>();
loadDialect<OtherDialect, MoreDialects...>();		loadDialect<OtherDialect, MoreDialects...>();
}		}

/// Get (or create) a dynamic dialect for the given name.		/// Get (or create) a dynamic dialect for the given name.
DynamicDialect *		DynamicDialect *
getOrLoadDynamicDialect(StringRef dialectNamespace,		getOrLoadDynamicDialect(StringRef dialectNamespace,
function_ref<void(DynamicDialect *)> ctor);		function_ref<void(DynamicDialect *)> ctor);

▲ Show 20 Lines • Show All 135 Lines • Show Last 20 Lines

mlir/lib/IR/MLIRContext.cpp

Show First 20 Lines • Show All 169 Lines • ▼ Show 20 Lines #endif

/// destruction with the context. /// destruction with the context.

std::unique_ptr<llvm::ThreadPool> ownedThreadPool; std::unique_ptr<llvm::ThreadPool> ownedThreadPool;

/// This is a list of dialects that are created referring to this context. /// This is a list of dialects that are created referring to this context.

/// The MLIRContext owns the objects. /// The MLIRContext owns the objects.

DenseMap<StringRef, std::unique_ptr<Dialect>> loadedDialects; DenseMap<StringRef, std::unique_ptr<Dialect>> loadedDialects;

DialectRegistry dialectsRegistry; DialectRegistry dialectsRegistry;

/// This is a list of dialect that are currently in the process of loading.

ftynseUnsubmitted

Done

DialectRegistry dialectsRegistry;

- /// This is a list of dialect that are currently in the process of loading.

+ /// This is a list of dialects that are currently in the process of loading.

/// I.e., their constructor/initializer is still executing.

ftynse:

/// I.e., their constructor/initializer is still executing.

DenseSet<StringRef> loadingDialects;

mehdi_aminiUnsubmitted

Done

I find this a bit annoying to have this separate data structure, what about Alex's suggestion of inserting a nullptr in the loadedDialects map?

mehdi_amini: I find this a bit annoying to have this separate data structure, what about Alex's suggestion…

/// An allocator used for AbstractAttribute and AbstractType objects. /// An allocator used for AbstractAttribute and AbstractType objects.

llvm::BumpPtrAllocator abstractDialectSymbolAllocator; llvm::BumpPtrAllocator abstractDialectSymbolAllocator;

/// This is a mapping from operation name to the operation info describing it. /// This is a mapping from operation name to the operation info describing it.

llvm::StringMap<OperationName::Impl> operations; llvm::StringMap<OperationName::Impl> operations;

/// A vector of operation info specifically for registered operations. /// A vector of operation info specifically for registered operations.

llvm::StringMap<RegisteredOperationName> registeredOperations; llvm::StringMap<RegisteredOperationName> registeredOperations;

▲ Show 20 Lines • Show All 238 Lines • ▼ Show 20 Lines LLVM_DEBUG(llvm::dbgs()

<< "Load new dialect in Context " << dialectNamespace << "\n"); << "Load new dialect in Context " << dialectNamespace << "\n");

#ifndef NDEBUG #ifndef NDEBUG

if (impl.multiThreadedExecutionContext != 0) if (impl.multiThreadedExecutionContext != 0)

llvm::report_fatal_error( llvm::report_fatal_error(

"Loading a dialect (" + dialectNamespace + "Loading a dialect (" + dialectNamespace +

") while in a multi-threaded execution context (maybe " ") while in a multi-threaded execution context (maybe "

"the PassManager): this can indicate a " "the PassManager): this can indicate a "

"missing `dependentDialects` in a pass for example."); "missing `dependentDialects` in a pass for example.");

if (impl.loadingDialects.contains(dialectNamespace))

llvm::report_fatal_error(

"Loading (and getting) a dialect (" + dialectNamespace +

") while the same dialect is still loading: use loadDialect instead "

"of getOrLoadDialect.");

#endif #endif

auto it = impl.loadingDialects.insert(dialectNamespace);

std::unique_ptr<Dialect> &dialect = std::unique_ptr<Dialect> &dialect =

impl.loadedDialects.insert({dialectNamespace, ctor()}).first->second; impl.loadedDialects.insert({dialectNamespace, ctor()}).first->second;

assert(dialect && "dialect ctor failed"); assert(dialect && "dialect ctor failed");

impl.loadingDialects.erase(it.first);

// Refresh all the identifiers dialect field, this catches cases where a // Refresh all the identifiers dialect field, this catches cases where a

rriddleUnsubmitted

Not Done

Can you avoid the double lookup here? It'd be nicer to just assign std::unique_ptr<Dialect> &dialect = impl.loadedDialects[dialectNamespace]; first, and then call the constructor. The lookup should already default initialize it to null.

rriddle: Can you avoid the double lookup here? It'd be nicer to just assign `std::unique_ptr<Dialect>…

// dialect may be loaded after identifier prefixed with this dialect name // dialect may be loaded after identifier prefixed with this dialect name

// were already created. // were already created.

auto stringAttrsIt = impl.dialectReferencingStrAttrs.find(dialectNamespace); auto stringAttrsIt = impl.dialectReferencingStrAttrs.find(dialectNamespace);

if (stringAttrsIt != impl.dialectReferencingStrAttrs.end()) { if (stringAttrsIt != impl.dialectReferencingStrAttrs.end()) {

for (StringAttrStorage *storage : stringAttrsIt->second) for (StringAttrStorage *storage : stringAttrsIt->second)

storage->referencedDialect = dialect.get(); storage->referencedDialect = dialect.get();

impl.dialectReferencingStrAttrs.erase(stringAttrsIt); impl.dialectReferencingStrAttrs.erase(stringAttrsIt);

} }

// Apply any extensions to this newly loaded dialect. // Apply any extensions to this newly loaded dialect.

impl.dialectsRegistry.applyExtensions(dialect.get()); impl.dialectsRegistry.applyExtensions(dialect.get());

return dialect.get(); return dialect.get();

} }

// Abort if dialect with namespace has already been registered. // Abort if dialect with namespace has already been registered.

std::unique_ptr<Dialect> &dialect = dialectIt->second; std::unique_ptr<Dialect> &dialect = dialectIt->second;

if (dialect->getTypeID() != dialectID) if (dialect->getTypeID() != dialectID)

llvm::report_fatal_error("a dialect with namespace '" + dialectNamespace + llvm::report_fatal_error("a dialect with namespace '" + dialectNamespace +

"' has already been registered"); "' has already been registered");

return dialect.get(); return dialect.get();

} }

bool MLIRContext::isDialectLoading(StringRef dialectNamespace) {

return getImpl().loadingDialects.contains(dialectNamespace);

ftynseUnsubmitted

Done

Nit: I'd expand this auto.

ftynse: Nit: I'd expand this auto.

}

DynamicDialect *MLIRContext::getOrLoadDynamicDialect( DynamicDialect *MLIRContext::getOrLoadDynamicDialect(

ftynseUnsubmitted

Done

I'd rather do == nullptr to better match with the comment.

ftynse: I'd rather do `== nullptr` to better match with the comment.

StringRef dialectNamespace, function_ref<void(DynamicDialect *)> ctor) { StringRef dialectNamespace, function_ref<void(DynamicDialect *)> ctor) {

auto &impl = getImpl(); auto &impl = getImpl();

// Get the correct insertion position sorted by namespace. // Get the correct insertion position sorted by namespace.

auto dialectIt = impl.loadedDialects.find(dialectNamespace); auto dialectIt = impl.loadedDialects.find(dialectNamespace);

if (dialectIt != impl.loadedDialects.end()) { if (dialectIt != impl.loadedDialects.end()) {

if (auto dynDialect = dyn_cast<DynamicDialect>(dialectIt->second.get())) if (auto dynDialect = dyn_cast<DynamicDialect>(dialectIt->second.get()))

return dynDialect; return dynDialect;

▲ Show 20 Lines • Show All 606 Lines • Show Last 20 Lines

mlir/tools/mlir-tblgen/DialectGen.cpp

Show First 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	public:
static constexpr ::llvm::StringLiteral getDialectNamespace() {		static constexpr ::llvm::StringLiteral getDialectNamespace() {
return ::llvm::StringLiteral("{1}");		return ::llvm::StringLiteral("{1}");
}		}
)";		)";

/// Registration for a single dependent dialect: to be inserted in the ctor		/// Registration for a single dependent dialect: to be inserted in the ctor
/// above for each dependent dialect.		/// above for each dependent dialect.
const char *const dialectRegistrationTemplate = R"(		const char *const dialectRegistrationTemplate = R"(
getContext()->getOrLoadDialect<{0}>();		getContext()->loadDialect<{0}>();
)";		)";

/// The code block for the attribute parser/printer hooks.		/// The code block for the attribute parser/printer hooks.
static const char *const attrParserDecl = R"(		static const char *const attrParserDecl = R"(
/// Parse an attribute registered to this dialect.		/// Parse an attribute registered to this dialect.
::mlir::Attribute parseAttribute(::mlir::DialectAsmParser &parser,		::mlir::Attribute parseAttribute(::mlir::DialectAsmParser &parser,
::mlir::Type type) const override;		::mlir::Type type) const override;

▲ Show 20 Lines • Show All 135 Lines • ▼ Show 20 Lines
/// {1}: initialization code that is emitted in the ctor body before calling		/// {1}: initialization code that is emitted in the ctor body before calling
/// initialize().		/// initialize().
/// {2}: The dialect parent class.		/// {2}: The dialect parent class.
static const char *const dialectConstructorStr = R"(		static const char *const dialectConstructorStr = R"(
{0}::{0}(::mlir::MLIRContext *context)		{0}::{0}(::mlir::MLIRContext *context)
: ::mlir::{2}(getDialectNamespace(), context, ::mlir::TypeID::get<{0}>()) {{		: ::mlir::{2}(getDialectNamespace(), context, ::mlir::TypeID::get<{0}>()) {{
{1}		{1}
initialize();		initialize();
}		}
		mehdi_aminiUnsubmitted Done Reply Inline Actions I think this require much more explanations: call out the recursive initialization through get-dependent. But more importantly I don't actually understand how `initialize()` gets called twice, where is the second call coming from? You mention in the description that there is a call to `initialize()` from `ctx->loadDialect<ADialect>()`, but how does this call `initialize()`? It may call another constructor, but we'd have the constructor called twice? Does not seem right, so I'm missing something :) mehdi_amini: I think this require much more explanations: call out the recursive initialization through get…
		springermAuthorUnsubmitted Done Reply Inline Actions Yes, the same constructor is getting called twice. When `ADialect` is loaded for the first time, an instance of `ADialect` is created, so the constructor is called. But the `ADialect` instance is not yet added to the list. This happens after the constructor is finished executing: std::unique_ptr<Dialect> &dialect = impl.loadedDialects.insert({dialectNamespace, ctor()}).first->second; assert(dialect && "dialect ctor failed"); The constructor (`ADialect::ADialect`) then indirectly calls `ctx->loadDialect<ADialect>()`, which calls `getOrLoadDialect` again. But we are still in the process of constructing the other `ADialect` instance, so it has not been added to `loadedDialects` yet. And therefore we create a second instance. Note: `impl.loadedDialects.insert` is called twice. The second `insert` fails; i.e., it returns the already inserted `ADialect` instance, so one of the two `ADialect` objects is discarded. springerm: Yes, the same constructor is getting called twice. When `ADialect` is loaded for the first…
		springermAuthorUnsubmitted Done Reply Inline Actions so one of the two ADialect objects is discarded This makes me wonder, are dialect objects allowed to have state? In that case, this change is wrong (as it may discard state). However, it's still not ideal that `ADialect` is created twice.... springerm: > so one of the two ADialect objects is discarded This makes me wonder, are dialect objects…
		mehdi_aminiUnsubmitted Done Reply Inline Actions Yes, the same constructor is getting called twice. Right let's avoid that :) This makes me wonder, are dialect objects allowed to have state? Yes. mehdi_amini: > Yes, the same constructor is getting called twice. Right let's avoid that :) > This makes…
)";		)";

/// The code block to generate a default desturctor definition.		/// The code block to generate a default desturctor definition.
///		///
/// {0}: The name of the dialect class.		/// {0}: The name of the dialect class.
static const char *const dialectDestructorStr = R"(		static const char *const dialectDestructorStr = R"(
{0}::~{0}() = default;		{0}::~{0}() = default;

▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Fix circular dialect initialization
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 470742

mlir/include/mlir/IR/MLIRContext.h

mlir/lib/IR/MLIRContext.cpp

mlir/tools/mlir-tblgen/DialectGen.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Fix circular dialect initializationClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 470742

mlir/include/mlir/IR/MLIRContext.h

mlir/lib/IR/MLIRContext.cpp

mlir/tools/mlir-tblgen/DialectGen.cpp

[mlir] Fix circular dialect initialization
ClosedPublic