This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/ExecutionEngine/Orc/
-
llvm/
-
ExecutionEngine/
-
Orc/
-
DebugObjectManagerPlugin.h
-
lib/ExecutionEngine/Orc/
-
ExecutionEngine/
-
Orc/
6/6
DebugObjectManagerPlugin.cpp

Differential D98785

[Orc] Fix pending debug object tracking in DebugObjectManagerPlugin
ClosedPublic

Authored by sgraenitz on Mar 17 2021, 8:49 AM.

Download Raw Diff

Details

Reviewers

lhames

Commits

rGc154cddabd71: [Orc] Fix tracking of pending debug objects in DebugObjectManagerPlugin

Summary

There can be multiple MaterializationResponsibilitys in-flight for a single ResourceKey. Hence, pending debug objects must be tracked by MaterializationResponsibility and not by ResourceKey.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sgraenitz created this revision.Mar 17 2021, 8:49 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptMar 17 2021, 8:49 AM

sgraenitz requested review of this revision.Mar 17 2021, 8:49 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 17 2021, 8:49 AM

This is hard to reproduce with lli, because single-threaded execution with LLLazyJIT will implicitly sequence materialization. LLJIT instead materializes module dependencies recursively, which makes this issue easy to reproduce. Unfortunately, we have no LLJIT tool that I could use for a test. It's not the first time this limitation comes up for me and I wonder if it's worth adding an orc-greedy kind to lli. What do you think?

llvm/lib/ExecutionEngine/Orc/DebugObjectManagerPlugin.cpp
515	Can we guaranteed that affected in-flight MRs are still alive at this point? Running the code as is works, but I wonder if it's a robust assumption.
523	It's a different issue, but I realized that this might be a dangerous use of resource keys. They are only guaranteed to be valid within the lambda passed into `withResourceKeyDo()` right?

Harbormaster completed remote builds in B94251: Diff 331278.Mar 17 2021, 9:42 AM

lhames added inline comments.Mar 17 2021, 6:22 PM

llvm/lib/ExecutionEngine/Orc/DebugObjectManagerPlugin.cpp
515	We can guarantee that no MR is trying to register resources with the key at the point that `notifyRemovingResources` is called, provided that keys are never leaked from `withResourceKeyDo` -- see below. :)
523	Yep. I missed this on my first read through. You should always use `MaterializationResponsibility::withResourceKeyDo` to create associations between resources and keys, and never leak the key from inside `withResourceKeyDo`. This system guarantees that the association never ends up in a race with resource removal: Any race becomes ordered as either: ASSOCIATE then REMOVE (normal removal case), or REMOVE then ASSOCIATE (in which case `withResourceKey` do will return a `ResourceTrackerDefunct` error and not run your lambda).

This is hard to reproduce with lli, because single-threaded execution with LLLazyJIT will implicitly sequence materialization. LLJIT instead materializes module dependencies recursively, which makes this issue easy to reproduce. Unfortunately, we have no LLJIT tool that I could use for a test. It's not the first time this limitation comes up for me and I wonder if it's worth adding an orc-greedy kind to lli. What do you think?

I'm 100% in favor of an -orc-greedy mode for lli. The only reason it doesn't have one is that I've been short on time.

If it were built I would say that it should just be -jit-kind=orc (with -jit-kind=orc-lazy remaining as it is), and that it should be the new default. In that case all MCJIT test cases would just have to be updated to explicitly specify -jit-kind=mcjit.

Pending resources are irrelevant in both, notifyTransferringResources() and notifyRemovingResources(). They are guaranteed to be either finalizd in notifyEmitted() or discarded in notifyFailed().

Thanks for elaborating.

llvm/lib/ExecutionEngine/Orc/DebugObjectManagerPlugin.cpp
515	It turned out that I don't need to worry about pending debug objects here, which renders my question irrelevant.
523	Fixing this in D98863

sgraenitz marked 2 inline comments as done.Mar 18 2021, 6:34 AM

Harbormaster completed remote builds in B94443: Diff 331537.Mar 18 2021, 6:55 AM

sgraenitz mentioned this in rGe1579894d205: [lli] Add Orc greedy mode as -jit-kind=orc.Mar 18 2021, 3:19 PM

LGTM.

This revision is now accepted and ready to land.Mar 22 2021, 9:26 AM

Closed by commit rGc154cddabd71: [Orc] Fix tracking of pending debug objects in DebugObjectManagerPlugin (authored by sgraenitz). · Explain WhyMar 22 2021, 9:48 AM

This revision was automatically updated to reflect the committed changes.

sgraenitz added a commit: rGc154cddabd71: [Orc] Fix tracking of pending debug objects in DebugObjectManagerPlugin.

Revision Contents

Path

Size

llvm/

include/

llvm/

ExecutionEngine/

Orc/

DebugObjectManagerPlugin.h

2 lines

lib/

ExecutionEngine/

Orc/

DebugObjectManagerPlugin.cpp

66 lines

Diff 332337

llvm/include/llvm/ExecutionEngine/Orc/DebugObjectManagerPlugin.h

Show First 20 Lines • Show All 64 Lines • ▼ Show 20 Lines	public:
void modifyPassConfig(MaterializationResponsibility &MR,		void modifyPassConfig(MaterializationResponsibility &MR,
jitlink::LinkGraph &LG,		jitlink::LinkGraph &LG,
jitlink::PassConfiguration &PassConfig) override;		jitlink::PassConfiguration &PassConfig) override;

private:		private:
ExecutionSession &ES;		ExecutionSession &ES;

using OwnedDebugObject = std::unique_ptr<DebugObject>;		using OwnedDebugObject = std::unique_ptr<DebugObject>;
std::map<ResourceKey, OwnedDebugObject> PendingObjs;		std::map<MaterializationResponsibility *, OwnedDebugObject> PendingObjs;
std::map<ResourceKey, std::vector<OwnedDebugObject>> RegisteredObjs;		std::map<ResourceKey, std::vector<OwnedDebugObject>> RegisteredObjs;

std::mutex PendingObjsLock;		std::mutex PendingObjsLock;
std::mutex RegisteredObjsLock;		std::mutex RegisteredObjsLock;

std::unique_ptr<DebugObjectRegistrar> Target;		std::unique_ptr<DebugObjectRegistrar> Target;
};		};

} // namespace orc		} // namespace orc
} // namespace llvm		} // namespace llvm

#endif // LLVM_EXECUTIONENGINE_ORC_DEBUGOBJECTMANAGERPLUGIN_H		#endif // LLVM_EXECUTIONENGINE_ORC_DEBUGOBJECTMANAGERPLUGIN_H

llvm/lib/ExecutionEngine/Orc/DebugObjectManagerPlugin.cpp

Show First 20 Lines • Show All 401 Lines • ▼ Show 20 Lines	DebugObjectManagerPlugin::DebugObjectManagerPlugin(
ExecutionSession &ES, std::unique_ptr<DebugObjectRegistrar> Target)		ExecutionSession &ES, std::unique_ptr<DebugObjectRegistrar> Target)
: ES(ES), Target(std::move(Target)) {}		: ES(ES), Target(std::move(Target)) {}

DebugObjectManagerPlugin::~DebugObjectManagerPlugin() = default;		DebugObjectManagerPlugin::~DebugObjectManagerPlugin() = default;

void DebugObjectManagerPlugin::notifyMaterializing(		void DebugObjectManagerPlugin::notifyMaterializing(
MaterializationResponsibility &MR, LinkGraph &G, JITLinkContext &Ctx,		MaterializationResponsibility &MR, LinkGraph &G, JITLinkContext &Ctx,
MemoryBufferRef ObjBuffer) {		MemoryBufferRef ObjBuffer) {
assert(PendingObjs.count(getResourceKey(MR)) == 0 &&		std::lock_guard<std::mutex> Lock(PendingObjsLock);
		assert(PendingObjs.count(&MR) == 0 &&
"Cannot have more than one pending debug object per "		"Cannot have more than one pending debug object per "
"MaterializationResponsibility");		"MaterializationResponsibility");

std::lock_guard<std::mutex> Lock(PendingObjsLock);
if (auto DebugObj = createDebugObjectFromBuffer(ES, G, Ctx, ObjBuffer)) {		if (auto DebugObj = createDebugObjectFromBuffer(ES, G, Ctx, ObjBuffer)) {
// Not all link artifacts allow debugging.		// Not all link artifacts allow debugging.
if (*DebugObj != nullptr) {		if (*DebugObj != nullptr)
ResourceKey Key = getResourceKey(MR);		PendingObjs[&MR] = std::move(*DebugObj);
PendingObjs[Key] = std::move(*DebugObj);
}
} else {		} else {
ES.reportError(DebugObj.takeError());		ES.reportError(DebugObj.takeError());
}		}
}		}

void DebugObjectManagerPlugin::modifyPassConfig(		void DebugObjectManagerPlugin::modifyPassConfig(
MaterializationResponsibility &MR, LinkGraph &G,		MaterializationResponsibility &MR, LinkGraph &G,
PassConfiguration &PassConfig) {		PassConfiguration &PassConfig) {
// Not all link artifacts have associated debug objects.		// Not all link artifacts have associated debug objects.
std::lock_guard<std::mutex> Lock(PendingObjsLock);		std::lock_guard<std::mutex> Lock(PendingObjsLock);
auto It = PendingObjs.find(getResourceKey(MR));		auto It = PendingObjs.find(&MR);
if (It == PendingObjs.end())		if (It == PendingObjs.end())
return;		return;

DebugObject &DebugObj = *It->second;		DebugObject &DebugObj = *It->second;
if (DebugObj.has(Requirement::ReportFinalSectionLoadAddresses)) {		if (DebugObj.has(Requirement::ReportFinalSectionLoadAddresses)) {
PassConfig.PostAllocationPasses.push_back(		PassConfig.PostAllocationPasses.push_back(
[&DebugObj](LinkGraph &Graph) -> Error {		[&DebugObj](LinkGraph &Graph) -> Error {
for (const Section &GraphSection : Graph.sections())		for (const Section &GraphSection : Graph.sections())
DebugObj.reportSectionTargetMemoryRange(GraphSection.getName(),		DebugObj.reportSectionTargetMemoryRange(GraphSection.getName(),
SectionRange(GraphSection));		SectionRange(GraphSection));
return Error::success();		return Error::success();
});		});
}		}
}		}

Error DebugObjectManagerPlugin::notifyEmitted(		Error DebugObjectManagerPlugin::notifyEmitted(
MaterializationResponsibility &MR) {		MaterializationResponsibility &MR) {
ResourceKey Key = getResourceKey(MR);

std::lock_guard<std::mutex> Lock(PendingObjsLock);		std::lock_guard<std::mutex> Lock(PendingObjsLock);
auto It = PendingObjs.find(Key);		auto It = PendingObjs.find(&MR);
if (It == PendingObjs.end())		if (It == PendingObjs.end())
return Error::success();		return Error::success();

DebugObject *UnownedDebugObj = It->second.release();		DebugObject *UnownedDebugObj = It->second.release();
PendingObjs.erase(It);		PendingObjs.erase(It);

// During finalization the debug object is registered with the target.		// During finalization the debug object is registered with the target.
// Materialization must wait for this process to finish. Otherwise we might		// Materialization must wait for this process to finish. Otherwise we might
// start running code before the debugger processed the corresponding debug		// start running code before the debugger processed the corresponding debug
// info.		// info.
std::promise<MSVCPError> FinalizePromise;		std::promise<MSVCPError> FinalizePromise;
std::future<MSVCPError> FinalizeErr = FinalizePromise.get_future();		std::future<MSVCPError> FinalizeErr = FinalizePromise.get_future();

// FIXME: We released ownership of the DebugObject, so we can easily capture		// FIXME: We released ownership of the DebugObject, so we can easily capture
// the raw pointer in the continuation function, which re-owns it immediately.		// the raw pointer in the continuation function, which re-owns it immediately.
if (UnownedDebugObj)		if (UnownedDebugObj)
UnownedDebugObj->finalizeAsync(		UnownedDebugObj->finalizeAsync(
[this, Key, UnownedDebugObj,		[this, UnownedDebugObj, &MR,
&FinalizePromise](Expected<sys::MemoryBlock> TargetMem) {		&FinalizePromise](Expected<sys::MemoryBlock> TargetMem) {
std::unique_ptr<DebugObject> ReownedDebugObj(UnownedDebugObj);		std::unique_ptr<DebugObject> ReownedDebugObj(UnownedDebugObj);
if (!TargetMem) {		if (!TargetMem) {
FinalizePromise.set_value(TargetMem.takeError());		FinalizePromise.set_value(TargetMem.takeError());
return;		return;
}		}
if (Error Err = Target->registerDebugObject(*TargetMem)) {		if (Error Err = Target->registerDebugObject(*TargetMem)) {
FinalizePromise.set_value(std::move(Err));		FinalizePromise.set_value(std::move(Err));
return;		return;
}		}

// Registration successful, notifyEmitted() can return now and		// Registration successful, notifyEmitted() can return now and
// materialization can finish.		// materialization can finish.
FinalizePromise.set_value(Error::success());		FinalizePromise.set_value(Error::success());

		ResourceKey Key = getResourceKey(MR);
std::lock_guard<std::mutex> Lock(RegisteredObjsLock);		std::lock_guard<std::mutex> Lock(RegisteredObjsLock);
RegisteredObjs[Key].push_back(std::move(ReownedDebugObj));		RegisteredObjs[Key].push_back(std::move(ReownedDebugObj));
});		});

return FinalizeErr.get();		return FinalizeErr.get();
}		}

Error DebugObjectManagerPlugin::notifyFailed(		Error DebugObjectManagerPlugin::notifyFailed(
MaterializationResponsibility &MR) {		MaterializationResponsibility &MR) {
std::lock_guard<std::mutex> Lock(PendingObjsLock);		std::lock_guard<std::mutex> Lock(PendingObjsLock);
PendingObjs.erase(getResourceKey(MR));		PendingObjs.erase(&MR);
return Error::success();		return Error::success();
}		}

void DebugObjectManagerPlugin::notifyTransferringResources(ResourceKey DstKey,		void DebugObjectManagerPlugin::notifyTransferringResources(ResourceKey DstKey,
ResourceKey SrcKey) {		ResourceKey SrcKey) {
{		// Debug objects are stored by ResourceKey only after registration.
		// Thus, pending objects don't need to be updated here.
std::lock_guard<std::mutex> Lock(RegisteredObjsLock);		std::lock_guard<std::mutex> Lock(RegisteredObjsLock);
auto SrcIt = RegisteredObjs.find(SrcKey);		auto SrcIt = RegisteredObjs.find(SrcKey);
if (SrcIt != RegisteredObjs.end()) {		if (SrcIt != RegisteredObjs.end()) {
// Resources from distinct MaterializationResponsibilitys can get merged		// Resources from distinct MaterializationResponsibilitys can get merged
// after emission, so we can have multiple debug objects per resource key.		// after emission, so we can have multiple debug objects per resource key.
for (std::unique_ptr<DebugObject> &DebugObj : SrcIt->second)		for (std::unique_ptr<DebugObject> &DebugObj : SrcIt->second)
RegisteredObjs[DstKey].push_back(std::move(DebugObj));		RegisteredObjs[DstKey].push_back(std::move(DebugObj));
RegisteredObjs.erase(SrcIt);		RegisteredObjs.erase(SrcIt);
}		}
}		}
{
std::lock_guard<std::mutex> Lock(PendingObjsLock);
auto SrcIt = PendingObjs.find(SrcKey);
if (SrcIt != PendingObjs.end()) {
assert(PendingObjs.count(DstKey) == 0 &&
"Cannot have more than one pending debug object per "
"MaterializationResponsibility");
PendingObjs[DstKey] = std::move(SrcIt->second);
PendingObjs.erase(SrcIt);
}
}
}

Error DebugObjectManagerPlugin::notifyRemovingResources(ResourceKey K) {		Error DebugObjectManagerPlugin::notifyRemovingResources(ResourceKey Key) {
{		// Removing the resource for a pending object fails materialization, so they
		// get cleaned up in the notifyFailed() handler.
std::lock_guard<std::mutex> Lock(RegisteredObjsLock);		std::lock_guard<std::mutex> Lock(RegisteredObjsLock);
		sgraenitzAuthorUnsubmitted Done Reply Inline Actions Can we guaranteed that affected in-flight MRs are still alive at this point? Running the code as is works, but I wonder if it's a robust assumption. sgraenitz: Can we guaranteed that affected in-flight MRs are still alive at this point? Running the code…
		lhamesUnsubmitted Done Reply Inline Actions We can guarantee that no MR is trying to register resources with the key at the point that `notifyRemovingResources` is called, provided that keys are never leaked from `withResourceKeyDo` -- see below. :) lhames: We can guarantee that no MR is trying to register resources with the key at the point that…
		sgraenitzAuthorUnsubmitted Done Reply Inline Actions It turned out that I don't need to worry about pending debug objects here, which renders my question irrelevant. sgraenitz: It turned out that I don't need to worry about pending debug objects here, which renders my…
RegisteredObjs.erase(K);		RegisteredObjs.erase(Key);
// TODO: Implement unregister notifications.
}
std::lock_guard<std::mutex> Lock(PendingObjsLock);
PendingObjs.erase(K);

		// TODO: Implement unregister notifications.
return Error::success();		return Error::success();
}		}

} // namespace orc		} // namespace orc
} // namespace llvm		} // namespace llvm
		sgraenitzAuthorUnsubmitted Done Reply Inline Actions It's a different issue, but I realized that this might be a dangerous use of resource keys. They are only guaranteed to be valid within the lambda passed into `withResourceKeyDo()` right? sgraenitz: It's a different issue, but I realized that this might be a dangerous use of resource keys.
		lhamesUnsubmitted Done Reply Inline Actions Yep. I missed this on my first read through. You should always use `MaterializationResponsibility::withResourceKeyDo` to create associations between resources and keys, and never leak the key from inside `withResourceKeyDo`. This system guarantees that the association never ends up in a race with resource removal: Any race becomes ordered as either: ASSOCIATE then REMOVE (normal removal case), or REMOVE then ASSOCIATE (in which case `withResourceKey` do will return a `ResourceTrackerDefunct` error and not run your lambda). lhames: Yep. I missed this on my first read through. You should always use…
		sgraenitzAuthorUnsubmitted Done Reply Inline Actions Fixing this in D98863 sgraenitz: Fixing this in D98863