This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/MachO/
-
MachO/
-
Writer.cpp
-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
-
TimeProfiler.h
-
lib/
-
LTO/
-
LTO.cpp
-
Support/
4/9
ThreadPool.cpp
-
TimeProfiler.cpp

Differential D118550

[Support] Have ThreadPool initialize a TimeTraceProfiler per thread
AbandonedPublic

Authored by int3 on Jan 29 2022, 8:51 PM.

Download Raw Diff

Details

Reviewers

anton-afanasyev
russell.gallop
mehdi_amini
MaskRay

Group Reviewers

Restricted Project

Summary

This makes the profiler easier to use; we no longer need to
remember to initialize it every time we fork. It also means that we
initialize the profiler at most once per thread instead of once per
task.

TODO: we should probably implement this for Parallel.h as well, but
perhaps only once we have a use case to test it with.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,080 ms	x64 debian > AddressSanitizer-x86_64-linux.TestCases::scariness_score_test.cpp
	820 ms	x64 debian > LLVM.CodeGen/AMDGPU::amdpal-callable.ll

Event Timeline

int3 created this revision.Jan 29 2022, 8:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 29 2022, 8:51 PM

Herald added subscribers: ormris, dexonsmith, steven_wu, hiraditya. · View Herald Transcript

int3 requested review of this revision.Jan 29 2022, 8:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 29 2022, 8:51 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B146499: Diff 404327.Jan 29 2022, 10:14 PM

Hi @int3 ,

Thanks for the patch. When I originally added the multi-threaded support for the time profiler I considered adding the initialize/finish thread into the threadpool itself (llvm/lib/Support/ThreadPool.cpp). In the end I thought it was a bit messy there so just added it into LTO.cpp. With the increased use of multi-threading I think it would be worth reconsidering putting this into ThreadPool.cpp so we don't need to add this everywhere we want to trace multithreaded code. I think that the RAII code you propose will help this fit into ThreadPool.cpp neatly. For all I know there may be other problems with doing that, but I think it will be neater now this is used in more places. What do you think about trying that?

Regards
Russ

Yeah, I think that's probably a good idea. It took me a while to figure out that initializing the profiler on a per-thread basis was required; it was not at all obvious why my profiler events had started disappearing while I was parallelizing lld. Having it done automatically would've saved me some headache.

Another advantage would be that we could actually initialize the profiler once per thread, instead of once per task.

One drawback I guess is that there isn't a good way to specify the ProcName for these threads any more. But perhaps the ProcName was never really that relevant, and the spawned threads could just initialize it with the empty string?

In D118550#3283693, @int3 wrote:

Yeah, I think that's probably a good idea. It took me a while to figure out that initializing the profiler on a per-thread basis was required; it was not at all obvious why my profiler events had started disappearing while I was parallelizing lld. Having it done automatically would've saved me some headache.

Yes, it might help anyone working to further parallelise lld (as long as the scopes are there).

Another advantage would be that we could actually initialize the profiler once per thread, instead of once per task.

That's a good point.

One drawback I guess is that there isn't a good way to specify the ProcName for these threads any more. But perhaps the ProcName was never really that relevant, and the spawned threads could just initialize it with the empty string?

ProcName was originally intended to be clang/lld etc. at the top level, so I'm not sure it's very useful in the way it's used in threads at the moment. This could be empty, or (e.g.) "Thread 1" etc.

alternative approach

Harbormaster completed remote builds in B146711: Diff 404630.Jan 31 2022, 2:25 PM

russell.gallop added a reviewer: mehdi_amini.Feb 1 2022, 3:08 AM

In D118550#3284934, @int3 wrote:

alternative approach

Thanks.

I think this now needs different/extra reviewer(s) for the ThreadPool change. Added @mehdi_amini.

ThreadPool changes looks fine overall.

The ThreadPool implementation so far does not have any dependency on global state, it is slightly annoying to me to introduce some here.

llvm/lib/Support/ThreadPool.cpp
57–58	Can you handle this with RAII? llvm::make_scope_exit outside of the loop?

scope_exit

int3 planned changes to this revision.Feb 1 2022, 1:40 PM

int3 requested review of this revision.

Harbormaster completed remote builds in B147012: Diff 405086.Feb 1 2022, 4:46 PM

ping

rebase

Harbormaster completed remote builds in B147745: Diff 406147.Feb 4 2022, 8:51 PM

@mehdi_amini is this good to go? I think the test failures are spurious (but not 100% sure)

(There are visible changes around the name associated with the tracer in lld that someone familiar with this in lld should approve.)

llvm/lib/Support/ThreadPool.cpp
37	It seems that we'll always use the instance initialized in the thread that calls "grow". Also, this instance has to be setup before the call to grow, and the thread can't reinitialize it for the lifetime duration of the ThreadPool if I understand correctly. I'm not sure this makes sense in the full generality of the ThreadPool API?

In D118550#3302487, @mehdi_amini wrote:

(There are visible changes around the name associated with the tracer in lld that someone familiar with this in lld should approve.)

Maybe add @MaskRay.

@int3, apologies if my suggestion of adding into ThreadPool has made this more complicated! Your original change may be okay as a quicker fix, moving to ThreadPool could be a follow up.

llvm/lib/Support/ThreadPool.cpp
37	I'm not sure this makes sense in the full generality of the ThreadPool API? Do we agree that time tracing all threads used by the ThreadPool is desirable and worth pursuing?

mehdi_amini added inline comments.Feb 9 2022, 2:00 AM

llvm/lib/Support/ThreadPool.cpp
37	This could be a useful feature to have a tracing feature for the ThreadPool. I'm not sure about: The expected behavior in terms of threading (with respect to the thread pool creation, the growing of the pool, or the enqueuing of a task). The current `TimeTraceProfiler` which is non-trivially coupled to some global state, making this all harder to reason about.

(There are visible changes around the name associated with the tracer in lld that someone familiar with this in lld should approve.)

I think they're pretty safe. But yeah, I will ping the other LLD people about that once we've settled on the core ThreadPool changes.

@int3, apologies if my suggestion of adding into ThreadPool has made this more complicated! Your original change may be okay as a quicker fix, moving to ThreadPool could be a follow up.

No worries & no rush; I'm happy to hash this out :) Was just a bit busy for the last couple of days.

llvm/lib/Support/ThreadPool.cpp
37	It seems that we'll always use the instance initialized in the thread that calls "grow". Actually we are just copying the value of its TimeTraceGranularity member and using that to initialize a new thread-local instance. I could have just copied the TimeTraceGranularity value itself, but I figured this was a slightly nicer abstraction -- if in the future we add more fields to TimeTraceProfilerInstance, we can keep the same initialization method signature. You are right that this makes things a little less general though. In particular, there is no way to have different granularities per profiler instance -- every thread must use the same value. IDK if that's an issue... after all, the places where the TimeProfiler is getting used ATM don't take advantage of this flexibility. the thread can't reinitialize it for the lifetime duration of the ThreadPool if I understand correctly I don't think there's a use case for reinitializing it... That said I didn't write the TimeProfiler, so maybe @russell.gallop can confirm.

int3 added a reviewer: MaskRay.Feb 9 2022, 5:11 PM

int3 added inline comments.Feb 14 2022, 11:36 PM

llvm/lib/Support/ThreadPool.cpp
37	bump -- @russell.gallop, could you chime in here?

russell.gallop added inline comments.Feb 15 2022, 7:42 AM

llvm/lib/Support/ThreadPool.cpp
37	You are right that this makes things a little less general though. In particular, there is no way to have different granularities per profiler instance -- every thread must use the same value. IDK if that's an issue... after all, the places where the TimeProfiler is getting used ATM don't take advantage of this flexibility. Yes, I don't imagine it is very useful to have different granularities. I don't think there's a use case for reinitializing it... That said I didn't write the TimeProfiler, so maybe @russell.gallop can confirm. I'm not sure I follow the use of the Thread Pool where this would be required... I think @anton-afanasyev added the time profiler, I extended for LLD tracing, but only really with the ThinLTO threading case in mind. I imagine that the profiler could be re-engineered to meet what the thread pool can do, but I'm not really aware of what the "full generality of the thread pool" is so can't really say more than that. I don't think I'll have time to do this myself. Let me know if you don't have time and I can ask if there is someone around here who can take a look at this.

mehdi_amini added inline comments.Feb 15 2022, 10:16 AM

llvm/lib/Support/ThreadPool.cpp
37	The ThreadPool isn't a "global" thing, but the profiler is. It seems to me that there is a mismatch in terms of lifetime / lifecycle that creeps up here and does not make it a natural fit for the ThreadPool to know directly about the profiler. For example a MLIRContext (similar to LLVMContext) owns a ThreadPool, you may through the lifetime of a process create and destroy multiple ThreadPool. The thread which creates the ThreadPool isn't necessarily the one that will enqueue work to the pool. In MLIR for example, the pool will get work enqueued when a pass manager is executed: what does it mean in terms of profiler instance? The kind of things that can be surprising with the implementation in this patch is that a sequence like: Create a ThreadPool Setup profiler Grow the pool to more thread schedule would lead to only some threads having the profiler enabled (the ones that grew post setup). More simpler "gotcha" is: Create a ThreadPool (indirectly, from the user point of it is "initialize the compiler" or "Create a MLIRContext") Setup profiler run this wouldn't get any thread other that the current one having profiler setup. However many construct like parallel_for will schedule iterations in the pool but also use the current thread to run work, so work executed in the current thread would get different behavior from the work setup in the pool. Variant of this behavior can include: Setup profiler Create a ThreadPool (indirectly, from the user point of it is "initialize the compiler" or "Create a MLIRContext") `parallel_for` from another thread This time the parallel iteration in the thread pool would have the profiler setup, but not the ones directly executed in the calling thread.

int3 planned changes to this revision.Feb 18 2022, 8:46 AM

int3 added inline comments.

llvm/lib/Support/ThreadPool.cpp
37	got it, I see why you are uncomfortable with the change now. I will revert to the earlier non-ThreadPool-invasive version of this diff. Thanks for the insight!

(Just wanted to say I know ld.lld --time-trace has problems. For example if I run

ld.lld @response.txt --threads=8 --time-trace -o clang
jq -r '.traceEvents[] | select((.name|contains("Total")) or (.name|contains("Write"))) | "\(.dur/1000000) \(.name) \(.args)"' < clang.time-trace

I can sometimes find missing trace events. But I haven't had time to look into this. Sorry!)

int3 abandoned this revision.Mar 16 2022, 2:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 16 2022, 2:32 PM

koops mentioned this in D123235: [OpenMP] atomic compare fail : Parser & AST support.May 14 2022, 4:47 AM

Revision Contents

Path

Size

lld/

MachO/

Writer.cpp

8 lines

llvm/

include/

llvm/

Support/

TimeProfiler.h

4 lines

lib/

LTO/

LTO.cpp

5 lines

Support/

ThreadPool.cpp

11 lines

TimeProfiler.cpp

5 lines

Diff 406147

lld/MachO/Writer.cpp

Show First 20 Lines • Show All 1,127 Lines • ▼ Show 20 Lines	template <class LP> void Writer::run() {
// After this point, we create no new segments; HOWEVER, we might		// After this point, we create no new segments; HOWEVER, we might
// yet create branch-range extension thunks for architectures whose		// yet create branch-range extension thunks for architectures whose
// hardware call instructions have limited range, e.g., ARM(64).		// hardware call instructions have limited range, e.g., ARM(64).
// The thunks are created as InputSections interspersed among		// The thunks are created as InputSections interspersed among
// the ordinary __TEXT,_text InputSections.		// the ordinary __TEXT,_text InputSections.
sortSegmentsAndSections();		sortSegmentsAndSections();
createLoadCommands<LP>();		createLoadCommands<LP>();
finalizeAddresses();		finalizeAddresses();
threadPool.async([&] {		threadPool.async(writeMapFile);
if (LLVM_ENABLE_THREADS && config->timeTraceEnabled)
timeTraceProfilerInitialize(config->timeTraceGranularity, "writeMapFile");
writeMapFile();
if (LLVM_ENABLE_THREADS && config->timeTraceEnabled)
timeTraceProfilerFinishThread();
});
finalizeLinkEditSegment();		finalizeLinkEditSegment();
writeOutputFile();		writeOutputFile();
}		}

template <class LP> void macho::writeResult() { Writer().run<LP>(); }		template <class LP> void macho::writeResult() { Writer().run<LP>(); }

void macho::resetWriter() { LCDylib::resetInstanceCount(); }		void macho::resetWriter() { LCDylib::resetInstanceCount(); }

Show All 37 Lines

llvm/include/llvm/Support/TimeProfiler.h

	Show All 19 Lines
	TimeTraceProfiler *getTimeTraceProfilerInstance();			TimeTraceProfiler *getTimeTraceProfilerInstance();

	/// Initialize the time trace profiler.			/// Initialize the time trace profiler.
	/// This sets up the global \p TimeTraceProfilerInstance			/// This sets up the global \p TimeTraceProfilerInstance
	/// variable to be the profiler instance.			/// variable to be the profiler instance.
	void timeTraceProfilerInitialize(unsigned TimeTraceGranularity,			void timeTraceProfilerInitialize(unsigned TimeTraceGranularity,
	StringRef ProcName);			StringRef ProcName);

				/// Initialize the time trace profiler with the same settings as Profiler.
				void timeTraceProfilerInitialize(const TimeTraceProfiler &Profiler,
				StringRef ProcName);

	/// Cleanup the time trace profiler, if it was initialized.			/// Cleanup the time trace profiler, if it was initialized.
	void timeTraceProfilerCleanup();			void timeTraceProfilerCleanup();

	/// Finish a time trace profiler running on a worker thread.			/// Finish a time trace profiler running on a worker thread.
	void timeTraceProfilerFinishThread();			void timeTraceProfilerFinishThread();

	/// Is the time trace profiler enabled, i.e. initialized?			/// Is the time trace profiler enabled, i.e. initialized?
	inline bool timeTraceProfilerEnabled() {			inline bool timeTraceProfilerEnabled() {
	▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

llvm/lib/LTO/LTO.cpp

Show First 20 Lines • Show All 1,262 Lines • ▼ Show 20 Lines	Error start(
BackendThreadPool.async(		BackendThreadPool.async(
[=](BitcodeModule BM, ModuleSummaryIndex &CombinedIndex,		[=](BitcodeModule BM, ModuleSummaryIndex &CombinedIndex,
const FunctionImporter::ImportMapTy &ImportList,		const FunctionImporter::ImportMapTy &ImportList,
const FunctionImporter::ExportSetTy &ExportList,		const FunctionImporter::ExportSetTy &ExportList,
const std::map<GlobalValue::GUID, GlobalValue::LinkageTypes>		const std::map<GlobalValue::GUID, GlobalValue::LinkageTypes>
&ResolvedODR,		&ResolvedODR,
const GVSummaryMapTy &DefinedGlobals,		const GVSummaryMapTy &DefinedGlobals,
MapVector<StringRef, BitcodeModule> &ModuleMap) {		MapVector<StringRef, BitcodeModule> &ModuleMap) {
if (LLVM_ENABLE_THREADS && Conf.TimeTraceEnabled)
timeTraceProfilerInitialize(Conf.TimeTraceGranularity,
"thin backend");
Error E = runThinLTOBackendThread(		Error E = runThinLTOBackendThread(
AddStream, Cache, Task, BM, CombinedIndex, ImportList, ExportList,		AddStream, Cache, Task, BM, CombinedIndex, ImportList, ExportList,
ResolvedODR, DefinedGlobals, ModuleMap);		ResolvedODR, DefinedGlobals, ModuleMap);
if (E) {		if (E) {
std::unique_lock<std::mutex> L(ErrMu);		std::unique_lock<std::mutex> L(ErrMu);
if (Err)		if (Err)
Err = joinErrors(std::move(*Err), std::move(E));		Err = joinErrors(std::move(*Err), std::move(E));
else		else
Err = std::move(E);		Err = std::move(E);
}		}
if (LLVM_ENABLE_THREADS && Conf.TimeTraceEnabled)
timeTraceProfilerFinishThread();
},		},
BM, std::ref(CombinedIndex), std::ref(ImportList), std::ref(ExportList),		BM, std::ref(CombinedIndex), std::ref(ImportList), std::ref(ExportList),
std::ref(ResolvedODR), std::ref(DefinedGlobals), std::ref(ModuleMap));		std::ref(ResolvedODR), std::ref(DefinedGlobals), std::ref(ModuleMap));
return Error::success();		return Error::success();
}		}

Error wait() override {		Error wait() override {
BackendThreadPool.wait();		BackendThreadPool.wait();
▲ Show 20 Lines • Show All 339 Lines • Show Last 20 Lines

llvm/lib/Support/ThreadPool.cpp

	Show All 9 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "llvm/Support/ThreadPool.h"			#include "llvm/Support/ThreadPool.h"

	#include "llvm/Config/llvm-config.h"			#include "llvm/Config/llvm-config.h"

	#if LLVM_ENABLE_THREADS			#if LLVM_ENABLE_THREADS
				#include "llvm/ADT/ScopeExit.h"
	#include "llvm/Support/Threading.h"			#include "llvm/Support/Threading.h"
				#include "llvm/Support/TimeProfiler.h"
	#else			#else
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"
	#endif			#endif

	using namespace llvm;			using namespace llvm;

	#if LLVM_ENABLE_THREADS			#if LLVM_ENABLE_THREADS

	ThreadPool::ThreadPool(ThreadPoolStrategy S)			ThreadPool::ThreadPool(ThreadPoolStrategy S)
	: Strategy(S), MaxThreadCount(S.compute_thread_count()) {}			: Strategy(S), MaxThreadCount(S.compute_thread_count()) {}

	void ThreadPool::grow(int requested) {			void ThreadPool::grow(int requested) {
	std::unique_lock<std::mutex> LockGuard(ThreadsLock);			std::unique_lock<std::mutex> LockGuard(ThreadsLock);
	if (Threads.size() >= MaxThreadCount)			if (Threads.size() >= MaxThreadCount)
	return; // Already hit the max thread pool size.			return; // Already hit the max thread pool size.
	int newThreadCount = std::min<int>(requested, MaxThreadCount);			int newThreadCount = std::min<int>(requested, MaxThreadCount);
				TimeTraceProfiler *MainThreadProfiler = getTimeTraceProfilerInstance();
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions It seems that we'll always use the instance initialized in the thread that calls "grow". Also, this instance has to be setup before the call to grow, and the thread can't reinitialize it for the lifetime duration of the ThreadPool if I understand correctly. I'm not sure this makes sense in the full generality of the ThreadPool API? mehdi_amini: It seems that we'll always use the instance initialized in the thread that calls "grow". Also…
				russell.gallopUnsubmitted Not Done Reply Inline Actions I'm not sure this makes sense in the full generality of the ThreadPool API? Do we agree that time tracing all threads used by the ThreadPool is desirable and worth pursuing? russell.gallop: > I'm not sure this makes sense in the full generality of the ThreadPool API? Do we agree that…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions This could be a useful feature to have a tracing feature for the ThreadPool. I'm not sure about: The expected behavior in terms of threading (with respect to the thread pool creation, the growing of the pool, or the enqueuing of a task). The current `TimeTraceProfiler` which is non-trivially coupled to some global state, making this all harder to reason about. mehdi_amini: This could be a useful feature to have a tracing feature for the ThreadPool. I'm not sure about…
				int3AuthorUnsubmitted Done Reply Inline Actions It seems that we'll always use the instance initialized in the thread that calls "grow". Actually we are just copying the value of its TimeTraceGranularity member and using that to initialize a new thread-local instance. I could have just copied the TimeTraceGranularity value itself, but I figured this was a slightly nicer abstraction -- if in the future we add more fields to TimeTraceProfilerInstance, we can keep the same initialization method signature. You are right that this makes things a little less general though. In particular, there is no way to have different granularities per profiler instance -- every thread must use the same value. IDK if that's an issue... after all, the places where the TimeProfiler is getting used ATM don't take advantage of this flexibility. the thread can't reinitialize it for the lifetime duration of the ThreadPool if I understand correctly I don't think there's a use case for reinitializing it... That said I didn't write the TimeProfiler, so maybe @russell.gallop can confirm. int3: > It seems that we'll always use the instance initialized in the thread that calls "grow".
				int3AuthorUnsubmitted Done Reply Inline Actions bump -- @russell.gallop, could you chime in here? int3: bump -- @russell.gallop, could you chime in here?
				russell.gallopUnsubmitted Not Done Reply Inline Actions You are right that this makes things a little less general though. In particular, there is no way to have different granularities per profiler instance -- every thread must use the same value. IDK if that's an issue... after all, the places where the TimeProfiler is getting used ATM don't take advantage of this flexibility. Yes, I don't imagine it is very useful to have different granularities. I don't think there's a use case for reinitializing it... That said I didn't write the TimeProfiler, so maybe @russell.gallop can confirm. I'm not sure I follow the use of the Thread Pool where this would be required... I think @anton-afanasyev added the time profiler, I extended for LLD tracing, but only really with the ThinLTO threading case in mind. I imagine that the profiler could be re-engineered to meet what the thread pool can do, but I'm not really aware of what the "full generality of the thread pool" is so can't really say more than that. I don't think I'll have time to do this myself. Let me know if you don't have time and I can ask if there is someone around here who can take a look at this. russell.gallop: > You are right that this makes things a little less general though. In particular, there is no…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions The ThreadPool isn't a "global" thing, but the profiler is. It seems to me that there is a mismatch in terms of lifetime / lifecycle that creeps up here and does not make it a natural fit for the ThreadPool to know directly about the profiler. For example a MLIRContext (similar to LLVMContext) owns a ThreadPool, you may through the lifetime of a process create and destroy multiple ThreadPool. The thread which creates the ThreadPool isn't necessarily the one that will enqueue work to the pool. In MLIR for example, the pool will get work enqueued when a pass manager is executed: what does it mean in terms of profiler instance? The kind of things that can be surprising with the implementation in this patch is that a sequence like: Create a ThreadPool Setup profiler Grow the pool to more thread schedule would lead to only some threads having the profiler enabled (the ones that grew post setup). More simpler "gotcha" is: Create a ThreadPool (indirectly, from the user point of it is "initialize the compiler" or "Create a MLIRContext") Setup profiler run this wouldn't get any thread other that the current one having profiler setup. However many construct like parallel_for will schedule iterations in the pool but also use the current thread to run work, so work executed in the current thread would get different behavior from the work setup in the pool. Variant of this behavior can include: Setup profiler Create a ThreadPool (indirectly, from the user point of it is "initialize the compiler" or "Create a MLIRContext") `parallel_for` from another thread This time the parallel iteration in the thread pool would have the profiler setup, but not the ones directly executed in the calling thread. mehdi_amini: The ThreadPool isn't a "global" thing, but the profiler is. It seems to me that there is a…
				int3AuthorUnsubmitted Done Reply Inline Actions got it, I see why you are uncomfortable with the change now. I will revert to the earlier non-ThreadPool-invasive version of this diff. Thanks for the insight! int3: got it, I see why you are uncomfortable with the change now. I will revert to the earlier non…
	while (static_cast<int>(Threads.size()) < newThreadCount) {			while (static_cast<int>(Threads.size()) < newThreadCount) {
	int ThreadID = Threads.size();			int ThreadID = Threads.size();
	Threads.emplace_back([this, ThreadID] {			Threads.emplace_back([this, MainThreadProfiler, ThreadID] {
				if (MainThreadProfiler)
				timeTraceProfilerInitialize(*MainThreadProfiler, "worker");
				auto ProfilerFinish = make_scope_exit([] {
				if (timeTraceProfilerEnabled())
				timeTraceProfilerFinishThread();
				});
	Strategy.apply_thread_strategy(ThreadID);			Strategy.apply_thread_strategy(ThreadID);
	while (true) {			while (true) {
	std::function<void()> Task;			std::function<void()> Task;
	{			{
	std::unique_lock<std::mutex> LockGuard(QueueLock);			std::unique_lock<std::mutex> LockGuard(QueueLock);
	// Wait for tasks to be pushed in the queue			// Wait for tasks to be pushed in the queue
	QueueCondition.wait(LockGuard,			QueueCondition.wait(LockGuard,
	[&] { return !EnableFlag \|\| !Tasks.empty(); });			[&] { return !EnableFlag \|\| !Tasks.empty(); });
	// Exit condition			// Exit condition
	if (!EnableFlag && Tasks.empty())			if (!EnableFlag && Tasks.empty())
	return;			return;
	// Yeah, we have a task, grab it and release the lock on the queue			// Yeah, we have a task, grab it and release the lock on the queue
				mehdi_aminiUnsubmitted Done Reply Inline Actions Can you handle this with RAII? llvm::make_scope_exit outside of the loop? mehdi_amini: Can you handle this with RAII? llvm::make_scope_exit outside of the loop?

	// We first need to signal that we are active before popping the queue			// We first need to signal that we are active before popping the queue
	// in order for wait() to properly detect that even if the queue is			// in order for wait() to properly detect that even if the queue is
	// empty, there is still a task in flight.			// empty, there is still a task in flight.
	++ActiveThreads;			++ActiveThreads;
	Task = std::move(Tasks.front());			Task = std::move(Tasks.front());
	Tasks.pop();			Tasks.pop();
	}			}
	▲ Show 20 Lines • Show All 73 Lines • Show Last 20 Lines

llvm/lib/Support/TimeProfiler.cpp

	Show First 20 Lines • Show All 261 Lines • ▼ Show 20 Lines
	void llvm::timeTraceProfilerInitialize(unsigned TimeTraceGranularity,			void llvm::timeTraceProfilerInitialize(unsigned TimeTraceGranularity,
	StringRef ProcName) {			StringRef ProcName) {
	assert(TimeTraceProfilerInstance == nullptr &&			assert(TimeTraceProfilerInstance == nullptr &&
	"Profiler should not be initialized");			"Profiler should not be initialized");
	TimeTraceProfilerInstance = new TimeTraceProfiler(			TimeTraceProfilerInstance = new TimeTraceProfiler(
	TimeTraceGranularity, llvm::sys::path::filename(ProcName));			TimeTraceGranularity, llvm::sys::path::filename(ProcName));
	}			}

				void llvm::timeTraceProfilerInitialize(const TimeTraceProfiler &Profiler,
				StringRef ProcName) {
				timeTraceProfilerInitialize(Profiler.TimeTraceGranularity, ProcName);
				}

	// Removes all TimeTraceProfilerInstances.			// Removes all TimeTraceProfilerInstances.
	// Called from main thread.			// Called from main thread.
	void llvm::timeTraceProfilerCleanup() {			void llvm::timeTraceProfilerCleanup() {
	delete TimeTraceProfilerInstance;			delete TimeTraceProfilerInstance;
	TimeTraceProfilerInstance = nullptr;			TimeTraceProfilerInstance = nullptr;
	std::lock_guard<std::mutex> Lock(Mu);			std::lock_guard<std::mutex> Lock(Mu);
	for (auto TTP : ThreadTimeTraceProfilerInstances)			for (auto TTP : ThreadTimeTraceProfilerInstances)
	delete TTP;			delete TTP;
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines