This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
4/6
ThreadPool.h
-
lib/Support/
-
Support/
2/3
ThreadPool.cpp

Differential D115019

ThreadPool: grow the pool only as needed
ClosedPublic

Authored by Benoit on Dec 2 2021, 6:47 PM.

Download Raw Diff

Details

Reviewers

mehdi_amini

Commits

rGb28f317c8156: Fix build for ThreadPool when using -DLLVM_ENABLE_THREADS=OFF
rG728b982bb2ae: ThreadPool: grow the pool only as needed

Summary

On my 96-core cloudtop 'machine', it seems unnecessary to always start
96 threads upfront... particularly as the ThreadPool is created even
with -mlir-disable-threading. Things like the resuling spew in GDB and
the obfuscated output of (gdb) info threads are my motivation here,
but it probably also doesn't hurt for at least some efficiency metrics to
avoid creating many threads upfront.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Benoit created this revision.Dec 2 2021, 6:47 PM

Herald added subscribers: Chia-hungDuan, dexonsmith, rriddle, hiraditya. · View Herald TranscriptDec 2 2021, 6:47 PM

Benoit requested review of this revision.Dec 2 2021, 6:47 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 2 2021, 6:47 PM

Herald added subscribers: llvm-commits, stephenneuendorffer. · View Herald Transcript

Benoit added a reviewer: mehdi_amini.Dec 2 2021, 6:48 PM

off by one mistake in max number of worker threads

Benoit added inline comments.Dec 2 2021, 7:05 PM

llvm/lib/Support/ThreadPool.cpp
24	Here actually I'm really not sure: what should be the max number of threads in `Threads`: should it be `Strategy.compute_thread_count()` or should it be that minus one to account for the main thread? it looks like the existing code was doing the former but (as actually confirmed by GDB) that meant it had one more thread running than the detected hardware concurrency. Which could be fine if we expect that the main thread would be idle most of the time? What was the intent here?

Benoit added inline comments.Dec 2 2021, 7:11 PM

llvm/include/llvm/Support/ThreadPool.h
69–72	Please decide what we want here? The immediate problem that led me to add +1 here was that when Threads was empty, I was now returning 0, and the caller was clearly expecting at least 1: At mlir/include/mlir/IR/Threading.h:75 size_t numActions = std::min(numElements, threadPool.getThreadCount()); SmallVector<std::shared_future<void>> threadFutures; threadFutures.reserve(numActions - 1); for (unsigned i = 1; i < numActions; ++i) threadFutures.emplace_back(threadPool.async(processFn));

Got it, getThreadCount() was really meant to return the max number of threads, not the current number. Those two numbers used to be the same before this patch.

Benoit added inline comments.Dec 2 2021, 7:42 PM

llvm/include/llvm/Support/ThreadPool.h
69–72	Self-replying: this call site really shows that `getThreadCount` was expected to return the potential max number of threads, which incidentally was equal to the current number of threads before this diff, but that's what is changing here. To avoid breaking existing users, I reverted `getThreadCount` to this behavior (I believe that the earlier state of this diff would have effectively kept the number of threads to 1. Now I've checked in GDB that we do create a dozen threads for a simple lit test).
llvm/lib/Support/ThreadPool.cpp
24	Got it, nevermind, updated this diff (see reply to the other comment thread).

particularly as the ThreadPool is created even with -mlir-disable-threading.

I'm fairly sure I fixed this one or two months ago, at least the C++ API allows to setup a project without starting a ThreadPool at all. This was a bottleneck in TensorFlow for some small MLIR work we're doing there.

There might be some similar plumbing we could do in how we handle --mlir-disable-threading with mlir-opt as well, since it is a testing tool we haven't tried to "optimize" this kind of things (you're making a good case for it though!).

That said, I like this change to lazily create thread as we go: so that even in cases where I want parallelism, I won't create more threads than actually needed!

llvm/lib/Support/ThreadPool.cpp
27	Nit: remove trivial braces

mehdi_amini added inline comments.Dec 2 2021, 8:02 PM

llvm/include/llvm/Support/ThreadPool.h
71	I'm not fond of keeping the API name as-is with a new semantics. What about removing it and using two APIs instead: getMaxThreadCount() getAvailableThreadCount() => return Threads.size()

Harbormaster completed remote builds in B137287: Diff 391531.Dec 2 2021, 8:14 PM

Address review comments.

In D115019#3168840, @mehdi_amini wrote:

I'm fairly sure I fixed this one or two months ago, at least the C++ API allows to setup a project without starting a ThreadPool at all. This was a bottleneck in TensorFlow for some small MLIR work we're doing there.

There might be some similar plumbing we could do in how we handle --mlir-disable-threading with mlir-opt as well, since it is a testing tool we haven't tried to "optimize" this kind of things (you're making a good case for it though!).

Indeed, it is in mlir-opt that I was observing that behavior this week.

llvm/include/llvm/Support/ThreadPool.h
71	There are 3 call sites in one Clang file, and 2 call sites in 2 MLIR files. That's a little more than I feel save updating all in one shot. Are you OK to keep the current name for now, and defer to a follow-up? At least the behavior isn't changing, these existing users are still going to get effectively the same result value. For the new methods: I agree that the current `getThreadCount` should get renamed `getMaxThreadCount`. I don't feel comfortable with `getAvailableThreadCount() => return Threads.size()` because that would conflict with the meaning of "available" in the existing member `AvailableThreads`. How about `getCurrentThreadCount` ?

mehdi_amini added inline comments.Dec 2 2021, 8:24 PM

llvm/include/llvm/Support/ThreadPool.h
149	Nit: it won't have any impact right now, but can you guard this declaration with LLVM_ENABLE_THREADS? That would lead to compile time failures instead of link-time failures if someone changes something incorrectly in the future.

LG, thanks!

llvm/include/llvm/Support/ThreadPool.h
71	getCurrentThreadCount() LGTM

This revision is now accepted and ready to land.Dec 2 2021, 8:26 PM

review comment

In D115019#3168875, @mehdi_amini wrote:

LG, thanks!

Thanks for the quick review! I don't have permissions to push by myself, FWIW.
I'm also running this through google global presubmits tonight... let's wait until tomorrow to submit this.

Sure, ping me tomorrow!

Harbormaster completed remote builds in B137292: Diff 391536.Dec 2 2021, 9:21 PM

Wait a bit more before submitting... we're still debugging CI failures.

fix preexisting race condition in isWorkerThread found by MLIR tests with TSan

This should be good to submit now.

This revision was landed with ongoing or failed builds.Dec 3 2021, 1:48 PM

Closed by commit rG728b982bb2ae: ThreadPool: grow the pool only as needed (authored by Benoit, committed by mehdi_amini). · Explain Why

This revision was automatically updated to reflect the committed changes.

mehdi_amini added a commit: rG728b982bb2ae: ThreadPool: grow the pool only as needed.

Harbormaster completed remote builds in B137423: Diff 391723.Dec 3 2021, 2:06 PM

Looks like this is breaking the buildbot:
https://lab.llvm.org/buildbot/#/builders/37/builds/8903

salient log extract:

[127/558] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/ThreadPool.cpp.o
FAILED: lib/Support/CMakeFiles/LLVMSupport.dir/ThreadPool.cpp.o 
/b/sanitizer-x86_64-linux/build/llvm_build64/bin/clang++  -DGTEST_HAS_RTTI=0 -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Ilib/Support -I/b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support -Iinclude -I/b/sanitizer-x86_64-linux/build/llvm-project/llvm/include -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Werror=unguarded-availability-new -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wmissing-field-initializers -pedantic -Wno-long-long -Wc++98-compat-extra-semi -Wimplicit-fallthrough -Wcovered-switch-default -Wno-noexcept-type -Wnon-virtual-dtor -Wdelete-non-virtual-dtor -Wsuggest-override -Wstring-conversion -Wmisleading-indentation -fdiagnostics-color -ffunction-sections -fdata-sections -Werror=global-constructors -m32 -fPIC -flto -Os -g0 -DNDEBUG -fno-rtti -fno-exceptions -nostdinc++ -I/b/sanitizer-x86_64-linux/build/symbolizer_build32/symbolizer/zlib -isystem /b/sanitizer-x86_64-linux/build/symbolizer_build32/symbolizer/libcxx/include/x86_64-unknown-linux-gnu/c++/v1 -isystem /b/sanitizer-x86_64-linux/build/symbolizer_build32/symbolizer/libcxx/include/c++/v1 -Wno-error=global-constructors   -std=c++14  -fno-exceptions -fno-rtti -MD -MT lib/Support/CMakeFiles/LLVMSupport.dir/ThreadPool.cpp.o -MF lib/Support/CMakeFiles/LLVMSupport.dir/ThreadPool.cpp.o.d -o lib/Support/CMakeFiles/LLVMSupport.dir/ThreadPool.cpp.o -c /b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support/ThreadPool.cpp
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support/ThreadPool.cpp:97:13: error: redefinition of 'ThreadPool'
ThreadPool::ThreadPool(ThreadPoolStrategy S)
            ^
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/include/llvm/Support/ThreadPool.h:43:3: note: previous definition is here
  ThreadPool(ThreadPoolStrategy S = hardware_concurrency())
  ^
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support/ThreadPool.cpp:98:7: error: initializer 'ThreadCount' does not name a non-static data member or base class; did you mean the member 'MaxThreadCount'?
    : ThreadCount(S.compute_thread_count()) {
      ^~~~~~~~~~~
      MaxThreadCount
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/include/llvm/Support/ThreadPool.h:177:18: note: 'MaxThreadCount' declared here
  const unsigned MaxThreadCount;
                 ^
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support/ThreadPool.cpp:99:7: error: use of undeclared identifier 'ThreadCount'
  if (ThreadCount != 1) {
      ^
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/lib/Support/ThreadPool.cpp:100:56: error: use of undeclared identifier 'ThreadCount'; did you mean 'MaxThreadCount'?
    errs() << "Warning: request a ThreadPool with " << ThreadCount
                                                       ^~~~~~~~~~~
                                                       MaxThreadCount
/b/sanitizer-x86_64-linux/build/llvm-project/llvm/include/llvm/Support/ThreadPool.h:177:18: note: 'MaxThreadCount' declared here
  const unsigned MaxThreadCount;
                 ^
4 errors generated.

mehdi_amini added a commit: rGb28f317c8156: Fix build for ThreadPool when using -DLLVM_ENABLE_THREADS=OFF.Dec 3 2021, 6:23 PM

Sorry! Pushed a fix in b28f317c8156 and will monitor the bot tonight.

https://lab.llvm.org/buildbot/ is broken for me right now though (502 Bad Gateway), does it work for you?

In D115019#3171261, @mehdi_amini wrote:

Sorry! Pushed a fix in b28f317c8156 and will monitor the bot tonight.

Sorry, all, for the breakage, and thanks for handling it while I was away!!
It looks like I never paid attention to the not-LLVM_ENABLE_THREADS path.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

ThreadPool.h

26 lines

lib/

Support/

ThreadPool.cpp

85 lines

Diff 391531

llvm/include/llvm/Support/ThreadPool.h

Show All 34 Lines
/// The pool keeps a vector of threads alive, waiting on a condition variable		/// The pool keeps a vector of threads alive, waiting on a condition variable
/// for some work to become available.		/// for some work to become available.
class ThreadPool {		class ThreadPool {
public:		public:
/// Construct a pool using the hardware strategy \p S for mapping hardware		/// Construct a pool using the hardware strategy \p S for mapping hardware
/// execution resources (threads, cores, CPUs)		/// execution resources (threads, cores, CPUs)
/// Defaults to using the maximum execution resources in the system, but		/// Defaults to using the maximum execution resources in the system, but
/// accounting for the affinity mask.		/// accounting for the affinity mask.
ThreadPool(ThreadPoolStrategy S = hardware_concurrency());		ThreadPool(ThreadPoolStrategy S = hardware_concurrency())
		: Strategy(S), MaxThreadCount(S.compute_thread_count()) {}

/// Blocking destructor: the pool will wait for all the threads to complete.		/// Blocking destructor: the pool will wait for all the threads to complete.
~ThreadPool();		~ThreadPool();

/// Asynchronous submission of a task to the pool. The returned future can be		/// Asynchronous submission of a task to the pool. The returned future can be
/// used to wait for the task to finish and is non-blocking on destruction.		/// used to wait for the task to finish and is non-blocking on destruction.
template <typename Function, typename... Args>		template <typename Function, typename... Args>
inline auto async(Function &&F, Args &&...ArgList) {		inline auto async(Function &&F, Args &&...ArgList) {
auto Task =		auto Task =
std::bind(std::forward<Function>(F), std::forward<Args>(ArgList)...);		std::bind(std::forward<Function>(F), std::forward<Args>(ArgList)...);
return async(std::move(Task));		return async(std::move(Task));
}		}

/// Asynchronous submission of a task to the pool. The returned future can be		/// Asynchronous submission of a task to the pool. The returned future can be
/// used to wait for the task to finish and is non-blocking on destruction.		/// used to wait for the task to finish and is non-blocking on destruction.
template <typename Func>		template <typename Func>
auto async(Func &&F) -> std::shared_future<decltype(F())> {		auto async(Func &&F) -> std::shared_future<decltype(F())> {
return asyncImpl(std::function<decltype(F())()>(std::forward<Func>(F)));		return asyncImpl(std::function<decltype(F())()>(std::forward<Func>(F)));
}		}

/// Blocking wait for all the threads to complete and the queue to be empty.		/// Blocking wait for all the threads to complete and the queue to be empty.
/// It is an error to try to add new tasks while blocking on this call.		/// It is an error to try to add new tasks while blocking on this call.
void wait();		void wait();

unsigned getThreadCount() const { return ThreadCount; }		// Returns maximum number of worker threads in the pool (not the current
		// number of threads).
		unsigned getThreadCount() const { return MaxThreadCount; }
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions I'm not fond of keeping the API name as-is with a new semantics. What about removing it and using two APIs instead: getMaxThreadCount() getAvailableThreadCount() => return Threads.size() mehdi_amini: I'm not fond of keeping the API name as-is with a new semantics. What about removing it and…
		BenoitAuthorUnsubmitted Done Reply Inline Actions There are 3 call sites in one Clang file, and 2 call sites in 2 MLIR files. That's a little more than I feel save updating all in one shot. Are you OK to keep the current name for now, and defer to a follow-up? At least the behavior isn't changing, these existing users are still going to get effectively the same result value. For the new methods: I agree that the current `getThreadCount` should get renamed `getMaxThreadCount`. I don't feel comfortable with `getAvailableThreadCount() => return Threads.size()` because that would conflict with the meaning of "available" in the existing member `AvailableThreads`. How about `getCurrentThreadCount` ? Benoit: There are 3 call sites in one Clang file, and 2 call sites in 2 MLIR files. That's a little…
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions getCurrentThreadCount() LGTM mehdi_amini: getCurrentThreadCount() LGTM

		BenoitAuthorUnsubmitted Done Reply Inline Actions Please decide what we want here? The immediate problem that led me to add +1 here was that when Threads was empty, I was now returning 0, and the caller was clearly expecting at least 1: At mlir/include/mlir/IR/Threading.h:75 size_t numActions = std::min(numElements, threadPool.getThreadCount()); SmallVector<std::shared_future<void>> threadFutures; threadFutures.reserve(numActions - 1); for (unsigned i = 1; i < numActions; ++i) threadFutures.emplace_back(threadPool.async(processFn)); Benoit: Please decide what we want here? The immediate problem that led me to add +1 here was that when…
		BenoitAuthorUnsubmitted Done Reply Inline Actions Self-replying: this call site really shows that `getThreadCount` was expected to return the potential max number of threads, which incidentally was equal to the current number of threads before this diff, but that's what is changing here. To avoid breaking existing users, I reverted `getThreadCount` to this behavior (I believe that the earlier state of this diff would have effectively kept the number of threads to 1. Now I've checked in GDB that we do create a dozen threads for a simple lit test). Benoit: Self-replying: this call site really shows that `getThreadCount` was expected to return the…
/// Returns true if the current thread is a worker thread of this thread pool.		/// Returns true if the current thread is a worker thread of this thread pool.
bool isWorkerThread() const;		bool isWorkerThread() const;

private:		private:
/// Helpers to create a promise and a callable wrapper of \p Task that sets		/// Helpers to create a promise and a callable wrapper of \p Task that sets
/// the result of the promise. Returns the callable and a future to access the		/// the result of the promise. Returns the callable and a future to access the
/// result.		/// result.
template <typename ResTy>		template <typename ResTy>
Show All 32 Lines	#if LLVM_ENABLE_THREADS

{		{
// Lock the queue and push the new task		// Lock the queue and push the new task
std::unique_lock<std::mutex> LockGuard(QueueLock);		std::unique_lock<std::mutex> LockGuard(QueueLock);

// Don't allow enqueueing after disabling the pool		// Don't allow enqueueing after disabling the pool
assert(EnableFlag && "Queuing a thread during ThreadPool destruction");		assert(EnableFlag && "Queuing a thread during ThreadPool destruction");
Tasks.push(std::move(R.first));		Tasks.push(std::move(R.first));
		grow();
}		}
QueueCondition.notify_one();		QueueCondition.notify_one();
return R.second.share();		return R.second.share();

#else // LLVM_ENABLE_THREADS Disabled		#else // LLVM_ENABLE_THREADS Disabled

// Get a Future with launch::deferred execution using std::async		// Get a Future with launch::deferred execution using std::async
auto Future = std::async(std::launch::deferred, std::move(Task)).share();		auto Future = std::async(std::launch::deferred, std::move(Task)).share();
// Wrap the future so that both ThreadPool::wait() can operate and the		// Wrap the future so that both ThreadPool::wait() can operate and the
// returned future can be sync'ed on.		// returned future can be sync'ed on.
Tasks.push([Future]() { Future.get(); });		Tasks.push([Future]() { Future.get(); });
return Future;		return Future;
#endif		#endif
}		}

		// Maybe create a new thread and add it to Threads.
		//
		// Requirements:
		// * this->QueueLock should be owned by the calling thread prior to
		// calling this function. It will neither lock it nor unlock it.
		// Calling this function without owning QueueLock would result in data
		// races as this function reads Tasks and ActiveThreads.
		// * this->Tasks should be populated with any pending tasks. This function
		// uses Tasks.size() to determine whether it needs to create a new thread.
		// * this->ActiveThreads should be up to date as it is also used to
		// determine whether to create a new thread.
		void grow();

		mehdi_aminiUnsubmitted Done Reply Inline Actions Nit: it won't have any impact right now, but can you guard this declaration with LLVM_ENABLE_THREADS? That would lead to compile time failures instead of link-time failures if someone changes something incorrectly in the future. mehdi_amini: Nit: it won't have any impact right now, but can you guard this declaration with…
/// Threads in flight		/// Threads in flight
std::vector<llvm::thread> Threads;		std::vector<llvm::thread> Threads;

/// Tasks waiting for execution in the pool.		/// Tasks waiting for execution in the pool.
std::queue<std::function<void()>> Tasks;		std::queue<std::function<void()>> Tasks;

/// Locking and signaling for accessing the Tasks queue.		/// Locking and signaling for accessing the Tasks queue.
std::mutex QueueLock;		std::mutex QueueLock;
std::condition_variable QueueCondition;		std::condition_variable QueueCondition;

/// Signaling for job completion		/// Signaling for job completion
std::condition_variable CompletionCondition;		std::condition_variable CompletionCondition;

/// Keep track of the number of thread actually busy		/// Keep track of the number of thread actually busy
unsigned ActiveThreads = 0;		unsigned ActiveThreads = 0;

#if LLVM_ENABLE_THREADS // avoids warning for unused variable		#if LLVM_ENABLE_THREADS // avoids warning for unused variable
/// Signal for the destruction of the pool, asking thread to exit.		/// Signal for the destruction of the pool, asking thread to exit.
bool EnableFlag = true;		bool EnableFlag = true;
#endif		#endif

unsigned ThreadCount;		const ThreadPoolStrategy Strategy;

		/// Maximum number of threads to potentially grow this pool to.
		const unsigned MaxThreadCount;
};		};
}		}

#endif // LLVM_SUPPORT_THREADPOOL_H		#endif // LLVM_SUPPORT_THREADPOOL_H

llvm/lib/Support/ThreadPool.cpp

	Show All 14 Lines
	#include "llvm/Config/llvm-config.h"			#include "llvm/Config/llvm-config.h"
	#include "llvm/Support/Threading.h"			#include "llvm/Support/Threading.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

	using namespace llvm;			using namespace llvm;

	#if LLVM_ENABLE_THREADS			#if LLVM_ENABLE_THREADS

	ThreadPool::ThreadPool(ThreadPoolStrategy S)			void ThreadPool::grow() {
	: ThreadCount(S.compute_thread_count()) {			if (Threads.size() >= MaxThreadCount) {
				BenoitAuthorUnsubmitted Done Reply Inline Actions Here actually I'm really not sure: what should be the max number of threads in `Threads`: should it be `Strategy.compute_thread_count()` or should it be that minus one to account for the main thread? it looks like the existing code was doing the former but (as actually confirmed by GDB) that meant it had one more thread running than the detected hardware concurrency. Which could be fine if we expect that the main thread would be idle most of the time? What was the intent here? Benoit: Here actually I'm really not sure: what should be the max number of threads in `Threads`…
				BenoitAuthorUnsubmitted Done Reply Inline Actions Got it, nevermind, updated this diff (see reply to the other comment thread). Benoit: Got it, nevermind, updated this diff (see reply to the other comment thread).
	// Create ThreadCount threads that will loop forever, wait on QueueCondition			// Already hit the max thread pool size.
	// for tasks to be queued or the Pool to be destroyed.			return;
	Threads.reserve(ThreadCount);			}
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Nit: remove trivial braces mehdi_amini: Nit: remove trivial braces
	for (unsigned ThreadID = 0; ThreadID < ThreadCount; ++ThreadID) {			if (ActiveThreads + Tasks.size() <= Threads.size()) {
	Threads.emplace_back([S, ThreadID, this] {			// We have enough threads for now.
	S.apply_thread_strategy(ThreadID);			return;
				}
				int ThreadID = Threads.size();
				Threads.emplace_back([this, ThreadID] {
				Strategy.apply_thread_strategy(ThreadID);
	while (true) {			while (true) {
	std::function<void()> Task;			std::function<void()> Task;
	{			{
	std::unique_lock<std::mutex> LockGuard(QueueLock);			std::unique_lock<std::mutex> LockGuard(QueueLock);
	// Wait for tasks to be pushed in the queue			// Wait for tasks to be pushed in the queue
	QueueCondition.wait(LockGuard,			QueueCondition.wait(LockGuard,
	[&] { return !EnableFlag \|\| !Tasks.empty(); });			[&] { return !EnableFlag \|\| !Tasks.empty(); });
	// Exit condition			// Exit condition
	if (!EnableFlag && Tasks.empty())			if (!EnableFlag && Tasks.empty())
	return;			return;
	// Yeah, we have a task, grab it and release the lock on the queue			// Yeah, we have a task, grab it and release the lock on the queue

	// We first need to signal that we are active before popping the queue			// We first need to signal that we are active before popping the queue
	// in order for wait() to properly detect that even if the queue is			// in order for wait() to properly detect that even if the queue is
	// empty, there is still a task in flight.			// empty, there is still a task in flight.
	++ActiveThreads;			++ActiveThreads;
	Task = std::move(Tasks.front());			Task = std::move(Tasks.front());
	Tasks.pop();			Tasks.pop();
	}			}
	// Run the task we just grabbed			// Run the task we just grabbed
	Task();			Task();

	bool Notify;			bool Notify;
	{			{
	// Adjust `ActiveThreads`, in case someone waits on ThreadPool::wait()			// Adjust `ActiveThreads`, in case someone waits on ThreadPool::wait()
	std::lock_guard<std::mutex> LockGuard(QueueLock);			std::lock_guard<std::mutex> LockGuard(QueueLock);
	--ActiveThreads;			--ActiveThreads;
	Notify = workCompletedUnlocked();			Notify = workCompletedUnlocked();
	}			}
	// Notify task completion if this is the last active thread, in case			// Notify task completion if this is the last active thread, in case
	// someone waits on ThreadPool::wait().			// someone waits on ThreadPool::wait().
	if (Notify)			if (Notify)
	CompletionCondition.notify_all();			CompletionCondition.notify_all();
	}			}
	});			});
	}			}
	}

	void ThreadPool::wait() {			void ThreadPool::wait() {
	// Wait for all threads to complete and the queue to be empty			// Wait for all threads to complete and the queue to be empty
	std::unique_lock<std::mutex> LockGuard(QueueLock);			std::unique_lock<std::mutex> LockGuard(QueueLock);
	CompletionCondition.wait(LockGuard, [&] { return workCompletedUnlocked(); });			CompletionCondition.wait(LockGuard, [&] { return workCompletedUnlocked(); });
	}			}

	bool ThreadPool::isWorkerThread() const {			bool ThreadPool::isWorkerThread() const {
	▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines