Diff 396114

mlir/lib/Transforms/Inliner.cpp

Show First 20 Lines • Show All 657 Lines • ▼ Show 20 Lines

if (!region->getParentOp()->hasTrait<OpTrait::IsIsolatedFromAbove>())

continue;

nodesToVisit.push_back(node);

}

if (nodesToVisit.empty())

return success();

// Optimize each of the nodes within the SCC in parallel.

if (failed(optimizeSCCAsync(nodesToVisit, context)))

return failure();

// Recompute the uses held by each of the nodes.

for (CallGraphNode *node : nodesToVisit)

useList.recomputeUses(node, cg);

return success();

}

LogicalResult

InlinerPass::optimizeSCCAsync(MutableArrayRef<CallGraphNode *> nodesToVisit,

MLIRContext *ctx) {

// Ensure that there are enough pipeline maps for the optimizer to run in

// parallel. Note: The number of pass managers here needs to remain constant

// to prevent issues with pass instrumentations that rely on having the same

// pass manager for the main thread.

mehdi_aminiUnsubmitted

Not Done

Patch LGTM, but I don't quite get this comment right now.

mehdi_amini: Patch LGTM, but I don't quite get this comment right now.

mehdi_aminiUnsubmitted

Not Done

(I meant that I don't get the existing comment in the codebase)

mehdi_amini: (I meant that I don't get the existing comment in the codebase)

stellaraccidentAuthorUnsubmitted

Done

How about:

"We must maintain a fixed pool of pass managers which is at least as large as the maximum parallelism of the failableParallelForEach below."

I don't understand the instrumentation/main thread connection myself.

stellaraccident: How about: "We must maintain a fixed pool of pass managers which is at least as large as the…

stellaraccidentAuthorUnsubmitted

Done

I was trying to indicate that there is an action at a distance constraint, but I couldn't come up with a better way to say it so just removed.

stellaraccident: I was trying to indicate that there is an action at a distance constraint, but I couldn't come…

stellaraccidentAuthorUnsubmitted

Done

I went ahead and kept the Note. I don't know quite what it is trying to convey but seems important to understand in a followup.

stellaraccident: I went ahead and kept the Note. I don't know quite what it is trying to convey but seems…

mehdi_aminiUnsubmitted

Not Done

Yeah it's the part about "The number of pass managers here needs to remain constant" that puzzles me, since we resize it right below.

mehdi_amini: Yeah it's the part about "The number of pass managers here needs to remain constant" that…

size_t numThreads = llvm::hardware_concurrency().compute_thread_count();

// Note that this lining up is dependent on failableParallelForEach using

// the context thread pool under the covers (the thread count must be

// consistent).

llvm::ThreadPool &threadPool = ctx->getThreadPool();

size_t numThreads = threadPool.getThreadCount();

if (opPipelines.size() < numThreads) {

// Reserve before resizing so that we can use a reference to the first

// element.

opPipelines.reserve(numThreads);

opPipelines.resize(numThreads, opPipelines.front());

}

// Ensure an analysis manager has been constructed for each of the nodes.

// This prevents thread races when running the nested pipelines.

for (CallGraphNode *node : nodesToVisit)

getAnalysisManager().nest(node->getCallableRegion()->getParentOp());

// An atomic failure variable for the async executors.

std::vector<std::atomic<bool>> activePMs(opPipelines.size());

std::fill(activePMs.begin(), activePMs.end(), false);

return failableParallelForEach(ctx, nodesToVisit, [&](CallGraphNode *node) {

// Find a pass manager for this operation.

auto it = llvm::find_if(activePMs, [](std::atomic<bool> &isActive) {

bool expectedInactive = false;

return isActive.compare_exchange_strong(expectedInactive, true);

});

assert(it != activePMs.end() &&

"could not find active pass manager for thread");

mehdi_aminiUnsubmitted

Done

assert(it != activePMs.end() &&

- "could not find active pass manager for thread");

+ "could not find inactive pass manager for thread");

unsigned pmIndex = it - activePMs.begin();

mehdi_amini:

mehdi_aminiUnsubmitted

Not Done

(did you miss this one?)

mehdi_amini: (did you miss this one?)

stellaraccidentAuthorUnsubmitted

Done

It shows as changed in my snapshot.

stellaraccident: It shows as changed in my snapshot.

unsigned pmIndex = it - activePMs.begin();

// Optimize this callable node.

LogicalResult result = optimizeCallable(node, opPipelines[pmIndex]);

// Reset the active bit for this pass manager.

activePMs[pmIndex].store(false);

return result;

▲ Show 20 Lines • Show All 71 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Use thread-pool's notion of thread count instead of requerying system.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 396114

mlir/lib/Transforms/Inliner.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[mlir] Use thread-pool's notion of thread count instead of requerying system.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 396114

mlir/lib/Transforms/Inliner.cpp

[mlir] Use thread-pool's notion of thread count instead of requerying system.
ClosedPublic