This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clangd/global-symbol-builder/
-
global-symbol-builder/
4
GlobalSymbolBuilderMain.cpp

Differential D45478

[clangd] Merge symbols in global-sym-builder on the go
AbandonedPublic

Authored by ilya-biryukov on Apr 10 2018, 2:12 AM.

Download Raw Diff

Details

Reviewers

hokein
sammccall
klimek

Summary

This avoids storing intermediate symbols in memory, most of which are
duplicates.
The resulting .yaml file is ~120MB, while intermediate symbols takes
more than 20GB.

Diff Detail

Repository

rCTE Clang Tools Extra

Build Status

Buildable 16927
Build 16927: arc lint + arc unit

Event Timeline

ilya-biryukov created this revision.Apr 10 2018, 2:12 AM

Herald added subscribers: MaskRay, ioeric, jkorous-apple. · View Herald TranscriptApr 10 2018, 2:12 AM

Harbormaster completed remote builds in B16927: Diff 141810.Apr 10 2018, 2:12 AM

Thanks for digging it out!

In upstream, we use InMemoryToolResults which saves all duplicated "std::string"s into the memory, I think we could optimize InMemoryToolResults by using Arena to keep the memory low, I will try it to see whether it can reduce the memory. A bonus point is that we don't need to change any client code.

Is this patch still relevant after haojian's string deduplication?

clangd/global-symbol-builder/GlobalSymbolBuilderMain.cpp
53	Seems reasonably likely we would actually have contention here? merging per-thread (combiner) and then globally at the end (reducer) might be the way to go (might be significantly faster). But not sure how big the impact is.

In D45478#1064027, @sammccall wrote:

Is this patch still relevant after haojian's string deduplication?

Apparently it does. It has an advantage of distributing the work more evenly between the program runs.
Currently the tool reports progress based on the number of files it parsed. But then it takes a lot of time to actually merge all the symbols at the end and the progress is not reported during that time.

The perfect behavior would be to remove duplicates, not just intern them :-) That shouldn't be too hard and it probably even makes sense to have a ToolExecutor that does that.

clangd/global-symbol-builder/GlobalSymbolBuilderMain.cpp
53	I haven't seen any noticeable performance degradation after this patch. There should be contention, but it doesn't seem to matter much, as parsing still takes way more time. Still makes sense to rewrite the code in order to avoid contention, will do that.

Herald added a subscriber: jkorous. · View Herald TranscriptApr 19 2018, 5:14 AM

In D45478#1071983, @ilya-biryukov wrote:

In D45478#1064027, @sammccall wrote:

Is this patch still relevant after haojian's string deduplication?

Apparently it does. It has an advantage of distributing the work more evenly between the program runs.
Currently the tool reports progress based on the number of files it parsed. But then it takes a lot of time to actually merge all the symbols at the end and the progress is not reported during that time.

The perfect behavior would be to remove duplicates, not just intern them :-) That shouldn't be too hard and it probably even makes sense to have a ToolExecutor that does that.

OK, can we sync up offline with @hokein and decide what to do here? I don't think it makes sense to have both these solutions to the same issue, but I don't have a strong opinion on exactly what we do.
(Other than clearly the right solution is to have a proper MR framework <half-trolling>)

ilya-biryukov added inline comments.Apr 19 2018, 6:00 AM

clangd/global-symbol-builder/GlobalSymbolBuilderMain.cpp
53	Looking at the traces, the threads spend ~6% of `mergeSymbols` time between `Lock(Mut)` and `for(const Sym...`. Improving it might be a good idea, but from my point of view it's not too bad to commit it as is. Doing it properly is a little complicated in the current setup, since we don't have knobs to create per-thread structs and merge them later.

I think this implementation will have problems in a "real" multi-machine MR framework. The lifetime of the Merger is the whole program, with output only coming at the end.
With N workers, each will store >1/N of the symbols (more because of overlap), and the streaming nature of a MR is important to its scaling.

As you know, we rely on this code in such a framework :-)
I would be much happier if we could express these constraints/patterns in the open-source code, I think the most promising approach would be to define a more complete clang mapreduce API in Tooling.
But that's way out of scope in this patch. I'm not really sure what to suggest here, happy to talk through it more tomorrow.

clangd/global-symbol-builder/GlobalSymbolBuilderMain.cpp
56	consider calling this `consume`, and the member in SymbolIndexActionFactory etc `Sink`, to emphasize the data flow

This landed as a different change.

Herald added subscribers: kadircet, arphaman. · View Herald TranscriptNov 26 2018, 7:24 AM

Revision Contents

Path

Size

clangd/

global-symbol-builder/

GlobalSymbolBuilderMain.cpp

87 lines

Diff 141810

clangd/global-symbol-builder/GlobalSymbolBuilderMain.cpp

Show All 25 Lines
#include "clang/Tooling/Execution.h"		#include "clang/Tooling/Execution.h"
#include "clang/Tooling/Tooling.h"		#include "clang/Tooling/Tooling.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/Signals.h"		#include "llvm/Support/Signals.h"
#include "llvm/Support/ThreadPool.h"		#include "llvm/Support/ThreadPool.h"
#include "llvm/Support/YAMLTraits.h"		#include "llvm/Support/YAMLTraits.h"
		#include <mutex>

using namespace llvm;		using namespace llvm;
using namespace clang::tooling;		using namespace clang::tooling;
using clang::clangd::SymbolSlab;		using clang::clangd::SymbolSlab;

namespace clang {		namespace clang {
namespace clangd {		namespace clangd {
namespace {		namespace {

static llvm::cl::opt<std::string> AssumedHeaderDir(		static llvm::cl::opt<std::string> AssumedHeaderDir(
"assume-header-dir",		"assume-header-dir",
llvm::cl::desc("The index includes header that a symbol is defined in. "		llvm::cl::desc("The index includes header that a symbol is defined in. "
"If the absolute path cannot be determined (e.g. an "		"If the absolute path cannot be determined (e.g. an "
"in-memory VFS) then the relative path is resolved against "		"in-memory VFS) then the relative path is resolved against "
"this directory, which must be absolute. If this flag is "		"this directory, which must be absolute. If this flag is "
"not given, such headers will have relative paths."),		"not given, such headers will have relative paths."),
llvm::cl::init(""));		llvm::cl::init(""));

		/// Combines occurrences of the same symbols across translation units.
		sammccallUnsubmitted Not Done Reply Inline Actions Seems reasonably likely we would actually have contention here? merging per-thread (combiner) and then globally at the end (reducer) might be the way to go (might be significantly faster). But not sure how big the impact is. sammccall: Seems reasonably likely we would actually have contention here? merging per-thread (combiner)…
		ilya-biryukovAuthorUnsubmitted Not Done Reply Inline Actions I haven't seen any noticeable performance degradation after this patch. There should be contention, but it doesn't seem to matter much, as parsing still takes way more time. Still makes sense to rewrite the code in order to avoid contention, will do that. ilya-biryukov: I haven't seen any noticeable performance degradation after this patch. There should be…
		ilya-biryukovAuthorUnsubmitted Not Done Reply Inline Actions Looking at the traces, the threads spend ~6% of `mergeSymbols` time between `Lock(Mut)` and `for(const Sym...`. Improving it might be a good idea, but from my point of view it's not too bad to commit it as is. Doing it properly is a little complicated in the current setup, since we don't have knobs to create per-thread structs and merge them later. ilya-biryukov: Looking at the traces, the threads spend ~6% of `mergeSymbols` time between `Lock(Mut)` and…
		class SymbolMerger {
		public:
		void mergeSymbols(const SymbolSlab &Symbols) {
		sammccallUnsubmitted Not Done Reply Inline Actions consider calling this `consume`, and the member in SymbolIndexActionFactory etc `Sink`, to emphasize the data flow sammccall: consider calling this `consume`, and the member in SymbolIndexActionFactory etc `Sink`, to…
		std::lock_guard<std::mutex> Lock(Mut);
		for (const Symbol &Sym : Symbols) {
		if (const auto *Existing = UniqueSymbols.find(Sym.ID)) {
		Symbol::Details Scratch;
		UniqueSymbols.insert(mergeSymbol(*Existing, Sym, &Scratch));
		} else {
		UniqueSymbols.insert(Sym);
		}
		}
		}

		SymbolSlab build() && {
		std::lock_guard<std::mutex> Lock(Mut);
		return std::move(UniqueSymbols).build();
		}

		private:
		std::mutex Mut;
		SymbolSlab::Builder UniqueSymbols;
		};

class SymbolIndexActionFactory : public tooling::FrontendActionFactory {		class SymbolIndexActionFactory : public tooling::FrontendActionFactory {
public:		public:
SymbolIndexActionFactory(tooling::ExecutionContext *Ctx) : Ctx(Ctx) {}		SymbolIndexActionFactory(SymbolMerger &Merger) : Merger(Merger) {}

clang::FrontendAction *create() override {		clang::FrontendAction *create() override {
// Wraps the index action and reports collected symbols to the execution		// Wraps the index action and reports collected symbols to the execution
// context at the end of each translation unit.		// context at the end of each translation unit.
class WrappedIndexAction : public WrapperFrontendAction {		class WrappedIndexAction : public WrapperFrontendAction {
public:		public:
WrappedIndexAction(std::shared_ptr<SymbolCollector> C,		WrappedIndexAction(SymbolMerger &Merger,
		std::shared_ptr<SymbolCollector> C,
std::unique_ptr<CanonicalIncludes> Includes,		std::unique_ptr<CanonicalIncludes> Includes,
const index::IndexingOptions &Opts,		const index::IndexingOptions &Opts)
tooling::ExecutionContext *Ctx)
: WrapperFrontendAction(		: WrapperFrontendAction(
index::createIndexingAction(C, Opts, nullptr)),		index::createIndexingAction(C, Opts, nullptr)),
Ctx(Ctx), Collector(C), Includes(std::move(Includes)),		Merger(Merger), Collector(C), Includes(std::move(Includes)),
PragmaHandler(collectIWYUHeaderMaps(this->Includes.get())) {}		PragmaHandler(collectIWYUHeaderMaps(this->Includes.get())) {}

std::unique_ptr<ASTConsumer>		std::unique_ptr<ASTConsumer>
CreateASTConsumer(CompilerInstance &CI, StringRef InFile) override {		CreateASTConsumer(CompilerInstance &CI, StringRef InFile) override {
CI.getPreprocessor().addCommentHandler(PragmaHandler.get());		CI.getPreprocessor().addCommentHandler(PragmaHandler.get());
return WrapperFrontendAction::CreateASTConsumer(CI, InFile);		return WrapperFrontendAction::CreateASTConsumer(CI, InFile);
}		}

void EndSourceFileAction() override {		void EndSourceFileAction() override {
WrapperFrontendAction::EndSourceFileAction();		WrapperFrontendAction::EndSourceFileAction();

auto Symbols = Collector->takeSymbols();		Merger.mergeSymbols(Collector->takeSymbols());
for (const auto &Sym : Symbols) {
std::string IDStr;
llvm::raw_string_ostream OS(IDStr);
OS << Sym.ID;
Ctx->reportResult(OS.str(), SymbolToYAML(Sym));
}
}		}

private:		private:
tooling::ExecutionContext *Ctx;		SymbolMerger &Merger;
std::shared_ptr<SymbolCollector> Collector;		std::shared_ptr<SymbolCollector> Collector;
std::unique_ptr<CanonicalIncludes> Includes;		std::unique_ptr<CanonicalIncludes> Includes;
std::unique_ptr<CommentHandler> PragmaHandler;		std::unique_ptr<CommentHandler> PragmaHandler;
};		};

index::IndexingOptions IndexOpts;		index::IndexingOptions IndexOpts;
IndexOpts.SystemSymbolFilter =		IndexOpts.SystemSymbolFilter =
index::IndexingOptions::SystemSymbolFilterKind::All;		index::IndexingOptions::SystemSymbolFilterKind::All;
IndexOpts.IndexFunctionLocals = false;		IndexOpts.IndexFunctionLocals = false;
auto CollectorOpts = SymbolCollector::Options();		auto CollectorOpts = SymbolCollector::Options();
CollectorOpts.FallbackDir = AssumedHeaderDir;		CollectorOpts.FallbackDir = AssumedHeaderDir;
CollectorOpts.CollectIncludePath = true;		CollectorOpts.CollectIncludePath = true;
CollectorOpts.CountReferences = true;		CollectorOpts.CountReferences = true;
auto Includes = llvm::make_unique<CanonicalIncludes>();		auto Includes = llvm::make_unique<CanonicalIncludes>();
addSystemHeadersMapping(Includes.get());		addSystemHeadersMapping(Includes.get());
CollectorOpts.Includes = Includes.get();		CollectorOpts.Includes = Includes.get();
return new WrappedIndexAction(		return new WrappedIndexAction(
std::make_shared<SymbolCollector>(std::move(CollectorOpts)),		Merger, std::make_shared<SymbolCollector>(std::move(CollectorOpts)),
std::move(Includes), IndexOpts, Ctx);		std::move(Includes), IndexOpts);
}		}

tooling::ExecutionContext *Ctx;		SymbolMerger &Merger;
};		};

// Combine occurrences of the same symbol across translation units.
SymbolSlab mergeSymbols(tooling::ToolResults *Results) {
SymbolSlab::Builder UniqueSymbols;
llvm::BumpPtrAllocator Arena;
Symbol::Details Scratch;
Results->forEachResult([&](llvm::StringRef Key, llvm::StringRef Value) {
Arena.Reset();
llvm::yaml::Input Yin(Value, &Arena);
auto Sym = clang::clangd::SymbolFromYAML(Yin, Arena);
clang::clangd::SymbolID ID;
Key >> ID;
if (const auto *Existing = UniqueSymbols.find(ID))
UniqueSymbols.insert(mergeSymbol(*Existing, Sym, &Scratch));
else
UniqueSymbols.insert(Sym);
});
return std::move(UniqueSymbols).build();
}

} // namespace		} // namespace
} // namespace clangd		} // namespace clangd
} // namespace clang		} // namespace clang

int main(int argc, const char **argv) {		int main(int argc, const char **argv) {
llvm::sys::PrintStackTraceOnErrorSignal(argv[0]);		llvm::sys::PrintStackTraceOnErrorSignal(argv[0]);

const char* Overview =		const char* Overview =
Show All 9 Lines	int main(int argc, const char **argv) {
}		}

if (!clang::clangd::AssumedHeaderDir.empty() &&		if (!clang::clangd::AssumedHeaderDir.empty() &&
!llvm::sys::path::is_absolute(clang::clangd::AssumedHeaderDir)) {		!llvm::sys::path::is_absolute(clang::clangd::AssumedHeaderDir)) {
llvm::errs() << "--assume-header-dir must be an absolute path.\n";		llvm::errs() << "--assume-header-dir must be an absolute path.\n";
return 1;		return 1;
}		}

// Map phase: emit symbols found in each translation unit.		clang::clangd::SymbolMerger Merger;
		// Emit and merge symbols found in each translation unit.
auto Err = Executor->get()->execute(		auto Err = Executor->get()->execute(
llvm::make_unique<clang::clangd::SymbolIndexActionFactory>(		llvm::make_unique<clang::clangd::SymbolIndexActionFactory>(Merger));
Executor->get()->getExecutionContext()));		if (Err)
if (Err) {
llvm::errs() << llvm::toString(std::move(Err)) << "\n";		llvm::errs() << llvm::toString(std::move(Err)) << "\n";
}		// Get resulting symbols.
		auto UniqueSymbols = std::move(Merger).build();
// Reduce phase: combine symbols using the ID as a key.
auto UniqueSymbols =
clang::clangd::mergeSymbols(Executor->get()->getToolResults());

// Output phase: emit YAML for result symbols.		// Output phase: emit YAML for result symbols.
SymbolsToYAML(UniqueSymbols, llvm::outs());		SymbolsToYAML(UniqueSymbols, llvm::outs());
return 0;		return 0;
}		}