Download Raw Diff

Details

Reviewers

friss
JDevlieghere

Summary

It too slow to generate dSYM file, when project is big.
Debug show that MainBinarySymbolAddresses.size() is 2,643k;

I replace linear lookup with a map to speed it up;
In my project it took 2.5 minutes than 30 minutes before.

Diff Detail

Event Timeline

C-_-fan created this revision.Mar 4 2022, 9:07 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 4 2022, 9:07 AM

Herald added subscribers: dexonsmith, hiraditya. · View Herald Transcript

C-_-fan requested review of this revision.Mar 4 2022, 9:07 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 4 2022, 9:07 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B152612: Diff 413031.Mar 4 2022, 9:39 AM

This only becomes a problem when you have a huge number of public symbols. I bet the majority of these symbols are weak exports from linkeonce_odr functions. Could you change your project to use something like -fvisibility=hidden to hide symbols by default? Doing that has the added benefit of saving the dynamic loader a bunch of work and improving launch time.

I ran some benchmarks on clang, and indeed, this patch actually slows things down:

Baseline:

Time (mean ± σ):     86.446 s ±  0.274 s    [User: 124.955 s, System: 4.702 s]
Range (min … max):   86.075 s … 86.932 s    10 runs

With this patch:

Time (mean ± σ):     88.012 s ±  0.502 s    [User: 127.594 s, System: 5.156 s]
Range (min … max):   87.397 s … 89.001 s    10 runs

llvm/lib/Support/StringRef.cpp
199–209 ↗	(On Diff #413031)	I'm not sure I understand what this is trying to achieve. Changes to support should go in a separate patch with its own description/motivation and unit test coverage.

lebedev.ri added a subscriber: lebedev.ri.Mar 4 2022, 12:35 PM

lebedev.ri added inline comments.

llvm/tools/dsymutil/MachODebugMapParser.cpp
58	`std::unordered_map` has bad performance characteristics. Does it help if this is instead a SmallDenseMap of SmallVector's?

C-_-fan added inline comments.Mar 4 2022, 5:09 PM

llvm/lib/Support/StringRef.cpp
199–209 ↗	(On Diff #413031)	Thank you for your review and comment. In profile results, `rfind` method takes a large proportion of time, because `substr(I, N)` is slow; If `substr(I, N)` equal to `Str`, character at `(Data+ I)` should be equal to the first character of Str.
llvm/tools/dsymutil/MachODebugMapParser.cpp
58	Thank you for your review and comment， I'll update and test later.

I am abandoned the change about rfind, and replaced std::unordered_map<uint64_t, std::vector<StringRef>> with SmallDenseMap<uint64_t, SmallVector<StringRef>>;

In my test, now it take more time because rfind performance rollback; but SmallDenseMap and SmallVector is fast than unordered_map and vector.

I will try -fvisibility=hidden next week.

Please reivew it agagin. If can not merge still, how can I close this revision？ Or this will close auto after timeout?

Thanks again.

Harbormaster completed remote builds in B152723: Diff 413209.Mar 5 2022, 5:48 AM

C-_-fan updated this revision to Diff 413359.Mar 7 2022, 12:26 AM

Harbormaster completed remote builds in B152850: Diff 413359.Mar 7 2022, 1:10 AM

Whether or not can I find a threshold to divide the code into 2 branches? One branch is use to processing the normal project and the other one is use to processing the overlong project.

C-_-fan updated this revision to Diff 413374.Mar 7 2022, 1:39 AM

Harbormaster completed remote builds in B152856: Diff 413374.Mar 7 2022, 2:24 AM

C-_-fan updated this revision to Diff 413701.Mar 7 2022, 9:40 PM

C-_-fan removed subscribers: lebedev.ri, hiraditya, dexonsmith.

Harbormaster completed remote builds in B153080: Diff 413701.Mar 7 2022, 10:28 PM

I just came in to comment on the API change — seems like ArrayRef would be better! — but a spotted a couple of other things.

(I don't have much context on the logic here in dsymutil, so these are just suggestions; probably @JDevlieghere or @lebedev.ri will need to LGTM; but thought I'd share what I noticed.)

llvm/tools/dsymutil/MachODebugMapParser.cpp
58	You could refine this a bit I think: Firstly, storing `SmallVector<StringRef>` on the heap is a bit questionable, since `SmallVector` includes non-trivial small storage. Probably you want `SmallVector<StringRef, 1>` to customize the small storage for the common case (assuming 1 symbol per address is the common case?). Secondly, `StringRef` is the size of two pointers, but you could use a `StringMapEntry<uint64_t> ` into `MainBinarySymbolAddresses` to halve the storage. You'd want to update: if (Extern) MainBinarySymbolAddresses[Name] = Addr; else MainBinarySymbolAddresses.try_emplace(Name, Addr); to: auto I = MainBinarySymbolAddresses.try_emplace(Name, Addr); // Replace the na if (Extern && !I.second) I.first->second = Addr; if (Extern \|\| I.second) MainBinaryAddressses2Names[Addr].push_back(&I.first); Writing that suggested code, I have reproduced what I think is a logic bug, where the API now returns something different than before your patch. See my other inline comment. Thirdly, I think you could probably do something complicated with a flat `SmallVector` and augment the data in `MainBinarySymbolAddresses` with two `uint32_t`s to indicate the range within it. More complicated, but probably less memory overall, faster for accessing, and maybe faster for building (depending on various characteristics). Here is the basic model: Change MainBinarySymbolAddresses to store a struct that has an `uint32_t` index; maybe called `SymbolAddressData`; index starts out as `-1U`. During the loop, build up `SmallVector<StringMapEntry<SymbolAddressData> *>` as you go, only pushing back on insertions. Use a stable sort to make the vector ordered by pointed-address, so aliases are adjacent to each other. Do a pass through the vector. For each address: If there's only one symbol with that address, filter it out, and leave `SymbolAddressData` pointing at `-1U`. Else, update each `SymbolAddressData` to point at the first index into the vector with that address. Update `getMainBinarySymbolNames()`: If the index is `-1U`, return itself. Else, iterate through the vector starting from the saved index. Probably not worth doing, but if there are concerns about memory overhead, this would reduce the overhead significantly. (Maybe a simpler alternative would be to skip caching the index in the StringMap. Instead, do a `std::lower_bound()` to find the first symbol with the same address. If that first symbol is `end()` or has the wrong address, then you have the `-1U` case. This might be faster overall, depending on the workload, especially if the common case is "one symbol per address", since the vector will end up being pretty small after filtering and the binary search will be trivially fast.)
83	An API like this should probably return an `ArrayRef<StringRef>` (immutable; doesn't leak the internal storage type) rather than `SmallVector<StringRef>`. Although my other inline comments point in a different direction...
594–595	It's not obvious to me that `MachODebugMapParser::getMainBinarySymbolNames()` is going to return the same things as before. If this replaces a different `Addr`, then there will be two different `Addrs` that have this symbol. I'm not sure if the precise, existing behaviour is important, but if it is, I think you'll need a second pass through. I think something like this would work (outside/after the `for` loop that's building up these data structures): for (auto &AliasList : MainBinarySymbolAddress2Name) { AliasList.second.erase( llvm::remove_if(AliasList.second, [&](StringMapEntry<uint64_t> *Entry) { return Entry->second != AliasList.first; }), AliasList.second.end()); }

Diff 413701

llvm/tools/dsymutil/MachODebugMapParser.cpp

//===- tools/dsymutil/MachODebugMapParser.cpp - Parse STABS debug maps ----===//		//===- tools/dsymutil/MachODebugMapParser.cpp - Parse STABS debug maps ----===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "BinaryHolder.h"		#include "BinaryHolder.h"
#include "DebugMap.h"		#include "DebugMap.h"
#include "MachOUtils.h"		#include "MachOUtils.h"
		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Optional.h"		#include "llvm/ADT/Optional.h"
#include "llvm/Object/MachO.h"		#include "llvm/Object/MachO.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/WithColor.h"		#include "llvm/Support/WithColor.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <vector>		#include <vector>

namespace {		namespace {
Show All 26 Lines	private:
SmallVector<StringRef, 1> Archs;		SmallVector<StringRef, 1> Archs;
std::string PathPrefix;		std::string PathPrefix;
bool PaperTrailWarnings;		bool PaperTrailWarnings;

/// Owns the MemoryBuffer for the main binary.		/// Owns the MemoryBuffer for the main binary.
BinaryHolder BinHolder;		BinaryHolder BinHolder;
/// Map of the binary symbol addresses.		/// Map of the binary symbol addresses.
StringMap<uint64_t> MainBinarySymbolAddresses;		StringMap<uint64_t> MainBinarySymbolAddresses;

		/// Binary symbol addresses to names map, to speedup
		/// `getMainBinarySymbolNames`;
		SmallDenseMap<uint64_t, SmallVector<StringRef>> MainBinaryAddresses2NamesMap;
		lebedev.riUnsubmitted Not Done Reply Inline Actions `std::unordered_map` has bad performance characteristics. Does it help if this is instead a SmallDenseMap of SmallVector's? lebedev.ri: `std::unordered_map` has bad performance characteristics. Does it help if this is instead a…
		C-_-fanAuthorUnsubmitted Done Reply Inline Actions Thank you for your review and comment， I'll update and test later. C-_-fan: Thank you for your review and comment， I'll update and test later.
		dexonsmithUnsubmitted Not Done Reply Inline Actions You could refine this a bit I think: Firstly, storing `SmallVector<StringRef>` on the heap is a bit questionable, since `SmallVector` includes non-trivial small storage. Probably you want `SmallVector<StringRef, 1>` to customize the small storage for the common case (assuming 1 symbol per address is the common case?). Secondly, `StringRef` is the size of two pointers, but you could use a `StringMapEntry<uint64_t> ` into `MainBinarySymbolAddresses` to halve the storage. You'd want to update: if (Extern) MainBinarySymbolAddresses[Name] = Addr; else MainBinarySymbolAddresses.try_emplace(Name, Addr); to: auto I = MainBinarySymbolAddresses.try_emplace(Name, Addr); // Replace the na if (Extern && !I.second) I.first->second = Addr; if (Extern \|\| I.second) MainBinaryAddressses2Names[Addr].push_back(&I.first); Writing that suggested code, I have reproduced what I think is a logic bug, where the API now returns something different than before your patch. See my other inline comment. Thirdly, I think you could probably do something complicated with a flat `SmallVector` and augment the data in `MainBinarySymbolAddresses` with two `uint32_t`s to indicate the range within it. More complicated, but probably less memory overall, faster for accessing, and maybe faster for building (depending on various characteristics). Here is the basic model: Change MainBinarySymbolAddresses to store a struct that has an `uint32_t` index; maybe called `SymbolAddressData`; index starts out as `-1U`. During the loop, build up `SmallVector<StringMapEntry<SymbolAddressData> >` as you go, only pushing back on insertions. Use a stable sort to make the vector ordered by pointed-address, so aliases are adjacent to each other. Do a pass through the vector. For each address: If there's only one symbol with that address, filter it out, and leave `SymbolAddressData` pointing at `-1U`. Else, update each `SymbolAddressData` to point at the first index into the vector with that address. Update `getMainBinarySymbolNames()`: If the index is `-1U`, return itself. Else, iterate through the vector starting from the saved index. Probably not worth doing, but if there are concerns about memory overhead, this would reduce the overhead significantly. (Maybe a simpler alternative would be to skip caching the index in the StringMap. Instead, do a `std::lower_bound()` to find the first symbol with the same address. If that first symbol is `end()` or has the wrong address, then you have the `-1U` case. This might be faster overall, depending on the workload, especially if the common case is "one symbol per address", since the vector will end up being pretty small after filtering and the binary search will be trivially fast.) dexonsmith:* You could refine this a bit I think: Firstly, storing `SmallVector<StringRef>` on the heap is…

StringRef MainBinaryStrings;		StringRef MainBinaryStrings;
/// The constructed DebugMap.		/// The constructed DebugMap.
std::unique_ptr<DebugMap> Result;		std::unique_ptr<DebugMap> Result;
/// List of common symbols that need to be added to the debug map.		/// List of common symbols that need to be added to the debug map.
std::vector<std::string> CommonSymbols;		std::vector<std::string> CommonSymbols;

/// Map of the currently processed object file symbol addresses.		/// Map of the currently processed object file symbol addresses.
StringMap<Optional<uint64_t>> CurrentObjectAddresses;		StringMap<Optional<uint64_t>> CurrentObjectAddresses;
/// Element of the debug map corresponding to the current object file.		/// Element of the debug map corresponding to the current object file.
DebugMapObject *CurrentDebugMapObject;		DebugMapObject *CurrentDebugMapObject;

/// Holds function info while function scope processing.		/// Holds function info while function scope processing.
const char *CurrentFunctionName;		const char *CurrentFunctionName;
uint64_t CurrentFunctionAddress;		uint64_t CurrentFunctionAddress;

std::unique_ptr<DebugMap> parseOneBinary(const MachOObjectFile &MainBinary,		std::unique_ptr<DebugMap> parseOneBinary(const MachOObjectFile &MainBinary,
StringRef BinaryPath);		StringRef BinaryPath);

void		void
switchToNewDebugMapObject(StringRef Filename,		switchToNewDebugMapObject(StringRef Filename,
sys::TimePoint<std::chrono::seconds> Timestamp);		sys::TimePoint<std::chrono::seconds> Timestamp);
void resetParserState();		void resetParserState();
uint64_t getMainBinarySymbolAddress(StringRef Name);		uint64_t getMainBinarySymbolAddress(StringRef Name);
std::vector<StringRef> getMainBinarySymbolNames(uint64_t Value);		SmallVector<StringRef> &getMainBinarySymbolNames(uint64_t Value);
		dexonsmithUnsubmitted Not Done Reply Inline Actions An API like this should probably return an `ArrayRef<StringRef>` (immutable; doesn't leak the internal storage type) rather than `SmallVector<StringRef>`. Although my other inline comments point in a different direction... dexonsmith: An API like this should probably return an `ArrayRef<StringRef>` (immutable; doesn't leak the…
void loadMainBinarySymbols(const MachOObjectFile &MainBinary);		void loadMainBinarySymbols(const MachOObjectFile &MainBinary);
void loadCurrentObjectFileSymbols(const object::MachOObjectFile &Obj);		void loadCurrentObjectFileSymbols(const object::MachOObjectFile &Obj);
void handleStabSymbolTableEntry(uint32_t StringIndex, uint8_t Type,		void handleStabSymbolTableEntry(uint32_t StringIndex, uint8_t Type,
uint8_t SectionIndex, uint16_t Flags,		uint8_t SectionIndex, uint16_t Flags,
uint64_t Value);		uint64_t Value);

template <typename STEType> void handleStabDebugMapEntry(const STEType &STE) {		template <typename STEType> void handleStabDebugMapEntry(const STEType &STE) {
handleStabSymbolTableEntry(STE.n_strx, STE.n_type, STE.n_sect, STE.n_desc,		handleStabSymbolTableEntry(STE.n_strx, STE.n_type, STE.n_sect, STE.n_desc,
▲ Show 20 Lines • Show All 440 Lines • ▼ Show 20 Lines
uint64_t MachODebugMapParser::getMainBinarySymbolAddress(StringRef Name) {		uint64_t MachODebugMapParser::getMainBinarySymbolAddress(StringRef Name) {
auto Sym = MainBinarySymbolAddresses.find(Name);		auto Sym = MainBinarySymbolAddresses.find(Name);
if (Sym == MainBinarySymbolAddresses.end())		if (Sym == MainBinarySymbolAddresses.end())
return 0;		return 0;
return Sym->second;		return Sym->second;
}		}

/// Get all symbol names in the main binary for the given value.		/// Get all symbol names in the main binary for the given value.
std::vector<StringRef>		SmallVector<StringRef> &MachODebugMapParser::getMainBinarySymbolNames(uint64_t Value) {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code -SmallVector<StringRef> &MachODebugMapParser::getMainBinarySymbolNames(uint64_t Value) { +SmallVector<StringRef> & +MachODebugMapParser::getMainBinarySymbolNames(uint64_t Value) { Lint: Pre-merge checks: clang-format: please reformat the code ``` -SmallVector<StringRef> &MachODebugMapParser…
MachODebugMapParser::getMainBinarySymbolNames(uint64_t Value) {		return MainBinaryAddresses2NamesMap[Value];
std::vector<StringRef> Names;
for (const auto &Entry : MainBinarySymbolAddresses) {
if (Entry.second == Value)
Names.push_back(Entry.first());
}
return Names;
}		}

/// Load the interesting main binary symbols' addresses into		/// Load the interesting main binary symbols' addresses into
/// MainBinarySymbolAddresses.		/// MainBinarySymbolAddresses.
void MachODebugMapParser::loadMainBinarySymbols(		void MachODebugMapParser::loadMainBinarySymbols(
const MachOObjectFile &MainBinary) {		const MachOObjectFile &MainBinary) {
section_iterator Section = MainBinary.section_end();		section_iterator Section = MainBinary.section_end();
MainBinarySymbolAddresses.clear();		MainBinarySymbolAddresses.clear();
		MainBinaryAddresses2NamesMap.clear();

for (const auto &Sym : MainBinary.symbols()) {		for (const auto &Sym : MainBinary.symbols()) {
Expected<SymbolRef::Type> TypeOrErr = Sym.getType();		Expected<SymbolRef::Type> TypeOrErr = Sym.getType();
if (!TypeOrErr) {		if (!TypeOrErr) {
// TODO: Actually report errors helpfully.		// TODO: Actually report errors helpfully.
consumeError(TypeOrErr.takeError());		consumeError(TypeOrErr.takeError());
continue;		continue;
}		}
SymbolRef::Type Type = *TypeOrErr;		SymbolRef::Type Type = *TypeOrErr;
Show All 25 Lines	if (!NameOrErr) {
// TODO: Actually report errors helpfully.		// TODO: Actually report errors helpfully.
consumeError(NameOrErr.takeError());		consumeError(NameOrErr.takeError());
continue;		continue;
}		}
StringRef Name = *NameOrErr;		StringRef Name = *NameOrErr;
if (Name.size() == 0 \|\| Name[0] == '\0')		if (Name.size() == 0 \|\| Name[0] == '\0')
continue;		continue;
// Override only if the new key is global.		// Override only if the new key is global.
if (Extern)		if (Extern) {
MainBinarySymbolAddresses[Name] = Addr;		MainBinarySymbolAddresses[Name] = Addr;
else		getMainBinarySymbolNames(Addr).push_back(Name);
		dexonsmithUnsubmitted Not Done Reply Inline Actions It's not obvious to me that `MachODebugMapParser::getMainBinarySymbolNames()` is going to return the same things as before. If this replaces a different `Addr`, then there will be two different `Addrs` that have this symbol. I'm not sure if the precise, existing behaviour is important, but if it is, I think you'll need a second pass through. I think something like this would work (outside/after the `for` loop that's building up these data structures): for (auto &AliasList : MainBinarySymbolAddress2Name) { AliasList.second.erase( llvm::remove_if(AliasList.second, [&](StringMapEntry<uint64_t> Entry) { return Entry->second != AliasList.first; }), AliasList.second.end()); } dexonsmith:* It's not obvious to me that `MachODebugMapParser::getMainBinarySymbolNames()` is going to…
MainBinarySymbolAddresses.try_emplace(Name, Addr);		} else {
		if (MainBinarySymbolAddresses.try_emplace(Name, Addr).second) {
		getMainBinarySymbolNames(Addr).push_back(Name);
		}
		}
}		}
}		}

namespace llvm {		namespace llvm {
namespace dsymutil {		namespace dsymutil {
llvm::ErrorOr<std::vector<std::unique_ptr<DebugMap>>>		llvm::ErrorOr<std::vector<std::unique_ptr<DebugMap>>>
parseDebugMap(llvm::IntrusiveRefCntPtr<llvm::vfs::FileSystem> VFS,		parseDebugMap(llvm::IntrusiveRefCntPtr<llvm::vfs::FileSystem> VFS,
StringRef InputFile, ArrayRef<std::string> Archs,		StringRef InputFile, ArrayRef<std::string> Archs,
Show All 18 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Speedup dsymutil when working with big project.
Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 413701

llvm/tools/dsymutil/MachODebugMapParser.cpp

This is an archive of the discontinued LLVM Phabricator instance.

Speedup dsymutil when working with big project.Needs ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 413701

llvm/tools/dsymutil/MachODebugMapParser.cpp

Speedup dsymutil when working with big project.
Needs ReviewPublic