Diff 405818

llvm/docs/CommandGuide/llvm-symbolizer.rst

Show First 20 Lines • Show All 177 Lines • ▼ Show 20 Lines .. option:: --adjust-vma <offset>

This can be used to perform lookups as if the object were relocated by the This can be used to perform lookups as if the object were relocated by the

offset. offset.

.. option:: --basenames, -s .. option:: --basenames, -s

Print just the file's name without any directories, instead of the Print just the file's name without any directories, instead of the

absolute path. absolute path.

.. option:: --build-id

Look up the object using the given build ID, specified as a hexadecimal

string. Mutually exclusive with :option:`--obj`.

phosekUnsubmitted

Done

I wonder whether it wouldn't be less error prone to diagnose the case when user uses both --obj and --build-id as an error?

phosek: I wonder whether it wouldn't be less error prone to diagnose the case when user uses both `…

jhendersonUnsubmitted

Done

+1: I think it should be an error if a user specifies both, unless you have a good use-case for it.

jhenderson: +1: I think it should be an error if a user specifies both, unless you have a good use-case for…

jhendersonUnsubmitted

Done

Look up the object using the given build ID, specified as a hexadecimal

- string. Mutually exclusive with the --obj family.

+ string. Mutually exclusive with :option:`--obj`.

.. _llvm-symbolizer-opt-C:

I'd delete "family" since "-e" etc are just aliases, and therefore it's implied.

jhenderson: I'd delete "family" since "-e" etc are just aliases, and therefore it's implied.

.. _llvm-symbolizer-opt-C: .. _llvm-symbolizer-opt-C:

.. option:: --demangle, -C .. option:: --demangle, -C

Print demangled function names, if the names are mangled (e.g. the mangled Print demangled function names, if the names are mangled (e.g. the mangled

name `_Z3bazv` becomes `baz()`, whilst the non-mangled name `foz` is printed name `_Z3bazv` becomes `baz()`, whilst the non-mangled name `foz` is printed

as is). Defaults to true. as is). Defaults to true.

Show All 33 Lines

.. option:: --no-demangle .. option:: --no-demangle

Don't print demangled function names. Don't print demangled function names.

.. option:: --obj <path>, --exe, -e .. option:: --obj <path>, --exe, -e

Path to object file to be symbolized. If ``-`` is specified, read the object Path to object file to be symbolized. If ``-`` is specified, read the object

directly from the standard input stream. directly from the standard input stream. Mutually exclusive with

:option:`--build-id`.

jhendersonUnsubmitted

Done

Path to object file to be symbolized. If ``-`` is specified, read the object

- directly from the standard input stream. Mutually exclusive with --build-id.

+ directly from the standard input stream. Mutually exclusive with :option:`--build-id`.

.. _llvm-symbolizer-opt-output-style:

jhenderson:

.. _llvm-symbolizer-opt-output-style: .. _llvm-symbolizer-opt-output-style:

.. option:: --output-style <LLVM|GNU|JSON> .. option:: --output-style <LLVM|GNU|JSON>

Specify the preferred output style. Defaults to ``LLVM``. When the output Specify the preferred output style. Defaults to ``LLVM``. When the output

style is set to ``GNU``, the tool follows the style of GNU's **addr2line**. style is set to ``GNU``, the tool follows the style of GNU's **addr2line**.

The differences from the ``LLVM`` style are: The differences from the ``LLVM`` style are:

▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h

//===- Symbolize.h ----------------------------------------------- C++ --===//		//===- Symbolize.h ----------------------------------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// Header for LLVM symbolization library.		// Header for LLVM symbolization library.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZE_H		#ifndef LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZE_H
#define LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZE_H		#define LLVM_DEBUGINFO_SYMBOLIZE_SYMBOLIZE_H

		#include "llvm/ADT/StringMap.h"
#include "llvm/DebugInfo/Symbolize/SymbolizableModule.h"		#include "llvm/DebugInfo/Symbolize/SymbolizableModule.h"
#include "llvm/Object/Binary.h"		#include "llvm/Object/Binary.h"
#include "llvm/Object/ELFObjectFile.h"		#include "llvm/Object/ELFObjectFile.h"
#include "llvm/Object/ObjectFile.h"		#include "llvm/Object/ObjectFile.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include <algorithm>		#include <algorithm>
#include <cstdint>		#include <cstdint>
#include <map>		#include <map>
Show All 28 Lines	public:
};		};

LLVMSymbolizer() = default;		LLVMSymbolizer() = default;
LLVMSymbolizer(const Options &Opts) : Opts(Opts) {}		LLVMSymbolizer(const Options &Opts) : Opts(Opts) {}

~LLVMSymbolizer() { flush(); }		~LLVMSymbolizer() { flush(); }

// Overloads accepting ObjectFile does not support COFF currently		// Overloads accepting ObjectFile does not support COFF currently
Expected<DILineInfo> symbolizeCode(const ObjectFile &Obj,		Expected<DILineInfo> symbolizeCode(const ObjectFile &Obj,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
Expected<DILineInfo> symbolizeCode(const std::string &ModuleName,		Expected<DILineInfo> symbolizeCode(const std::string &ModuleName,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
		Expected<DILineInfo> symbolizeCode(ArrayRef<uint8_t> BuildID,
		object::SectionedAddress ModuleOffset);
Expected<DIInliningInfo>		Expected<DIInliningInfo>
		jhendersonUnsubmitted Not Done Reply Inline Actions Not necessarily something that you need to do right now, but nonetheless worth thinking about: it's clear by adding a third variation of each of these methods that some sort of refactoring would be appropriate to simplify extensibility of this code. In this case, I'd probably make each of the different input types (object, module name, build ID) a separate class, with the relevant methods attached. `symbolizeCode` would then just become something like: template <typename T> Expected<DILineInfo> symbolizeCode(const T &Input, object::SectionedAddress ModuleOffset) { return Input.symbolizeCode(ModuleOffset, ...); } and so on. Adding a new input kind wouldn't then require modifying this class again, keeping this class stable, and just adding new classes for each input kind. jhenderson: Not necessarily something that you need to do right now, but nonetheless worth thinking about…
		mysterymathAuthorUnsubmitted Done Reply Inline Actions Yeah, this occurred to me too. I'll take a stab at this afterwards and see if I can design a clean abstraction. mysterymath: Yeah, this occurred to me too. I'll take a stab at this afterwards and see if I can design a…
symbolizeInlinedCode(const ObjectFile &Obj,		symbolizeInlinedCode(const ObjectFile &Obj,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
Expected<DIInliningInfo>		Expected<DIInliningInfo>
symbolizeInlinedCode(const std::string &ModuleName,		symbolizeInlinedCode(const std::string &ModuleName,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
		Expected<DIInliningInfo>
		symbolizeInlinedCode(ArrayRef<uint8_t> BuildID,
		object::SectionedAddress ModuleOffset);

Expected<DIGlobal> symbolizeData(const ObjectFile &Obj,		Expected<DIGlobal> symbolizeData(const ObjectFile &Obj,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
Expected<DIGlobal> symbolizeData(const std::string &ModuleName,		Expected<DIGlobal> symbolizeData(const std::string &ModuleName,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
		Expected<DIGlobal> symbolizeData(ArrayRef<uint8_t> BuildID,
		object::SectionedAddress ModuleOffset);
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrame(const ObjectFile &Obj, object::SectionedAddress ModuleOffset);		symbolizeFrame(const ObjectFile &Obj, object::SectionedAddress ModuleOffset);
Expected<std::vector<DILocal>>		Expected<std::vector<DILocal>>
symbolizeFrame(const std::string &ModuleName,		symbolizeFrame(const std::string &ModuleName,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
		Expected<std::vector<DILocal>>
		symbolizeFrame(ArrayRef<uint8_t> BuildID,
		object::SectionedAddress ModuleOffset);
void flush();		void flush();

static std::string		static std::string
DemangleName(const std::string &Name,		DemangleName(const std::string &Name,
const SymbolizableModule *DbiModuleDescriptor);		const SymbolizableModule *DbiModuleDescriptor);

private:		private:
		bool getOrFindDebugBinary(const ArrayRef<uint8_t> BuildID,
		std::string &Result);

// Bundles together object file with code/data and object file with		// Bundles together object file with code/data and object file with
// corresponding debug info. These objects can be the same.		// corresponding debug info. These objects can be the same.
using ObjectPair = std::pair<const ObjectFile , const ObjectFile >;		using ObjectPair = std::pair<const ObjectFile , const ObjectFile >;

template <typename T>		template <typename T>
Expected<DILineInfo>		Expected<DILineInfo>
symbolizeCodeCommon(const T &ModuleSpecifier,		symbolizeCodeCommon(const T &ModuleSpecifier,
object::SectionedAddress ModuleOffset);		object::SectionedAddress ModuleOffset);
Show All 11 Lines	private:

/// Returns a SymbolizableModule or an error if loading debug info failed.		/// Returns a SymbolizableModule or an error if loading debug info failed.
/// Only one attempt is made to load a module, and errors during loading are		/// Only one attempt is made to load a module, and errors during loading are
/// only reported once. Subsequent calls to get module info for a module that		/// only reported once. Subsequent calls to get module info for a module that
/// failed to load will return nullptr.		/// failed to load will return nullptr.
Expected<SymbolizableModule *>		Expected<SymbolizableModule *>
getOrCreateModuleInfo(const std::string &ModuleName);		getOrCreateModuleInfo(const std::string &ModuleName);
Expected<SymbolizableModule *> getOrCreateModuleInfo(const ObjectFile &Obj);		Expected<SymbolizableModule *> getOrCreateModuleInfo(const ObjectFile &Obj);
		Expected<SymbolizableModule *>
		getOrCreateModuleInfo(ArrayRef<uint8_t> BuildID);

Expected<SymbolizableModule *>		Expected<SymbolizableModule *>
createModuleInfo(const ObjectFile *Obj, std::unique_ptr<DIContext> Context,		createModuleInfo(const ObjectFile *Obj, std::unique_ptr<DIContext> Context,
StringRef ModuleName);		StringRef ModuleName);

ObjectFile *lookUpDsymFile(const std::string &Path,		ObjectFile *lookUpDsymFile(const std::string &Path,
const MachOObjectFile *ExeObj,		const MachOObjectFile *ExeObj,
const std::string &ArchName);		const std::string &ArchName);
Show All 11 Lines	private:
/// Return a pointer to object file at specified path, for a specified		/// Return a pointer to object file at specified path, for a specified
/// architecture (e.g. if path refers to a Mach-O universal binary, only one		/// architecture (e.g. if path refers to a Mach-O universal binary, only one
/// object file from it will be returned).		/// object file from it will be returned).
Expected<ObjectFile *> getOrCreateObject(const std::string &Path,		Expected<ObjectFile *> getOrCreateObject(const std::string &Path,
const std::string &ArchName);		const std::string &ArchName);

std::map<std::string, std::unique_ptr<SymbolizableModule>, std::less<>>		std::map<std::string, std::unique_ptr<SymbolizableModule>, std::less<>>
Modules;		Modules;
		StringMap<std::string> BuildIDPaths;

/// Contains cached results of getOrCreateObjectPair().		/// Contains cached results of getOrCreateObjectPair().
std::map<std::pair<std::string, std::string>, ObjectPair>		std::map<std::pair<std::string, std::string>, ObjectPair>
ObjectPairForPathArch;		ObjectPairForPathArch;

/// Contains parsed binary for each path, or parsing error.		/// Contains parsed binary for each path, or parsing error.
std::map<std::string, OwningBinary<Binary>> BinaryForPath;		std::map<std::string, OwningBinary<Binary>> BinaryForPath;

Show All 12 Lines

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines

} }

Expected<DILineInfo> Expected<DILineInfo>

LLVMSymbolizer::symbolizeCode(const std::string &ModuleName, LLVMSymbolizer::symbolizeCode(const std::string &ModuleName,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

return symbolizeCodeCommon(ModuleName, ModuleOffset); return symbolizeCodeCommon(ModuleName, ModuleOffset);

} }

Expected<DILineInfo>

LLVMSymbolizer::symbolizeCode(ArrayRef<uint8_t> BuildID,

object::SectionedAddress ModuleOffset) {

return symbolizeCodeCommon(BuildID, ModuleOffset);

}

template <typename T> template <typename T>

Expected<DIInliningInfo> LLVMSymbolizer::symbolizeInlinedCodeCommon( Expected<DIInliningInfo> LLVMSymbolizer::symbolizeInlinedCodeCommon(

const T &ModuleSpecifier, object::SectionedAddress ModuleOffset) { const T &ModuleSpecifier, object::SectionedAddress ModuleOffset) {

auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier); auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier);

if (!InfoOrErr) if (!InfoOrErr)

return InfoOrErr.takeError(); return InfoOrErr.takeError();

SymbolizableModule *Info = *InfoOrErr; SymbolizableModule *Info = *InfoOrErr;

Show All 27 Lines

} }

Expected<DIInliningInfo> Expected<DIInliningInfo>

LLVMSymbolizer::symbolizeInlinedCode(const std::string &ModuleName, LLVMSymbolizer::symbolizeInlinedCode(const std::string &ModuleName,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

return symbolizeInlinedCodeCommon(ModuleName, ModuleOffset); return symbolizeInlinedCodeCommon(ModuleName, ModuleOffset);

} }

Expected<DIInliningInfo>

LLVMSymbolizer::symbolizeInlinedCode(ArrayRef<uint8_t> BuildID,

object::SectionedAddress ModuleOffset) {

return symbolizeInlinedCodeCommon(BuildID, ModuleOffset);

}

template <typename T> template <typename T>

Expected<DIGlobal> Expected<DIGlobal>

LLVMSymbolizer::symbolizeDataCommon(const T &ModuleSpecifier, LLVMSymbolizer::symbolizeDataCommon(const T &ModuleSpecifier,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier); auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier);

if (!InfoOrErr) if (!InfoOrErr)

return InfoOrErr.takeError(); return InfoOrErr.takeError();

Show All 23 Lines

} }

Expected<DIGlobal> Expected<DIGlobal>

LLVMSymbolizer::symbolizeData(const std::string &ModuleName, LLVMSymbolizer::symbolizeData(const std::string &ModuleName,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

return symbolizeDataCommon(ModuleName, ModuleOffset); return symbolizeDataCommon(ModuleName, ModuleOffset);

} }

Expected<DIGlobal>

LLVMSymbolizer::symbolizeData(ArrayRef<uint8_t> BuildID,

object::SectionedAddress ModuleOffset) {

return symbolizeDataCommon(BuildID, ModuleOffset);

}

template <typename T> template <typename T>

Expected<std::vector<DILocal>> Expected<std::vector<DILocal>>

LLVMSymbolizer::symbolizeFrameCommon(const T &ModuleSpecifier, LLVMSymbolizer::symbolizeFrameCommon(const T &ModuleSpecifier,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier); auto InfoOrErr = getOrCreateModuleInfo(ModuleSpecifier);

if (!InfoOrErr) if (!InfoOrErr)

return InfoOrErr.takeError(); return InfoOrErr.takeError();

Show All 19 Lines

} }

Expected<std::vector<DILocal>> Expected<std::vector<DILocal>>

LLVMSymbolizer::symbolizeFrame(const std::string &ModuleName, LLVMSymbolizer::symbolizeFrame(const std::string &ModuleName,

object::SectionedAddress ModuleOffset) { object::SectionedAddress ModuleOffset) {

return symbolizeFrameCommon(ModuleName, ModuleOffset); return symbolizeFrameCommon(ModuleName, ModuleOffset);

} }

Expected<std::vector<DILocal>>

LLVMSymbolizer::symbolizeFrame(ArrayRef<uint8_t> BuildID,

object::SectionedAddress ModuleOffset) {

return symbolizeFrameCommon(BuildID, ModuleOffset);

}

void LLVMSymbolizer::flush() { void LLVMSymbolizer::flush() {

ObjectForUBPathAndArch.clear(); ObjectForUBPathAndArch.clear();

BinaryForPath.clear(); BinaryForPath.clear();

ObjectPairForPathArch.clear(); ObjectPairForPathArch.clear();

Modules.clear(); Modules.clear();

BuildIDPaths.clear();

} }

namespace { namespace {

// For Path="/path/to/foo" and Basename="foo" assume that debug info is in // For Path="/path/to/foo" and Basename="foo" assume that debug info is in

// /path/to/foo.dSYM/Contents/Resources/DWARF/foo. // /path/to/foo.dSYM/Contents/Resources/DWARF/foo.

// For Path="/path/to/bar.dSYM" and Basename="foo" assume that debug info is in // For Path="/path/to/bar.dSYM" and Basename="foo" assume that debug info is in

// /path/to/bar.dSYM/Contents/Resources/DWARF/foo. // /path/to/bar.dSYM/Contents/Resources/DWARF/foo.

std::string getDarwinDWARFResourceForPath(const std::string &Path, std::string getDarwinDWARFResourceForPath(const std::string &Path,

const std::string &Basename) { const std::string &Basename) {

SmallString<16> ResourceName = StringRef(Path); SmallString<16> ResourceName = StringRef(Path);

if (sys::path::extension(Path) != ".dSYM") { if (sys::path::extension(Path) != ".dSYM") {

▲ Show 20 Lines • Show All 127 Lines • ▼ Show 20 Lines Optional<ArrayRef<uint8_t>> getBuildID(const ELFObjectFileBase *Obj) {

else if (auto *O = dyn_cast<ELFObjectFile<ELF64LE>>(Obj)) else if (auto *O = dyn_cast<ELFObjectFile<ELF64LE>>(Obj))

BuildID = getBuildID(O->getELFFile()); BuildID = getBuildID(O->getELFFile());

else if (auto *O = dyn_cast<ELFObjectFile<ELF64BE>>(Obj)) else if (auto *O = dyn_cast<ELFObjectFile<ELF64BE>>(Obj))

BuildID = getBuildID(O->getELFFile()); BuildID = getBuildID(O->getELFFile());

else else

llvm_unreachable("unsupported file format"); llvm_unreachable("unsupported file format");

return BuildID; return BuildID;

} }

} // end anonymous namespace

static StringRef getBuildIDStr(ArrayRef<uint8_t> BuildID) {

return StringRef(reinterpret_cast<const char *>(BuildID.data()),

BuildID.size());

}

bool LLVMSymbolizer::getOrFindDebugBinary(const ArrayRef<uint8_t> BuildID,

std::string &Result) {

StringRef BuildIDStr = getBuildIDStr(BuildID);

auto I = BuildIDPaths.find(BuildIDStr);

if (I != BuildIDPaths.end()) {

// If an error was recorded.

jhendersonUnsubmitted

Done

if (I != BuildIDPaths.end()) {

- // If an error was recorded

+ // If an error was recorded.

if (I->second.empty())

jhenderson:

if (I->second.empty())

return false;

Result = I->second;

return true;

}

bool findDebugBinary(const std::vector<std::string> &DebugFileDirectory,

const ArrayRef<uint8_t> BuildID, std::string &Result) {

auto getDebugPath = [&](StringRef Directory) { auto getDebugPath = [&](StringRef Directory) {

jhendersonUnsubmitted

Done

Don't think you want this blank line.

jhenderson: Don't think you want this blank line.

SmallString<128> Path{Directory}; SmallString<128> Path{Directory};

sys::path::append(Path, ".build-id", sys::path::append(Path, ".build-id",

llvm::toHex(BuildID[0], /*LowerCase=*/true), llvm::toHex(BuildID[0], /*LowerCase=*/true),

llvm::toHex(BuildID.slice(1), /*LowerCase=*/true)); llvm::toHex(BuildID.slice(1), /*LowerCase=*/true));

Path += ".debug"; Path += ".debug";

return Path; return Path;

}; };

if (DebugFileDirectory.empty()) { auto recordPath = [&](StringRef Path) {

Result = Path.str();

auto InsertResult = BuildIDPaths.insert({BuildIDStr, Result});

assert(InsertResult.second);

};

if (Opts.DebugFileDirectory.empty()) {

SmallString<128> Path = getDebugPath( SmallString<128> Path = getDebugPath(

#if defined(__NetBSD__) #if defined(__NetBSD__)

// Try /usr/libdata/debug/.build-id/../... // Try /usr/libdata/debug/.build-id/../...

"/usr/libdata/debug" "/usr/libdata/debug"

#else #else

// Try /usr/lib/debug/.build-id/../... // Try /usr/lib/debug/.build-id/../...

"/usr/lib/debug" "/usr/lib/debug"

#endif #endif

); );

if (llvm::sys::fs::exists(Path)) { if (llvm::sys::fs::exists(Path)) {

Result = std::string(Path.str()); recordPath(Path);

return true; return true;

} }

} else { } else {

for (const auto &Directory : DebugFileDirectory) { for (const auto &Directory : Opts.DebugFileDirectory) {

// Try <debug-file-directory>/.build-id/../... // Try <debug-file-directory>/.build-id/../...

SmallString<128> Path = getDebugPath(Directory); SmallString<128> Path = getDebugPath(Directory);

if (llvm::sys::fs::exists(Path)) { if (llvm::sys::fs::exists(Path)) {

Result = std::string(Path.str()); recordPath(Path);

return true; return true;

} }

// Try debuginfod client cache and known servers. // Try debuginfod client cache and known servers.

Expected<std::string> PathOrErr = getCachedOrDownloadDebuginfo(BuildID); Expected<std::string> PathOrErr = getCachedOrDownloadDebuginfo(BuildID);

if (!PathOrErr) { if (!PathOrErr) {

consumeError(PathOrErr.takeError()); consumeError(PathOrErr.takeError());

// Record that an error occurred.

recordPath("");

phosekUnsubmitted

Not Done

IIUC this slightly changes the semantics. Currently, even if we failed to download a file (for example because of the intermittent network error), we would try again on then next symbolization request. With this change, we would cache the error result (that is, an empty string) after the first attempt and never retry. Maybe it would be better to drop this line to preserve the current semantics?

phosek: IIUC this slightly changes the semantics. Currently, even if we failed to download a file (for…

mysterymathAuthorUnsubmitted

Done

This is slightly tricky: we should retry the network request if it failed previously, but we probably shouldn't spam the console with repeated error messages for the build ID, since that differs from the way errors work for paths (one per path).

I've tried to keep the error behavior while still retrying the lookup under the hood. If a retry succeeds, the data becomes available for all future symbolization request; otherwise, the error is silenced.

mysterymath: This is slightly tricky: we should retry the network request if it failed previously, but we…

phosekUnsubmitted

Done

Personally I'd prefer getting an error on every unsuccessful attempt; that may be more spammy but avoids hiding potentially useful information from the user. If the errors go to stderr, user can always redirect it to /dev/null if needed. Either solution is fine with me though.

phosek: Personally I'd prefer getting an error on every unsuccessful attempt; that may be more spammy…

return false; return false;

} }

Result = *PathOrErr; recordPath(*PathOrErr);

return true; return true;

} }

} // end anonymous namespace

ObjectFile *LLVMSymbolizer::lookUpDsymFile(const std::string &ExePath, ObjectFile *LLVMSymbolizer::lookUpDsymFile(const std::string &ExePath,

const MachOObjectFile *MachExeObj, const MachOObjectFile *MachExeObj,

const std::string &ArchName) { const std::string &ArchName) {

// On Darwin we may find DWARF in separate object file in // On Darwin we may find DWARF in separate object file in

// resource directory. // resource directory.

std::vector<std::string> DsymPaths; std::vector<std::string> DsymPaths;

StringRef Filename = sys::path::filename(ExePath); StringRef Filename = sys::path::filename(ExePath);

DsymPaths.push_back( DsymPaths.push_back(

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines ObjectFile *LLVMSymbolizer::lookUpBuildIDObject(const std::string &Path,

const ELFObjectFileBase *Obj, const ELFObjectFileBase *Obj,

const std::string &ArchName) { const std::string &ArchName) {

auto BuildID = getBuildID(Obj); auto BuildID = getBuildID(Obj);

if (!BuildID) if (!BuildID)

return nullptr; return nullptr;

if (BuildID->size() < 2) if (BuildID->size() < 2)

return nullptr; return nullptr;

std::string DebugBinaryPath; std::string DebugBinaryPath;

if (!findDebugBinary(Opts.DebugFileDirectory, *BuildID, DebugBinaryPath)) if (!getOrFindDebugBinary(*BuildID, DebugBinaryPath))

return nullptr; return nullptr;

auto DbgObjOrErr = getOrCreateObject(DebugBinaryPath, ArchName); auto DbgObjOrErr = getOrCreateObject(DebugBinaryPath, ArchName);

if (!DbgObjOrErr) { if (!DbgObjOrErr) {

consumeError(DbgObjOrErr.takeError()); consumeError(DbgObjOrErr.takeError());

return nullptr; return nullptr;

} }

return DbgObjOrErr.get(); return DbgObjOrErr.get();

} }

Expected<LLVMSymbolizer::ObjectPair> Expected<LLVMSymbolizer::ObjectPair>

LLVMSymbolizer::getOrCreateObjectPair(const std::string &Path, LLVMSymbolizer::getOrCreateObjectPair(const std::string &Path,

const std::string &ArchName) { const std::string &ArchName) {

auto I = ObjectPairForPathArch.find(std::make_pair(Path, ArchName)); auto I = ObjectPairForPathArch.find(std::make_pair(Path, ArchName));

if (I != ObjectPairForPathArch.end()) if (I != ObjectPairForPathArch.end())

return I->second; return I->second;

auto ObjOrErr = getOrCreateObject(Path, ArchName); auto ObjOrErr = getOrCreateObject(Path, ArchName);

if (!ObjOrErr) { if (!ObjOrErr) {

ObjectPairForPathArch.emplace(std::make_pair(Path, ArchName), ObjectPairForPathArch.emplace(std::make_pair(Path, ArchName),

ObjectPair(nullptr, nullptr)); ObjectPair(nullptr, nullptr));

return ObjOrErr.takeError(); return ObjOrErr.takeError();

} }

ObjectFile *Obj = ObjOrErr.get(); ObjectFile *Obj = ObjOrErr.get();

assert(Obj != nullptr); assert(Obj != nullptr);

MaskRayUnsubmitted

Not Done

There was a -Wunused-variable warning in -DLLVM_ENABLE_ASSERTIONS=off builds. Fixed by f8701a30f648dd07d472c108f90fa675c1198c46

MaskRay: There was a -Wunused-variable warning in -DLLVM_ENABLE_ASSERTIONS=off builds. Fixed by…

ObjectFile *DbgObj = nullptr; ObjectFile *DbgObj = nullptr;

if (auto MachObj = dyn_cast<const MachOObjectFile>(Obj)) if (auto MachObj = dyn_cast<const MachOObjectFile>(Obj))

DbgObj = lookUpDsymFile(Path, MachObj, ArchName); DbgObj = lookUpDsymFile(Path, MachObj, ArchName);

else if (auto ELFObj = dyn_cast<const ELFObjectFileBase>(Obj)) else if (auto ELFObj = dyn_cast<const ELFObjectFileBase>(Obj))

DbgObj = lookUpBuildIDObject(Path, ELFObj, ArchName); DbgObj = lookUpBuildIDObject(Path, ELFObj, ArchName);

if (!DbgObj) if (!DbgObj)

DbgObj = lookUpDebuglinkObject(Path, Obj, ArchName); DbgObj = lookUpDebuglinkObject(Path, Obj, ArchName);

▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines LLVMSymbolizer::getOrCreateModuleInfo(const ObjectFile &Obj) {

if (I != Modules.end()) if (I != Modules.end())

return I->second.get(); return I->second.get();

std::unique_ptr<DIContext> Context = DWARFContext::create(Obj); std::unique_ptr<DIContext> Context = DWARFContext::create(Obj);

// FIXME: handle COFF object with PDB info to use PDBContext // FIXME: handle COFF object with PDB info to use PDBContext

return createModuleInfo(&Obj, std::move(Context), ObjName); return createModuleInfo(&Obj, std::move(Context), ObjName);

} }

Expected<SymbolizableModule *>

LLVMSymbolizer::getOrCreateModuleInfo(ArrayRef<uint8_t> BuildID) {

StringRef BuildIDStr = getBuildIDStr(BuildID);

auto I = BuildIDPaths.find(BuildIDStr);

if (I != BuildIDPaths.end() && I->second.empty()) {

// An error has already been reported.

return nullptr;

}

std::string Path;

if (!getOrFindDebugBinary(BuildID, Path)) {

// An error has not yet been reported.

return createStringError(errc::no_such_file_or_directory,

Twine("could not find build ID '") +

toHex(BuildID) + "'");

}

return getOrCreateModuleInfo(Path);

}

namespace { namespace {

// Undo these various manglings for Win32 extern "C" functions: // Undo these various manglings for Win32 extern "C" functions:

// cdecl - _foo // cdecl - _foo

// stdcall - _foo@12 // stdcall - _foo@12

// fastcall - @foo@12 // fastcall - @foo@12

// vectorcall - foo@@12 // vectorcall - foo@@12

// These are all different linkage names for 'foo'. // These are all different linkage names for 'foo'.

▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/test/tools/llvm-symbolizer/debuginfod-bad-build-id.test

This file was added.

				RUN: not llvm-symbolizer --build-id=not_a_hex_string 0x1234 2>&1 \| FileCheck %s

				CHECK: --build-id=: expected a build ID, but got 'not_a_hex_string'

llvm/test/tools/llvm-symbolizer/debuginfod-build-id-and-obj.test

This file was added.

				RUN: not llvm-symbolizer --build-id=abc --obj=bad 0x1234 2>&1 \| FileCheck %s

				CHECK: error: cannot specify both --build-id and --obj

llvm/test/tools/llvm-symbolizer/debuginfod-missing-build-id.test

This file was added.

				RUN: llvm-symbolizer --build-id=abad 0x1234 0x5678 2>&1 \| FileCheck %s

				CHECK: LLVMSymbolizer: error reading file: could not find build ID 'ABAD'
				CHECK: ??
				CHECK: ??:0:0
				CHECK-NOT: LLVMSymbolizer
				CHECK: ??
				CHECK: ??:0:0

llvm/test/tools/llvm-symbolizer/debuginfod.test

	Show All 19 Lines
	RUN: llvm-objcopy --keep-section=.debug_info %p/Inputs/addr.exe \			RUN: llvm-objcopy --keep-section=.debug_info %p/Inputs/addr.exe \
	RUN: %t/llvmcache-9800707741016212219			RUN: %t/llvmcache-9800707741016212219

	# The symbolizer should call the debuginfod client library, which finds the			# The symbolizer should call the debuginfod client library, which finds the
	# debuginfo placed in the cache, enabling symbolization of the address.			# debuginfo placed in the cache, enabling symbolization of the address.
	RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \			RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \
	RUN: --obj=%t/addr.exe 0x40054d \| FileCheck %s --check-prefix=FOUND			RUN: --obj=%t/addr.exe 0x40054d \| FileCheck %s --check-prefix=FOUND
	FOUND: {{[/\]+}}tmp{{[/\]+}}x.c:14:0			FOUND: {{[/\]+}}tmp{{[/\]+}}x.c:14:0

				# This should also work if the build ID is provided.
				RUN: env DEBUGINFOD_CACHE_PATH=%t llvm-symbolizer \
				RUN: --build-id=127da749021c1fc1a58cba734a1f542cbe2b7ce4 0x40054d \| \
				RUN: FileCheck %s --check-prefix=FOUND

llvm/tools/llvm-symbolizer/Opts.td

	Show All 15 Lines
	def grp_mach_o : OptionGroup<"kind">,			def grp_mach_o : OptionGroup<"kind">,
	HelpText<"llvm-symbolizer Mach-O Specific Options">;			HelpText<"llvm-symbolizer Mach-O Specific Options">;

	def addresses : F<"addresses", "Show address before line information">;			def addresses : F<"addresses", "Show address before line information">;
	defm adjust_vma			defm adjust_vma
	: Eq<"adjust-vma", "Add specified offset to object file addresses">,			: Eq<"adjust-vma", "Add specified offset to object file addresses">,
	MetaVarName<"<offset>">;			MetaVarName<"<offset>">;
	def basenames : Flag<["--"], "basenames">, HelpText<"Strip directory names from paths">;			def basenames : Flag<["--"], "basenames">, HelpText<"Strip directory names from paths">;
				defm build_id : Eq<"build-id", "Build ID used to look up the object file">;
	defm debug_file_directory : Eq<"debug-file-directory", "Path to directory where to look for debug files">, MetaVarName<"<dir>">;			defm debug_file_directory : Eq<"debug-file-directory", "Path to directory where to look for debug files">, MetaVarName<"<dir>">;
	defm default_arch			defm default_arch
	: Eq<"default-arch", "Default architecture (for multi-arch objects)">,			: Eq<"default-arch", "Default architecture (for multi-arch objects)">,
	Group<grp_mach_o>;			Group<grp_mach_o>;
	defm demangle : B<"demangle", "Demangle function names", "Don't demangle function names">;			defm demangle : B<"demangle", "Demangle function names", "Don't demangle function names">;
	def functions : F<"functions", "Print function name for a given address">;			def functions : F<"functions", "Print function name for a given address">;
	def functions_EQ : Joined<["--"], "functions=">, HelpText<"Print function name for a given address">, Values<"none,short,linkage">;			def functions_EQ : Joined<["--"], "functions=">, HelpText<"Print function name for a given address">, Values<"none,short,linkage">;
	def help : F<"help", "Display this help">;			def help : F<"help", "Display this help">;
	▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

Show All 9 Lines

// tuples (module name, module offset) to code locations (function name, // tuples (module name, module offset) to code locations (function name,

// file, line number, column number). It is targeted for compiler-rt tools // file, line number, column number). It is targeted for compiler-rt tools

// (especially AddressSanitizer and ThreadSanitizer) that can use it // (especially AddressSanitizer and ThreadSanitizer) that can use it

// to symbolize stack traces in their error reports. // to symbolize stack traces in their error reports.

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "Opts.inc" #include "Opts.inc"

#include "llvm/ADT/StringExtras.h"

#include "llvm/ADT/StringRef.h" #include "llvm/ADT/StringRef.h"

#include "llvm/Config/config.h" #include "llvm/Config/config.h"

#include "llvm/DebugInfo/Symbolize/DIPrinter.h" #include "llvm/DebugInfo/Symbolize/DIPrinter.h"

#include "llvm/DebugInfo/Symbolize/Symbolize.h" #include "llvm/DebugInfo/Symbolize/Symbolize.h"

#include "llvm/Debuginfod/HTTPClient.h" #include "llvm/Debuginfod/HTTPClient.h"

#include "llvm/Option/Arg.h" #include "llvm/Option/Arg.h"

#include "llvm/Option/ArgList.h" #include "llvm/Option/ArgList.h"

#include "llvm/Option/Option.h" #include "llvm/Option/Option.h"

▲ Show 20 Lines • Show All 71 Lines • ▼ Show 20 Lines

enum class OutputStyle { LLVM, GNU, JSON }; enum class OutputStyle { LLVM, GNU, JSON };

enum class Command { enum class Command {

Code, Code,

Data, Data,

Frame, Frame,

}; };

static bool parseCommand(StringRef BinaryName, bool IsAddr2Line, static bool parseCommand(StringRef BinaryName, ArrayRef<uint8_t> BuildID,

StringRef InputString, Command &Cmd, bool IsAddr2Line, StringRef InputString, Command &Cmd,

std::string &ModuleName, uint64_t &ModuleOffset) { std::string &ModuleName, uint64_t &ModuleOffset) {

const char kDelimiters[] = " \n\r"; const char kDelimiters[] = " \n\r";

ModuleName = ""; ModuleName = "";

if (InputString.consume_front("CODE ")) { if (InputString.consume_front("CODE ")) {

Cmd = Command::Code; Cmd = Command::Code;

} else if (InputString.consume_front("DATA ")) { } else if (InputString.consume_front("DATA ")) {

Cmd = Command::Data; Cmd = Command::Data;

} else if (InputString.consume_front("FRAME ")) { } else if (InputString.consume_front("FRAME ")) {

Cmd = Command::Frame; Cmd = Command::Frame;

} else { } else {

// If no cmd, assume it's CODE. // If no cmd, assume it's CODE.

Cmd = Command::Code; Cmd = Command::Code;

} }

const char *Pos = InputString.data(); const char *Pos = InputString.data();

// Skip delimiters and parse input filename (if needed). // Skip delimiters and parse input filename (if needed).

if (BinaryName.empty()) { if (BinaryName.empty() && BuildID.empty()) {

Pos += strspn(Pos, kDelimiters); Pos += strspn(Pos, kDelimiters);

if (*Pos == '"' || *Pos == '\'') { if (*Pos == '"' || *Pos == '\'') {

char Quote = *Pos; char Quote = *Pos;

Pos++; Pos++;

const char *End = strchr(Pos, Quote); const char *End = strchr(Pos, Quote);

if (!End) if (!End)

return false; return false;

ModuleName = std::string(Pos, End - Pos); ModuleName = std::string(Pos, End - Pos);

Show All 12 Lines static bool parseCommand(StringRef BinaryName, ArrayRef<uint8_t> BuildID,

StringRef Offset(Pos, OffsetLength); StringRef Offset(Pos, OffsetLength);

// GNU addr2line assumes the offset is hexadecimal and allows a redundant // GNU addr2line assumes the offset is hexadecimal and allows a redundant

// "0x" or "0X" prefix; do the same for compatibility. // "0x" or "0X" prefix; do the same for compatibility.

if (IsAddr2Line) if (IsAddr2Line)

Offset.consume_front("0x") || Offset.consume_front("0X"); Offset.consume_front("0x") || Offset.consume_front("0X");

return !Offset.getAsInteger(IsAddr2Line ? 16 : 0, ModuleOffset); return !Offset.getAsInteger(IsAddr2Line ? 16 : 0, ModuleOffset);

} }

static void symbolizeInput(const opt::InputArgList &Args, uint64_t AdjustVMA, template <typename T>

bool IsAddr2Line, OutputStyle Style, void executeCommand(StringRef ModuleName, const T &ModuleSpec, Command Cmd,

StringRef InputString, LLVMSymbolizer &Symbolizer, uint64_t Offset, uint64_t AdjustVMA, bool ShouldInline,

OutputStyle Style, LLVMSymbolizer &Symbolizer,

DIPrinter &Printer) { DIPrinter &Printer) {

Command Cmd;

std::string ModuleName;

uint64_t Offset = 0;

if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), IsAddr2Line,

StringRef(InputString), Cmd, ModuleName, Offset)) {

Printer.printInvalidCommand({ModuleName, None}, InputString);

return;

}

uint64_t AdjustedOffset = Offset - AdjustVMA; uint64_t AdjustedOffset = Offset - AdjustVMA;

object::SectionedAddress Address = {AdjustedOffset,

object::SectionedAddress::UndefSection};

if (Cmd == Command::Data) { if (Cmd == Command::Data) {

Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData( Expected<DIGlobal> ResOrErr = Symbolizer.symbolizeData(ModuleSpec, Address);

ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection});

print({ModuleName, Offset}, ResOrErr, Printer); print({ModuleName, Offset}, ResOrErr, Printer);

} else if (Cmd == Command::Frame) { } else if (Cmd == Command::Frame) {

Expected<std::vector<DILocal>> ResOrErr = Symbolizer.symbolizeFrame( Expected<std::vector<DILocal>> ResOrErr =

ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection}); Symbolizer.symbolizeFrame(ModuleSpec, Address);

print({ModuleName, Offset}, ResOrErr, Printer); print({ModuleName, Offset}, ResOrErr, Printer);

} else if (Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line)) { } else if (ShouldInline) {

Expected<DIInliningInfo> ResOrErr = Symbolizer.symbolizeInlinedCode( Expected<DIInliningInfo> ResOrErr =

ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection}); Symbolizer.symbolizeInlinedCode(ModuleSpec, Address);

print({ModuleName, Offset}, ResOrErr, Printer); print({ModuleName, Offset}, ResOrErr, Printer);

} else if (Style == OutputStyle::GNU) { } else if (Style == OutputStyle::GNU) {

// With PrintFunctions == FunctionNameKind::LinkageName (default) // With PrintFunctions == FunctionNameKind::LinkageName (default)

// and UseSymbolTable == true (also default), Symbolizer.symbolizeCode() // and UseSymbolTable == true (also default), Symbolizer.symbolizeCode()

// may override the name of an inlined function with the name of the topmost // may override the name of an inlined function with the name of the topmost

// caller function in the inlining chain. This contradicts the existing // caller function in the inlining chain. This contradicts the existing

// behavior of addr2line. Symbolizer.symbolizeInlinedCode() overrides only // behavior of addr2line. Symbolizer.symbolizeInlinedCode() overrides only

// the topmost function, which suits our needs better. // the topmost function, which suits our needs better.

Expected<DIInliningInfo> ResOrErr = Symbolizer.symbolizeInlinedCode( Expected<DIInliningInfo> ResOrErr =

ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection}); Symbolizer.symbolizeInlinedCode(ModuleSpec, Address);

Expected<DILineInfo> Res0OrErr = Expected<DILineInfo> Res0OrErr =

!ResOrErr !ResOrErr

? Expected<DILineInfo>(ResOrErr.takeError()) ? Expected<DILineInfo>(ResOrErr.takeError())

: ((ResOrErr->getNumberOfFrames() == 0) ? DILineInfo() : ((ResOrErr->getNumberOfFrames() == 0) ? DILineInfo()

: ResOrErr->getFrame(0)); : ResOrErr->getFrame(0));

print({ModuleName, Offset}, Res0OrErr, Printer); print({ModuleName, Offset}, Res0OrErr, Printer);

} else { } else {

Expected<DILineInfo> ResOrErr = Symbolizer.symbolizeCode( Expected<DILineInfo> ResOrErr =

ModuleName, {AdjustedOffset, object::SectionedAddress::UndefSection}); Symbolizer.symbolizeCode(ModuleSpec, Address);

print({ModuleName, Offset}, ResOrErr, Printer); print({ModuleName, Offset}, ResOrErr, Printer);

} }

static void symbolizeInput(const opt::InputArgList &Args,

ArrayRef<uint8_t> BuildID, uint64_t AdjustVMA,

bool IsAddr2Line, OutputStyle Style,

StringRef InputString, LLVMSymbolizer &Symbolizer,

DIPrinter &Printer) {

Command Cmd;

std::string ModuleName;

uint64_t Offset = 0;

if (!parseCommand(Args.getLastArgValue(OPT_obj_EQ), BuildID, IsAddr2Line,

StringRef(InputString), Cmd, ModuleName, Offset)) {

Printer.printInvalidCommand({ModuleName, None}, InputString);

return;

}

bool ShouldInline = Args.hasFlag(OPT_inlines, OPT_no_inlines, !IsAddr2Line);

if (!BuildID.empty()) {

assert(ModuleName.empty());

std::string BuildIDStr = toHex(BuildID);

executeCommand(BuildIDStr, BuildID, Cmd, Offset, AdjustVMA, ShouldInline,

Style, Symbolizer, Printer);

} else {

executeCommand(ModuleName, ModuleName, Cmd, Offset, AdjustVMA, ShouldInline,

Style, Symbolizer, Printer);

}

static void printHelp(StringRef ToolName, const SymbolizerOptTable &Tbl, static void printHelp(StringRef ToolName, const SymbolizerOptTable &Tbl,

raw_ostream &OS) { raw_ostream &OS) {

const char HelpText[] = " [options] addresses..."; const char HelpText[] = " [options] addresses...";

Tbl.printHelp(OS, (ToolName + HelpText).str().c_str(), Tbl.printHelp(OS, (ToolName + HelpText).str().c_str(),

ToolName.str().c_str()); ToolName.str().c_str());

// TODO Replace this with OptTable API once it adds extrahelp support. // TODO Replace this with OptTable API once it adds extrahelp support.

OS << "\nPass @FILE as argument to read options from FILE.\n"; OS << "\nPass @FILE as argument to read options from FILE.\n";

} }

▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines static FunctionNameKind decideHowToPrintFunctions(const opt::InputArgList &Args,

if (const opt::Arg *A = Args.getLastArg(OPT_functions_EQ)) if (const opt::Arg *A = Args.getLastArg(OPT_functions_EQ))

return StringSwitch<FunctionNameKind>(A->getValue()) return StringSwitch<FunctionNameKind>(A->getValue())

.Case("none", FunctionNameKind::None) .Case("none", FunctionNameKind::None)

.Case("short", FunctionNameKind::ShortName) .Case("short", FunctionNameKind::ShortName)

.Default(FunctionNameKind::LinkageName); .Default(FunctionNameKind::LinkageName);

return IsAddr2Line ? FunctionNameKind::None : FunctionNameKind::LinkageName; return IsAddr2Line ? FunctionNameKind::None : FunctionNameKind::LinkageName;

} }

SmallVector<uint8_t> parseBuildIDArg(const opt::InputArgList &Args, int ID) {

phosekUnsubmitted

Done

This is only invoked from one location so I'd consider inlining it.

phosek: This is only invoked from one location so I'd consider inlining it.

if (const opt::Arg *A = Args.getLastArg(ID)) {

StringRef V(A->getValue());

std::string Bytes;

if (!tryGetFromHex(V, Bytes)) {

errs() << A->getSpelling() + ": expected a build ID, but got '" + V +

"'\n";

exit(1);

}

ArrayRef<uint8_t> BuildID(reinterpret_cast<const uint8_t *>(Bytes.data()),

Bytes.size());

return SmallVector<uint8_t>(BuildID.begin(), BuildID.end());

}

return {};

}

int main(int argc, char **argv) { int main(int argc, char **argv) {

InitLLVM X(argc, argv); InitLLVM X(argc, argv);

// The HTTPClient must be initialized for use by the debuginfod client. // The HTTPClient must be initialized for use by the debuginfod client.

HTTPClient::initialize(); HTTPClient::initialize();

sys::InitializeCOMRAII COM(sys::COMThreadingMode::MultiThreaded); sys::InitializeCOMRAII COM(sys::COMThreadingMode::MultiThreaded);

bool IsAddr2Line = sys::path::stem(argv[0]).contains("addr2line"); bool IsAddr2Line = sys::path::stem(argv[0]).contains("addr2line");

BumpPtrAllocator A; BumpPtrAllocator A;

▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines if (const opt::Arg *A = Args.getLastArg(OPT_output_style_EQ)) {

if (strcmp(A->getValue(), "GNU") == 0) if (strcmp(A->getValue(), "GNU") == 0)

Style = OutputStyle::GNU; Style = OutputStyle::GNU;

else if (strcmp(A->getValue(), "JSON") == 0) else if (strcmp(A->getValue(), "JSON") == 0)

Style = OutputStyle::JSON; Style = OutputStyle::JSON;

else else

Style = OutputStyle::LLVM; Style = OutputStyle::LLVM;

} }

SmallVector<uint8_t> BuildID = parseBuildIDArg(Args, OPT_build_id_EQ);

if (!BuildID.empty() && !Args.getLastArgValue(OPT_obj_EQ).empty()) {

phosekUnsubmitted

Not Done

It might be more efficient to check if both --obj and --build-id was specified before parsing --build-id and allocating memory for build ID, but it only makes a difference for the error path so it's not particularly important.

phosek: It might be more efficient to check if both `--obj` and `--build-id` was specified before…

phosekUnsubmitted

Done

Style = OutputStyle::LLVM;

}

- if (!Args.getLastArgValue(OPT_build_id_EQ).empty() &&

- !Args.getLastArgValue(OPT_obj_EQ).empty()) {

+ if (!Args.hasArg(OPT_build_id_EQ) &&

+ !Args.hasArg(OPT_obj_EQ)) {

errs() << "error: cannot specify both --build-id and --obj\n";

I'd use hasArg.

phosek: I'd use `hasArg`.

mysterymathAuthorUnsubmitted

Done

There's a use to having a value that means "flag unset": you can unset flags via concatenation, which can help when using utilities in scripts. E.g., an environment variable is set up with a ton of flags outside your control, and you want to remove exactly one of them. This can require either some tricky string manipulation or just appending "--obj= ", depending on the semantics of '--obj'.

That being said, I did a quick random sampling of flags in LLVM, and it's wildly variable whether or not a given flag offers this property. At least some of that is probably due to GCC backcompat concerns, but it's definitely not a property a user can rely on across the board, so doesn't seem to be much point in making more flags behave that way.

mysterymath: There's a use to having a value that means "flag unset": you can unset flags via concatenation…

errs() << "error: cannot specify both --build-id and --obj\n";

return EXIT_FAILURE;

}

LLVMSymbolizer Symbolizer(Opts); LLVMSymbolizer Symbolizer(Opts);

std::unique_ptr<DIPrinter> Printer; std::unique_ptr<DIPrinter> Printer;

if (Style == OutputStyle::GNU) if (Style == OutputStyle::GNU)

Printer = std::make_unique<GNUPrinter>(outs(), errs(), Config); Printer = std::make_unique<GNUPrinter>(outs(), errs(), Config);

else if (Style == OutputStyle::JSON) else if (Style == OutputStyle::JSON)

Printer = std::make_unique<JSONPrinter>(outs(), Config); Printer = std::make_unique<JSONPrinter>(outs(), Config);

else else

Printer = std::make_unique<LLVMPrinter>(outs(), errs(), Config); Printer = std::make_unique<LLVMPrinter>(outs(), errs(), Config);

std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT); std::vector<std::string> InputAddresses = Args.getAllArgValues(OPT_INPUT);

if (InputAddresses.empty()) { if (InputAddresses.empty()) {

const int kMaxInputStringLength = 1024; const int kMaxInputStringLength = 1024;

char InputString[kMaxInputStringLength]; char InputString[kMaxInputStringLength];

while (fgets(InputString, sizeof(InputString), stdin)) { while (fgets(InputString, sizeof(InputString), stdin)) {

// Strip newline characters. // Strip newline characters.

jhendersonUnsubmitted

Done

This is untested.

Also, the error format is not consistent with the LLVM coding standards or what llvm-symbolizer prints when there's a missing file, as far as I can see.

jhenderson: This is untested. Also, the error format is not consistent with the LLVM coding standards or…

mysterymathAuthorUnsubmitted

Done

There were some considerable semantic divergences between --obj and --build-id; the former would return EXIT_SUCCESS and keep going, producing only one error, while the latter would immediately exit with a nonzero status.

The current approach also doesn't extend well to handling multiple build IDs specified on a line-by-line basis, which is a near-term TODO.

Accordingly, I've reworked this functionality to take place inside the Symbolizer, rather than in the wrapping tool. This makes BuildIDs a first-class mechanism that can be used to specify modules; it thus intrinsically has similar error behavior to the other ways of specifying modules. This will also make it easy to build syntax into the stdin mechanism to switch out the module specifier type.

mysterymath: There were some considerable semantic divergences between --obj and --build-id; the former…

std::string StrippedInputString(InputString); std::string StrippedInputString(InputString);

llvm::erase_if(StrippedInputString, llvm::erase_if(StrippedInputString,

[](char c) { return c == '\r' || c == '\n'; }); [](char c) { return c == '\r' || c == '\n'; });

symbolizeInput(Args, AdjustVMA, IsAddr2Line, Style, StrippedInputString, symbolizeInput(Args, BuildID, AdjustVMA, IsAddr2Line, Style,

Symbolizer, *Printer); StrippedInputString, Symbolizer, *Printer);

outs().flush(); outs().flush();

} }

} else { } else {

Printer->listBegin(); Printer->listBegin();

for (StringRef Address : InputAddresses) for (StringRef Address : InputAddresses)

symbolizeInput(Args, AdjustVMA, IsAddr2Line, Style, Address, Symbolizer, symbolizeInput(Args, BuildID, AdjustVMA, IsAddr2Line, Style, Address,

*Printer); Symbolizer, *Printer);

Printer->listEnd(); Printer->listEnd();

} }

return 0; return 0;

} }

This is an archive of the discontinued LLVM Phabricator instance.

[Symbolizer] Add Build ID flag to llvm-symbolizer.
ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 405818

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp

llvm/test/tools/llvm-symbolizer/debuginfod-bad-build-id.test

llvm/test/tools/llvm-symbolizer/debuginfod-build-id-and-obj.test

llvm/test/tools/llvm-symbolizer/debuginfod-missing-build-id.test

llvm/test/tools/llvm-symbolizer/debuginfod.test

llvm/tools/llvm-symbolizer/Opts.td

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[Symbolizer] Add Build ID flag to llvm-symbolizer.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 405818

llvm/docs/CommandGuide/llvm-symbolizer.rst

llvm/include/llvm/DebugInfo/Symbolize/Symbolize.h

llvm/lib/DebugInfo/Symbolize/Symbolize.cpp

llvm/test/tools/llvm-symbolizer/debuginfod-bad-build-id.test

llvm/test/tools/llvm-symbolizer/debuginfod-build-id-and-obj.test

llvm/test/tools/llvm-symbolizer/debuginfod-missing-build-id.test

llvm/test/tools/llvm-symbolizer/debuginfod.test

llvm/tools/llvm-symbolizer/Opts.td

llvm/tools/llvm-symbolizer/llvm-symbolizer.cpp

[Symbolizer] Add Build ID flag to llvm-symbolizer.
ClosedPublic