This is an archive of the discontinued LLVM Phabricator instance.

LTO: Reduce memory consumption by creating an in-memory symbol table for InputFiles. NFCI.
ClosedPublic

Authored by pcc on Mar 24 2017, 11:14 PM.

Download Raw Diff

Details

Reviewers

tejohnson
mehdi_amini

Commits

rGd9717aa0e420: LTO: Reduce memory consumption by creating an in-memory symbol table for…
rLLD299168: LTO: Reduce memory consumption by creating an in-memory symbol table for…
rL299168: LTO: Reduce memory consumption by creating an in-memory symbol table for…

Summary

Introduce symbol table data structures that can be potentially written to
disk, have the LTO library build those data structures using temporarily
constructed modules and redirect the LTO library implementation to go through
those data structures. This allows us to remove the LLVMContext and Modules
owned by InputFile.

With this change I measured a peak memory consumption decrease from 5.4GB to
2.8GB in a no-op incremental ThinLTO link of Chromium on Linux. The impact on
memory consumption is larger in COFF linkers where we are currently forced to
materialize all metadata in order to read linker options. Peak memory consumption
linking a large piece of Chromium for Windows with full LTO and debug info decreases
from >64GB (OOM) to 15GB.

Part of PR27551.

Diff Detail

Repository: rL LLVM

Event Timeline

pcc created this revision.Mar 24 2017, 11:14 PM

Herald added subscribers: aprantl, mgorny. · View Herald TranscriptMar 24 2017, 11:14 PM

With this change I measured a peak memory consumption decrease from 5.4GB to 2.8GB in a no-op incremental ThinLTO link of Chromium on Linux, and total time elapsed decreases from ~61s to ~48s.

That seems to indicate an issue where we don't release ressources early enough in the process though. We could also look into this independently of the symbol table work.

Just a couple of notes.

llvm/lib/LTO/LTO.cpp
515 ↗	(On Diff #93032)	This may appear to not be NFC but it looks like this part has already been moved to the backend.
601 ↗	(On Diff #93032)	This is not actually NFC but a fix for a bug found by a colleague. I'll split it out into a separate change later.

• rafael added inline comments.Mar 27 2017, 8:38 AM

lld/COFF/InputFiles.cpp
347 ↗	(On Diff #93032)	These small utility predicates are awesome. Would you mind committing them first with the current implementation? It would be a nice improvement and make this patch easier to read.
llvm/include/llvm/Object/IRSymtab.h
31 ↗	(On Diff #93032)	why do you use this for Symbol? If we are storing something by value it is better to just use a uint32_t.

getting phab to send email.

pcc added a subscriber: inglorion.Mar 27 2017, 12:52 PM

pcc added inline comments.

lld/COFF/InputFiles.cpp
347 ↗	(On Diff #93032)	Will do
llvm/include/llvm/Object/IRSymtab.h
31 ↗	(On Diff #93032)	If we eventually want to write these data structures out to files I guess we will need to specify an endianness. The plan I have in mind is: define data structures that can be potentially written to files, but use them only as an in-memory format to start with (that is what this patch does) potentially improve them in-tree (e.g. by sharing the string table between the module and the symbol table) flip the switch and start writing them to files
llvm/lib/LTO/LTO.cpp
601 ↗	(On Diff #93032)	Correction: this is NFC. The bug is elsewhere. I'll let @inglorion send out a fix in due course.

Introduce a symbol reader API

Harbormaster completed remote builds in B5186: Diff 93504.Mar 30 2017, 11:01 AM

pcc edited the summary of this revision. (Show Details)Mar 30 2017, 11:02 AM

I removed the performance claims from the commit message because they were based on bad measurements (too much variance, insufficient runs). My most recent measurements do show a small perf improvement of about 3%, but it may not be statistically significant, and that isn't the most important thing about this change anyway.

This looks good to me. Since it is a big addition please make sure Mehdi or Teresa LGTM it too.

Thanks!

I'm fine with the approach. I didn't review in great details but I can trust @rafael.

llvm/include/llvm/Object/IRSymtab.h
15 ↗	(On Diff #93032)	I think it should be mentioned that the table contains the information for multiple modules. This is non-intuitive when someone think about a "symbol table for IR".

It's a lot of code but I skimmed most of it and read a few parts in more detail. A few questions/comments.

llvm/include/llvm/Object/IRSymtab.h
123 ↗	(On Diff #93504)	The name "write" here seems unexpected to me, since we aren't writing to disk e.g.. The client does a "write" which involves a Writer class, followed by a Reader, when together both are needed to essentially "read" the symbols from Modules. Maybe "buildSymbolTable" or something like that. The Writer is more like a Builder.
llvm/lib/LTO/LTO.cpp
506 ↗	(On Diff #93504)	Before this was embedded within symbol_iterator. What is the impact of moving it here - we only want to skip these symbols in the regular LTO case? It's a little confusing to me that we have two symbol iterations going on in the below loop, one over the InputFile::Symbols and one over the ModuleSymbolTable - why do we need two data structures of symbols now?
llvm/lib/Object/IRSymtab.cpp
24 ↗	(On Diff #93504)	Needs some comments.

Address review comments

pcc added inline comments.Mar 30 2017, 3:02 PM

llvm/include/llvm/Object/IRSymtab.h
123 ↗	(On Diff #93504)	Renamed to "build" (likewise to "Builder").
llvm/lib/LTO/LTO.cpp
506 ↗	(On Diff #93504)	Before this was embedded within symbol_iterator. What is the impact of moving it here - we only want to skip these symbols in the regular LTO case? We need to skip the same set of symbols in the module as we do in the InputFile's symbol table. The logic in Skip() duplicates the condition on line 361 of LTO.cpp where we only add global/non-format-specific symbols to the InputFile. It may be worth eliminating some of the duplication here, but I'd have to think about it more carefully. It's a little confusing to me that we have two symbol iterations going on in the below loop, one over the InputFile::Symbols and one over the ModuleSymbolTable - why do we need two data structures of symbols now? This function needs some attributes from the symbol table as well as direct access to module symbols for the IRMover. Because symbol table symbols were previously implemented in terms of module symbols, the symbol table list also served the purpose of providing access to module symbols. This change separated the symbol table from the module, so now we have three lists: resolution, symbol table symbols and module symbols. In principle the latter two should contain the same information, so strictly we only need resolution and module symbols, but sometimes it's more convenient to pull symbol attributes out of the symbol table. I agree that this is a little confusing, so I've left a comment at the top of this block.

LGTM

This revision is now accepted and ready to land.Mar 30 2017, 6:27 PM

Closed by commit rL299168: LTO: Reduce memory consumption by creating an in-memory symbol table for… (authored by pcc). · Explain WhyMar 30 2017, 8:05 PM

This revision was automatically updated to reflect the committed changes.

tejohnson mentioned this in D32061: [wip] Bitcode: Write the irsymtab to disk..Apr 18 2017, 9:37 AM

pcc added inline comments.Jun 6 2017, 3:26 PM

llvm/include/llvm/Object/IRSymtab.h
123 ↗	(On Diff #93504)	Every time I go into this code it bugs me a little that we have a reader and a builder (as opposed to a writer). I hate to bring up a bikeshedding topic, and it's not a big deal I guess, but would you mind if I rename this back to write/Writer? In support of my position I point to the many Writer classes we have in LLVM [1] that are not necessarily writing to disk. [1] http://llvm-cs.pcc.me.uk/?q=writer

tejohnson added inline comments.Jun 7 2017, 2:18 PM

llvm/include/llvm/Object/IRSymtab.h
123 ↗	(On Diff #93504)	Given where this is going (towards writing those directly to disk), I'm less concerned with the builder vs writer name so this seems ok to me.

Revision Contents

Path

Size

lld/

trunk/

COFF/

InputFiles.cpp

5 lines

ELF/

InputFiles.cpp

8 lines

llvm/

trunk/

include/

llvm/

LTO/

LTO.h

205 lines

Object/

IRSymtab.h

298 lines

lib/

LTO/

LTO.cpp

216 lines

Object/

CMakeLists.txt

1 line

IRSymtab.cpp

228 lines

tools/

gold/

gold-plugin.cpp

4 lines

Diff 93578

lld/trunk/COFF/InputFiles.cpp

Show First 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	if (ObjSym.isUndefined()) {
Sym = Symtab->addCommon(this, SymName, ObjSym.getCommonSize());		Sym = Symtab->addCommon(this, SymName, ObjSym.getCommonSize());
} else if (ObjSym.isWeak() && ObjSym.isIndirect()) {		} else if (ObjSym.isWeak() && ObjSym.isIndirect()) {
// Weak external.		// Weak external.
Sym = Symtab->addUndefined(SymName, this, true);		Sym = Symtab->addUndefined(SymName, this, true);
std::string Fallback = ObjSym.getCOFFWeakExternalFallback();		std::string Fallback = ObjSym.getCOFFWeakExternalFallback();
SymbolBody *Alias = Symtab->addUndefined(Saver.save(Fallback));		SymbolBody *Alias = Symtab->addUndefined(Saver.save(Fallback));
checkAndSetWeakAlias(Symtab, this, Sym->body(), Alias);		checkAndSetWeakAlias(Symtab, this, Sym->body(), Alias);
} else {		} else {
Expected<int> ComdatIndex = ObjSym.getComdatIndex();		bool IsCOMDAT = ObjSym.getComdatIndex() != -1;
bool IsCOMDAT = ComdatIndex && *ComdatIndex != -1;
Sym = Symtab->addRegular(this, SymName, IsCOMDAT);		Sym = Symtab->addRegular(this, SymName, IsCOMDAT);
}		}
SymbolBodies.push_back(Sym->body());		SymbolBodies.push_back(Sym->body());
}		}
Directives = check(Obj->getLinkerOpts());		Directives = Obj->getCOFFLinkerOpts();
}		}

MachineTypes BitcodeFile::getMachineType() {		MachineTypes BitcodeFile::getMachineType() {
Expected<std::string> ET = getBitcodeTargetTriple(MB);		Expected<std::string> ET = getBitcodeTargetTriple(MB);
if (!ET)		if (!ET)
return IMAGE_FILE_MACHINE_UNKNOWN;		return IMAGE_FILE_MACHINE_UNKNOWN;
switch (Triple(*ET).getArch()) {		switch (Triple(*ET).getArch()) {
case Triple::x86_64:		case Triple::x86_64:
Show All 32 Lines

lld/trunk/ELF/InputFiles.cpp

Show First 20 Lines • Show All 812 Lines • ▼ Show 20 Lines	static Symbol *createBitcodeSymbol(const std::vector<bool> &KeptComdats,
BitcodeFile *F) {		BitcodeFile *F) {
StringRef NameRef = Saver.save(ObjSym.getName());		StringRef NameRef = Saver.save(ObjSym.getName());
uint32_t Binding = ObjSym.isWeak() ? STB_WEAK : STB_GLOBAL;		uint32_t Binding = ObjSym.isWeak() ? STB_WEAK : STB_GLOBAL;

uint8_t Type = ObjSym.isTLS() ? STT_TLS : STT_NOTYPE;		uint8_t Type = ObjSym.isTLS() ? STT_TLS : STT_NOTYPE;
uint8_t Visibility = mapVisibility(ObjSym.getVisibility());		uint8_t Visibility = mapVisibility(ObjSym.getVisibility());
bool CanOmitFromDynSym = ObjSym.canBeOmittedFromSymbolTable();		bool CanOmitFromDynSym = ObjSym.canBeOmittedFromSymbolTable();

int C = check(ObjSym.getComdatIndex(), F->LogName);		int C = ObjSym.getComdatIndex();
if (C != -1 && !KeptComdats[C])		if (C != -1 && !KeptComdats[C])
return Symtab<ELFT>::X->addUndefined(NameRef, /IsLocal=/false, Binding,		return Symtab<ELFT>::X->addUndefined(NameRef, /IsLocal=/false, Binding,
Visibility, Type, CanOmitFromDynSym,		Visibility, Type, CanOmitFromDynSym,
F);		F);

if (ObjSym.isUndefined())		if (ObjSym.isUndefined())
return Symtab<ELFT>::X->addUndefined(NameRef, /IsLocal=/false, Binding,		return Symtab<ELFT>::X->addUndefined(NameRef, /IsLocal=/false, Binding,
Visibility, Type, CanOmitFromDynSym,		Visibility, Type, CanOmitFromDynSym,
Show All 20 Lines	void BitcodeFile::parse(DenseSet<CachedHashStringRef> &ComdatGroups) {
// taken into consideration at LTO time (which very likely causes undefined		// taken into consideration at LTO time (which very likely causes undefined
// symbols later in the link stage).		// symbols later in the link stage).
MemoryBufferRef MBRef(MB.getBuffer(),		MemoryBufferRef MBRef(MB.getBuffer(),
Saver.save(ArchiveName + MB.getBufferIdentifier() +		Saver.save(ArchiveName + MB.getBufferIdentifier() +
utostr(OffsetInArchive)));		utostr(OffsetInArchive)));
Obj = check(lto::InputFile::create(MBRef), this->LogName);		Obj = check(lto::InputFile::create(MBRef), this->LogName);

std::vector<bool> KeptComdats;		std::vector<bool> KeptComdats;
for (StringRef S : Obj->getComdatTable()) {		for (StringRef S : Obj->getComdatTable())
StringRef N = Saver.save(S);		KeptComdats.push_back(ComdatGroups.insert(CachedHashStringRef(S)).second);
KeptComdats.push_back(ComdatGroups.insert(CachedHashStringRef(N)).second);
}

for (const lto::InputFile::Symbol &ObjSym : Obj->symbols())		for (const lto::InputFile::Symbol &ObjSym : Obj->symbols())
Symbols.push_back(createBitcodeSymbol<ELFT>(KeptComdats, ObjSym, this));		Symbols.push_back(createBitcodeSymbol<ELFT>(KeptComdats, ObjSym, this));
}		}

template <template <class> class T>		template <template <class> class T>
static InputFile *createELFFile(MemoryBufferRef MB) {		static InputFile *createELFFile(MemoryBufferRef MB) {
unsigned char Size;		unsigned char Size;
▲ Show 20 Lines • Show All 172 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/LTO/LTO.h

Show All 18 Lines
#include "llvm/ADT/MapVector.h"		#include "llvm/ADT/MapVector.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
#include "llvm/CodeGen/Analysis.h"		#include "llvm/CodeGen/Analysis.h"
#include "llvm/IR/DiagnosticInfo.h"		#include "llvm/IR/DiagnosticInfo.h"
#include "llvm/IR/ModuleSummaryIndex.h"		#include "llvm/IR/ModuleSummaryIndex.h"
#include "llvm/LTO/Config.h"		#include "llvm/LTO/Config.h"
#include "llvm/Linker/IRMover.h"		#include "llvm/Linker/IRMover.h"
#include "llvm/Object/ModuleSymbolTable.h"		#include "llvm/Object/IRSymtab.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Support/ToolOutputFile.h"		#include "llvm/Support/ToolOutputFile.h"
#include "llvm/Support/thread.h"		#include "llvm/Support/thread.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"
#include "llvm/Transforms/IPO/FunctionImport.h"		#include "llvm/Transforms/IPO/FunctionImport.h"

namespace llvm {		namespace llvm {

Show All 38 Lines
Expected<std::unique_ptr<tool_output_file>>		Expected<std::unique_ptr<tool_output_file>>
setupOptimizationRemarks(LLVMContext &Context, StringRef LTORemarksFilename,		setupOptimizationRemarks(LLVMContext &Context, StringRef LTORemarksFilename,
bool LTOPassRemarksWithHotness, int Count = -1);		bool LTOPassRemarksWithHotness, int Count = -1);

class LTO;		class LTO;
struct SymbolResolution;		struct SymbolResolution;
class ThinBackendProc;		class ThinBackendProc;

/// An input file. This is a wrapper for ModuleSymbolTable that exposes only the		/// An input file. This is a symbol table wrapper that only exposes the
/// information that an LTO client should need in order to do symbol resolution.		/// information that an LTO client should need in order to do symbol resolution.
class InputFile {		class InputFile {
		public:
		class Symbol;

		private:
// FIXME: Remove LTO class friendship once we have bitcode symbol tables.		// FIXME: Remove LTO class friendship once we have bitcode symbol tables.
friend LTO;		friend LTO;
InputFile() = default;		InputFile() = default;

// FIXME: Remove the LLVMContext once we have bitcode symbol tables.		std::vector<BitcodeModule> Mods;
LLVMContext Ctx;		SmallVector<char, 0> Strtab;
struct InputModule;		std::vector<Symbol> Symbols;
std::vector<InputModule> Mods;
ModuleSymbolTable SymTab;

std::vector<StringRef> Comdats;		// [begin, end) for each module
DenseMap<const Comdat *, unsigned> ComdatMap;		std::vector<std::pair<size_t, size_t>> ModuleSymIndices;

		StringRef SourceFileName, COFFLinkerOpts;
		std::vector<StringRef> ComdatTable;

public:		public:
~InputFile();		~InputFile();

/// Create an InputFile.		/// Create an InputFile.
static Expected<std::unique_ptr<InputFile>> create(MemoryBufferRef Object);		static Expected<std::unique_ptr<InputFile>> create(MemoryBufferRef Object);

class symbol_iterator;		/// The purpose of this class is to only expose the symbol information that an
		/// LTO client should need in order to do symbol resolution.
/// This is a wrapper for ArrayRef<ModuleSymbolTable::Symbol>::iterator that		class Symbol : irsymtab::Symbol {
/// exposes only the information that an LTO client should need in order to do
/// symbol resolution.
///
/// This object is ephemeral; it is only valid as long as an iterator obtained
/// from symbols() refers to it.
class Symbol {
friend symbol_iterator;
friend LTO;		friend LTO;

ArrayRef<ModuleSymbolTable::Symbol>::iterator I;
const ModuleSymbolTable &SymTab;
const InputFile *File;
uint32_t Flags;
SmallString<64> Name;

bool shouldSkip() {
return !(Flags & object::BasicSymbolRef::SF_Global) \|\|
(Flags & object::BasicSymbolRef::SF_FormatSpecific);
}

void skip() {
ArrayRef<ModuleSymbolTable::Symbol>::iterator E = SymTab.symbols().end();
while (I != E) {
Flags = SymTab.getSymbolFlags(*I);
if (!shouldSkip())
break;
++I;
}
if (I == E)
return;

Name.clear();
{
raw_svector_ostream OS(Name);
SymTab.printSymbolName(OS, *I);
}
}

bool isGV() const { return I->is<GlobalValue *>(); }
GlobalValue getGV() const { return I->get<GlobalValue >(); }

public:		public:
Symbol(ArrayRef<ModuleSymbolTable::Symbol>::iterator I,		Symbol(const irsymtab::Symbol &S) : irsymtab::Symbol(S) {}
const ModuleSymbolTable &SymTab, const InputFile *File)
: I(I), SymTab(SymTab), File(File) {
skip();
}

bool isUndefined() const {
return Flags & object::BasicSymbolRef::SF_Undefined;
}
bool isCommon() const { return Flags & object::BasicSymbolRef::SF_Common; }
bool isWeak() const { return Flags & object::BasicSymbolRef::SF_Weak; }
bool isIndirect() const {
return Flags & object::BasicSymbolRef::SF_Indirect;
}

/// For COFF weak externals, returns the name of the symbol that is used
/// as a fallback if the weak external remains undefined.
std::string getCOFFWeakExternalFallback() const {
assert((Flags & object::BasicSymbolRef::SF_Weak) &&
(Flags & object::BasicSymbolRef::SF_Indirect) &&
"symbol is not a weak external");
std::string Name;
raw_string_ostream OS(Name);
SymTab.printSymbolName(
OS,
cast<GlobalValue>(
cast<GlobalAlias>(getGV())->getAliasee()->stripPointerCasts()));
OS.flush();
return Name;
}

/// Returns the mangled name of the global.		using irsymtab::Symbol::isUndefined;
StringRef getName() const { return Name; }		using irsymtab::Symbol::isCommon;
		using irsymtab::Symbol::isWeak;
GlobalValue::VisibilityTypes getVisibility() const {		using irsymtab::Symbol::isIndirect;
if (isGV())		using irsymtab::Symbol::getName;
return getGV()->getVisibility();		using irsymtab::Symbol::getVisibility;
return GlobalValue::DefaultVisibility;		using irsymtab::Symbol::canBeOmittedFromSymbolTable;
}		using irsymtab::Symbol::isTLS;
bool canBeOmittedFromSymbolTable() const {		using irsymtab::Symbol::getComdatIndex;
return isGV() && llvm::canBeOmittedFromSymbolTable(getGV());		using irsymtab::Symbol::getCommonSize;
}		using irsymtab::Symbol::getCommonAlignment;
bool isTLS() const {		using irsymtab::Symbol::getCOFFWeakExternalFallback;
// FIXME: Expose a thread-local flag for module asm symbols.
return isGV() && getGV()->isThreadLocal();
}

// Returns the index of the comdat this symbol is in or -1 if the symbol
// is not in a comdat.
// FIXME: We have to return Expected<int> because aliases point to an
// arbitrary ConstantExpr and that might not actually be a constant. That
// means we might not be able to find what an alias is aliased to and
// so find its comdat.
Expected<int> getComdatIndex() const;

uint64_t getCommonSize() const {
assert(Flags & object::BasicSymbolRef::SF_Common);
if (!isGV())
return 0;
return getGV()->getParent()->getDataLayout().getTypeAllocSize(
getGV()->getType()->getElementType());
}
unsigned getCommonAlignment() const {
assert(Flags & object::BasicSymbolRef::SF_Common);
if (!isGV())
return 0;
return getGV()->getAlignment();
}
};

class symbol_iterator {
Symbol Sym;

public:
symbol_iterator(ArrayRef<ModuleSymbolTable::Symbol>::iterator I,
const ModuleSymbolTable &SymTab, const InputFile *File)
: Sym(I, SymTab, File) {}

symbol_iterator &operator++() {
++Sym.I;
Sym.skip();
return *this;
}

symbol_iterator operator++(int) {
symbol_iterator I = *this;
++*this;
return I;
}

const Symbol &operator*() const { return Sym; }
const Symbol *operator->() const { return &Sym; }

bool operator!=(const symbol_iterator &Other) const {
return Sym.I != Other.Sym.I;
}
};		};

/// A range over the symbols in this InputFile.		/// A range over the symbols in this InputFile.
iterator_range<symbol_iterator> symbols() {		ArrayRef<Symbol> symbols() const { return Symbols; }
return llvm::make_range(
symbol_iterator(SymTab.symbols().begin(), SymTab, this),
symbol_iterator(SymTab.symbols().end(), SymTab, this));
}

/// Returns linker options specified in the input file.		/// Returns linker options specified in the input file.
Expected<std::string> getLinkerOpts();		StringRef getCOFFLinkerOpts() const { return COFFLinkerOpts; }

/// Returns the path to the InputFile.		/// Returns the path to the InputFile.
StringRef getName() const;		StringRef getName() const;

/// Returns the source file path specified at compile time.		/// Returns the source file path specified at compile time.
StringRef getSourceFileName() const;		StringRef getSourceFileName() const { return SourceFileName; }

// Returns a table with all the comdats used by this file.		// Returns a table with all the comdats used by this file.
ArrayRef<StringRef> getComdatTable() const { return Comdats; }		ArrayRef<StringRef> getComdatTable() const { return ComdatTable; }

private:		private:
iterator_range<symbol_iterator> module_symbols(InputModule &IM);		ArrayRef<Symbol> module_symbols(unsigned I) const {
		const auto &Indices = ModuleSymIndices[I];
		return {Symbols.data() + Indices.first, Symbols.data() + Indices.second};
		}
};		};

/// This class wraps an output stream for a native object. Most clients should		/// This class wraps an output stream for a native object. Most clients should
/// just be able to return an instance of this base class from the stream		/// just be able to return an instance of this base class from the stream
/// callback, but if a client needs to perform some action after the stream is		/// callback, but if a client needs to perform some action after the stream is
/// written to, that can be done by deriving from this class and overriding the		/// written to, that can be done by deriving from this class and overriding the
/// destructor.		/// destructor.
class NativeObjectStream {		class NativeObjectStream {
▲ Show 20 Lines • Show All 171 Lines • ▼ Show 20 Lines	enum : unsigned {
/// The RegularLTO partition		/// The RegularLTO partition
RegularLTO = 0,		RegularLTO = 0,
};		};
};		};

// Global mapping from mangled symbol names to resolutions.		// Global mapping from mangled symbol names to resolutions.
StringMap<GlobalResolution> GlobalResolutions;		StringMap<GlobalResolution> GlobalResolutions;

void addSymbolToGlobalRes(SmallPtrSet<GlobalValue *, 8> &Used,		void addSymbolToGlobalRes(const InputFile::Symbol &Sym, SymbolResolution Res,
const InputFile::Symbol &Sym, SymbolResolution Res,
unsigned Partition);		unsigned Partition);

// These functions take a range of symbol resolutions [ResI, ResE) and consume		// These functions take a range of symbol resolutions [ResI, ResE) and consume
// the resolutions used by a single input module by incrementing ResI. After		// the resolutions used by a single input module by incrementing ResI. After
// these functions return, [ResI, ResE) will refer to the resolution range for		// these functions return, [ResI, ResE) will refer to the resolution range for
// the remaining modules in the InputFile.		// the remaining modules in the InputFile.
Error addModule(InputFile &Input, InputFile::InputModule &IM,		Error addModule(InputFile &Input, unsigned ModI,
const SymbolResolution &ResI, const SymbolResolution ResE);		const SymbolResolution &ResI, const SymbolResolution ResE);
Error addRegularLTO(BitcodeModule BM, const SymbolResolution *&ResI,		Error addRegularLTO(BitcodeModule BM,
		ArrayRef<InputFile::Symbol> Syms,
		const SymbolResolution *&ResI,
const SymbolResolution *ResE);		const SymbolResolution *ResE);
Error addThinLTO(BitcodeModule BM, Module &M,		Error addThinLTO(BitcodeModule BM, ArrayRef<InputFile::Symbol> Syms,
iterator_range<InputFile::symbol_iterator> Syms,
const SymbolResolution &ResI, const SymbolResolution ResE);		const SymbolResolution &ResI, const SymbolResolution ResE);

Error runRegularLTO(AddStreamFn AddStream);		Error runRegularLTO(AddStreamFn AddStream);
Error runThinLTO(AddStreamFn AddStream, NativeObjectCache Cache,		Error runThinLTO(AddStreamFn AddStream, NativeObjectCache Cache,
bool HasRegularLTO);		bool HasRegularLTO);

mutable bool CalledGetMaxTasks = false;		mutable bool CalledGetMaxTasks = false;
};		};
Show All 22 Lines

llvm/trunk/include/llvm/Object/IRSymtab.h

				//===- IRSymtab.h - data definitions for IR symbol tables -------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file contains data definitions and a reader and builder for a symbol
				// table for LLVM IR. Its purpose is to allow linkers and other consumers of
				// bitcode files to efficiently read the symbol table for symbol resolution
				// purposes without needing to construct a module in memory.
				//
				// As with most object files the symbol table has two parts: the symbol table
				// itself and a string table which is referenced by the symbol table.
				//
				// A symbol table corresponds to a single bitcode file, which may consist of
				// multiple modules, so symbol tables may likewise contain symbols for multiple
				// modules.
				//
				//===----------------------------------------------------------------------===//

				#ifndef LLVM_OBJECT_IRSYMTAB_H
				#define LLVM_OBJECT_IRSYMTAB_H

				#include "llvm/ADT/ArrayRef.h"
				#include "llvm/IR/GlobalValue.h"
				#include "llvm/Object/SymbolicFile.h"
				#include "llvm/Support/Endian.h"

				namespace llvm {
				namespace irsymtab {
				namespace storage {

				// The data structures in this namespace define the low-level serialization
				// format. Clients that just want to read a symbol table should use the
				// irsymtab::Reader class.

				typedef support::ulittle32_t Word;

				/// A reference to a string in the string table.
				struct Str {
				Word Offset;
				StringRef get(StringRef Strtab) const {
				return Strtab.data() + Offset;
				}
				};

				/// A reference to a range of objects in the symbol table.
				template <typename T> struct Range {
				Word Offset, Size;
				ArrayRef<T> get(StringRef Symtab) const {
				return {reinterpret_cast<const T *>(Symtab.data() + Offset), Size};
				}
				};

				/// Describes the range of a particular module's symbols within the symbol
				/// table.
				struct Module {
				Word Begin, End;
				};

				/// This is equivalent to an IR comdat.
				struct Comdat {
				Str Name;
				};

				/// Contains the information needed by linkers for symbol resolution, as well as
				/// by the LTO implementation itself.
				struct Symbol {
				/// The mangled symbol name.
				Str Name;

				/// The unmangled symbol name, or the empty string if this is not an IR
				/// symbol.
				Str IRName;

				/// The index into Header::Comdats, or -1 if not a comdat member.
				Word ComdatIndex;

				Word Flags;
				enum FlagBits {
				FB_visibility, // 2 bits
				FB_undefined = FB_visibility + 2,
				FB_weak,
				FB_common,
				FB_indirect,
				FB_used,
				FB_tls,
				FB_may_omit,
				FB_global,
				FB_format_specific,
				FB_unnamed_addr,
				};

				/// The index into the Uncommon table, or -1 if this symbol does not have an
				/// Uncommon.
				Word UncommonIndex;
				};

				/// This data structure contains rarely used symbol fields and is optionally
				/// referenced by a Symbol.
				struct Uncommon {
				Word CommonSize, CommonAlign;

				/// COFF-specific: the name of the symbol that a weak external resolves to
				/// if not defined.
				Str COFFWeakExternFallbackName;
				};

				struct Header {
				Range<Module> Modules;
				Range<Comdat> Comdats;
				Range<Symbol> Symbols;
				Range<Uncommon> Uncommons;

				Str SourceFileName;

				/// COFF-specific: linker directives.
				Str COFFLinkerOpts;
				};

				}

				/// Fills in Symtab and Strtab with a valid symbol and string table for Mods.
				Error build(ArrayRef<Module *> Mods, SmallVector<char, 0> &Symtab,
				SmallVector<char, 0> &Strtab);

				/// This represents a symbol that has been read from a storage::Symbol and
				/// possibly a storage::Uncommon.
				struct Symbol {
				// Copied from storage::Symbol.
				StringRef Name, IRName;
				int ComdatIndex;
				uint32_t Flags;

				// Copied from storage::Uncommon.
				uint32_t CommonSize, CommonAlign;
				StringRef COFFWeakExternFallbackName;

				/// Returns the mangled symbol name.
				StringRef getName() const { return Name; }

				/// Returns the unmangled symbol name, or the empty string if this is not an
				/// IR symbol.
				StringRef getIRName() const { return IRName; }

				/// Returns the index into the comdat table (see Reader::getComdatTable()), or
				/// -1 if not a comdat member.
				int getComdatIndex() const { return ComdatIndex; }

				using S = storage::Symbol;
				GlobalValue::VisibilityTypes getVisibility() const {
				return GlobalValue::VisibilityTypes((Flags >> S::FB_visibility) & 3);
				}
				bool isUndefined() const { return (Flags >> S::FB_undefined) & 1; }
				bool isWeak() const { return (Flags >> S::FB_weak) & 1; }
				bool isCommon() const { return (Flags >> S::FB_common) & 1; }
				bool isIndirect() const { return (Flags >> S::FB_indirect) & 1; }
				bool isUsed() const { return (Flags >> S::FB_used) & 1; }
				bool isTLS() const { return (Flags >> S::FB_tls) & 1; }
				bool canBeOmittedFromSymbolTable() const {
				return (Flags >> S::FB_may_omit) & 1;
				}
				bool isGlobal() const { return (Flags >> S::FB_global) & 1; }
				bool isFormatSpecific() const { return (Flags >> S::FB_format_specific) & 1; }
				bool isUnnamedAddr() const { return (Flags >> S::FB_unnamed_addr) & 1; }

				size_t getCommonSize() const {
				assert(isCommon());
				return CommonSize;
				}
				uint32_t getCommonAlignment() const {
				assert(isCommon());
				return CommonAlign;
				}

				/// COFF-specific: for weak externals, returns the name of the symbol that is
				/// used as a fallback if the weak external remains undefined.
				StringRef getCOFFWeakExternalFallback() const {
				assert(isWeak() && isIndirect());
				return COFFWeakExternFallbackName;
				}
				};

				/// This class can be used to read a Symtab and Strtab produced by
				/// irsymtab::build.
				class Reader {
				StringRef Symtab, Strtab;

				ArrayRef<storage::Module> Modules;
				ArrayRef<storage::Comdat> Comdats;
				ArrayRef<storage::Symbol> Symbols;
				ArrayRef<storage::Uncommon> Uncommons;

				StringRef str(storage::Str S) const { return S.get(Strtab); }
				template <typename T> ArrayRef<T> range(storage::Range<T> R) const {
				return R.get(Symtab);
				}
				const storage::Header &header() const {
				return reinterpret_cast<const storage::Header >(Symtab.data());
				}

				public:
				class SymbolRef;

				Reader() = default;
				Reader(StringRef Symtab, StringRef Strtab) : Symtab(Symtab), Strtab(Strtab) {
				Modules = range(header().Modules);
				Comdats = range(header().Comdats);
				Symbols = range(header().Symbols);
				Uncommons = range(header().Uncommons);
				}

				typedef iterator_range<object::content_iterator<SymbolRef>> symbol_range;

				/// Returns the symbol table for the entire bitcode file.
				/// The symbols enumerated by this method are ephemeral, but they can be
				/// copied into an irsymtab::Symbol object.
				symbol_range symbols() const;

				/// Returns a slice of the symbol table for the I'th module in the file.
				/// The symbols enumerated by this method are ephemeral, but they can be
				/// copied into an irsymtab::Symbol object.
				symbol_range module_symbols(unsigned I) const;

				/// Returns the source file path specified at compile time.
				StringRef getSourceFileName() const { return str(header().SourceFileName); }

				/// Returns a table with all the comdats used by this file.
				std::vector<StringRef> getComdatTable() const {
				std::vector<StringRef> ComdatTable;
				ComdatTable.reserve(Comdats.size());
				for (auto C : Comdats)
				ComdatTable.push_back(str(C.Name));
				return ComdatTable;
				}

				/// COFF-specific: returns linker options specified in the input file.
				StringRef getCOFFLinkerOpts() const { return str(header().COFFLinkerOpts); }
				};

				/// Ephemeral symbols produced by Reader::symbols() and
				/// Reader::module_symbols().
				class Reader::SymbolRef : public Symbol {
				const storage::Symbol SymI, SymE;
				const Reader *R;

				public:
				SymbolRef(const storage::Symbol SymI, const storage::Symbol SymE,
				const Reader *R)
				: SymI(SymI), SymE(SymE), R(R) {
				read();
				}

				void read() {
				if (SymI == SymE)
				return;

				Name = R->str(SymI->Name);
				IRName = R->str(SymI->IRName);
				ComdatIndex = SymI->ComdatIndex;
				Flags = SymI->Flags;

				uint32_t UncI = SymI->UncommonIndex;
				if (UncI != -1u) {
				const storage::Uncommon &Unc = R->Uncommons[UncI];
				CommonSize = Unc.CommonSize;
				CommonAlign = Unc.CommonAlign;
				COFFWeakExternFallbackName = R->str(Unc.COFFWeakExternFallbackName);
				}
				}
				void moveNext() {
				++SymI;
				read();
				}

				bool operator==(const SymbolRef &Other) const { return SymI == Other.SymI; }
				};

				inline Reader::symbol_range Reader::symbols() const {
				return {SymbolRef(Symbols.begin(), Symbols.end(), this),
				SymbolRef(Symbols.end(), Symbols.end(), this)};
				}

				inline Reader::symbol_range Reader::module_symbols(unsigned I) const {
				const storage::Module &M = Modules[I];
				const storage::Symbol *MBegin = Symbols.begin() + M.Begin,
				*MEnd = Symbols.begin() + M.End;
				return {SymbolRef(MBegin, MEnd, this), SymbolRef(MEnd, MEnd, this)};
				}

				}

				}

				#endif

llvm/trunk/lib/LTO/LTO.cpp

Show First 20 Lines • Show All 299 Lines • ▼ Show 20 Lines
// as external and non-exported values as internal.		// as external and non-exported values as internal.
void llvm::thinLTOInternalizeAndPromoteInIndex(		void llvm::thinLTOInternalizeAndPromoteInIndex(
ModuleSummaryIndex &Index,		ModuleSummaryIndex &Index,
function_ref<bool(StringRef, GlobalValue::GUID)> isExported) {		function_ref<bool(StringRef, GlobalValue::GUID)> isExported) {
for (auto &I : Index)		for (auto &I : Index)
thinLTOInternalizeAndPromoteGUID(I.second, I.first, isExported);		thinLTOInternalizeAndPromoteGUID(I.second, I.first, isExported);
}		}

struct InputFile::InputModule {
BitcodeModule BM;
std::unique_ptr<Module> Mod;

// The range of ModuleSymbolTable entries for this input module.
size_t SymBegin, SymEnd;
};

// Requires a destructor for std::vector<InputModule>.		// Requires a destructor for std::vector<InputModule>.
InputFile::~InputFile() = default;		InputFile::~InputFile() = default;

Expected<std::unique_ptr<InputFile>> InputFile::create(MemoryBufferRef Object) {		Expected<std::unique_ptr<InputFile>> InputFile::create(MemoryBufferRef Object) {
std::unique_ptr<InputFile> File(new InputFile);		std::unique_ptr<InputFile> File(new InputFile);

ErrorOr<MemoryBufferRef> BCOrErr =		ErrorOr<MemoryBufferRef> BCOrErr =
IRObjectFile::findBitcodeInMemBuffer(Object);		IRObjectFile::findBitcodeInMemBuffer(Object);
if (!BCOrErr)		if (!BCOrErr)
return errorCodeToError(BCOrErr.getError());		return errorCodeToError(BCOrErr.getError());

Expected<std::vector<BitcodeModule>> BMsOrErr =		Expected<std::vector<BitcodeModule>> BMsOrErr =
getBitcodeModuleList(*BCOrErr);		getBitcodeModuleList(*BCOrErr);
if (!BMsOrErr)		if (!BMsOrErr)
return BMsOrErr.takeError();		return BMsOrErr.takeError();

if (BMsOrErr->empty())		if (BMsOrErr->empty())
return make_error<StringError>("Bitcode file does not contain any modules",		return make_error<StringError>("Bitcode file does not contain any modules",
inconvertibleErrorCode());		inconvertibleErrorCode());

// Create an InputModule for each module in the InputFile, and add it to the		File->Mods = *BMsOrErr;
// ModuleSymbolTable.
		LLVMContext Ctx;
		std::vector<Module *> Mods;
		std::vector<std::unique_ptr<Module>> OwnedMods;
for (auto BM : *BMsOrErr) {		for (auto BM : *BMsOrErr) {
Expected<std::unique_ptr<Module>> MOrErr =		Expected<std::unique_ptr<Module>> MOrErr =
BM.getLazyModule(File->Ctx, /ShouldLazyLoadMetadata/ true,		BM.getLazyModule(Ctx, /ShouldLazyLoadMetadata/ true,
/IsImporting/ false);		/IsImporting/ false);
if (!MOrErr)		if (!MOrErr)
return MOrErr.takeError();		return MOrErr.takeError();

size_t SymBegin = File->SymTab.symbols().size();		if ((*MOrErr)->getDataLayoutStr().empty())
File->SymTab.addModule(MOrErr->get());		return make_error<StringError>("input module has no datalayout",
size_t SymEnd = File->SymTab.symbols().size();

for (const auto &C : (*MOrErr)->getComdatSymbolTable()) {
auto P = File->ComdatMap.insert(
std::make_pair(&C.second, File->Comdats.size()));
assert(P.second);
(void)P;
File->Comdats.push_back(C.first());
}

File->Mods.push_back({BM, std::move(*MOrErr), SymBegin, SymEnd});
}

return std::move(File);
}

Expected<int> InputFile::Symbol::getComdatIndex() const {
if (!isGV())
return -1;
const GlobalObject *GO = getGV()->getBaseObject();
if (!GO)
return make_error<StringError>("Unable to determine comdat of alias!",
inconvertibleErrorCode());		inconvertibleErrorCode());
if (const Comdat *C = GO->getComdat()) {
auto I = File->ComdatMap.find(C);
assert(I != File->ComdatMap.end());
return I->second;
}
return -1;
}

Expected<std::string> InputFile::getLinkerOpts() {		Mods.push_back(MOrErr->get());
std::string LinkerOpts;		OwnedMods.push_back(std::move(*MOrErr));
raw_string_ostream LOS(LinkerOpts);
// Extract linker options from module metadata.
for (InputModule &Mod : Mods) {
std::unique_ptr<Module> &M = Mod.Mod;
if (auto E = M->materializeMetadata())
return std::move(E);
if (Metadata *Val = M->getModuleFlag("Linker Options")) {
MDNode *LinkerOptions = cast<MDNode>(Val);
for (const MDOperand &MDOptions : LinkerOptions->operands())
for (const MDOperand &MDOption : cast<MDNode>(MDOptions)->operands())
LOS << " " << cast<MDString>(MDOption)->getString();
}
}		}

// Synthesize export flags for symbols with dllexport storage.		SmallVector<char, 0> Symtab;
const Triple TT(Mods[0].Mod->getTargetTriple());		if (Error E = irsymtab::build(Mods, Symtab, File->Strtab))
Mangler M;		return std::move(E);
for (const ModuleSymbolTable::Symbol &Sym : SymTab.symbols())
if (auto GV = Sym.dyn_cast<GlobalValue>())
emitLinkerFlagsForGlobalCOFF(LOS, GV, TT, M);
LOS.flush();
return LinkerOpts;
}

StringRef InputFile::getName() const {		irsymtab::Reader R({Symtab.data(), Symtab.size()},
return Mods[0].BM.getModuleIdentifier();		{File->Strtab.data(), File->Strtab.size()});
		File->SourceFileName = R.getSourceFileName();
		File->COFFLinkerOpts = R.getCOFFLinkerOpts();
		File->ComdatTable = R.getComdatTable();

		for (unsigned I = 0; I != Mods.size(); ++I) {
		size_t Begin = File->Symbols.size();
		for (const irsymtab::Reader::SymbolRef &Sym : R.module_symbols(I))
		// Skip symbols that are irrelevant to LTO. Note that this condition needs
		// to match the one in Skip() in LTO::addRegularLTO().
		if (Sym.isGlobal() && !Sym.isFormatSpecific())
		File->Symbols.push_back(Sym);
		File->ModuleSymIndices.push_back({Begin, File->Symbols.size()});
}		}

StringRef InputFile::getSourceFileName() const {		return std::move(File);
return Mods[0].Mod->getSourceFileName();
}		}

iterator_range<InputFile::symbol_iterator>		StringRef InputFile::getName() const {
InputFile::module_symbols(InputModule &IM) {		return Mods[0].getModuleIdentifier();
return llvm::make_range(
symbol_iterator(SymTab.symbols().data() + IM.SymBegin, SymTab, this),
symbol_iterator(SymTab.symbols().data() + IM.SymEnd, SymTab, this));
}		}

LTO::RegularLTOState::RegularLTOState(unsigned ParallelCodeGenParallelismLevel,		LTO::RegularLTOState::RegularLTOState(unsigned ParallelCodeGenParallelismLevel,
Config &Conf)		Config &Conf)
: ParallelCodeGenParallelismLevel(ParallelCodeGenParallelismLevel),		: ParallelCodeGenParallelismLevel(ParallelCodeGenParallelismLevel),
Ctx(Conf) {}		Ctx(Conf) {}

LTO::ThinLTOState::ThinLTOState(ThinBackend Backend) : Backend(Backend) {		LTO::ThinLTOState::ThinLTOState(ThinBackend Backend) : Backend(Backend) {
if (!Backend)		if (!Backend)
this->Backend =		this->Backend =
createInProcessThinBackend(llvm::heavyweight_hardware_concurrency());		createInProcessThinBackend(llvm::heavyweight_hardware_concurrency());
}		}

LTO::LTO(Config Conf, ThinBackend Backend,		LTO::LTO(Config Conf, ThinBackend Backend,
unsigned ParallelCodeGenParallelismLevel)		unsigned ParallelCodeGenParallelismLevel)
: Conf(std::move(Conf)),		: Conf(std::move(Conf)),
RegularLTO(ParallelCodeGenParallelismLevel, this->Conf),		RegularLTO(ParallelCodeGenParallelismLevel, this->Conf),
ThinLTO(std::move(Backend)) {}		ThinLTO(std::move(Backend)) {}

// Requires a destructor for MapVector<BitcodeModule>.		// Requires a destructor for MapVector<BitcodeModule>.
LTO::~LTO() = default;		LTO::~LTO() = default;

// Add the given symbol to the GlobalResolutions map, and resolve its partition.		// Add the given symbol to the GlobalResolutions map, and resolve its partition.
void LTO::addSymbolToGlobalRes(SmallPtrSet<GlobalValue *, 8> &Used,		void LTO::addSymbolToGlobalRes(const InputFile::Symbol &Sym,
const InputFile::Symbol &Sym,
SymbolResolution Res, unsigned Partition) {		SymbolResolution Res, unsigned Partition) {
GlobalValue *GV = Sym.isGV() ? Sym.getGV() : nullptr;

auto &GlobalRes = GlobalResolutions[Sym.getName()];		auto &GlobalRes = GlobalResolutions[Sym.getName()];
if (GV) {		GlobalRes.UnnamedAddr &= Sym.isUnnamedAddr();
GlobalRes.UnnamedAddr &= GV->hasGlobalUnnamedAddr();
if (Res.Prevailing)		if (Res.Prevailing)
GlobalRes.IRName = GV->getName();		GlobalRes.IRName = Sym.getIRName();
}
// Set the partition to external if we know it is used elsewhere, e.g.		// Set the partition to external if we know it is used elsewhere, e.g.
// it is visible to a regular object, is referenced from llvm.compiler_used,		// it is visible to a regular object, is referenced from llvm.compiler_used,
// or was already recorded as being referenced from a different partition.		// or was already recorded as being referenced from a different partition.
if (Res.VisibleToRegularObj \|\| (GV && Used.count(GV)) \|\|		if (Res.VisibleToRegularObj \|\| Sym.isUsed() \|\|
(GlobalRes.Partition != GlobalResolution::Unknown &&		(GlobalRes.Partition != GlobalResolution::Unknown &&
GlobalRes.Partition != Partition)) {		GlobalRes.Partition != Partition)) {
GlobalRes.Partition = GlobalResolution::External;		GlobalRes.Partition = GlobalResolution::External;
} else		} else
// First recorded reference, save the current partition.		// First recorded reference, save the current partition.
GlobalRes.Partition = Partition;		GlobalRes.Partition = Partition;

// Flag as visible outside of ThinLTO if visible from a regular object or		// Flag as visible outside of ThinLTO if visible from a regular object or
Show All 27 Lines
Error LTO::add(std::unique_ptr<InputFile> Input,		Error LTO::add(std::unique_ptr<InputFile> Input,
ArrayRef<SymbolResolution> Res) {		ArrayRef<SymbolResolution> Res) {
assert(!CalledGetMaxTasks);		assert(!CalledGetMaxTasks);

if (Conf.ResolutionFile)		if (Conf.ResolutionFile)
writeToResolutionFile(*Conf.ResolutionFile, Input.get(), Res);		writeToResolutionFile(*Conf.ResolutionFile, Input.get(), Res);

const SymbolResolution *ResI = Res.begin();		const SymbolResolution *ResI = Res.begin();
for (InputFile::InputModule &IM : Input->Mods)		for (unsigned I = 0; I != Input->Mods.size(); ++I)
if (Error Err = addModule(*Input, IM, ResI, Res.end()))		if (Error Err = addModule(*Input, I, ResI, Res.end()))
return Err;		return Err;

assert(ResI == Res.end());		assert(ResI == Res.end());
return Error::success();		return Error::success();
}		}

Error LTO::addModule(InputFile &Input, InputFile::InputModule &IM,		Error LTO::addModule(InputFile &Input, unsigned ModI,
const SymbolResolution *&ResI,		const SymbolResolution *&ResI,
const SymbolResolution *ResE) {		const SymbolResolution *ResE) {
// FIXME: move to backend		Expected<bool> HasThinLTOSummary = Input.Mods[ModI].hasSummary();
Module &M = *IM.Mod;

if (M.getDataLayoutStr().empty())
return make_error<StringError>("input module has no datalayout",
inconvertibleErrorCode());

if (!Conf.OverrideTriple.empty())
M.setTargetTriple(Conf.OverrideTriple);
else if (M.getTargetTriple().empty())
M.setTargetTriple(Conf.DefaultTriple);

Expected<bool> HasThinLTOSummary = IM.BM.hasSummary();
if (!HasThinLTOSummary)		if (!HasThinLTOSummary)
return HasThinLTOSummary.takeError();		return HasThinLTOSummary.takeError();

		auto ModSyms = Input.module_symbols(ModI);
if (*HasThinLTOSummary)		if (*HasThinLTOSummary)
return addThinLTO(IM.BM, M, Input.module_symbols(IM), ResI, ResE);		return addThinLTO(Input.Mods[ModI], ModSyms, ResI, ResE);
else		else
return addRegularLTO(IM.BM, ResI, ResE);		return addRegularLTO(Input.Mods[ModI], ModSyms, ResI, ResE);
}		}

// Add a regular LTO object to the link.		// Add a regular LTO object to the link.
Error LTO::addRegularLTO(BitcodeModule BM, const SymbolResolution *&ResI,		Error LTO::addRegularLTO(BitcodeModule BM,
		ArrayRef<InputFile::Symbol> Syms,
		const SymbolResolution *&ResI,
const SymbolResolution *ResE) {		const SymbolResolution *ResE) {
if (!RegularLTO.CombinedModule) {		if (!RegularLTO.CombinedModule) {
RegularLTO.CombinedModule =		RegularLTO.CombinedModule =
llvm::make_unique<Module>("ld-temp.o", RegularLTO.Ctx);		llvm::make_unique<Module>("ld-temp.o", RegularLTO.Ctx);
RegularLTO.Mover = llvm::make_unique<IRMover>(*RegularLTO.CombinedModule);		RegularLTO.Mover = llvm::make_unique<IRMover>(*RegularLTO.CombinedModule);
}		}
Expected<std::unique_ptr<Module>> MOrErr =		Expected<std::unique_ptr<Module>> MOrErr =
BM.getLazyModule(RegularLTO.Ctx, /ShouldLazyLoadMetadata/ true,		BM.getLazyModule(RegularLTO.Ctx, /ShouldLazyLoadMetadata/ true,
/IsImporting/ false);		/IsImporting/ false);
if (!MOrErr)		if (!MOrErr)
return MOrErr.takeError();		return MOrErr.takeError();

Module &M = **MOrErr;		Module &M = **MOrErr;
if (Error Err = M.materializeMetadata())		if (Error Err = M.materializeMetadata())
return Err;		return Err;
UpgradeDebugInfo(M);		UpgradeDebugInfo(M);

ModuleSymbolTable SymTab;		ModuleSymbolTable SymTab;
SymTab.addModule(&M);		SymTab.addModule(&M);

SmallPtrSet<GlobalValue *, 8> Used;
collectUsedGlobalVariables(M, Used, /CompilerUsed/ false);

std::vector<GlobalValue *> Keep;		std::vector<GlobalValue *> Keep;

for (GlobalVariable &GV : M.globals())		for (GlobalVariable &GV : M.globals())
if (GV.hasAppendingLinkage())		if (GV.hasAppendingLinkage())
Keep.push_back(&GV);		Keep.push_back(&GV);

DenseSet<GlobalObject *> AliasedGlobals;		DenseSet<GlobalObject *> AliasedGlobals;
for (auto &GA : M.aliases())		for (auto &GA : M.aliases())
if (GlobalObject *GO = GA.getBaseObject())		if (GlobalObject *GO = GA.getBaseObject())
AliasedGlobals.insert(GO);		AliasedGlobals.insert(GO);

for (const InputFile::Symbol &Sym :		// In this function we need IR GlobalValues matching the symbols in Syms
make_range(InputFile::symbol_iterator(SymTab.symbols().begin(), SymTab,		// (which is not backed by a module), so we need to enumerate them in the same
nullptr),		// order. The symbol enumeration order of a ModuleSymbolTable intentionally
InputFile::symbol_iterator(SymTab.symbols().end(), SymTab,		// matches the order of an irsymtab, but when we read the irsymtab in
nullptr))) {		// InputFile::create we omit some symbols that are irrelevant to LTO. The
		// Skip() function skips the same symbols from the module as InputFile does
		// from the symbol table.
		auto MsymI = SymTab.symbols().begin(), MsymE = SymTab.symbols().end();
		auto Skip = [&]() {
		while (MsymI != MsymE) {
		auto Flags = SymTab.getSymbolFlags(*MsymI);
		if ((Flags & object::BasicSymbolRef::SF_Global) &&
		!(Flags & object::BasicSymbolRef::SF_FormatSpecific))
		return;
		++MsymI;
		}
		};
		Skip();

		for (const InputFile::Symbol &Sym : Syms) {
assert(ResI != ResE);		assert(ResI != ResE);
SymbolResolution Res = *ResI++;		SymbolResolution Res = *ResI++;
addSymbolToGlobalRes(Used, Sym, Res, 0);		addSymbolToGlobalRes(Sym, Res, 0);

if (Sym.isGV()) {		assert(MsymI != MsymE);
GlobalValue *GV = Sym.getGV();		ModuleSymbolTable::Symbol Msym = *MsymI++;
		Skip();

		if (GlobalValue GV = Msym.dyn_cast<GlobalValue >()) {
if (Res.Prevailing) {		if (Res.Prevailing) {
if (Sym.isUndefined())		if (Sym.isUndefined())
continue;		continue;
Keep.push_back(GV);		Keep.push_back(GV);
switch (GV->getLinkage()) {		switch (GV->getLinkage()) {
default:		default:
break;		break;
case GlobalValue::LinkOnceAnyLinkage:		case GlobalValue::LinkOnceAnyLinkage:
Show All 21 Lines	if (GlobalValue GV = Msym.dyn_cast<GlobalValue >()) {
}		}
}		}
// Common resolution: collect the maximum size/alignment over all commons.		// Common resolution: collect the maximum size/alignment over all commons.
// We also record if we see an instance of a common as prevailing, so that		// We also record if we see an instance of a common as prevailing, so that
// if none is prevailing we can ignore it later.		// if none is prevailing we can ignore it later.
if (Sym.isCommon()) {		if (Sym.isCommon()) {
// FIXME: We should figure out what to do about commons defined by asm.		// FIXME: We should figure out what to do about commons defined by asm.
// For now they aren't reported correctly by ModuleSymbolTable.		// For now they aren't reported correctly by ModuleSymbolTable.
auto &CommonRes = RegularLTO.Commons[Sym.getGV()->getName()];		auto &CommonRes = RegularLTO.Commons[Sym.getIRName()];
CommonRes.Size = std::max(CommonRes.Size, Sym.getCommonSize());		CommonRes.Size = std::max(CommonRes.Size, Sym.getCommonSize());
CommonRes.Align = std::max(CommonRes.Align, Sym.getCommonAlignment());		CommonRes.Align = std::max(CommonRes.Align, Sym.getCommonAlignment());
CommonRes.Prevailing \|= Res.Prevailing;		CommonRes.Prevailing \|= Res.Prevailing;
}		}

// FIXME: use proposed local attribute for FinalDefinitionInLinkageUnit.		// FIXME: use proposed local attribute for FinalDefinitionInLinkageUnit.
}		}
		assert(MsymI == MsymE);

return RegularLTO.Mover->move(std::move(*MOrErr), Keep,		return RegularLTO.Mover->move(std::move(*MOrErr), Keep,
[](GlobalValue &, IRMover::ValueAdder) {},		[](GlobalValue &, IRMover::ValueAdder) {},
/* IsPerformingImport */ false);		/* IsPerformingImport */ false);
}		}

// Add a ThinLTO object to the link.		// Add a ThinLTO object to the link.
// FIXME: This function should not need to take as many parameters once we have		Error LTO::addThinLTO(BitcodeModule BM,
// a bitcode symbol table.		ArrayRef<InputFile::Symbol> Syms,
Error LTO::addThinLTO(BitcodeModule BM, Module &M,
iterator_range<InputFile::symbol_iterator> Syms,
const SymbolResolution *&ResI,		const SymbolResolution *&ResI,
const SymbolResolution *ResE) {		const SymbolResolution *ResE) {
SmallPtrSet<GlobalValue *, 8> Used;
collectUsedGlobalVariables(M, Used, /CompilerUsed/ false);

Expected<std::unique_ptr<ModuleSummaryIndex>> SummaryOrErr = BM.getSummary();		Expected<std::unique_ptr<ModuleSummaryIndex>> SummaryOrErr = BM.getSummary();
if (!SummaryOrErr)		if (!SummaryOrErr)
return SummaryOrErr.takeError();		return SummaryOrErr.takeError();
ThinLTO.CombinedIndex.mergeFrom(std::move(*SummaryOrErr),		ThinLTO.CombinedIndex.mergeFrom(std::move(*SummaryOrErr),
ThinLTO.ModuleMap.size());		ThinLTO.ModuleMap.size());

for (const InputFile::Symbol &Sym : Syms) {		for (const InputFile::Symbol &Sym : Syms) {
assert(ResI != ResE);		assert(ResI != ResE);
SymbolResolution Res = *ResI++;		SymbolResolution Res = *ResI++;
addSymbolToGlobalRes(Used, Sym, Res, ThinLTO.ModuleMap.size() + 1);		addSymbolToGlobalRes(Sym, Res, ThinLTO.ModuleMap.size() + 1);

if (Res.Prevailing && Sym.isGV())		if (Res.Prevailing) {
ThinLTO.PrevailingModuleForGUID[Sym.getGV()->getGUID()] =		if (!Sym.getIRName().empty()) {
BM.getModuleIdentifier();		auto GUID = GlobalValue::getGUID(GlobalValue::getGlobalIdentifier(
		Sym.getIRName(), GlobalValue::ExternalLinkage, ""));
		ThinLTO.PrevailingModuleForGUID[GUID] = BM.getModuleIdentifier();
		}
		}
}		}

if (!ThinLTO.ModuleMap.insert({BM.getModuleIdentifier(), BM}).second)		if (!ThinLTO.ModuleMap.insert({BM.getModuleIdentifier(), BM}).second)
return make_error<StringError>(		return make_error<StringError>(
"Expected at most one ThinLTO module per bitcode file",		"Expected at most one ThinLTO module per bitcode file",
inconvertibleErrorCode());		inconvertibleErrorCode());

return Error::success();		return Error::success();
▲ Show 20 Lines • Show All 446 Lines • Show Last 20 Lines

llvm/trunk/lib/Object/CMakeLists.txt

	add_llvm_library(LLVMObject			add_llvm_library(LLVMObject
	Archive.cpp			Archive.cpp
	ArchiveWriter.cpp			ArchiveWriter.cpp
	Binary.cpp			Binary.cpp
	COFFObjectFile.cpp			COFFObjectFile.cpp
	Decompressor.cpp			Decompressor.cpp
	ELF.cpp			ELF.cpp
	ELFObjectFile.cpp			ELFObjectFile.cpp
	Error.cpp			Error.cpp
	IRObjectFile.cpp			IRObjectFile.cpp
				IRSymtab.cpp
	MachOObjectFile.cpp			MachOObjectFile.cpp
	MachOUniversal.cpp			MachOUniversal.cpp
	ModuleSummaryIndexObjectFile.cpp			ModuleSummaryIndexObjectFile.cpp
	ModuleSymbolTable.cpp			ModuleSymbolTable.cpp
	Object.cpp			Object.cpp
	ObjectFile.cpp			ObjectFile.cpp
	RecordStreamer.cpp			RecordStreamer.cpp
	SymbolicFile.cpp			SymbolicFile.cpp
	Show All 9 Lines

llvm/trunk/lib/Object/IRSymtab.cpp

				//===- IRSymtab.cpp - implementation of IR symbol tables --------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Object/IRSymtab.h"
				#include "llvm/CodeGen/Analysis.h"
				#include "llvm/CodeGen/TargetLoweringObjectFileImpl.h"
				#include "llvm/IR/Module.h"
				#include "llvm/MC/StringTableBuilder.h"
				#include "llvm/Object/ModuleSymbolTable.h"
				#include "llvm/Support/Allocator.h"
				#include "llvm/Support/StringSaver.h"

				using namespace llvm;
				using namespace irsymtab;

				namespace {

				/// Stores the temporary state that is required to build an IR symbol table.
				struct Builder {
				SmallVector<char, 0> &Symtab;
				SmallVector<char, 0> &Strtab;
				Builder(SmallVector<char, 0> &Symtab, SmallVector<char, 0> &Strtab)
				: Symtab(Symtab), Strtab(Strtab) {}

				StringTableBuilder StrtabBuilder{StringTableBuilder::ELF};

				BumpPtrAllocator Alloc;
				StringSaver Saver{Alloc};

				DenseMap<const Comdat *, unsigned> ComdatMap;
				ModuleSymbolTable Msymtab;
				SmallPtrSet<GlobalValue *, 8> Used;
				Mangler Mang;
				Triple TT;

				std::vector<storage::Comdat> Comdats;
				std::vector<storage::Module> Mods;
				std::vector<storage::Symbol> Syms;
				std::vector<storage::Uncommon> Uncommons;

				std::string COFFLinkerOpts;
				raw_string_ostream COFFLinkerOptsOS{COFFLinkerOpts};

				void setStr(storage::Str &S, StringRef Value) {
				S.Offset = StrtabBuilder.add(Value);
				}
				template <typename T>
				void writeRange(storage::Range<T> &R, const std::vector<T> &Objs) {
				R.Offset = Symtab.size();
				R.Size = Objs.size();
				Symtab.insert(Symtab.end(), reinterpret_cast<const char *>(Objs.data()),
				reinterpret_cast<const char *>(Objs.data() + Objs.size()));
				}

				Error addModule(Module *M);
				Error addSymbol(ModuleSymbolTable::Symbol Sym);

				Error build(ArrayRef<Module *> Mods);
				};

				Error Builder::addModule(Module *M) {
				collectUsedGlobalVariables(M, Used, /CompilerUsed*/ false);

				storage::Module Mod;
				Mod.Begin = Msymtab.symbols().size();
				Msymtab.addModule(M);
				Mod.End = Msymtab.symbols().size();
				Mods.push_back(Mod);

				if (TT.isOSBinFormatCOFF()) {
				if (auto E = M->materializeMetadata())
				return E;
				if (Metadata *Val = M->getModuleFlag("Linker Options")) {
				MDNode *LinkerOptions = cast<MDNode>(Val);
				for (const MDOperand &MDOptions : LinkerOptions->operands())
				for (const MDOperand &MDOption : cast<MDNode>(MDOptions)->operands())
				COFFLinkerOptsOS << " " << cast<MDString>(MDOption)->getString();
				}
				}

				return Error::success();
				}

				Error Builder::addSymbol(ModuleSymbolTable::Symbol Msym) {
				Syms.emplace_back();
				storage::Symbol &Sym = Syms.back();
				Sym = {};

				Sym.UncommonIndex = -1;
				storage::Uncommon *Unc = nullptr;
				auto Uncommon = [&]() -> storage::Uncommon & {
				if (Unc)
				return *Unc;
				Sym.UncommonIndex = Uncommons.size();
				Uncommons.emplace_back();
				Unc = &Uncommons.back();
				*Unc = {};
				setStr(Unc->COFFWeakExternFallbackName, "");
				return *Unc;
				};

				SmallString<64> Name;
				{
				raw_svector_ostream OS(Name);
				Msymtab.printSymbolName(OS, Msym);
				}
				setStr(Sym.Name, Saver.save(StringRef(Name)));

				auto Flags = Msymtab.getSymbolFlags(Msym);
				if (Flags & object::BasicSymbolRef::SF_Undefined)
				Sym.Flags \|= 1 << storage::Symbol::FB_undefined;
				if (Flags & object::BasicSymbolRef::SF_Weak)
				Sym.Flags \|= 1 << storage::Symbol::FB_weak;
				if (Flags & object::BasicSymbolRef::SF_Common)
				Sym.Flags \|= 1 << storage::Symbol::FB_common;
				if (Flags & object::BasicSymbolRef::SF_Indirect)
				Sym.Flags \|= 1 << storage::Symbol::FB_indirect;
				if (Flags & object::BasicSymbolRef::SF_Global)
				Sym.Flags \|= 1 << storage::Symbol::FB_global;
				if (Flags & object::BasicSymbolRef::SF_FormatSpecific)
				Sym.Flags \|= 1 << storage::Symbol::FB_format_specific;

				Sym.ComdatIndex = -1;
				auto GV = Msym.dyn_cast<GlobalValue >();
				if (!GV) {
				setStr(Sym.IRName, "");
				return Error::success();
				}

				setStr(Sym.IRName, GV->getName());

				if (Used.count(GV))
				Sym.Flags \|= 1 << storage::Symbol::FB_used;
				if (GV->isThreadLocal())
				Sym.Flags \|= 1 << storage::Symbol::FB_tls;
				if (GV->hasGlobalUnnamedAddr())
				Sym.Flags \|= 1 << storage::Symbol::FB_unnamed_addr;
				if (canBeOmittedFromSymbolTable(GV))
				Sym.Flags \|= 1 << storage::Symbol::FB_may_omit;
				Sym.Flags \|= unsigned(GV->getVisibility()) << storage::Symbol::FB_visibility;

				if (Flags & object::BasicSymbolRef::SF_Common) {
				Uncommon().CommonSize = GV->getParent()->getDataLayout().getTypeAllocSize(
				GV->getType()->getElementType());
				Uncommon().CommonAlign = GV->getAlignment();
				}

				const GlobalObject *Base = GV->getBaseObject();
				if (!Base)
				return make_error<StringError>("Unable to determine comdat of alias!",
				inconvertibleErrorCode());
				if (const Comdat *C = Base->getComdat()) {
				auto P = ComdatMap.insert(std::make_pair(C, Comdats.size()));
				Sym.ComdatIndex = P.first->second;

				if (P.second) {
				storage::Comdat Comdat;
				setStr(Comdat.Name, C->getName());
				Comdats.push_back(Comdat);
				}
				}

				if (TT.isOSBinFormatCOFF()) {
				emitLinkerFlagsForGlobalCOFF(COFFLinkerOptsOS, GV, TT, Mang);

				if ((Flags & object::BasicSymbolRef::SF_Weak) &&
				(Flags & object::BasicSymbolRef::SF_Indirect)) {
				std::string FallbackName;
				raw_string_ostream OS(FallbackName);
				Msymtab.printSymbolName(
				OS, cast<GlobalValue>(
				cast<GlobalAlias>(GV)->getAliasee()->stripPointerCasts()));
				OS.flush();
				setStr(Uncommon().COFFWeakExternFallbackName, Saver.save(FallbackName));
				}
				}

				return Error::success();
				}

				Error Builder::build(ArrayRef<Module *> IRMods) {
				storage::Header Hdr;

				assert(!IRMods.empty());
				setStr(Hdr.SourceFileName, IRMods[0]->getSourceFileName());
				TT = Triple(IRMods[0]->getTargetTriple());

				// This adds the symbols for each module to Msymtab.
				for (auto *M : IRMods)
				if (Error Err = addModule(M))
				return Err;

				for (ModuleSymbolTable::Symbol Msym : Msymtab.symbols())
				if (Error Err = addSymbol(Msym))
				return Err;

				COFFLinkerOptsOS.flush();
				setStr(Hdr.COFFLinkerOpts, COFFLinkerOpts);

				// We are about to fill in the header's range fields, so reserve space for it
				// and copy it in afterwards.
				Symtab.resize(sizeof(storage::Header));
				writeRange(Hdr.Modules, Mods);
				writeRange(Hdr.Comdats, Comdats);
				writeRange(Hdr.Symbols, Syms);
				writeRange(Hdr.Uncommons, Uncommons);

				reinterpret_cast<storage::Header >(Symtab.data()) = Hdr;

				raw_svector_ostream OS(Strtab);
				StrtabBuilder.finalizeInOrder();
				StrtabBuilder.write(OS);

				return Error::success();
				}

				} // anonymous namespace

				Error irsymtab::build(ArrayRef<Module *> Mods, SmallVector<char, 0> &Symtab,
				SmallVector<char, 0> &Strtab) {
				return Builder(Symtab, Strtab).build(Mods);
				}

llvm/trunk/tools/gold/gold-plugin.cpp

Show First 20 Lines • Show All 459 Lines • ▼ Show 20 Lines	static ld_plugin_status claim_file_hook(const ld_plugin_input_file *file,
Expected<std::unique_ptr<InputFile>> ObjOrErr = InputFile::create(BufferRef);		Expected<std::unique_ptr<InputFile>> ObjOrErr = InputFile::create(BufferRef);
if (!ObjOrErr) {		if (!ObjOrErr) {
handleAllErrors(ObjOrErr.takeError(), [&](const ErrorInfoBase &EI) {		handleAllErrors(ObjOrErr.takeError(), [&](const ErrorInfoBase &EI) {
std::error_code EC = EI.convertToErrorCode();		std::error_code EC = EI.convertToErrorCode();
if (EC == object::object_error::invalid_file_type \|\|		if (EC == object::object_error::invalid_file_type \|\|
EC == object::object_error::bitcode_section_not_found)		EC == object::object_error::bitcode_section_not_found)
*claimed = 0;		*claimed = 0;
else		else
message(LDPL_ERROR,		message(LDPL_FATAL,
"LLVM gold plugin has failed to create LTO module: %s",		"LLVM gold plugin has failed to create LTO module: %s",
EI.message().c_str());		EI.message().c_str());
});		});

return *claimed ? LDPS_ERR : LDPS_OK;		return *claimed ? LDPS_ERR : LDPS_OK;
}		}

std::unique_ptr<InputFile> Obj = std::move(*ObjOrErr);		std::unique_ptr<InputFile> Obj = std::move(*ObjOrErr);
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	if (Sym.isUndefined()) {
sym.def = LDPK_COMMON;		sym.def = LDPK_COMMON;
else if (Sym.isWeak())		else if (Sym.isWeak())
sym.def = LDPK_WEAKDEF;		sym.def = LDPK_WEAKDEF;
else		else
sym.def = LDPK_DEF;		sym.def = LDPK_DEF;

sym.size = 0;		sym.size = 0;
sym.comdat_key = nullptr;		sym.comdat_key = nullptr;
int CI = check(Sym.getComdatIndex());		int CI = Sym.getComdatIndex();
if (CI != -1) {		if (CI != -1) {
StringRef C = Obj->getComdatTable()[CI];		StringRef C = Obj->getComdatTable()[CI];
sym.comdat_key = strdup(C.str().c_str());		sym.comdat_key = strdup(C.str().c_str());
}		}

sym.resolution = LDPR_UNKNOWN;		sym.resolution = LDPR_UNKNOWN;
}		}

▲ Show 20 Lines • Show All 401 Lines • Show Last 20 Lines