This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/lld/Core/
-
lld/
-
Core/
-
DefinedAtom.h
-
lib/
-
Core/
-
SymbolTable.cpp
-
ReaderWriter/PECOFF/
-
PECOFF/
-
Atoms.h
-
ReaderCOFF.cpp

Differential D4042

Add DefinedAtom::sectionSize().
Needs RevisionPublic

Authored by ruiu on Jun 6 2014, 1:06 AM.

Download Raw Diff

Details

Reviewers

Bigcheese
shankarke

Summary

Merge::mergeByLargestSection is defined in terms of section size,
but there was no way to get the section size for an atom. Previously
we followed layout-before/layout-after chains to collect all atoms
for a section, but that's not the right way to do it because there's
no guarantee that layout-before/layout-after exist. Even if they
exist, it's not guaranteed that they represent a section.

So this patch adds sectionSize() member function to DefinedAtom.

Diff Detail

Event Timeline

ruiu updated this revision to Diff 10168.Jun 6 2014, 1:06 AM

ruiu retitled this revision from to Add DefinedAtom::sectionSize()..

ruiu updated this object.

ruiu edited the test plan for this revision. (Show Details)

ruiu added reviewers: Bigcheese, shankarke, atanasyan.

ruiu added a subscriber: Unknown Object (MLST).

Iis this because of associated sections ? That you cannot get the section size ?

Can you please let us know why you can't get the section size from the layout after references / layout before references.

I am not in agreement for this as there is not a use of this on MachO or ELF.

It's not related to associated sections.

As I wrote in the parch description, layout-before and layout-after are
mechanisms to layout atoms in some order, and it's not always guaranteed to
be the same as sections. The current implementation that to be removed by
this parch is a hack that depends on an incorrect assumption that
layout-after/layout-before can be used to walk all atoms in the same
section.
2014/06/06 22:47 "Shankar Easwaran" <shankar.kalpathi.easwaran@gmail.com>:

Iis this because of associated sections ? That you cannot get the section
size ?

Can you please let us know why you can't get the section size from the
layout after references / layout before references.

I am not in agreement for this as there is not a use of this on MachO or
ELF.

http://reviews.llvm.org/D4042

Ping.

Is it possible to keep the sectionSize() method in the COFFDefinedAtom class only? In general every defined atom has non-zero section size. But this method returns a sensible result for a very specific set of atoms.

We wont be able to keep the sectionSize in COFFDefinedAtom because the Native/YAML only understand DefinedAtoms. I was proposing a way to do this earlier by having atoms contain a set of key value pairs (key=>value) that can be used to set various values.

We could start a discussion on the thread, and arrive at a way to solve this if you think we should do that.

I like this idea because now I try to figure out how to propagate the --as-needed from the ELFFileNode::parse() to the point where we create the DT_NEEDED dynamic tag. Your idea might be a solution to this problem too.

Adding generic key-value pair container to DefinedAtom class will probably end up with too limited or too generic code. If we limit the type of value to int64 (or some fixed-type size), it'd be pretty easy to dump it to YAML/Native, but it's restricted so it won't solve the problems we have. On the other hand, if we allow arbitrary type, it's hard to handle and needs custom dumper for Native anyway.

The thing I'd like to focus here is not the above one, but this. Merge::mergeByLargestSection is already there, and its definition is "pick the atom according to its source section size". However it currently lacks the way to get the section size for an atom. It's just wrong. Merge::mergeByLargestSection should go away (which we don't want) or we need a way to get a section size for an atom.

In general the patch is LGTM. The only thing concerns me is that we start to populate 'Core' classes by functions which really used by the single target.

I basically agree, we have to be careful not to bloat Core with lots of target specific stuff. However something that is needed for linking stage itself needs to be in Core, and we already have quite a lot for ELF. I'd think this change is small and acceptable.

I am not sure which instance in the Core Atom model that you are pointing at. The core atom model models generically for all flavors, I dont see something in the Atom model specifically designed for ELF.

That said, please check with Nick before you would like to commit this change.

This revision now requires changes to proceed.Jun 23 2014, 2:46 PM

It is not ideal to add DefinedAtom::sectionSize() to every platform, when this is only needed in rare cases on one platform.

A more pay-to-play approach would be to require mergeByLargestSection atoms to have a (group?) reference to a new atom which has a special ContentType and is a placeholder for the section, and its size() method returns the size of the section. The resolver, when finding a mergeByLargestSection atom looks for the Reference to the magic "section" atom and gets its size for comparison. The PE/COFF Writer would then also need to ignore atoms of the special contentType atoms.

DefinedAtom::sectionSize() is basically free if your atom does not have mergeByLargestSection attribute. It's a virtual function so it'll use one more slot for vftable, but that's it. There's no extra cost for each atomwhich does not use mergeByLargestSection. Making a virtual group atom only for size would work, but it's too complicated.

Yes, adding a virtual method is "free" in that it does not increase the size of each object, but this patch alone does not solve the round-trip (yaml/native) that Shankar brought up. My suggestion to add an atom representing the section and a reference from the mergeByLargestSection atom to the section atom, does work with round tripping.

The extra section atom and reference are not just arbitrary. They actually correspond to this merge mode. The atom is saying "pick me if my section is biggest". The reference answers the question "what is your section?" and the section atom answers the question "what is your section's size".

But there may also be another approach to this. Rather than have the SymbolTable understand all the merge modes, maybe we can have it call out to the LinkingContext to pick between custom modes? Or have it support mergeByLargestSection by calling out to the LinkingContext for the section size of a particular atom?

I would think adding a reference to another atom, and querying the atom for the size could be the approach we may want to take here.

The LinkingContext approach may work but it doesnot fit in the atom model properly IMHO.

atanasyan resigned from this revision.Feb 2 2016, 10:27 PM

atanasyan removed a reviewer: atanasyan.

Revision Contents

Path

Size

include/

lld/

Core/

DefinedAtom.h

4 lines

lib/

Core/

SymbolTable.cpp

38 lines

ReaderWriter/

PECOFF/

Atoms.h

10 lines

ReaderCOFF.cpp

8 lines

Diff 10168

include/lld/Core/DefinedAtom.h

Show First 20 Lines • Show All 249 Lines • ▼ Show 20 Lines	public:

/// \brief If sectionChoice() != sectionBasedOnContent, then this return the		/// \brief If sectionChoice() != sectionBasedOnContent, then this return the
/// name of the section the atom should be placed into.		/// name of the section the atom should be placed into.
virtual StringRef customSectionName() const = 0;		virtual StringRef customSectionName() const = 0;

/// \brief constraints on whether the linker may dead strip away this atom.		/// \brief constraints on whether the linker may dead strip away this atom.
virtual SectionPosition sectionPosition() const = 0;		virtual SectionPosition sectionPosition() const = 0;

		/// \brief If merge() == mergeByLargestSection, this returns the
		/// size of the section for this atom.
		virtual uint64_t sectionSize() const { return 0; }

/// \brief constraints on whether the linker may dead strip away this atom.		/// \brief constraints on whether the linker may dead strip away this atom.
virtual DeadStripKind deadStrip() const = 0;		virtual DeadStripKind deadStrip() const = 0;

/// \brief Under which conditions should this atom be dynamically exported.		/// \brief Under which conditions should this atom be dynamically exported.
virtual DynamicExport dynamicExport() const {		virtual DynamicExport dynamicExport() const {
return dynamicExportNormal;		return dynamicExportNormal;
}		}

▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Core/SymbolTable.cpp

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines

static MergeResolution mergeSelect(DefinedAtom::Merge first,		static MergeResolution mergeSelect(DefinedAtom::Merge first,
DefinedAtom::Merge second) {		DefinedAtom::Merge second) {
assert(first != DefinedAtom::mergeByContent);		assert(first != DefinedAtom::mergeByContent);
assert(second != DefinedAtom::mergeByContent);		assert(second != DefinedAtom::mergeByContent);
return mergeCases[first][second];		return mergeCases[first][second];
}		}

static const DefinedAtom followReference(const DefinedAtom atom,
uint32_t kind) {
for (const Reference r : atom)
if (r->kindNamespace() == Reference::KindNamespace::all &&
r->kindArch() == Reference::KindArch::all &&
r->kindValue() == kind)
return cast<const DefinedAtom>(r->target());
return nullptr;
}

static uint64_t getSizeFollowReferences(const DefinedAtom *atom,
uint32_t kind) {
uint64_t size = 0;
for (;;) {
atom = followReference(atom, kind);
if (!atom)
return size;
size += atom->size();
}
}

// Returns the size of the section containing the given atom. Atoms in the same
// section are connected by layout-before and layout-after edges, so this
// function traverses them to get the total size of atoms in the same section.
static uint64_t sectionSize(const DefinedAtom *atom) {
return atom->size()
+ getSizeFollowReferences(atom, lld::Reference::kindLayoutBefore)
+ getSizeFollowReferences(atom, lld::Reference::kindLayoutAfter);
}

bool SymbolTable::addByName(const Atom &newAtom) {		bool SymbolTable::addByName(const Atom &newAtom) {
StringRef name = newAtom.name();		StringRef name = newAtom.name();
assert(!name.empty());		assert(!name.empty());
const Atom *existing = findByName(name);		const Atom *existing = findByName(name);
if (existing == nullptr) {		if (existing == nullptr) {
// Name is not in symbol table yet, add it associate with this atom.		// Name is not in symbol table yet, add it associate with this atom.
_nameTable[name] = &newAtom;		_nameTable[name] = &newAtom;
return true;		return true;
Show All 19 Lines	switch (mergeSelect(((DefinedAtom*)existing)->merge(),
((DefinedAtom*)&newAtom)->merge())) {		((DefinedAtom*)&newAtom)->merge())) {
case MCR_First:		case MCR_First:
useNew = false;		useNew = false;
break;		break;
case MCR_Second:		case MCR_Second:
useNew = true;		useNew = true;
break;		break;
case MCR_Largest: {		case MCR_Largest: {
uint64_t existingSize = sectionSize((DefinedAtom*)existing);		uint64_t existingSize = ((DefinedAtom*)existing)->sectionSize();
uint64_t newSize = sectionSize((DefinedAtom*)&newAtom);		uint64_t newSize = ((DefinedAtom*)&newAtom)->sectionSize();
useNew = (newSize >= existingSize);		useNew = (newSize >= existingSize);
break;		break;
}		}
case MCR_SameSize: {		case MCR_SameSize: {
uint64_t existingSize = sectionSize((DefinedAtom*)existing);		uint64_t existingSize = ((DefinedAtom*)existing)->sectionSize();
uint64_t newSize = sectionSize((DefinedAtom*)&newAtom);		uint64_t newSize = ((DefinedAtom*)&newAtom)->sectionSize();
if (existingSize == newSize) {		if (existingSize == newSize) {
useNew = true;		useNew = true;
break;		break;
}		}
llvm::errs() << "Size mismatch: "		llvm::errs() << "Size mismatch: "
<< existing->name() << " (" << existingSize << ") "		<< existing->name() << " (" << existingSize << ") "
<< newAtom.name() << " (" << newSize << ")\n";		<< newAtom.name() << " (" << newSize << ")\n";
// fallthrough		// fallthrough
▲ Show 20 Lines • Show All 202 Lines • Show Last 20 Lines

lib/ReaderWriter/PECOFF/Atoms.h

Show First 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	private:
std::vector<std::unique_ptr<COFFReference> > _references;		std::vector<std::unique_ptr<COFFReference> > _references;
};		};

// A COFFDefinedAtom represents an atom read from a file and has contents.		// A COFFDefinedAtom represents an atom read from a file and has contents.
class COFFDefinedAtom : public COFFDefinedFileAtom {		class COFFDefinedAtom : public COFFDefinedFileAtom {
public:		public:
COFFDefinedAtom(const File &file, StringRef name, StringRef sectionName,		COFFDefinedAtom(const File &file, StringRef name, StringRef sectionName,
Scope scope, ContentType type, bool isComdat,		Scope scope, ContentType type, bool isComdat,
ContentPermissions perms, Merge merge, ArrayRef<uint8_t> data,		ContentPermissions perms, Merge merge, uint64_t sectionSize,
uint64_t ordinal)		ArrayRef<uint8_t> data, uint64_t ordinal)
: COFFDefinedFileAtom(file, name, sectionName, scope, type, perms,		: COFFDefinedFileAtom(file, name, sectionName, scope, type, perms,
ordinal),		ordinal),
_isComdat(isComdat), _merge(merge), _dataref(data) {}		_isComdat(isComdat), _merge(merge), _sectionSize(sectionSize),
		_dataref(data) {}

Merge merge() const override { return _merge; }		Merge merge() const override { return _merge; }
uint64_t size() const override { return _dataref.size(); }		uint64_t size() const override { return _dataref.size(); }
ArrayRef<uint8_t> rawContent() const override { return _dataref; }		ArrayRef<uint8_t> rawContent() const override { return _dataref; }

		uint64_t sectionSize() const override { return _sectionSize; }

DeadStripKind deadStrip() const override {		DeadStripKind deadStrip() const override {
// Only COMDAT symbols would be dead-stripped.		// Only COMDAT symbols would be dead-stripped.
return _isComdat ? deadStripNormal : deadStripNever;		return _isComdat ? deadStripNormal : deadStripNever;
}		}

private:		private:
bool _isComdat;		bool _isComdat;
Merge _merge;		Merge _merge;
		uint64_t _sectionSize;
ArrayRef<uint8_t> _dataref;		ArrayRef<uint8_t> _dataref;
};		};

// A COFFDefinedAtom represents an atom for BSS section.		// A COFFDefinedAtom represents an atom for BSS section.
class COFFBSSAtom : public COFFDefinedFileAtom {		class COFFBSSAtom : public COFFDefinedFileAtom {
public:		public:
COFFBSSAtom(const File &file, StringRef name, Scope scope,		COFFBSSAtom(const File &file, StringRef name, Scope scope,
ContentPermissions perms, Merge merge, uint32_t size,		ContentPermissions perms, Merge merge, uint32_t size,
▲ Show 20 Lines • Show All 166 Lines • Show Last 20 Lines

lib/ReaderWriter/PECOFF/ReaderCOFF.cpp

Show First 20 Lines • Show All 630 Lines • ▼ Show 20 Lines	error_code FileCOFF::AtomizeDefinedSymbolsInSection(
DefinedAtom::ContentPermissions perms = getPermissions(section);		DefinedAtom::ContentPermissions perms = getPermissions(section);
bool isComdat = (_comdatSections.count(section) == 1);		bool isComdat = (_comdatSections.count(section) == 1);

// Create an atom for the entire section.		// Create an atom for the entire section.
if (symbols.empty()) {		if (symbols.empty()) {
ArrayRef<uint8_t> data(secData.data(), secData.size());		ArrayRef<uint8_t> data(secData.data(), secData.size());
auto *atom = new (_alloc) COFFDefinedAtom(		auto *atom = new (_alloc) COFFDefinedAtom(
*this, "", sectionName, Atom::scopeTranslationUnit, type, isComdat,		*this, "", sectionName, Atom::scopeTranslationUnit, type, isComdat,
perms, _merge[section], data, _ordinal++);		perms, _merge[section], section->SizeOfRawData, data, _ordinal++);
atoms.push_back(atom);		atoms.push_back(atom);
_definedAtomLocations[section][0].push_back(atom);		_definedAtomLocations[section][0].push_back(atom);
return error_code();		return error_code();
}		}

// Create an unnamed atom if the first atom isn't at the start of the		// Create an unnamed atom if the first atom isn't at the start of the
// section.		// section.
if (symbols[0]->Value != 0) {		if (symbols[0]->Value != 0) {
uint64_t size = symbols[0]->Value;		uint64_t size = symbols[0]->Value;
ArrayRef<uint8_t> data(secData.data(), size);		ArrayRef<uint8_t> data(secData.data(), size);
auto *atom = new (_alloc) COFFDefinedAtom(		auto *atom = new (_alloc) COFFDefinedAtom(
*this, "", sectionName, Atom::scopeTranslationUnit, type, isComdat,		*this, "", sectionName, Atom::scopeTranslationUnit, type, isComdat,
perms, _merge[section], data, _ordinal++);		perms, _merge[section], section->SizeOfRawData, data, _ordinal++);
atoms.push_back(atom);		atoms.push_back(atom);
_definedAtomLocations[section][0].push_back(atom);		_definedAtomLocations[section][0].push_back(atom);
}		}

for (auto si = symbols.begin(), se = symbols.end(); si != se; ++si) {		for (auto si = symbols.begin(), se = symbols.end(); si != se; ++si) {
const uint8_t start = secData.data() + (si)->Value;		const uint8_t start = secData.data() + (si)->Value;
// if this is the last symbol, take up the remaining data.		// if this is the last symbol, take up the remaining data.
const uint8_t *end = (si + 1 == se) ? secData.data() + secData.size()		const uint8_t *end = (si + 1 == se) ? secData.data() + secData.size()
: secData.data() + (*(si + 1))->Value;		: secData.data() + (*(si + 1))->Value;
ArrayRef<uint8_t> data(start, end);		ArrayRef<uint8_t> data(start, end);
auto *atom = new (_alloc) COFFDefinedAtom(		auto *atom = new (_alloc) COFFDefinedAtom(
this, _symbolName[si], sectionName, getScope(*si), type, isComdat,		this, _symbolName[si], sectionName, getScope(*si), type, isComdat,
perms, _merge[section], data, _ordinal++);		perms, _merge[section], section->SizeOfRawData, data, _ordinal++);
atoms.push_back(atom);		atoms.push_back(atom);
_symbolAtom[*si] = atom;		_symbolAtom[*si] = atom;
_definedAtomLocations[section][(*si)->Value].push_back(atom);		_definedAtomLocations[section][(*si)->Value].push_back(atom);
}		}

// Finally, set alignment to the first atom so that the section contents		// Finally, set alignment to the first atom so that the section contents
// will be aligned as specified by the object section header.		// will be aligned as specified by the object section header.
_definedAtomLocations[section][0][0]->setAlignment(getAlignment(section));		_definedAtomLocations[section][0][0]->setAlignment(getAlignment(section));
▲ Show 20 Lines • Show All 155 Lines • ▼ Show 20 Lines	error_code FileCOFF::maybeCreateSXDataAtoms() {
if (sxdata.empty())		if (sxdata.empty())
return error_code();		return error_code();

std::vector<uint8_t> atomContent =		std::vector<uint8_t> atomContent =
*new (_alloc) std::vector<uint8_t>((size_t)sxdata.size());		*new (_alloc) std::vector<uint8_t>((size_t)sxdata.size());
auto *atom = new (_alloc) COFFDefinedAtom(		auto *atom = new (_alloc) COFFDefinedAtom(
*this, "", ".sxdata", Atom::scopeTranslationUnit, DefinedAtom::typeData,		*this, "", ".sxdata", Atom::scopeTranslationUnit, DefinedAtom::typeData,
false /isComdat/, DefinedAtom::permR__, DefinedAtom::mergeNo,		false /isComdat/, DefinedAtom::permR__, DefinedAtom::mergeNo,
atomContent, _ordinal++);		0 /* sectionSize */, atomContent, _ordinal++);

const ulittle32_t *symbolIndex =		const ulittle32_t *symbolIndex =
reinterpret_cast<const ulittle32_t *>(sxdata.data());		reinterpret_cast<const ulittle32_t *>(sxdata.data());
int numSymbols = sxdata.size() / sizeof(uint32_t);		int numSymbols = sxdata.size() / sizeof(uint32_t);

for (int i = 0; i < numSymbols; ++i) {		for (int i = 0; i < numSymbols; ++i) {
Atom *handlerFunc;		Atom *handlerFunc;
if (error_code ec = getAtomBySymbolIndex(symbolIndex[i], handlerFunc))		if (error_code ec = getAtomBySymbolIndex(symbolIndex[i], handlerFunc))
▲ Show 20 Lines • Show All 326 Lines • Show Last 20 Lines