This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lld/trunk/ELF/
-
trunk/
-
ELF/
-
SymbolTable.cpp
-
Symbols.h

Differential D66130

[ELF] Initialize 2 fields of Symbol in SymbolTable::insert
ClosedPublic

Authored by MaskRay on Aug 13 2019, 2:32 AM.

Download Raw Diff

Details

Reviewers

grimar
peter.smith
ruiu
• espindola

Commits

rGf1d538cce54b: [ELF] Initialize 2 fields of Symbol in SymbolTable::insert
rLLD368784: [ELF] Initialize 2 fields of Symbol in SymbolTable::insert
rL368784: [ELF] Initialize 2 fields of Symbol in SymbolTable::insert

Summary

A new symbol is added to elf::symtab in 3 steps:

SymbolTable::insert creates a placeholder.
Symbol::mergeProperties
Symbol::replace

Fields referenced by steps 2) and 3) should be initialized in
SymbolTable::insert. traced and referenced were missed previously.
This did not cause problems because compilers generated code that
initialized them (bit fields) to 0.

Diff Detail

Repository: rL LLVM

Event Timeline

MaskRay created this revision.Aug 13 2019, 2:32 AM

Herald added a reviewer: • espindola. · View Herald TranscriptAug 13 2019, 2:32 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, arichardson, emaste. · View Herald Transcript

Harbormaster completed remote builds in B36651: Diff 214787.Aug 13 2019, 2:32 AM

LGTM. Please wait for other reviewer(s) opinion about comments (I'd remove them).

ELF/SymbolTable.cpp
75 ↗	(On Diff #214787)	I am not sure that comments of this patch are valuable. You just initializing fields because they can be used anywhere in LLD. Not sure it is worth to mention where they are used/copied.

This revision is now accepted and ready to land.Aug 13 2019, 2:38 AM

Improve comments

Harbormaster completed remote builds in B36652: Diff 214789.Aug 13 2019, 2:51 AM

LGTM too, I agree with George that making the comment specific isn't helpful but a general one isn't doing any harm.

An alternative is to make an initialization function in Symbol and have the default constructor call that for the bitfields. This could be called from SymbolTable::insert. This would have the benefit that we'd only have to update one place for a new field. I notice that canInline is given a different default value from the default constructor, but if this is important it could be a parameter to the initialization function.

A further alternative is to wrap the bitfields in a struct that can be initialized and copied as a whole, but would require every use of one of the fields to be updated.

In D66130#1626705, @peter.smith wrote:

LGTM too, I agree with George that making the comment specific isn't helpful but a general one isn't doing any harm.

An alternative is to make an initialization function in Symbol and have the default constructor call that for the bitfields. This could be called from SymbolTable::insert. This would have the benefit that we'd only have to update one place for a new field. I notice that canInline is given a different default value from the default constructor, but if this is important it could be a parameter to the initialization function.

Yes, canInline is weird. The member initializer arbitrarily sets it to false, then SymbolTable::insert changes it to true, in --wrap handling it gets assigned to false (true->false semilattice).

A further alternative is to wrap the bitfields in a struct that can be initialized and copied as a whole, but would require every use of one of the fields to be updated.

The problem is that I can't give the member function a reasonable name :( Conceptually any field referenced when the symbol is still a placeholder should be initialized (these fields are all bit fields currently). Note, there are many other fields that are not referenced - they are not initialized in SymbolTable::insert. Some bit fields are initialized differently in the member initializer list and SymbolTable::insert

visibility
isUsedInRegularObj
canInline
exportDynamic

So a member function may not save us a lot of trouble.

Delete a sentence from a comment

Harbormaster completed remote builds in B36655: Diff 214794.Aug 13 2019, 3:17 AM

Closed by commit rL368784: [ELF] Initialize 2 fields of Symbol in SymbolTable::insert (authored by MaskRay). · Explain WhyAug 13 2019, 6:52 PM

This revision was automatically updated to reflect the committed changes.

It might make sense to zero-initialize a new instance with memset(sym, 0, sizeof(Symbol)) before setting any (non-zero/false) values to sym members, as it is less error-prone. What do you think?

In D66130#1635001, @ruiu wrote:

It might make sense to zero-initialize a new instance with memset(sym, 0, sizeof(Symbol)) before setting any (non-zero/false) values to sym members, as it is less error-prone. What do you think?

The list documents the fields that may be used while the symbol is still a placeholder. I worry that such auto zero initialization can hide problems that are harder to diagnose.

Revision Contents

Path

Size

lld/

trunk/

ELF/

SymbolTable.cpp

4 lines

Symbols.h

2 lines

Diff 215005

lld/trunk/ELF/SymbolTable.cpp

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	Symbol *SymbolTable::insert(StringRef name) {
bool isNew = p.second;		bool isNew = p.second;

if (!isNew)		if (!isNew)
return symVector[symIndex];		return symVector[symIndex];

Symbol sym = reinterpret_cast<Symbol >(make<SymbolUnion>());		Symbol sym = reinterpret_cast<Symbol >(make<SymbolUnion>());
symVector.push_back(sym);		symVector.push_back(sym);

		// *sym was not initialized by a constructor. Fields that may get referenced
		// when it is a placeholder must be initialized here.
sym->setName(name);		sym->setName(name);
sym->symbolKind = Symbol::PlaceholderKind;		sym->symbolKind = Symbol::PlaceholderKind;
sym->versionId = VER_NDX_GLOBAL;		sym->versionId = VER_NDX_GLOBAL;
sym->visibility = STV_DEFAULT;		sym->visibility = STV_DEFAULT;
sym->isUsedInRegularObj = false;		sym->isUsedInRegularObj = false;
sym->exportDynamic = false;		sym->exportDynamic = false;
sym->inDynamicList = false;		sym->inDynamicList = false;
sym->canInline = true;		sym->canInline = true;
		sym->referenced = false;
		sym->traced = false;
sym->scriptDefined = false;		sym->scriptDefined = false;
sym->partition = 1;		sym->partition = 1;
return sym;		return sym;
}		}

Symbol *SymbolTable::addSymbol(const Symbol &newSym) {		Symbol *SymbolTable::addSymbol(const Symbol &newSym) {
Symbol *sym = symtab->insert(newSym.getName());		Symbol *sym = symtab->insert(newSym.getName());
sym->resolve(newSym);		sym->resolve(newSym);
▲ Show 20 Lines • Show All 177 Lines • Show Last 20 Lines

lld/trunk/ELF/Symbols.h

Show First 20 Lines • Show All 531 Lines • ▼ Show 20 Lines	void Symbol::replace(const Symbol &newSym) {
if (symbolKind != PlaceholderKind && !isLazy() && !newSym.isLazy() &&		if (symbolKind != PlaceholderKind && !isLazy() && !newSym.isLazy() &&
(type == STT_TLS) != (newSym.type == STT_TLS))		(type == STT_TLS) != (newSym.type == STT_TLS))
error("TLS attribute mismatch: " + toString(*this) + "\n>>> defined in " +		error("TLS attribute mismatch: " + toString(*this) + "\n>>> defined in " +
toString(newSym.file) + "\n>>> defined in " + toString(file));		toString(newSym.file) + "\n>>> defined in " + toString(file));

Symbol old = *this;		Symbol old = *this;
memcpy(this, &newSym, newSym.getSymbolSize());		memcpy(this, &newSym, newSym.getSymbolSize());

		// old may be a placeholder. The referenced fields must be initialized in
		// SymbolTable::insert.
versionId = old.versionId;		versionId = old.versionId;
visibility = old.visibility;		visibility = old.visibility;
isUsedInRegularObj = old.isUsedInRegularObj;		isUsedInRegularObj = old.isUsedInRegularObj;
exportDynamic = old.exportDynamic;		exportDynamic = old.exportDynamic;
inDynamicList = old.inDynamicList;		inDynamicList = old.inDynamicList;
canInline = old.canInline;		canInline = old.canInline;
referenced = old.referenced;		referenced = old.referenced;
traced = old.traced;		traced = old.traced;
Show All 20 Lines