This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/Object/
-
llvm/
-
Object/
-
ELFObjectFile.h
-
ObjectFile.h
-
lib/Object/
-
Object/
-
ObjectFile.cpp
-
test/tools/llvm-size/X86/
-
tools/
-
llvm-size/
-
X86/
-
elf-sizes.test
-
ignore-sections.s
-
tools/llvm-size/
-
llvm-size/
-
llvm-size.cpp

Differential D54369

[llvm-size][libobject] Add explicit "inTextSegment" methods similar to "isText" section methods to calculate size correctly.
ClosedPublic

Authored by rupprecht on Nov 9 2018, 5:18 PM.

Download Raw Diff

Details

Reviewers

echristo
Bigcheese
MaskRay

Commits

rG4888c4aba5a1: [llvm-size][libobject] Add explicit "inTextSegment" methods similar to "isText"…
rL349074: [llvm-size][libobject] Add explicit "inTextSegment" methods similar to "isText"…

Summary

llvm-size uses "isText()" etc. which seem to indicate whether the section contains code-like things, not whether or not it will actually go in the text segment when in a fully linked executable.

The unit test added (elf-sizes.test) shows some types of sections that cause discrepencies versus the GNU size tool. llvm-size is not correctly reporting sizes of things mapping to text/data segments, at least for ELF files.

This fixes pr38723.

Diff Detail

Repository: rL LLVM

Event Timeline

rupprecht created this revision.Nov 9 2018, 5:18 PM

Herald added a subscriber: llvm-commits. · View Herald TranscriptNov 9 2018, 5:19 PM

Harbormaster completed remote builds in B24834: Diff 173478.Nov 9 2018, 5:19 PM

I think this is the right direction, couple comment:

More detail in the comments for the *Segment methods. Maybe something elaborating about link/execution time or something else? Examples as well :)
While it's fairly obvious it'll work, a couple of sections with the right flags but not named the standard thing would be good for the test. .text.unlikely and .data1 maybe?

MaskRay added inline comments.Nov 10 2018, 1:25 PM

include/llvm/Object/ELFObjectFile.h
767 ↗	(On Diff #173478)	The parentheses outside of `!` are redundant. `GNU size` uses Berkeley format by default (`static int berkeley_format = BSD_DEFAULT;`) I'd say it uses a weird counting https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;a=blob;f=binutils/size.c;h=3c72e484752d0272a24ac628f497f89ecf36d547;hb=refs/heads/master#l459 459 if ((flags & SEC_CODE) != 0 \|\| (flags & SEC_READONLY) != 0) 460 textsize += size; 461 else if ((flags & SEC_HAS_CONTENTS) != 0) 462 datasize += size; 463 else 464 bsssize += size; Basically `SHF_EXECINSTR` sections (`.text`) + `!SHF_WRITE` sections (`.rodata .eh_frame` ...). I feel the name `inTextSegment` is a bit misleading. How about `isBerkeleyText`? `inDataSegment` may be renamed to `isBerkeleyData`.

MaskRay added inline comments.Nov 10 2018, 1:50 PM

include/llvm/Object/ELFObjectFile.h
767 ↗	(On Diff #173478)	The Berkeley format is not that weird. It uses some simple condition to approximate how the traditional linkers partition sections into segments. With the advent of fine-grained segments (split of `R` and `RX`): `-z separate-code` in ld.bfd and lld's default (unless `--no-rosegment`), the computation may be less relevant. `isBerkeleyText` makes more sense to me.

Remove useless parens
Rename inXXXSegment->isBerkeleyXXX
Clarify segment comments
Add test cases that ensure section flags are used instead of section names

Harbormaster completed remote builds in B25970: Diff 177953.Dec 12 2018, 3:38 PM

Sorry, for not getting to this patch sooner, I dropped it due to vacation + some unexpected things... I'm still eager to get this patch in :)

In D54369#1293994, @echristo wrote:

I think this is the right direction, couple comment:

More detail in the comments for the *Segment methods. Maybe something elaborating about link/execution time or something else? Examples as well :)

Expanded a bit, I can add more if this isn't enough

While it's fairly obvious it'll work, a couple of sections with the right flags but not named the standard thing would be good for the test. .text.unlikely and .data1 maybe?

Done, added a couple examples both ways (named .text.* but not actually text, and named .something_random but is actually data)

(Addressed Ray's comments too)

MaskRay accepted this revision.Dec 12 2018, 3:52 PM

MaskRay added inline comments.

include/llvm/Object/ObjectFile.h
111 ↗	(On Diff #177953)	Typo here. initialized -> uninitialized https://en.wikipedia.org/wiki/.bss "BSS reserves a block of uninitialized data"

This revision is now accepted and ready to land.Dec 12 2018, 3:52 PM

Closed by commit rL349074: [llvm-size][libobject] Add explicit "inTextSegment" methods similar to "isText"… (authored by rupprecht). · Explain WhyDec 13 2018, 11:43 AM

This revision was automatically updated to reflect the committed changes.

rupprecht marked an inline comment as done.

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

Object/

ELFObjectFile.h

16 lines

ObjectFile.h

22 lines

lib/

Object/

ObjectFile.cpp

8 lines

test/

tools/

llvm-size/

X86/

elf-sizes.test

55 lines

ignore-sections.s

4 lines

tools/

llvm-size/

llvm-size.cpp

4 lines

Diff 178104

llvm/trunk/include/llvm/Object/ELFObjectFile.h

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	protected:
std::error_code getSectionContents(DataRefImpl Sec,		std::error_code getSectionContents(DataRefImpl Sec,
StringRef &Res) const override;		StringRef &Res) const override;
uint64_t getSectionAlignment(DataRefImpl Sec) const override;		uint64_t getSectionAlignment(DataRefImpl Sec) const override;
bool isSectionCompressed(DataRefImpl Sec) const override;		bool isSectionCompressed(DataRefImpl Sec) const override;
bool isSectionText(DataRefImpl Sec) const override;		bool isSectionText(DataRefImpl Sec) const override;
bool isSectionData(DataRefImpl Sec) const override;		bool isSectionData(DataRefImpl Sec) const override;
bool isSectionBSS(DataRefImpl Sec) const override;		bool isSectionBSS(DataRefImpl Sec) const override;
bool isSectionVirtual(DataRefImpl Sec) const override;		bool isSectionVirtual(DataRefImpl Sec) const override;
		bool isBerkeleyText(DataRefImpl Sec) const override;
		bool isBerkeleyData(DataRefImpl Sec) const override;
relocation_iterator section_rel_begin(DataRefImpl Sec) const override;		relocation_iterator section_rel_begin(DataRefImpl Sec) const override;
relocation_iterator section_rel_end(DataRefImpl Sec) const override;		relocation_iterator section_rel_end(DataRefImpl Sec) const override;
std::vector<SectionRef> dynamic_relocation_sections() const override;		std::vector<SectionRef> dynamic_relocation_sections() const override;
section_iterator getRelocatedSection(DataRefImpl Sec) const override;		section_iterator getRelocatedSection(DataRefImpl Sec) const override;

void moveRelocationNext(DataRefImpl &Rel) const override;		void moveRelocationNext(DataRefImpl &Rel) const override;
uint64_t getRelocationOffset(DataRefImpl Rel) const override;		uint64_t getRelocationOffset(DataRefImpl Rel) const override;
symbol_iterator getRelocationSymbol(DataRefImpl Rel) const override;		symbol_iterator getRelocationSymbol(DataRefImpl Rel) const override;
▲ Show 20 Lines • Show All 484 Lines • ▼ Show 20 Lines
}		}

template <class ELFT>		template <class ELFT>
bool ELFObjectFile<ELFT>::isSectionVirtual(DataRefImpl Sec) const {		bool ELFObjectFile<ELFT>::isSectionVirtual(DataRefImpl Sec) const {
return getSection(Sec)->sh_type == ELF::SHT_NOBITS;		return getSection(Sec)->sh_type == ELF::SHT_NOBITS;
}		}

template <class ELFT>		template <class ELFT>
		bool ELFObjectFile<ELFT>::isBerkeleyText(DataRefImpl Sec) const {
		return getSection(Sec)->sh_flags & ELF::SHF_ALLOC &&
		(getSection(Sec)->sh_flags & ELF::SHF_EXECINSTR \|\|
		!(getSection(Sec)->sh_flags & ELF::SHF_WRITE));
		}

		template <class ELFT>
		bool ELFObjectFile<ELFT>::isBerkeleyData(DataRefImpl Sec) const {
		const Elf_Shdr *EShdr = getSection(Sec);
		return !isBerkeleyText(Sec) && EShdr->sh_type != ELF::SHT_NOBITS &&
		EShdr->sh_flags & ELF::SHF_ALLOC;
		}

		template <class ELFT>
relocation_iterator		relocation_iterator
ELFObjectFile<ELFT>::section_rel_begin(DataRefImpl Sec) const {		ELFObjectFile<ELFT>::section_rel_begin(DataRefImpl Sec) const {
DataRefImpl RelData;		DataRefImpl RelData;
auto SectionsOrErr = EF.sections();		auto SectionsOrErr = EF.sections();
if (!SectionsOrErr)		if (!SectionsOrErr)
return relocation_iterator(RelocationRef());		return relocation_iterator(RelocationRef());
uintptr_t SHT = reinterpret_cast<uintptr_t>((*SectionsOrErr).begin());		uintptr_t SHT = reinterpret_cast<uintptr_t>((*SectionsOrErr).begin());
RelData.d.a = (Sec.p - SHT) / EF.getHeader()->e_shentsize;		RelData.d.a = (Sec.p - SHT) / EF.getHeader()->e_shentsize;
▲ Show 20 Lines • Show All 392 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Object/ObjectFile.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	public:
uint64_t getIndex() const;		uint64_t getIndex() const;
uint64_t getSize() const;		uint64_t getSize() const;
std::error_code getContents(StringRef &Result) const;		std::error_code getContents(StringRef &Result) const;

/// Get the alignment of this section as the actual value (not log 2).		/// Get the alignment of this section as the actual value (not log 2).
uint64_t getAlignment() const;		uint64_t getAlignment() const;

bool isCompressed() const;		bool isCompressed() const;
		/// Whether this section contains instructions.
bool isText() const;		bool isText() const;
		/// Whether this section contains data, not instructions.
bool isData() const;		bool isData() const;
		/// Whether this section contains BSS uninitialized data.
bool isBSS() const;		bool isBSS() const;
bool isVirtual() const;		bool isVirtual() const;
bool isBitcode() const;		bool isBitcode() const;
bool isStripped() const;		bool isStripped() const;

		/// Whether this section will be placed in the text segment, according to the
		/// Berkeley size format. This is true if the section is allocatable, and
		/// contains either code or readonly data.
		bool isBerkeleyText() const;
		/// Whether this section will be placed in the data segment, according to the
		/// Berkeley size format. This is true if the section is allocatable and
		/// contains data (e.g. PROGBITS), but is not text.
		bool isBerkeleyData() const;

bool containsSymbol(SymbolRef S) const;		bool containsSymbol(SymbolRef S) const;

relocation_iterator relocation_begin() const;		relocation_iterator relocation_begin() const;
relocation_iterator relocation_end() const;		relocation_iterator relocation_end() const;
iterator_range<relocation_iterator> relocations() const {		iterator_range<relocation_iterator> relocations() const {
return make_range(relocation_begin(), relocation_end());		return make_range(relocation_begin(), relocation_end());
}		}
section_iterator getRelocatedSection() const;		section_iterator getRelocatedSection() const;
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	protected:
virtual bool isSectionCompressed(DataRefImpl Sec) const = 0;		virtual bool isSectionCompressed(DataRefImpl Sec) const = 0;
virtual bool isSectionText(DataRefImpl Sec) const = 0;		virtual bool isSectionText(DataRefImpl Sec) const = 0;
virtual bool isSectionData(DataRefImpl Sec) const = 0;		virtual bool isSectionData(DataRefImpl Sec) const = 0;
virtual bool isSectionBSS(DataRefImpl Sec) const = 0;		virtual bool isSectionBSS(DataRefImpl Sec) const = 0;
// A section is 'virtual' if its contents aren't present in the object image.		// A section is 'virtual' if its contents aren't present in the object image.
virtual bool isSectionVirtual(DataRefImpl Sec) const = 0;		virtual bool isSectionVirtual(DataRefImpl Sec) const = 0;
virtual bool isSectionBitcode(DataRefImpl Sec) const;		virtual bool isSectionBitcode(DataRefImpl Sec) const;
virtual bool isSectionStripped(DataRefImpl Sec) const;		virtual bool isSectionStripped(DataRefImpl Sec) const;
		virtual bool isBerkeleyText(DataRefImpl Sec) const;
		virtual bool isBerkeleyData(DataRefImpl Sec) const;
virtual relocation_iterator section_rel_begin(DataRefImpl Sec) const = 0;		virtual relocation_iterator section_rel_begin(DataRefImpl Sec) const = 0;
virtual relocation_iterator section_rel_end(DataRefImpl Sec) const = 0;		virtual relocation_iterator section_rel_end(DataRefImpl Sec) const = 0;
virtual section_iterator getRelocatedSection(DataRefImpl Sec) const;		virtual section_iterator getRelocatedSection(DataRefImpl Sec) const;

// Same as above for RelocationRef.		// Same as above for RelocationRef.
friend class RelocationRef;		friend class RelocationRef;
virtual void moveRelocationNext(DataRefImpl &Rel) const = 0;		virtual void moveRelocationNext(DataRefImpl &Rel) const = 0;
virtual uint64_t getRelocationOffset(DataRefImpl Rel) const = 0;		virtual uint64_t getRelocationOffset(DataRefImpl Rel) const = 0;
▲ Show 20 Lines • Show All 195 Lines • ▼ Show 20 Lines
inline bool SectionRef::isBitcode() const {		inline bool SectionRef::isBitcode() const {
return OwningObject->isSectionBitcode(SectionPimpl);		return OwningObject->isSectionBitcode(SectionPimpl);
}		}

inline bool SectionRef::isStripped() const {		inline bool SectionRef::isStripped() const {
return OwningObject->isSectionStripped(SectionPimpl);		return OwningObject->isSectionStripped(SectionPimpl);
}		}

		inline bool SectionRef::isBerkeleyText() const {
		return OwningObject->isBerkeleyText(SectionPimpl);
		}

		inline bool SectionRef::isBerkeleyData() const {
		return OwningObject->isBerkeleyData(SectionPimpl);
		}

inline relocation_iterator SectionRef::relocation_begin() const {		inline relocation_iterator SectionRef::relocation_begin() const {
return OwningObject->section_rel_begin(SectionPimpl);		return OwningObject->section_rel_begin(SectionPimpl);
}		}

inline relocation_iterator SectionRef::relocation_end() const {		inline relocation_iterator SectionRef::relocation_end() const {
return OwningObject->section_rel_end(SectionPimpl);		return OwningObject->section_rel_end(SectionPimpl);
}		}

▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

llvm/trunk/lib/Object/ObjectFile.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	bool ObjectFile::isSectionBitcode(DataRefImpl Sec) const {
StringRef SectName;		StringRef SectName;
if (!getSectionName(Sec, SectName))		if (!getSectionName(Sec, SectName))
return SectName == ".llvmbc";		return SectName == ".llvmbc";
return false;		return false;
}		}

bool ObjectFile::isSectionStripped(DataRefImpl Sec) const { return false; }		bool ObjectFile::isSectionStripped(DataRefImpl Sec) const { return false; }

		bool ObjectFile::isBerkeleyText(DataRefImpl Sec) const {
		return isSectionText(Sec);
		}

		bool ObjectFile::isBerkeleyData(DataRefImpl Sec) const {
		return isSectionData(Sec);
		}

section_iterator ObjectFile::getRelocatedSection(DataRefImpl Sec) const {		section_iterator ObjectFile::getRelocatedSection(DataRefImpl Sec) const {
return section_iterator(SectionRef(Sec, this));		return section_iterator(SectionRef(Sec, this));
}		}

Triple ObjectFile::makeTriple() const {		Triple ObjectFile::makeTriple() const {
Triple TheTriple;		Triple TheTriple;
auto Arch = getArch();		auto Arch = getArch();
TheTriple.setArch(Triple::ArchType(Arch));		TheTriple.setArch(Triple::ArchType(Arch));
▲ Show 20 Lines • Show All 80 Lines • Show Last 20 Lines

llvm/trunk/test/tools/llvm-size/X86/elf-sizes.test

				# RUN: yaml2obj %s > %t.o
				# RUN: llvm-size -B %t.o \| FileCheck %s

				!ELF
				FileHeader:
				Class: ELFCLASS64
				Data: ELFDATA2LSB
				Type: ET_EXEC
				Machine: EM_X86_64
				Sections:
				- Name: .bss
				Type: SHT_NOBITS
				Flags: [ SHF_ALLOC, SHF_WRITE ]
				Size: 1
				- Name: .text
				Type: SHT_PROGBITS
				Flags: [ SHF_ALLOC, SHF_EXECINSTR ]
				Size: 2
				- Name: .unusual_name_for_code
				Type: SHT_PROGBITS
				Flags: [ SHF_ALLOC, SHF_EXECINSTR ]
				Size: 64
				- Name: .eh_frame
				Type: SHT_PROGBITS
				Flags: [ SHF_ALLOC ]
				Size: 4
				- Name: .data
				Type: SHT_PROGBITS
				Flags: [ SHF_ALLOC, SHF_WRITE ]
				Size: 8
				- Name: .moar_stuff
				Type: SHT_PROGBITS
				Flags: [ SHF_ALLOC, SHF_WRITE ]
				Size: 128
				- Name: .text.but_not_really
				Type: SHT_PROGBITS
				Flags: [ ]
				Size: 256
				- Name: .debug_info
				Type: SHT_PROGBITS
				Flags: [ ]
				Size: 16
				- Name: .init_array
				Type: SHT_INIT_ARRAY
				Flags: [ SHF_ALLOC, SHF_WRITE ]
				Size: 32

				# text is .text, .eh_frame, .unusual_name_for_code: 2 + 4 + 64 = 70
				# data is .data, .init_array, .moar_stuff: 8 + 32 + 128 = 168
				# bss is .bss: 1
				# total: 239
				# unaccounted for (not affecting total) is .debug_info, .text.but_not_really

				# CHECK: text data bss dec
				# CHECK: 70 168 1 239

llvm/trunk/test/tools/llvm-size/X86/ignore-sections.s

	Show All 19 Lines
	// SYSV-NEXT: .data 4 0			// SYSV-NEXT: .data 4 0
	// SYSV-NEXT: .bss 4 0			// SYSV-NEXT: .bss 4 0
	// SYSV-NEXT: .comment 5 0			// SYSV-NEXT: .comment 5 0
	// SYSV-NEXT: foo 4 0			// SYSV-NEXT: foo 4 0
	// SYSV-NEXT: .eh_frame 48 0			// SYSV-NEXT: .eh_frame 48 0
	// SYSV-NEXT: Total 69			// SYSV-NEXT: Total 69

	// BSD: text data bss dec hex filename			// BSD: text data bss dec hex filename
	// BSD-NEXT: 4 4 4 12 c {{[ -\(\)_A-Za-z0-9.\\/:]+}}			// BSD-NEXT: 52 4 4 60 3c {{[ -\(\)_A-Za-z0-9.\\/:]+}}
	// BSD-NEXT: 4 4 4 12 c (TOTALS)			// BSD-NEXT: 52 4 4 60 3c (TOTALS)

llvm/trunk/tools/llvm-size/llvm-size.cpp

Show First 20 Lines • Show All 451 Lines • ▼ Show 20 Lines	else if (OutputFormat == sysv) {
// displays the cumulative size for each section type.		// displays the cumulative size for each section type.
uint64_t total_text = 0;		uint64_t total_text = 0;
uint64_t total_data = 0;		uint64_t total_data = 0;
uint64_t total_bss = 0;		uint64_t total_bss = 0;

// Make one pass over the section table to calculate sizes.		// Make one pass over the section table to calculate sizes.
for (const SectionRef &Section : Obj->sections()) {		for (const SectionRef &Section : Obj->sections()) {
uint64_t size = Section.getSize();		uint64_t size = Section.getSize();
bool isText = Section.isText();		bool isText = Section.isBerkeleyText();
bool isData = Section.isData();		bool isData = Section.isBerkeleyData();
bool isBSS = Section.isBSS();		bool isBSS = Section.isBSS();
if (isText)		if (isText)
total_text += size;		total_text += size;
else if (isData)		else if (isData)
total_data += size;		total_data += size;
else if (isBSS)		else if (isBSS)
total_bss += size;		total_bss += size;
}		}
▲ Show 20 Lines • Show All 425 Lines • Show Last 20 Lines