This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
test/tools/llvm-objdump/X86/
-
tools/
-
llvm-objdump/
-
X86/
-
print-symbol-addr.s
-
tools/llvm-objdump/
-
llvm-objdump/
-
llvm-objdump.cpp

Differential D56123

[llvm-objdump] - Print symbol addressed when dumping disassembly output (-d)
ClosedPublic

Authored by grimar on Dec 28 2018, 2:22 AM.

Download Raw Diff

Details

Reviewers

echristo
dblaikie
davide
jhenderson

Commits

rG3ba0f3c0fb44: [llvm-objdump] - Print symbol addressed when dumping disassembly output (-d)
rL350726: [llvm-objdump] - Print symbol addressed when dumping disassembly output (-d)

Summary

When GNU objdump dumps the input with -d it prints the symbol addresses,
for example:

0000000000000031 <foo>:
  31:	00 00                	add    %al,(%rax)
	...

0000000000000035 <bar>:
	...

when llvm-objdump dumps the same object, it doesn't do that:

foo:
		...
      39:	00 00 	addb	%al, (%rax)

bar:
		...
      39:	00 00 	addb	%al, (%rax)
      3b:	00 00 	addb	%al, (%rax)

The reason to do the same is the following:
I am working on a D56083, which implements -z/--disassemble-zeroes.
Normally the disassembly output will skip blocks of zeroes. Currently, by default GNU objdump
skip them, but llvm-objdump does not. And the issue is shown in the sample above.
If we omit the bytes at the beginning of the section (see bar above), then the first
address (0x0000000000000035) is not printed and it is inconvenient and makes the output
not so useful as we do not see the start address of the symbol then.

So I suggest to follow the GNU objdump behavior and also print the address unless the
-no-leading-addr flag is set.

Diff Detail

Repository: rL LLVM

Event Timeline

grimar created this revision.Dec 28 2018, 2:22 AM

grimar mentioned this in D56083: [llvm-objdump] - Implement -z/--disassemble-zeroes.Dec 28 2018, 4:13 AM

grimar added a child revision: D56083: [llvm-objdump] - Implement -z/--disassemble-zeroes.

Sure.

-eric

This revision is now accepted and ready to land.Dec 30 2018, 10:58 PM

LGTM too. Not that I'm saying you should change it, as I don't know how complex it would be, but perhaps an alternative to consider might have been to disassemble the first zeroes after a symbol, even without -z. Something like this:

foo:
10: 00 00
...
bar:
40: 00 00
...

I'm not sure how desirable that is versus the alternative though (really, I'm not sure I see much benefit to the behaviour of not disassembling zeroes, but that's just me).

In D56123#1343558, @jhenderson wrote:
LGTM too. Not that I'm saying you should change it, as I don't know how complex it would be, but perhaps an alternative to consider might have been to disassemble the first zeroes after a symbol, even without -z. Something like this:
foo:
10: 00 00
...
bar:
40: 00 00
...
I'm not sure how desirable that is versus the alternative though (really, I'm not sure I see much benefit to the behaviour of not disassembling zeroes, but that's just me).

I am definitely not against of further tweaking of the -z/no -z behavior, but I think printing the symbol address (this patch) can and should be an independent thing.
Skipping/not-skipping any zeroes is a heuristic logic that might change and it better to reduce/eliminate/not introduce any dependencies/assumptions here I believe.

Closed by commit rL350726: [llvm-objdump] - Print symbol addressed when dumping disassembly output (-d) (authored by grimar). · Explain WhyJan 9 2019, 6:47 AM

This revision was automatically updated to reflect the committed changes.

grimar mentioned this in rL350728: [LLD][ELF] - Fix BB after r350726..Jan 9 2019, 7:12 AM

grimar mentioned this in rLLD350728: [LLD][ELF] - Fix BB after r350726..

Revision Contents

Path

Size

llvm/

trunk/

test/

tools/

llvm-objdump/

X86/

print-symbol-addr.s

29 lines

tools/

llvm-objdump/

llvm-objdump.cpp

3 lines

Diff 180836

llvm/trunk/test/tools/llvm-objdump/X86/print-symbol-addr.s

				// RUN: llvm-mc %s -filetype=obj -triple=x86_64-pc-linux -o %t.o

				// Check we print the address of `foo` and `bar`.
				// RUN: llvm-objdump -d %t.o \| FileCheck %s
				// CHECK: Disassembly of section .text:
				// CHECK-NEXT: 0000000000000000 foo:
				// CHECK-NEXT: 0: {{.*}} nop
				// CHECK-NEXT: 1: {{.*}} nop
				// CHECK: 0000000000000002 bar:
				// CHECK-NEXT: 2: {{.*}} nop

				// Check we do not print the addresses with -no-leading-addr.
				// RUN: llvm-objdump -d -no-leading-addr %t.o \| FileCheck %s --check-prefix=NOADDR
				// NOADDR: Disassembly of section .text:
				// NOADDR-NEXT: {{^}}foo:
				// NOADDR-NEXT: {{.*}} nop
				// NOADDR-NEXT: {{.*}} nop
				// NOADDR: {{^}}bar:
				// NOADDR-NEXT: {{.*}} nop

				.text
				.globl foo
				.type foo, @function
				foo:
				nop
				nop

				bar:
				nop

llvm/trunk/tools/llvm-objdump/llvm-objdump.cpp

Show First 20 Lines • Show All 1,586 Lines • ▼ Show 20 Lines	for (unsigned si = 0, se = Symbols.size(); si != se; ++si) {
const auto Limit = End - (std::min)(EndAlign, End - Start);		const auto Limit = End - (std::min)(EndAlign, End - Start);
while (End > Limit &&		while (End > Limit &&
reinterpret_cast<const support::ulittle32_t>(&Bytes[End - 4]) == 0)		reinterpret_cast<const support::ulittle32_t>(&Bytes[End - 4]) == 0)
End -= 4;		End -= 4;
}		}
}		}

outs() << '\n';		outs() << '\n';
		if (!NoLeadingAddr)
		outs() << format("%016" PRIx64 " ", SectionAddr + Start);

StringRef SymbolName = std::get<1>(Symbols[si]);		StringRef SymbolName = std::get<1>(Symbols[si]);
if (Demangle)		if (Demangle)
outs() << demangle(SymbolName) << ":\n";		outs() << demangle(SymbolName) << ":\n";
else		else
outs() << SymbolName << ":\n";		outs() << SymbolName << ":\n";

// Don't print raw contents of a virtual section. A virtual section		// Don't print raw contents of a virtual section. A virtual section
// doesn't have any contents in the file.		// doesn't have any contents in the file.
▲ Show 20 Lines • Show All 904 Lines • Show Last 20 Lines