This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/AsmPrinter/
-
CodeGen/
-
AsmPrinter/
-
DwarfUnit.cpp
-
test/DebugInfo/PowerPC/
-
DebugInfo/
-
PowerPC/
-
export-symbol.ll

Differential D100440

[Debug-Info] DW_AT_export_symbols shouldn't be generated before version-5 of DWARF.
AbandonedPublic

Authored by Esme on Apr 13 2021, 8:25 PM.

Download Raw Diff

Details

Reviewers

shchenz
aprantl
dblaikie
jsji

Group Reviewers

Restricted Project

Summary

DW_AT_export_symbols is an attribute introduced in DWARF-5, which should not be generated under previous DWARF versions. And this unexpected behavior caused the DBX to fail.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Esme created this revision.Apr 13 2021, 8:25 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptApr 13 2021, 8:25 PM

Esme requested review of this revision.Apr 13 2021, 8:25 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 13 2021, 8:25 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

shchenz added a project: debug-info.Apr 13 2021, 8:42 PM

Harbormaster completed remote builds in B98604: Diff 337322.Apr 13 2021, 11:28 PM

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

Can we guard the fix only for DBX? So that other consumers will not be impacted?

Addressed Zheng's comments.

Herald added subscribers: ormris, nemanjai. · View Herald TranscriptApr 16 2021, 6:18 AM

Harbormaster completed remote builds in B99154: Diff 338082.Apr 16 2021, 6:19 AM

In D100440#2688023, @dblaikie wrote:

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

I'm not the right person to ask, but I wouldn't expect a DWARF v4 consumer to be able to recognise a DWARF v5 attribute in general, and therefore we shouldn't be using emitting it in older cases. @probinson or maybe @jmorse will probably have a better insight here.

In D100440#2688023, @dblaikie wrote:

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

That's right. LLDB is using this attribute even when parsing DWARF v4 to model inline namespaces (such as libc++'s __1 namespace). If we don't have the attribute then we end up with a bunch of __1 in the type names we show to the user (+ some other minor changes in behaviour). It shouldn't break any LLDB scripts though as the type names used in scripts always specify inline namespaces.

If we decide to remove this for all consumers we can just special-case the namespace parsing in LLDB to infer that std::__1 is an inline namespace. It's the only namespace that is important for users (and the test suite).

In D100440#2698501, @jhenderson wrote:

In D100440#2688023, @dblaikie wrote:

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

I'm not the right person to ask, but I wouldn't expect a DWARF v4 consumer to be able to recognise a DWARF v5 attribute in general, and therefore we shouldn't be using emitting it in older cases. @probinson or maybe @jmorse will probably have a better insight here.

Generally we figure emitting new tags/attributes is OK, because consumers are meant to ignore them if they don't recognize them - and some consumers can recognize them as an extension. (such as lldb)

This conversation's gotten fragmented over a bunch of reviews, unfortunately - but it sounds like the direction it's headed is that maybe -gstrict-dwarf will be wired up to do all this pedantry/not emit things from future standards, and DBX tuning will opt in to -gstrict-dwarf mode.

In D100440#2698675, @teemperor wrote:

In D100440#2688023, @dblaikie wrote:

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

That's right. LLDB is using this attribute even when parsing DWARF v4 to model inline namespaces (such as libc++'s __1 namespace). If we don't have the attribute then we end up with a bunch of __1 in the type names we show to the user (+ some other minor changes in behaviour). It shouldn't break any LLDB scripts though as the type names used in scripts always specify inline namespaces.

If we decide to remove this for all consumers we can just special-case the namespace parsing in LLDB to infer that std::__1 is an inline namespace. It's the only namespace that is important for users (and the test suite).

Thanks for the context. (from other threads, sounds like we might be moving more towards DBX folks wiring up -gstrict-dwarf to implement all this stuff and enabling -gstrict-dwarf when targeting DBX by default - leaving all these future-spec-feature-in-prior-spec-mode on by default for everyone else)

In D100440#2698987, @dblaikie wrote:

In D100440#2698501, @jhenderson wrote:

In D100440#2688023, @dblaikie wrote:

@aprantl @probinson @jhenderson - any of you folks want this in older versions? I think this is for C++ inline namespaces - so if you used a modern C++ standard library (that uses inline namespaces like "std::__1::basic_string", etc) users would have to use these extra qualifiers when naming types, functions, etc. Not unworkable, but a bit inconvenient.

I'm not the right person to ask, but I wouldn't expect a DWARF v4 consumer to be able to recognise a DWARF v5 attribute in general, and therefore we shouldn't be using emitting it in older cases. @probinson or maybe @jmorse will probably have a better insight here.

Generally we figure emitting new tags/attributes is OK, because consumers are meant to ignore them if they don't recognize them - and some consumers can recognize them as an extension. (such as lldb)

This conversation's gotten fragmented over a bunch of reviews, unfortunately - but it sounds like the direction it's headed is that maybe -gstrict-dwarf will be wired up to do all this pedantry/not emit things from future standards, and DBX tuning will opt in to -gstrict-dwarf mode.

Thanks for the explanation @dblaikie Yes, we just add -gstrict-dwarf support in D100809 and D100826. This option is turned off by default and only enabled for DBX.

I think we also need to change this patch accordingly. @Esme

use strict dwarf flag instead of DBX debugger

Harbormaster completed remote builds in B99674: Diff 338802.Apr 20 2021, 3:53 AM

Esme added a parent revision: D100826: [Debug-Info][NFC] add -gstrict-dwarf support in backend.Apr 20 2021, 3:53 AM

There may be an opportunity to do this more robustly, instead of scattering the checks all over the place. Note that the Dwarf.def file already knows the version for each tag and attribute; DwarfDebug already knows the version we're emitting; so if DwarfDebug also had the strict-dwarf flag, it would be easy to add a helper predicate or two that would do the checking for any tag or attribute.

Then the methods that add attributes could generally call the predicate themselves, instead of having higher-level code do it.
For tags you probably do want the higher-level code to call the predicate, but it seems like it would be a lot simpler to do something like if (canEmitTag(dwarf::DW_TAG_rvalue_reference)) than the kind of thing we're seeing in these patches.

Does that seem reasonable?

In D100440#2702203, @probinson wrote:

There may be an opportunity to do this more robustly, instead of scattering the checks all over the place. Note that the Dwarf.def file already knows the version for each tag and attribute; DwarfDebug already knows the version we're emitting; so if DwarfDebug also had the strict-dwarf flag, it would be easy to add a helper predicate or two that would do the checking for any tag or attribute.

Then the methods that add attributes could generally call the predicate themselves, instead of having higher-level code do it.
For tags you probably do want the higher-level code to call the predicate, but it seems like it would be a lot simpler to do something like if (canEmitTag(dwarf::DW_TAG_rvalue_reference)) than the kind of thing we're seeing in these patches.

Does that seem reasonable?

Agreed - brought up something similar here: https://reviews.llvm.org/D100826#2702135

(For the tags, yeah, I'm not sure - wonder if the code for emitting the DIEs could return null/empty DIE/something to indicate that the DIE wasn't created - though that might be hard to retrofit if it means revisiting every place that constructs a DIE to ensure it can handle a "nothing" return)

shchenz mentioned this in D100826: [Debug-Info][NFC] add -gstrict-dwarf support in backend.Apr 21 2021, 8:19 PM

Esme abandoned this revision.Apr 25 2021, 6:41 PM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

AsmPrinter/

DwarfUnit.cpp

7 lines

test/

DebugInfo/

PowerPC/

export-symbol.ll

34 lines

Diff 338802

llvm/lib/CodeGen/AsmPrinter/DwarfUnit.cpp

Show All 30 Lines
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCDwarf.h"		#include "llvm/MC/MCDwarf.h"
#include "llvm/MC/MCSection.h"		#include "llvm/MC/MCSection.h"
#include "llvm/MC/MCStreamer.h"		#include "llvm/MC/MCStreamer.h"
#include "llvm/MC/MachineLocation.h"		#include "llvm/MC/MachineLocation.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Target/TargetLoweringObjectFile.h"		#include "llvm/Target/TargetLoweringObjectFile.h"
		#include "llvm/Target/TargetMachine.h"
#include <cassert>		#include <cassert>
#include <cstdint>		#include <cstdint>
#include <string>		#include <string>
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "dwarfdebug"		#define DEBUG_TYPE "dwarfdebug"
▲ Show 20 Lines • Show All 878 Lines • ▼ Show 20 Lines	for (const auto *Element : Elements) {
constructTypeDIE(VariantPart, Composite);		constructTypeDIE(VariantPart, Composite);
}		}
}		}
}		}

if (CTy->isAppleBlockExtension())		if (CTy->isAppleBlockExtension())
addFlag(Buffer, dwarf::DW_AT_APPLE_block);		addFlag(Buffer, dwarf::DW_AT_APPLE_block);

if (CTy->getExportSymbols())		if (CTy->getExportSymbols() &&
		(!Asm->TM.Options.DebugStrictDwarf \|\| DD->getDwarfVersion() >= 5))
addFlag(Buffer, dwarf::DW_AT_export_symbols);		addFlag(Buffer, dwarf::DW_AT_export_symbols);

// This is outside the DWARF spec, but GDB expects a DW_AT_containing_type		// This is outside the DWARF spec, but GDB expects a DW_AT_containing_type
// inside C++ composite types to point to the base class with the vtable.		// inside C++ composite types to point to the base class with the vtable.
// Rust uses DW_AT_containing_type to link a vtable to the type		// Rust uses DW_AT_containing_type to link a vtable to the type
// for which it was created.		// for which it was created.
if (auto *ContainingType = CTy->getVTableHolder())		if (auto *ContainingType = CTy->getVTableHolder())
addDIEEntry(Buffer, dwarf::DW_AT_containing_type,		addDIEEntry(Buffer, dwarf::DW_AT_containing_type,
▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	DIE DwarfUnit::getOrCreateNameSpace(const DINamespace NS) {

StringRef Name = NS->getName();		StringRef Name = NS->getName();
if (!Name.empty())		if (!Name.empty())
addString(NDie, dwarf::DW_AT_name, NS->getName());		addString(NDie, dwarf::DW_AT_name, NS->getName());
else		else
Name = "(anonymous namespace)";		Name = "(anonymous namespace)";
DD->addAccelNamespace(*CUNode, Name, NDie);		DD->addAccelNamespace(*CUNode, Name, NDie);
addGlobalName(Name, NDie, NS->getScope());		addGlobalName(Name, NDie, NS->getScope());
if (NS->getExportSymbols())		if (NS->getExportSymbols() &&
		(!Asm->TM.Options.DebugStrictDwarf \|\| DD->getDwarfVersion() >= 5))
addFlag(NDie, dwarf::DW_AT_export_symbols);		addFlag(NDie, dwarf::DW_AT_export_symbols);
return &NDie;		return &NDie;
}		}

DIE DwarfUnit::getOrCreateModule(const DIModule M) {		DIE DwarfUnit::getOrCreateModule(const DIModule M) {
// Construct the context before querying for the existence of the DIE in case		// Construct the context before querying for the existence of the DIE in case
// such construction creates the DIE.		// such construction creates the DIE.
DIE *ContextDIE = getOrCreateContextDIE(M->getScope());		DIE *ContextDIE = getOrCreateContextDIE(M->getScope());
▲ Show 20 Lines • Show All 736 Lines • Show Last 20 Lines

llvm/test/DebugInfo/PowerPC/export-symbol.ll

				; RUN: %llc_dwarf -O0 -filetype=obj -mtriple=powerpc64le-unknown-linux-gnu < %s \| \
				; RUN: llvm-dwarfdump -debug-info - \| FileCheck %s
				; RUN: %llc_dwarf -O0 -filetype=obj -mtriple=powerpc64le-unknown-linux-gnu \
				; RUN: -strict-dwarf=true < %s \| llvm-dwarfdump -debug-info - \| \
				; RUN: FileCheck %s --check-prefix=STRICT

				; CHECK: DW_AT_export_symbols
				; STRICT-NOT: DW_AT_export_symbols

				%struct.A = type { %struct.anon }
				%struct.anon = type { i32 }

				@a = global %struct.A zeroinitializer, align 4, !dbg !0

				!llvm.module.flags = !{!14, !15}
				!llvm.dbg.cu = !{!2}
				!llvm.ident = !{!16}

				!0 = !DIGlobalVariableExpression(var: !1, expr: !DIExpression())
				!1 = distinct !DIGlobalVariable(name: "a", scope: !2, file: !3, line: 5, type: !6, isLocal: false, isDefinition: true)
				!2 = distinct !DICompileUnit(language: DW_LANG_C_plus_plus, file: !3, producer: "clang version 10.0.0", isOptimized: false, runtimeVersion: 0, emissionKind: FullDebug, enums: !4, globals: !5, nameTableKind: GNU)
				!3 = !DIFile(filename: "simple_anon_class.cpp", directory: "/dir")
				!4 = !{}
				!5 = !{!0}
				!6 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "A", file: !3, line: 1, size: 32, flags: DIFlagTypePassByValue, elements: !7, identifier: "_ZTS1A")
				!7 = !{!8}
				!8 = !DIDerivedType(tag: DW_TAG_member, scope: !6, file: !3, line: 2, baseType: !9, size: 32)
				!9 = distinct !DICompositeType(tag: DW_TAG_structure_type, scope: !6, file: !3, line: 2, size: 32, flags: DIFlagExportSymbols \| DIFlagTypePassByValue, elements: !10, identifier: "_ZTSN1AUt_E")
				!10 = !{!11}
				!11 = !DIDerivedType(tag: DW_TAG_member, name: "y", scope: !9, file: !3, line: 3, baseType: !12, size: 32)
				!12 = !DIBasicType(name: "int", size: 32, encoding: DW_ATE_signed)
				!14 = !{i32 2, !"Dwarf Version", i32 4}
				!15 = !{i32 2, !"Debug Info Version", i32 3}
				!16 = !{!"clang version 10.0.0"}