This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/Driver/ToolChains/
-
lib/
-
Driver/
-
ToolChains/
-
Clang.cpp
-
Cuda.cpp

Differential D94123

[NVPTX] Fix debugging information being added to NVPTX target if remarks are enabled
ClosedPublic

Authored by jhuber6 on Jan 5 2021, 2:33 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
tra
jholewinski
serge-sans-paille

Commits

rG1ca5e68aa07e: [NVPTX] Fix debugging information being added to NVPTX target if remarks are…

Summary

Optimized debugging is not supported by ptxas. Debugging information is degraded to line information only if optimizations are enabled, but debugging information would be added back in by the driver if remarks were enabled. This solves https://bugs.llvm.org/show_bug.cgi?id=48153.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jhuber6 created this revision.Jan 5 2021, 2:33 PM

Herald added subscribers: hiraditya, jholewinski. · View Herald TranscriptJan 5 2021, 2:33 PM

jhuber6 requested review of this revision.Jan 5 2021, 2:33 PM

Herald added a subscriber: llvm-commits. · View Herald TranscriptJan 5 2021, 2:33 PM

Can we have a test for this?

@tra @jholewinski I'd be interested to hear what you think about this solution. It should allow us to stop disabling -g in the frontend, thereby providing source information to things like the remarks emitted for GPU code.
@serge-sans-paille Is the a NPM way of doing this?

There's --cuda-noopt-device-debug option specifically to allow compiling GPU code with full debug info. Clang will generate optimized PTX, but ptxas optimizations will be disabled.

Without that flag clang automatically downgrades debug info generation to lineinfo only. I think -fsave-optimization-record should do the same.
Adding a pass to strip debug info may not be the best place to deal with the issue. I think not enabling full debug info would be a better choice.

Harbormaster completed remote builds in B84118: Diff 314723.Jan 5 2021, 3:04 PM

In D94123#2480633, @tra wrote:

There's --cuda-noopt-device-debug option specifically to allow compiling GPU code with full debug info. Clang will generate optimized PTX, but ptxas optimizations will be disabled.

Without that flag clang automatically downgrades debug info generation to lineinfo only. I think -fsave-optimization-record should do the same.
Adding a pass to strip debug info may not be the best place to deal with the issue. I think not enabling full debug info would be a better choice.

Okay, so without that flag Clang will not create debug symbols in the PTX assembly output. And if the user specified --cuda-noopt-device-debug then the Cuda driver will not pass the optimization flags to the ptxas invocation, right? So if that's the case, then the problem with -fsave-optimization-record is that it's not being correctly picked up as generating debug info. So the solution here would be to make sure it treats that flag as debug information. You should be able to see it not working by checking the *.s output when build with -fsave-optimization-record having debug in the target.

In D94123#2480848, @jhuber6 wrote:

Okay, so without that flag Clang will not create debug symbols in the PTX assembly output.

Only if optimizations are enabled. W/o optimization, full debug info will be there.
--cuda-noopt-device-debug re-enables full debug info but tells ptxas to expect it (and that requires disabling ptxas optimizations)
E.g. https://godbolt.org/z/1jPcnd

Changing the solution. The problem seems to be that after adjusting the debug info, the driver would change the debug kind if remarks were enabled. Now it adjusts the debug information after performing that change. This means that some diagnostics won't work with optimizations but it's necessary to compile correctly.

Herald added a project: Restricted Project. · View Herald TranscriptJan 6 2021, 8:05 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Harbormaster completed remote builds in B84218: Diff 314899.Jan 6 2021, 8:38 AM

OK with me, @tra ?

LGTM.

This revision is now accepted and ready to land.Jan 6 2021, 10:37 AM

Closed by commit rG1ca5e68aa07e: [NVPTX] Fix debugging information being added to NVPTX target if remarks are… (authored by jhuber6). · Explain WhyJan 6 2021, 10:45 AM

This revision was automatically updated to reflect the committed changes.

jhuber6 added a commit: rG1ca5e68aa07e: [NVPTX] Fix debugging information being added to NVPTX target if remarks are….

If you use arc diff, you can obtain Reviewed-by: line from Phabricator. It is more useful than Reviewers: (a list of reviewers do not mean they endorse or accept the patch)

In D94123#2482636, @MaskRay wrote:

If you use arc diff, you can obtain Reviewed-by: line from Phabricator. It is more useful than Reviewers: (a list of reviewers do not mean they endorse or accept the patch)

arc land did work, now it is arc land --onto main, but it does these things for you. I like it.

Revision Contents

Path

Size

clang/

lib/

Driver/

ToolChains/

Clang.cpp

6 lines

Cuda.cpp

2 lines

Diff 314899

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,934 Lines • ▼ Show 20 Lines	static void RenderDebugOptions(const ToolChain &TC, const Driver &D,
}		}

// Omit inline line tables if requested.		// Omit inline line tables if requested.
if (Args.hasFlag(options::OPT_gno_inline_line_tables,		if (Args.hasFlag(options::OPT_gno_inline_line_tables,
options::OPT_ginline_line_tables, false)) {		options::OPT_ginline_line_tables, false)) {
CmdArgs.push_back("-gno-inline-line-tables");		CmdArgs.push_back("-gno-inline-line-tables");
}		}

// Adjust the debug info kind for the given toolchain.
TC.adjustDebugInfoKind(DebugInfoKind, Args);

// When emitting remarks, we need at least debug lines in the output.		// When emitting remarks, we need at least debug lines in the output.
if (willEmitRemarks(Args) &&		if (willEmitRemarks(Args) &&
DebugInfoKind <= codegenoptions::DebugDirectivesOnly)		DebugInfoKind <= codegenoptions::DebugDirectivesOnly)
DebugInfoKind = codegenoptions::DebugLineTablesOnly;		DebugInfoKind = codegenoptions::DebugLineTablesOnly;

		// Adjust the debug info kind for the given toolchain.
		TC.adjustDebugInfoKind(DebugInfoKind, Args);

RenderDebugEnablingArgs(Args, CmdArgs, DebugInfoKind, EffectiveDWARFVersion,		RenderDebugEnablingArgs(Args, CmdArgs, DebugInfoKind, EffectiveDWARFVersion,
DebuggerTuning);		DebuggerTuning);

// -fdebug-macro turns on macro debug info generation.		// -fdebug-macro turns on macro debug info generation.
if (Args.hasFlag(options::OPT_fdebug_macro, options::OPT_fno_debug_macro,		if (Args.hasFlag(options::OPT_fdebug_macro, options::OPT_fno_debug_macro,
false))		false))
if (checkDebugInfoOption(Args.getLastArg(options::OPT_fdebug_macro), Args,		if (checkDebugInfoOption(Args.getLastArg(options::OPT_fdebug_macro), Args,
D, TC))		D, TC))
▲ Show 20 Lines • Show All 3,477 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Cuda.cpp

Show First 20 Lines • Show All 378 Lines • ▼ Show 20 Lines	if (const Arg *A = Args.getLastArg(options::OPT_g_Group)) {
if (Opt.matches(options::OPT_gN_Group)) {		if (Opt.matches(options::OPT_gN_Group)) {
if (Opt.matches(options::OPT_g0) \|\| Opt.matches(options::OPT_ggdb0))		if (Opt.matches(options::OPT_g0) \|\| Opt.matches(options::OPT_ggdb0))
return DisableDebugInfo;		return DisableDebugInfo;
if (Opt.matches(options::OPT_gline_directives_only))		if (Opt.matches(options::OPT_gline_directives_only))
return DebugDirectivesOnly;		return DebugDirectivesOnly;
}		}
return IsDebugEnabled ? EmitSameDebugInfoAsHost : DebugDirectivesOnly;		return IsDebugEnabled ? EmitSameDebugInfoAsHost : DebugDirectivesOnly;
}		}
return DisableDebugInfo;		return willEmitRemarks(Args) ? DebugDirectivesOnly : DisableDebugInfo;
}		}

void NVPTX::Assembler::ConstructJob(Compilation &C, const JobAction &JA,		void NVPTX::Assembler::ConstructJob(Compilation &C, const JobAction &JA,
const InputInfo &Output,		const InputInfo &Output,
const InputInfoList &Inputs,		const InputInfoList &Inputs,
const ArgList &Args,		const ArgList &Args,
const char *LinkingOutput) const {		const char *LinkingOutput) const {
const auto &TC =		const auto &TC =
▲ Show 20 Lines • Show All 558 Lines • Show Last 20 Lines