This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/
-
clang/
-
Basic/
-
DiagnosticIDs.h
-
DiagnosticOptions.def
-
Driver/
-
CC1Options.td
-
lib/
-
Basic/
-
DiagnosticIDs.cpp
-
Frontend/
-
CompilerInvocation.cpp
-
TextDiagnosticPrinter.cpp
-
test/Frontend/
-
Frontend/
-
diagnostics-option-diag-ids.cpp

Differential D35175

New option that adds the DiagID enum name and index to Diagnostic output.
AbandonedPublic

Authored by hintonda on Jul 8 2017, 10:24 PM.

Download Raw Diff

Details

Reviewers

srhines
rjmccall

Summary

This option helps locate the origin of a diagnostic message
by outputing the enum name and index associated with a specific
DiagID, allowing users to grep the code for the enum name directly
without having to find it in the td files first.

Additional ideas:

add another option to pass in the index (or enum) to force an assert or backtrace when a specific DiagID is seen.
capture FILE and LINE when a diagnostic is created and output it. This would make it easier to find the specific instance, and verify all instances are actually tested. Currently, it's almost impossible to determine if all instances are actually tested.
keep track of the permutations and make sure each one is tested.

Diff Detail

Build Status

Buildable 8148
Build 8148: arc lint + arc unit

Event Timeline

hintonda created this revision.Jul 8 2017, 10:24 PM

Harbormaster completed remote builds in B8076: Diff 105753.Jul 9 2017, 6:15 AM

Thanks, that's pretty cool!

How bigger did the clang binary get after you've added all these strings?
I feel like this is more of a CC1 option as well.

I have some feedback for your additional ideas:

add another option to pass in the index (or enum) to force an assert or backtrace when a specific DiagID is seen.

That sounds quite useful, but could it be something that's more suited for an external debugging script? I have a personal script that computes the enum value for a particular diagnostic, launches clang in lldb, sets a breakpoint for that particular diagnostic enum and runs clang. I could work on upstreaming it into clang/utils if people are interested.

capture FILE and LINE when a diagnostic is created and output it. This would make it easier to find the specific instance, and verify all instances are actually tested. Currently, it's almost impossible to determine if all instances are actually tested.

I reckon the first part (find the specific instance) could be useful, but I think that if you can force a backtrace on a specific DiagID then it becomes less useful. I disagree with the second part, can't you use our coverage bots and see if the all places where the diagnostic is emitted are covered to see if they are tested? It might be tedious to find these places, but maybe we can add a search for our coverage viewer so you quickly find the lines that have the name of diagnostic?

In D35175#803496, @arphaman wrote:

Thanks, that's pretty cool!

How bigger did the clang binary get after you've added all these strings?

Currently looks like around 200k (4534 @ 33 byte avg length + ptr). If this is too much, we could make it conditional based on NDEBUG or some other macro at compile time.

I feel like this is more of a CC1 option as well.

Sure, I can do that.

I have some feedback for your additional ideas:

add another option to pass in the index (or enum) to force an assert or backtrace when a specific DiagID is seen.

That sounds quite useful, but could it be something that's more suited for an external debugging script? I have a personal script that computes the enum value for a particular diagnostic, launches clang in lldb, sets a breakpoint for that particular diagnostic enum and runs clang. I could work on upstreaming it into clang/utils if people are interested.

This type of behavior (either an assert/bt or coupled with debugger) could be useful as a quick and easy solution. However, capturing __FILE__ and __LINE__ when a diagnostic is reported, would be my preference. However, that change would be very invasive and should probably be handled by a source to source transformation -- I did some of this by hand as a proof of concept, but doing all of clang manually would take quite a while, not to mention various tools that also use diagnostics.

capture FILE and LINE when a diagnostic is created and output it. This would make it easier to find the specific instance, and verify all instances are actually tested. Currently, it's almost impossible to determine if all instances are actually tested.

I reckon the first part (find the specific instance) could be useful, but I think that if you can force a backtrace on a specific DiagID then it becomes less useful. I disagree with the second part, can't you use our coverage bots and see if the all places where the diagnostic is emitted are covered to see if they are tested? It might be tedious to find these places, but maybe we can add a search for our coverage viewer so you quickly find the lines that have the name of diagnostic?

Agreed, but the problem with relying exclusively on coverage is that it can't cover the various permutations, e.g., "%select{A|B|C}0". It's pretty difficult to tell if A, B, and C were actually tested -- or if that makes a difference.

If we included enum name (and permutation) with __FILE__ and __LINE__ in the output, then we could quickly analyze the test output and produce a simple report. I think this would also be helpful in allowing test writers to see exactly which diagnostic report was triggered for each test.

Make option cc1 only. Rename function, and add test.

Currently looks like around 200k (4534 @ 33 byte avg length + ptr). If this is too much, we could make it conditional based on NDEBUG or some other macro at compile time.

I think this is mostly useful during development, so some conditional mechanism would make sense IMO. I think that it makes sense to avoid the growth of our release binaries if such growth can be avoided.

This type of behavior (either an assert/bt or coupled with debugger) could be useful as a quick and easy solution. However, capturing __FILE__ and __LINE__ when a diagnostic is reported, would be my preference. However, that change would be very invasive and should probably be handled by a source to source transformation -- I did some of this by hand as a proof of concept, but doing all of clang manually would take quite a while, not to mention various tools that also use diagnostics.

I can definitely see the usefulness of __FILE__ and __LINE__ markers. However, I think that should be a development only feature as well. I agree about the source to source transformation, a refactoring tool should handle it.

Agreed, but the problem with relying exclusively on coverage is that it can't cover the various permutations, e.g., "%select{A|B|C}0". It's pretty difficult to tell if A, B, and C were actually tested -- or if that makes a difference.

If we included enum name (and permutation) with __FILE__ and __LINE__ in the output, then we could quickly analyze the test output and produce a simple report. I think this would also be helpful in allowing test writers to see exactly which diagnostic report was triggered for each test.

That makes sense, thanks for pointing it out. I agree, It would be useful if we had such "coverage" reports for diagnostics.

Only maintain enum names in debug builds. Current cost of maintaining enum names is 176k.

hintonda added a reviewer: rjmccall.Jul 11 2017, 3:39 PM

This is a cute hack, but... I'm not sure I accept the premise that it's a noteworthy obstacle to Clang development to do two greps instead of one. And I don't think I've ever had to debug a diagnostic where FILE and LINE information would've been helpful.

Also, you could easily write a script that could automatically annotate arbitrary text with this information retroactively by turning the diagnostic text into a bunch of regular expressions, and then you wouldn't even need to re-run the compiler.

It's just an effort to make the code a bit more accessible, especially for new users (or ones not used to running find/grep).

Steve had suggested adding an option that took the entire message and matched it when it was produced. However, that won't work very well since the message isn't actually produced until just before it is printed, which means the assert/backtrace isn't in the correctly location if the it's delayed. That's why I chose to simply print just the enum name and index.

Adding __FILE__/__LINE__ info would help identify the exact location (could be multiple for the same error message), but due to the way the code is structured it isn't really doable.

It might be kinda fun writing the script you suggest -- I'll look into it, but printing the enum in the first place for little or no cost seems a bit more elegant.

To me, features that only serve to help compiler development need to meet a higher bar than this. This just seems really marginal.

Like Alex said, you should be able to pretty easily write a debugger script that breaks when it sees a specific diagnostic, or a diagnostic at a specific line and column. That would be quite welcome in utils/.

Okay, sounds good. Look forward to seeing Alex's script...

My script relies on a hack to map the name of the diagnostic to the enum value. We need to come up with a better way to map the diagnostic name to the enum value. I propose a new utility tool that would take the name of the diagnostic and map it back to the enum value.

Not sure how to do this all in a script, but perhaps we could enhance diagtool to generate this mapping for you. It currently only lists warnings, but I don't think it would be difficult to extend it and generate a mapping you could use in your script.

Right. I was aware of the diagtool before, but didn't really look into what it did. TIL! It would make sense to add this kind of mapping functionality to that tool then.

I'd be happy to do that if it would help. If so, should I do it here create a new diff?

Perhaps we might even make sense add the ability to pass in a message and find the matching name/index.

I was impatient, so I already started on a patch for diagtool. I'll post it soon.

Great, thanks...

hintonda mentioned this in D36083: [utils] Add a script that runs clang in LLDB and stops it when a specified diagnostic is emitted.Aug 1 2017, 9:55 AM

Revision Contents

Path

Size

include/

clang/

Basic/

DiagnosticIDs.h

3 lines

DiagnosticOptions.def

1 line

Driver/

CC1Options.td

2 lines

lib/

Basic/

DiagnosticIDs.cpp

24 lines

Frontend/

CompilerInvocation.cpp

2 lines

TextDiagnosticPrinter.cpp

7 lines

test/

Frontend/

diagnostics-option-diag-ids.cpp

15 lines

Diff 106094

include/clang/Basic/DiagnosticIDs.h

Show First 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	public:
static unsigned getCategoryNumberForDiag(unsigned DiagID);		static unsigned getCategoryNumberForDiag(unsigned DiagID);

/// \brief Return the number of diagnostic categories.		/// \brief Return the number of diagnostic categories.
static unsigned getNumberOfCategories();		static unsigned getNumberOfCategories();

/// \brief Given a category ID, return the name of the category.		/// \brief Given a category ID, return the name of the category.
static StringRef getCategoryNameFromID(unsigned CategoryID);		static StringRef getCategoryNameFromID(unsigned CategoryID);

		/// \brief Given a DiagID, return the name.
		static StringRef getNameFromID(unsigned DiagID);

/// \brief Return true if a given diagnostic falls into an ARC diagnostic		/// \brief Return true if a given diagnostic falls into an ARC diagnostic
/// category.		/// category.
static bool isARCDiagnostic(unsigned DiagID);		static bool isARCDiagnostic(unsigned DiagID);

/// \brief Enumeration describing how the emission of a diagnostic should		/// \brief Enumeration describing how the emission of a diagnostic should
/// be treated when it occurs during C++ template argument deduction.		/// be treated when it occurs during C++ template argument deduction.
enum SFINAEResponse {		enum SFINAEResponse {
/// \brief The diagnostic should not be reported, but it should cause		/// \brief The diagnostic should not be reported, but it should cause
▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

include/clang/Basic/DiagnosticOptions.def

	Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
	DIAGOPT(ShowSourceRanges, 1, 0) /// Show source ranges in numeric form.			DIAGOPT(ShowSourceRanges, 1, 0) /// Show source ranges in numeric form.
	DIAGOPT(ShowParseableFixits, 1, 0) /// Show machine parseable fix-its.			DIAGOPT(ShowParseableFixits, 1, 0) /// Show machine parseable fix-its.
	DIAGOPT(ShowPresumedLoc, 1, 0) /// Show presumed location for diagnostics.			DIAGOPT(ShowPresumedLoc, 1, 0) /// Show presumed location for diagnostics.
	DIAGOPT(ShowOptionNames, 1, 0) /// Show the option name for mappable			DIAGOPT(ShowOptionNames, 1, 0) /// Show the option name for mappable
	/// diagnostics.			/// diagnostics.
	DIAGOPT(ShowNoteIncludeStack, 1, 0) /// Show include stacks for notes.			DIAGOPT(ShowNoteIncludeStack, 1, 0) /// Show include stacks for notes.
	VALUE_DIAGOPT(ShowCategories, 2, 0) /// Show categories: 0 -> none, 1 -> Number,			VALUE_DIAGOPT(ShowCategories, 2, 0) /// Show categories: 0 -> none, 1 -> Number,
	/// 2 -> Full Name.			/// 2 -> Full Name.
				DIAGOPT(ShowDiagIDs, 1, 0) /// Show Diagnostic ID

	ENUM_DIAGOPT(Format, TextDiagnosticFormat, 2, Clang) /// Format for diagnostics:			ENUM_DIAGOPT(Format, TextDiagnosticFormat, 2, Clang) /// Format for diagnostics:

	DIAGOPT(ShowColors, 1, 0) /// Show diagnostics with ANSI color sequences.			DIAGOPT(ShowColors, 1, 0) /// Show diagnostics with ANSI color sequences.
	ENUM_DIAGOPT(ShowOverloads, OverloadsShown, 1,			ENUM_DIAGOPT(ShowOverloads, OverloadsShown, 1,
	Ovl_All) /// Overload candidates to show.			Ovl_All) /// Overload candidates to show.
	DIAGOPT(VerifyDiagnostics, 1, 0) /// Check that diagnostics match the expected			DIAGOPT(VerifyDiagnostics, 1, 0) /// Check that diagnostics match the expected
	/// diagnostics, indicated by markers in the			/// diagnostics, indicated by markers in the
	Show All 31 Lines

include/clang/Driver/CC1Options.td

	Show First 20 Lines • Show All 374 Lines • ▼ Show 20 Lines
	def verify : Flag<["-"], "verify">,			def verify : Flag<["-"], "verify">,
	HelpText<"Verify diagnostic output using comment directives">;			HelpText<"Verify diagnostic output using comment directives">;
	def verify_ignore_unexpected : Flag<["-"], "verify-ignore-unexpected">,			def verify_ignore_unexpected : Flag<["-"], "verify-ignore-unexpected">,
	HelpText<"Ignore unexpected diagnostic messages">;			HelpText<"Ignore unexpected diagnostic messages">;
	def verify_ignore_unexpected_EQ : CommaJoined<["-"], "verify-ignore-unexpected=">,			def verify_ignore_unexpected_EQ : CommaJoined<["-"], "verify-ignore-unexpected=">,
	HelpText<"Ignore unexpected diagnostic messages">;			HelpText<"Ignore unexpected diagnostic messages">;
	def Wno_rewrite_macros : Flag<["-"], "Wno-rewrite-macros">,			def Wno_rewrite_macros : Flag<["-"], "Wno-rewrite-macros">,
	HelpText<"Silence ObjC rewriting warnings">;			HelpText<"Silence ObjC rewriting warnings">;
				def fdiagnostics_show_diag_ids : Flag<["-"], "fdiagnostics-show-diag-ids">,
				HelpText<"Show Diagnostic enum name and index">;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// Frontend Options			// Frontend Options
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// This isn't normally used, it is just here so we can parse a			// This isn't normally used, it is just here so we can parse a
	// CompilerInvocation out of a driver-derived argument vector.			// CompilerInvocation out of a driver-derived argument vector.
	def cc1 : Flag<["-"], "cc1">;			def cc1 : Flag<["-"], "cc1">;
	▲ Show 20 Lines • Show All 409 Lines • Show Last 20 Lines

lib/Basic/DiagnosticIDs.cpp

	Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines
	/// category, an empty string if CategoryID is zero, or null if CategoryID is			/// category, an empty string if CategoryID is zero, or null if CategoryID is
	/// invalid.			/// invalid.
	StringRef DiagnosticIDs::getCategoryNameFromID(unsigned CategoryID) {			StringRef DiagnosticIDs::getCategoryNameFromID(unsigned CategoryID) {
	if (CategoryID >= getNumberOfCategories())			if (CategoryID >= getNumberOfCategories())
	return StringRef();			return StringRef();
	return CategoryNameTable[CategoryID].getName();			return CategoryNameTable[CategoryID].getName();
	}			}

				StringRef DiagnosticIDs::getNameFromID(unsigned DiagID) {
				#ifndef NDEBUG
				static const char *StaticDiagName[] = {
				#define DIAG(ENUM, CLASS, DEFAULT_SEVERITY, DESC, GROUP, SFINAE, NOWERROR, \
				SHOWINSYSHEADER, CATEGORY) \
				#ENUM,
				#include "clang/Basic/DiagnosticCommonKinds.inc"
				#include "clang/Basic/DiagnosticDriverKinds.inc"
				#include "clang/Basic/DiagnosticFrontendKinds.inc"
				#include "clang/Basic/DiagnosticSerializationKinds.inc"
				#include "clang/Basic/DiagnosticLexKinds.inc"
				#include "clang/Basic/DiagnosticParseKinds.inc"
				#include "clang/Basic/DiagnosticASTKinds.inc"
				#include "clang/Basic/DiagnosticCommentKinds.inc"
				#include "clang/Basic/DiagnosticSemaKinds.inc"
				#include "clang/Basic/DiagnosticAnalysisKinds.inc"
				#undef DIAG
				};

				if (const StaticDiagInfoRec *Info = GetDiagInfo(DiagID))
				return StringRef(StaticDiagName[Info - StaticDiagInfo]);
				#endif

				return StringRef();
				}

	DiagnosticIDs::SFINAEResponse			DiagnosticIDs::SFINAEResponse
	DiagnosticIDs::getDiagnosticSFINAEResponse(unsigned DiagID) {			DiagnosticIDs::getDiagnosticSFINAEResponse(unsigned DiagID) {
	if (const StaticDiagInfoRec *Info = GetDiagInfo(DiagID))			if (const StaticDiagInfoRec *Info = GetDiagInfo(DiagID))
	return static_cast<DiagnosticIDs::SFINAEResponse>(Info->SFINAE);			return static_cast<DiagnosticIDs::SFINAEResponse>(Info->SFINAE);
	return SFINAE_Report;			return SFINAE_Report;
	}			}

	▲ Show 20 Lines • Show All 477 Lines • Show Last 20 Lines

lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 1,073 Lines • ▼ Show 20 Lines	bool clang::ParseDiagnosticArgs(DiagnosticOptions &Opts, ArgList &Args,
else {		else {
Success = false;		Success = false;
if (Diags)		if (Diags)
Diags->Report(diag::err_drv_invalid_value)		Diags->Report(diag::err_drv_invalid_value)
<< Args.getLastArg(OPT_fdiagnostics_show_category)->getAsString(Args)		<< Args.getLastArg(OPT_fdiagnostics_show_category)->getAsString(Args)
<< ShowCategory;		<< ShowCategory;
}		}

		Opts.ShowDiagIDs = Args.hasArg(OPT_fdiagnostics_show_diag_ids);

StringRef Format =		StringRef Format =
Args.getLastArgValue(OPT_fdiagnostics_format, "clang");		Args.getLastArgValue(OPT_fdiagnostics_format, "clang");
if (Format == "clang")		if (Format == "clang")
Opts.setFormat(DiagnosticOptions::Clang);		Opts.setFormat(DiagnosticOptions::Clang);
else if (Format == "msvc")		else if (Format == "msvc")
Opts.setFormat(DiagnosticOptions::MSVC);		Opts.setFormat(DiagnosticOptions::MSVC);
else if (Format == "msvc-fallback") {		else if (Format == "msvc-fallback") {
Opts.setFormat(DiagnosticOptions::MSVC);		Opts.setFormat(DiagnosticOptions::MSVC);
▲ Show 20 Lines • Show All 1,760 Lines • Show Last 20 Lines

lib/Frontend/TextDiagnosticPrinter.cpp

Show All 13 Lines
#include "clang/Frontend/TextDiagnosticPrinter.h"		#include "clang/Frontend/TextDiagnosticPrinter.h"
#include "clang/Basic/DiagnosticOptions.h"		#include "clang/Basic/DiagnosticOptions.h"
#include "clang/Basic/SourceManager.h"		#include "clang/Basic/SourceManager.h"
#include "clang/Frontend/TextDiagnostic.h"		#include "clang/Frontend/TextDiagnostic.h"
#include "clang/Lex/Lexer.h"		#include "clang/Lex/Lexer.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
		#include "llvm/Support/Signals.h"
#include <algorithm>		#include <algorithm>
using namespace clang;		using namespace clang;

TextDiagnosticPrinter::TextDiagnosticPrinter(raw_ostream &os,		TextDiagnosticPrinter::TextDiagnosticPrinter(raw_ostream &os,
DiagnosticOptions *diags,		DiagnosticOptions *diags,
bool _OwnsOutputStream)		bool _OwnsOutputStream)
: OS(os), DiagOpts(diags),		: OS(os), DiagOpts(diags),
OwnsOutputStream(_OwnsOutputStream) {		OwnsOutputStream(_OwnsOutputStream) {
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	if (DiagCategory) {
if (DiagOpts.ShowCategories == 1)		if (DiagOpts.ShowCategories == 1)
OS << DiagCategory;		OS << DiagCategory;
else {		else {
assert(DiagOpts.ShowCategories == 2 && "Invalid ShowCategories value");		assert(DiagOpts.ShowCategories == 2 && "Invalid ShowCategories value");
OS << DiagnosticIDs::getCategoryNameFromID(DiagCategory);		OS << DiagnosticIDs::getCategoryNameFromID(DiagCategory);
}		}
}		}
}		}
		// If the user wants to see diagnostic ids, include it too.
		if (DiagOpts.ShowDiagIDs) {
		OS << (Started ? "," : " [");
		Started = true;
		OS << DiagnosticIDs::getNameFromID(Info.getID()) << ":" << Info.getID();
		}
if (Started)		if (Started)
OS << ']';		OS << ']';
}		}

void TextDiagnosticPrinter::HandleDiagnostic(DiagnosticsEngine::Level Level,		void TextDiagnosticPrinter::HandleDiagnostic(DiagnosticsEngine::Level Level,
const Diagnostic &Info) {		const Diagnostic &Info) {
// Default implementation (Warnings/errors count).		// Default implementation (Warnings/errors count).
DiagnosticConsumer::HandleDiagnostic(Level, Info);		DiagnosticConsumer::HandleDiagnostic(Level, Info);
▲ Show 20 Lines • Show All 45 Lines • Show Last 20 Lines

test/Frontend/diagnostics-option-diag-ids.cpp

This file was added.

				// RUN: not %clang_cc1 -fdiagnostics-show-option -fdiagnostics-show-diag-ids %s 2> %t
				// RUN: FileCheck < %t %s

				class xx {
				xx(){}
				};

				void foo(sss) {
				// CHECK: [[@LINE-1]]:10: error: unknown type name 'sss' [err_unknown_typename:{{[0-9]+}}]
				int;
				// CHECK: [[@LINE-1]]:3: warning: declaration does not declare anything [-Wmissing-declarations,ext_no_declarators:{{[0-9]+}}]
				xx x;
				// CHECK: [[@LINE-1]]:6: error: calling a private constructor of class 'xx' [err_access_ctor:{{[0-9]+}}]
				// CHECK: [[@LINE-9]]:3: note: implicitly declared private here [note_access_natural:{{[0-9]+}}]
				}