This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/Basic/
-
clang/
-
Basic/
1
Diagnostic.h
-
lib/
-
Lex/
-
PPDirectives.cpp
-
Sema/
-
SemaTemplateInstantiate.cpp
-
unittests/Frontend/
-
Frontend/
1
PCHPreambleTest.cpp

Differential D50462

Try building complete AST after a fatal error was emitted if further diagnostics are expected
Needs ReviewPublic

Authored by Dmitry.Kozhevnikov on Aug 8 2018, 10:05 AM.

Download Raw Diff

Details

Reviewers

ilya-biryukov
sammccall
milianw
vsapsai

Summary

There is some code here and there that assume that no sane output is required if a fatal error has occurred. When using clangd/libclang, it's no longer true: diagnostics might still be requested, and AST might still be required for other IDE/tooling features, so it has to be as complete as possible.

Here I try to separate the following use cases:

Some clients check hasFatalErrorOccurred() because they are known to work unstable in presence of compile errors and want to mitigate it - they'll work as before
However, we don't want to take shortcuts in PP and Sema and still want to process include directives and instantiate templates

Note: I've found out that initially the flag in DiagnosticsEngine (which is now called SuppressAfterFatalError) had different meaning and was just demoting all fatal errors to non-fatal (https://reviews.llvm.org/rL262318). This would also fix this issue, however, it was partly reverted in https://reviews.llvm.org/rL301992 to the current state. Maybe we should go with the old approach instead (I assume the issue was that this flag was not serialized/restored, but probably should?)

Diff Detail

Repository: rC Clang

Event Timeline

Dmitry.Kozhevnikov created this revision.Aug 8 2018, 10:05 AM

Herald added subscribers: cfe-commits, ioeric. · View Herald TranscriptAug 8 2018, 10:05 AM

This seems sensible to me.

include/clang/Basic/Diagnostic.h
758	Looks like this line wasn't formatted with clang-format.

On a second look I think that there is value separating the concepts of 'producing diagnostics' after hitting the fatal error (which is SuppressAfterFatalError currently does), and trying to build a more complete AST.
For example,

An editor client might not want to show the diagnostics after the fatal error has been reached to match the diagnostics observed during build, but it would benefit from a more complete AST for other editing features.
Index-while-building: the compiler shouldn't show the diagnostics after a fatal error has been reached, but the indexer would be able to produce better indexing data from a more complete AST.

Addressed the review comment about separating "suppressing diagnostics" from "providing more complete AST"

In D50462#1193180, @arphaman wrote:

On a second look I think that there is value separating the concepts of 'producing diagnostics' after hitting the fatal error (which is SuppressAfterFatalError currently does), and trying to build a more complete AST.

Sorry for the delay.

I've introduced an accordingly-named method. I haven't created another flag yet, though, because it feels a bit weird currently since there are no clients that require it, added a todo note instead. What do you think?

unittests/Frontend/PCHPreambleTest.cpp
220	The condition in `SemaTemplateInstantiate.cpp` was causing the lookup of `Alias` to fail: it was marked as invalid and wasn't stored in the preamble. I'll be happy to make a more straightforward test, but I've not yet managed to: when placing everything in a single file, the lookup succeeds.

Adding Volodymyr who landed related patch recently.

Have you checked that produced AST is of sufficient quality to be used? Because, for example, for invalid code error recovery tries to keep going but the end result isn't always good.

In the test you verify that you can ignore bogus includes. Is this the real-world use case you want to address? Or something like "stddef.h not found"?

In D50462#1203038, @vsapsai wrote:

Have you checked that produced AST is of sufficient quality to be used? Because, for example, for invalid code error recovery tries to keep going but the end result isn't always good.
In the test you verify that you can ignore bogus includes. Is this the real-world use case you want to address? Or something like "stddef.h not found"?

I'm assuming it should help when the recovery is good enough (e.g. when the symbols from the missing header are used rarely, or only used inside function bodies). For the case like "stddef.h not found" there is a chance that resulting AST wouldn't be too useful, however, for IDE-like tasks, where we usually want to get as much info from the incomplete code as possible, it's not too bad IMO (comparing to just disabling all IDE features altogether after a missing include).

The only required property is that the resulting issues shouldn't be catastrophic (further processing shouldn't crash or hang), however, it would be a separate issue - after all, you always can remove the include directive in question.

The main use case I have is that it's inconvenient to browse or edit code in clang-based IDEs when either:

there is a typo in an include directive somewhere (e.g. because I'm still modifying the code, moving files around, etc)
there is a missing/misconfigured 3rd-party dependency. Obviously, the code is not supposed to compile, however, I'd expect e.g. "Goto Declaration" still work for unrelated code, assuming it recovered properly, which, in my anecdotical experience, it usually does
(a very specific use case I'm often hitting) you often have missing includes when you browse llvm/clang code base and haven't built some part of it, so you don't have generated headers. The parser usually recovers very well in this case, if not for these two cut-offs I'm changing in this patch

Thanks, Dmitry, for your explanation. It does help to understand your use case better.

What about having a mode that treats missing header as non-fatal error? Because I believe there are cases where there is no sense to continue after a fatal error. For example, consider hitting the error limit. Showing too many errors isn't useful and also after 50 errors it is a little bit too optimistic to assume that AST is in a good shape. I don't know if there are other fatal-but-we-can-continue errors and if we need a more general solution. So far looks like missing include is the biggest blocker for IDE-like functionality.

Herald added a subscriber: kadircet. · View Herald TranscriptAug 20 2018, 11:28 AM

Sorry for chiming in, wanted to add my 2 cents to the conversation.

In D50462#1206203, @vsapsai wrote:

What about having a mode that treats missing header as non-fatal error? Because I believe there are cases where there is no sense to continue after a fatal error. For example, consider hitting the error limit. Showing too many errors isn't useful and also after 50 errors it is a little bit too optimistic to assume that AST is in a good shape. I don't know if there are other fatal-but-we-can-continue errors and if we need a more general solution. So far looks like missing include is the biggest blocker for IDE-like functionality.

Your arguments are definitely valid (too many errors may be confusing and AST may not be prepared to recover on some fatal errors), but I would argue that the IDE use-case is different enough from the command-line tools to make the new behavior a preferred one.

Specifically, a few questions regarding the aforementioned fatal errors:

Why should hitting some error limit be considered a fatal error? One of the reasons I see for the command-line tools is not spamming the output buffers with too many errors, so that navigating to the first errors is easier. It looks like a non-issue for IDEs: they can both show all emitted errors in the text editor and make it easy to navigate to the first error (by providing a convenient UI for an error list, shortcuts to go to the first error, etc.). On the contrary, not seeing errors where I type because clang hit the limit before my line looks like a confusing experience for the editors/IDEs.

Why should an unresolved include (which is considered a fatal error) give a totally different result from the missing include (which is, obviously, undetectable)? They both require the same amount of work to recover and we obviously want clang to work in absence of some includes.

Besides, errors can also be easily "promoted" to fatal with command-line flags (-Wfatal-errors) and editor tools should probably respect those and not override their severity.

W.R.T. to the AST not being good enough: it may not be, but shouldn't we instead invest in improving the recovery on things like unresolved references?
This would give better experience in absence of non-fatal errors too (common case when copy-pasting code, removing too many #include directives from the file, etc.) and it looks doable.

Overall, I would argue that letting clang recover from fatal errors is the right thing to do for IDEs and editor integrations and the original patch was moving in the right direction.
WDYT?

When lookin through the list of the fatal errors, I think there are different categories:

"Scary" ones - "module was relocated", "invalid vfs overlay file", we probably shouldn't try to recover from it
"User" errors (include not found, too many errors) - we definitely shouldn't treat them as fatal in IDE
(subclass of the previous one): "something is too deep" (instantiations, brackets) - I'm afraid treating them as non-fatal right now would lead to a possibility of them happening again and again as we recover and proceed. So, in the future, the recovery might be more clever (i.e. going all the way up the instantiation/brackets stack, and then proceeding normally), however, while it's not done, I'm a bit uneasy demoting them from fatal.

So I'm preparing an alternative patch to demote "file not found" in include directive from the fatal error in .td file, and then immediately promote it back by default (but not in clangd). WDYT?

In general, I find the whole concept of "Fatal error occurred/Uncompilable error occurred/Too many errors occurred/etc" too coarse-grained for tooling/IDEs. Parse/PP/Sema should probably try harder to recover, and not report such global state change. I'm not ready to approach it, though :)

In D50462#1214120, @Dmitry.Kozhevnikov wrote:

When lookin through the list of the fatal errors, I think there are different categories:

"Scary" ones - "module was relocated", "invalid vfs overlay file", we probably shouldn't try to recover from it

"User" errors (include not found, too many errors) - we definitely shouldn't treat them as fatal in IDE

(subclass of the previous one): "something is too deep" (instantiations, brackets) - I'm afraid treating them as non-fatal right now would lead to a possibility of them happening again and again as we recover and proceed. So, in the future, the recovery might be more clever (i.e. going all the way up the instantiation/brackets stack, and then proceeding normally), however, while it's not done, I'm a bit uneasy demoting them from fatal.

So I'm preparing an alternative patch to demote "file not found" in include directive from the fatal error in .td file, and then immediately promote it back by default (but not in clangd). WDYT?

It feels we're uncertain whether continuing on fatal errors would work, so we choose to not do it.
E.g., about the scary ones:

on "invalid vfs overlay file" we wouldn't even run to the parsing, as it fires when parsing compile args and noone recovers from that.
on "module relocated". Haven't seen this one, so speculating here. If this happens at import directives, can we just pretend the import was unresolved and continue?

Having a mechanism to recover on specific errors seems like a more fine-grained approach that should work, but the current version of the patch looks very simple and if it turns out it works.
It would be a shame to have the added complexity if the simple solution would also work.

And there is -Wfatal-errors that turns every error into fatal, how would promote/demote approach work in that case?
Maybe we can first add an option, experiment with it and see if it works? And if not, it would give us enough data to classify the failure modes and fix them?

I wonder what others think about it.

In general, I find the whole concept of "Fatal error occurred/Uncompilable error occurred/Too many errors occurred/etc" too coarse-grained for tooling/IDEs. Parse/PP/Sema should probably try harder to recover, and not report such global state change. I'm not ready to approach it, though :)

Totally agree, I can't imagine a fatal error that makes it absolutely impossible for parser to recover (after the parser starts, i.e. the compilation args are correct). Clang can (and will) produce too much noise when recovering, but that's up to IDEs/tools to fix this.

PS we definitely want recovery from missing includes in clangd.
We have this interesting problem when building the preamble:

#include "resolved.h" // imagine this defines a struct named 'Foo'
#include "unresolved.h"

int main() {
  Foo f;
  f. // <-- complete after dot here
}

In that case, completion would work even when the second header is unresolved.
However, if we swap the includes, completion stops working because the fatal error (unresolved include) happens before the definition of the struct, so preamble does not contain the struct definition.

Agree that fatal/non-fatal error is too coarse and tooling/IDEs need more details and more control to provide better experience. But I don't think we are in a state to claim that all errors are recoverable (in theory and in current implementation). Instead of continuing on all errors, I prefer to select errors that are important for tooling and improve those first.

Regarding the current patch, I don't like creating coupling between hasFatalErrorOccurred and shouldRecoverAfterFatalErrors. Looks like after this patch you'll need to call these methods together in many cases. For example, probably Sema::makeTypoCorrectionConsumer in

if (Diags.hasFatalErrorOccurred() || !getLangOpts().SpellChecking ||
    DisableTypoCorrection)
  return nullptr;

should check shouldRecoverAfterFatalErrors too.

nridge added a subscriber: nridge.May 28 2021, 1:07 AM

Herald added a subscriber: usaxena95. · View Herald TranscriptMay 28 2021, 1:07 AM

Revision Contents

Path

Size

include/

clang/

Basic/

Diagnostic.h

9 lines

lib/

Lex/

PPDirectives.cpp

3 lines

Sema/

SemaTemplateInstantiate.cpp

1 line

unittests/

Frontend/

PCHPreambleTest.cpp

37 lines

Diff 160848

include/clang/Basic/Diagnostic.h

Show First 20 Lines • Show All 746 Lines • ▼ Show 20 Lines	public:

/// Errors that actually prevent compilation, not those that are		/// Errors that actually prevent compilation, not those that are
/// upgraded from a warning by -Werror.		/// upgraded from a warning by -Werror.
bool hasUncompilableErrorOccurred() const {		bool hasUncompilableErrorOccurred() const {
return UncompilableErrorOccurred;		return UncompilableErrorOccurred;
}		}
bool hasFatalErrorOccurred() const { return FatalErrorOccurred; }		bool hasFatalErrorOccurred() const { return FatalErrorOccurred; }

		/// Determine whether the client should because a fatal error was issued, so
		/// clients can cut off extra processing that might be wasteful in this case.
		bool shouldRecoverAfterFatalErrors() const {
		// todo: a separate flag might be required for a client that doesn't want to
		arphamanUnsubmitted Not Done Reply Inline Actions Looks like this line wasn't formatted with clang-format. arphaman: Looks like this line wasn't formatted with clang-format.
		// show any errors after the fatal, but still wants to collect as much
		// information from the source code as possible.
		return SuppressAfterFatalError;
		}

/// Determine whether any kind of unrecoverable error has occurred.		/// Determine whether any kind of unrecoverable error has occurred.
bool hasUnrecoverableErrorOccurred() const {		bool hasUnrecoverableErrorOccurred() const {
return FatalErrorOccurred \|\| UnrecoverableErrorOccurred;		return FatalErrorOccurred \|\| UnrecoverableErrorOccurred;
}		}

unsigned getNumWarnings() const { return NumWarnings; }		unsigned getNumWarnings() const { return NumWarnings; }

void setNumWarnings(unsigned NumWarnings) {		void setNumWarnings(unsigned NumWarnings) {
▲ Show 20 Lines • Show All 831 Lines • Show Last 20 Lines

lib/Lex/PPDirectives.cpp

Show First 20 Lines • Show All 1,893 Lines • ▼ Show 20 Lines	#endif
bool ShouldEnter = true;		bool ShouldEnter = true;

if (PPOpts->SingleFileParseMode)		if (PPOpts->SingleFileParseMode)
ShouldEnter = false;		ShouldEnter = false;

// Any diagnostics after the fatal error will not be visible. As the		// Any diagnostics after the fatal error will not be visible. As the
// compilation failed already and errors in subsequently included files won't		// compilation failed already and errors in subsequently included files won't
// be visible, avoid preprocessing those files.		// be visible, avoid preprocessing those files.
if (ShouldEnter && Diags->hasFatalErrorOccurred())		if (ShouldEnter && Diags->hasFatalErrorOccurred() &&
		!Diags->shouldRecoverAfterFatalErrors())
ShouldEnter = false;		ShouldEnter = false;

// Determine whether we should try to import the module for this #include, if		// Determine whether we should try to import the module for this #include, if
// there is one. Don't do so if precompiled module support is disabled or we		// there is one. Don't do so if precompiled module support is disabled or we
// are processing this module textually (because we're building the module).		// are processing this module textually (because we're building the module).
if (ShouldEnter && File && SuggestedModule && getLangOpts().Modules &&		if (ShouldEnter && File && SuggestedModule && getLangOpts().Modules &&
!isForModuleBuilding(SuggestedModule.getModule(),		!isForModuleBuilding(SuggestedModule.getModule(),
getLangOpts().CurrentModule,		getLangOpts().CurrentModule,
▲ Show 20 Lines • Show All 1,064 Lines • Show Last 20 Lines

lib/Sema/SemaTemplateInstantiate.cpp

Show First 20 Lines • Show All 213 Lines • ▼ Show 20 Lines	Sema::InstantiatingTemplate::InstantiatingTemplate(
SourceLocation PointOfInstantiation, SourceRange InstantiationRange,		SourceLocation PointOfInstantiation, SourceRange InstantiationRange,
Decl Entity, NamedDecl Template, ArrayRef<TemplateArgument> TemplateArgs,		Decl Entity, NamedDecl Template, ArrayRef<TemplateArgument> TemplateArgs,
sema::TemplateDeductionInfo *DeductionInfo)		sema::TemplateDeductionInfo *DeductionInfo)
: SemaRef(SemaRef) {		: SemaRef(SemaRef) {
// Don't allow further instantiation if a fatal error and an uncompilable		// Don't allow further instantiation if a fatal error and an uncompilable
// error have occurred. Any diagnostics we might have raised will not be		// error have occurred. Any diagnostics we might have raised will not be
// visible, and we do not need to construct a correct AST.		// visible, and we do not need to construct a correct AST.
if (SemaRef.Diags.hasFatalErrorOccurred() &&		if (SemaRef.Diags.hasFatalErrorOccurred() &&
		!SemaRef.Diags.shouldRecoverAfterFatalErrors() &&
SemaRef.Diags.hasUncompilableErrorOccurred()) {		SemaRef.Diags.hasUncompilableErrorOccurred()) {
Invalid = true;		Invalid = true;
return;		return;
}		}
Invalid = CheckInstantiationDepth(PointOfInstantiation, InstantiationRange);		Invalid = CheckInstantiationDepth(PointOfInstantiation, InstantiationRange);
if (!Invalid) {		if (!Invalid) {
CodeSynthesisContext Inst;		CodeSynthesisContext Inst;
Inst.Kind = Kind;		Inst.Kind = Kind;
▲ Show 20 Lines • Show All 2,776 Lines • Show Last 20 Lines

unittests/Frontend/PCHPreambleTest.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	void AddFile(const std::string &Filename, const std::string &Contents) {
::time(&now);		::time(&now);
VFS->addFile(Filename, now, MemoryBuffer::getMemBufferCopy(Contents, Filename));		VFS->addFile(Filename, now, MemoryBuffer::getMemBufferCopy(Contents, Filename));
}		}

void RemapFile(const std::string &Filename, const std::string &Contents) {		void RemapFile(const std::string &Filename, const std::string &Contents) {
RemappedFiles[Filename] = Contents;		RemappedFiles[Filename] = Contents;
}		}

std::unique_ptr<ASTUnit> ParseAST(const std::string &EntryFile) {		std::unique_ptr<ASTUnit> ParseAST(const std::string &EntryFile,
		bool SuppressAfterFatalError = true) {
PCHContainerOpts = std::make_shared<PCHContainerOperations>();		PCHContainerOpts = std::make_shared<PCHContainerOperations>();
std::shared_ptr<CompilerInvocation> CI(new CompilerInvocation);		std::shared_ptr<CompilerInvocation> CI(new CompilerInvocation);
CI->getFrontendOpts().Inputs.push_back(		CI->getFrontendOpts().Inputs.push_back(
FrontendInputFile(EntryFile, FrontendOptions::getInputKindForExtension(		FrontendInputFile(EntryFile, FrontendOptions::getInputKindForExtension(
llvm::sys::path::extension(EntryFile).substr(1))));		llvm::sys::path::extension(EntryFile).substr(1))));

CI->getTargetOpts().Triple = "i386-unknown-linux-gnu";		CI->getTargetOpts().Triple = "i386-unknown-linux-gnu";

CI->getPreprocessorOpts().RemappedFileBuffers = GetRemappedFiles();		CI->getPreprocessorOpts().RemappedFileBuffers = GetRemappedFiles();

		CI->getLangOpts()->CPlusPlus = true;
		CI->getLangOpts()->CPlusPlus11 = true;

PreprocessorOptions &PPOpts = CI->getPreprocessorOpts();		PreprocessorOptions &PPOpts = CI->getPreprocessorOpts();
PPOpts.RemappedFilesKeepOriginalName = true;		PPOpts.RemappedFilesKeepOriginalName = true;

IntrusiveRefCntPtr<DiagnosticsEngine>		IntrusiveRefCntPtr<DiagnosticsEngine>
Diags(CompilerInstance::createDiagnostics(new DiagnosticOptions, new DiagnosticConsumer));		Diags(CompilerInstance::createDiagnostics(new DiagnosticOptions, new DiagnosticConsumer));
		Diags->setSuppressAfterFatalError(SuppressAfterFatalError);

FileManager *FileMgr = new FileManager(FSOpts, VFS);		FileManager *FileMgr = new FileManager(FSOpts, VFS);

std::unique_ptr<ASTUnit> AST = ASTUnit::LoadFromCompilerInvocation(		std::unique_ptr<ASTUnit> AST = ASTUnit::LoadFromCompilerInvocation(
CI, PCHContainerOpts, Diags, FileMgr, false, false,		CI, PCHContainerOpts, Diags, FileMgr, false, false,
/PrecompilePreambleAfterNParses=/1);		/PrecompilePreambleAfterNParses=/1);
return AST;		return AST;
}		}
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	RemapFile(Main,
"int main() { return random() -2; }");		"int main() { return random() -2; }");

ASSERT_TRUE(ReparseAST(AST));		ASSERT_TRUE(ReparseAST(AST));
ASSERT_FALSE(AST->getDiagnostics().hasErrorOccurred());		ASSERT_FALSE(AST->getDiagnostics().hasErrorOccurred());

ASSERT_LE(HeaderReadCount, GetFileReadCount(Header));		ASSERT_LE(HeaderReadCount, GetFileReadCount(Header));
}		}

		TEST_F(PCHPreambleTest, MissingHeader) {
		std::string Header1 = "//./header1.h";
		AddFile(Header1, "template <class T, class U = int> class C;\n"
		"template <class T, class U> class C{};\n");

		std::string Header2 = "//./header2.h";
		AddFile(Header2, "using Alias = C<int>;\n");

		std::string Main = "//./main.cpp";
		AddFile(Main, "#include \"nonexistent1\"\n"
		"#include \"//./header1.h\"\n"
		"#include \"nonexistent2\"\n"
		"#include \"//./header2.h\"\n"
		"Alias Var;");

		std::unique_ptr<ASTUnit> AST(
		Dmitry.KozhevnikovAuthorUnsubmitted Not Done Reply Inline Actions The condition in `SemaTemplateInstantiate.cpp` was causing the lookup of `Alias` to fail: it was marked as invalid and wasn't stored in the preamble. I'll be happy to make a more straightforward test, but I've not yet managed to: when placing everything in a single file, the lookup succeeds. Dmitry.Kozhevnikov: The condition in `SemaTemplateInstantiate.cpp` was causing the lookup of `Alias` to fail: it…
		ParseAST(Main, /SuppressAfterFatalError=/false));
		ASSERT_TRUE(AST.get());

		// only "file not found" errors should be emitted,
		// "Alias" should be visible for lookup.
		auto ExpectedErrorsCount = 2u;

		ASSERT_EQ(AST->getDiagnostics().getClient()->getNumErrors(),
		ExpectedErrorsCount);
		ASSERT_TRUE(ReparseAST(AST));
		ASSERT_EQ(AST->getDiagnostics().getClient()->getNumErrors(),
		ExpectedErrorsCount);
		}

} // anonymous namespace		} // anonymous namespace