This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
cfe/trunk/
-
trunk/
-
lib/Parse/
-
Parse/
-
ParseExpr.cpp
-
test/Sema/
-
Sema/
-
typo-correction.c

Differential D20490

[Parser] Fix a crash on invalid where a delayed TypoExpr was corrected twice
ClosedPublic

Authored by erik.pilkington on May 20 2016, 2:02 PM.

Download Raw Diff

Details

Reviewers

rsmith

Commits

rG142a87489053: [Parser] Only correct delayed typos when needed
rC272587: [Parser] Only correct delayed typos when needed
rL272587: [Parser] Only correct delayed typos when needed

Summary

Previously, Clang crashed when parsing a BinaryOperator in C with a typo when a typo was already found. This is because the parser called Sema::ActOnBinOp, which corrects the found typo in C mode, then corrects the typo again from the parser. During the first correction pass, the TypoExprState corresponding to the typo was cleared from Sema when it was corrected. During a second pass, an assert fails in Sema::getTypoExprState because it cannot find the TypoExprState. The fix is to avoid correcting delayed typos in the parser in that case.

This patch looks like it fixes PR26700, PR27231, and PR27038.

On a more general note, the handling of delayed typo expressions is very messy right now, some of them are handled in semantic analysis, and some are handled in the parser, leading to easy to make responsibility bugs like this one. I think I might take a look at moving the correcting to one side or the other in a future patch.

Diff Detail

Repository: rL LLVM

Event Timeline

erik.pilkington updated this revision to Diff 57980.May 20 2016, 2:02 PM

erik.pilkington retitled this revision from to [Parser] Fix a crash on invalid where a delayed TypoExpr was corrected twice.

erik.pilkington updated this object.

erik.pilkington added a reviewer: rsmith.

erik.pilkington added a subscriber: cfe-commits.

Ping!!

Pong!!

rsmith added inline comments.Jun 9 2016, 1:27 PM

lib/Parse/ParseExpr.cpp
450–452 ↗	(On Diff #57980)	The inconsistent behavior of `ActOnBinOp` seems somewhere between an implementation detail and a bug; it doesn't seem reasonable for the parser to rely on that. I'm not particularly happy about making changes like this without some documentation of the overall design that shows whose responsibility it is to correct typos in which cases. Before we introduced `TypoExpr`, the parser was permitted to simply discard `Expr` nodes that it didn't use (because it'd hit a parse error). Ideally, I'd like to return to that state of affairs, by removing the relevant `CorrectDelayedTyposInExpr` calls from the parser and having Sema automatically diagnose them when we get to the end of the relevant context, if we've not already done so. Another reasonable-seeming option would be to add a `Sema::ActOnDiscardedExpr(Expr)` that the parser can call (which calls `CorrectDelayedTyposInExpr`), and make it clear that the parser is responsible for passing each Expr that it receives from Sema to exactly one ActOn function (unless otherwise specified) -- that way, at least the responsibilities will be clear, but it doesn't help us avoid bugs where `TypoExpr`s are accidentally discarded.

Let's go ahead with this for now and figure out the proper way to handle this as a follow-up change.

This revision is now accepted and ready to land.Jun 13 2016, 1:22 PM

Closed by commit rL272587: [Parser] Only correct delayed typos when needed (authored by epilk). · Explain WhyJun 13 2016, 2:03 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

cfe/

trunk/

lib/

Parse/

ParseExpr.cpp

4 lines

test/

Sema/

typo-correction.c

8 lines

Diff 60603

cfe/trunk/lib/Parse/ParseExpr.cpp

Show First 20 Lines • Show All 440 Lines • ▼ Show 20 Lines	if (!LHS.isInvalid()) {
if (!GreaterThanIsOperator && OpToken.is(tok::greatergreater))		if (!GreaterThanIsOperator && OpToken.is(tok::greatergreater))
SuggestParentheses(OpToken.getLocation(),		SuggestParentheses(OpToken.getLocation(),
diag::warn_cxx11_right_shift_in_template_arg,		diag::warn_cxx11_right_shift_in_template_arg,
SourceRange(Actions.getExprRange(LHS.get()).getBegin(),		SourceRange(Actions.getExprRange(LHS.get()).getBegin(),
Actions.getExprRange(RHS.get()).getEnd()));		Actions.getExprRange(RHS.get()).getEnd()));

LHS = Actions.ActOnBinOp(getCurScope(), OpToken.getLocation(),		LHS = Actions.ActOnBinOp(getCurScope(), OpToken.getLocation(),
OpToken.getKind(), LHS.get(), RHS.get());		OpToken.getKind(), LHS.get(), RHS.get());

		// In this case, ActOnBinOp performed the CorrectDelayedTyposInExpr check.
		if (!getLangOpts().CPlusPlus)
		continue;
} else {		} else {
LHS = Actions.ActOnConditionalOp(OpToken.getLocation(), ColonLoc,		LHS = Actions.ActOnConditionalOp(OpToken.getLocation(), ColonLoc,
LHS.get(), TernaryMiddle.get(),		LHS.get(), TernaryMiddle.get(),
RHS.get());		RHS.get());
}		}
}		}
// Ensure potential typos aren't left undiagnosed.		// Ensure potential typos aren't left undiagnosed.
if (LHS.isInvalid()) {		if (LHS.isInvalid()) {
▲ Show 20 Lines • Show All 2,411 Lines • Show Last 20 Lines

cfe/trunk/test/Sema/typo-correction.c

	Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines

	extern long afunction(int); // expected-note {{'afunction' declared here}}			extern long afunction(int); // expected-note {{'afunction' declared here}}
	void fn2() {			void fn2() {
	f(THIS_IS_AN_ERROR, // expected-error {{use of undeclared identifier 'THIS_IS_AN_ERROR'}}			f(THIS_IS_AN_ERROR, // expected-error {{use of undeclared identifier 'THIS_IS_AN_ERROR'}}
	afunction(afunction_)); // expected-error {{use of undeclared identifier 'afunction_'; did you mean 'afunction'?}}			afunction(afunction_)); // expected-error {{use of undeclared identifier 'afunction_'; did you mean 'afunction'?}}
	}			}

	int d = X ? d : L; // expected-error 2 {{use of undeclared identifier}}			int d = X ? d : L; // expected-error 2 {{use of undeclared identifier}}

				int fn_with_ids() { ID = ID == ID >= ID ; } // expected-error 4 {{use of undeclared identifier}}

				int fn_with_rs(int r) { r = TYPO + r * TYPO; } // expected-error 2 {{use of undeclared identifier}}

				void fn_with_unknown(int a, int b) {
				fn_with_unknown(unknown, unknown \| unknown); // expected-error 3 {{use of undeclared identifier}}
				}