This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/Sema/
-
clang/
-
Sema/
-
Sema.h
-
lib/
-
Parse/
3
ParseStmt.cpp
-
Sema/
7
SemaStmt.cpp
-
TreeTransform.h
-
test/
-
Misc/
-
ast-dump-invalid-switch.cpp
-
SemaCXX/
-
switch.cpp

Differential D26350

Keep invalid Switch in the AST
Needs ReviewPublic

Authored by ogoffart on Nov 7 2016, 6:40 AM.

Download Raw Diff

Details

Reviewers

rsmith
aaron.ballman
erikjv

Summary

When the condition is invalid, replace it by an OpaqueValueExpr

When parsing an invalid CaseStmt, don't drop the sub statement, just return it instead.

In Sema::ActOnStartOfSwitchStmt, always keep the SwitchStmt, even if it has duplicate case or defaults statement or that the condition cannot be converted to an integral type.

Diff Detail

Event Timeline

I believe only the change in ActOnFinishSwitchStmt might be controversial.
Is it breaking an invariant than having switches kept in the AST?

ping?

ping2

Ping

re-ping

aaron.ballman edited reviewers, added: aaron.ballman; removed: cfe-commits.Jul 14 2017, 5:28 AM

aaron.ballman added a subscriber: cfe-commits.

You've explained how you are accomplishing this but not why. I don't think Clang typically keeps erroneous AST nodes in the tree. What kind of problem is this intended to solve?

The problem i'm trying to solve is precisely to keep as much as possible of the valid AST in the main AST, despite errors.
I've already done some work with r249982, r272962 and more, and there is still a lot to do. But the goal is to keep as much as possible of it.

The reason i'm working on this is highlighting of code where some code might be potentially invalid (because you are editing it, or because the tool don't have access to all headers)
Things like if statement from my previous patch, or switch statement like this patch are the things which have the more impact, because not keeping them removes highlighting for potentially big blocks of code.

This is useful for example for IDE such as KDevelop which use clang for highlighting, or my own tool [code.woboq.org]

A random example is https://code.woboq.org/linux/linux/arch/arm/kernel/module.c.html?style=kdevelop#101
(generated with this patch applied.) There are errors because this is an arm file built with the option for an intel kernel. Yet, most of the file is properly highlighted. If this patch was not applied, the whole switch statement would be removed from the AST and nothing within would be hightlighted, only because some case label are invalid.

In D26350#809577, @ogoffart wrote:

The problem i'm trying to solve is precisely to keep as much as possible of the valid AST in the main AST, despite errors.
I've already done some work with r249982, r272962 and more, and there is still a lot to do. But the goal is to keep as much as possible of it.

Thank you for the explanation -- I like the goal, but am definitely concerned about the assumptions it might break by doing this (in general, not specific to this patch). We try to recover gracefully whenever possible, but still, a whole lot of frontend code relies on knowing when something is invalid to prevent the compiler from doing even worse things. I'm not certain the best balance to strike with that, but suspect it's going to adversely impact tools like clang-tidy which tend to assume that AST matchers match *valid* code and not invalid code.

lib/Parse/ParseStmt.cpp
1297	succeed -> succeeds Are the concerns pointed out in the FIXME addressed by code not in this patch?
lib/Sema/SemaStmt.cpp
669	This makes the condition result valid when it isn't. Users of this condition result may expect a valid condition result to return nonnull values when calling `get()`, which makes me uncomfortable.

Thanks for your review and i'll try to address the concerns.

I believe tools that really need valid code relies on the diagnostics and bail out on error. On the other hand, tools that may work on code containing error do a best effort to work on the remaining AST .
And patches like this one improve the remaining AST, so tools like clang-tidy will be able to also do valid transformation inside the switch statement, which could not have been possible when the whole body is gone.

The AST stays "valid" in the sense that all the nodes exist (no nullptr) and so conform to the expectation of the code. The condition of a switch may now be an OpaqueValueExpr which should not disturb the matchers.

lib/Parse/ParseStmt.cpp
1297	The FIXME is pointing out problems occuring if the parser found 'default' or 'case' statement, but cannot connect it to the corresponding 'switch' statement (because that switch statement did not exist as it was removed from the AST). Now that we always keep the 'switch' statement, this is no longer a problem.
lib/Sema/SemaStmt.cpp
669	Get return a non-null value. That's why i'm constructing an OpaqueValueExpr placeholder expression. The ConditionVar (nullptr in the line bellow) can be null. It is null in valid code most of the time actually, when one does not declare a new variable in in condition. But the result is that users of this condition will get a OpaqueValueExpr when calling get and should not be disturbed by that as they will just take that as an expression.

This looks reasonable to me, but you should wait for @rsmith to sign off before committing.

lib/Sema/SemaStmt.cpp
669	Ah, sorry, I misread the code in my haste.

ogoffart added a reviewer: erikjv.Oct 10 2017, 2:58 AM

rsmith added inline comments.Oct 10 2017, 3:07 PM

lib/Parse/ParseStmt.cpp
1297	I'm uncomfortable about this; this change couples Parser to the implementation details of Sema. How about this: remove this assert and change `ActOnFinishSwitchStmt` to take a `StmtResult` instead (which might be invalid). Then you can tell from within `ActOnFinishSwitchStmt` whether to check the case statements against the condition based on whether the switch is in fact invalid. (Alternatively: change `ActOnStartSwitchStmt` to return `void` and make `ActOnFinishSwitchStmt` pick up the switch statement from the `SwitchStack`.)
lib/Sema/SemaStmt.cpp
672	Won't this result in warnings or errors later on if we have `case` labels with expressions of other types? (Eg, narrowing warnings/errors) Please instead (somehow) track that the switch condition is invalid and skip those checks -- perhaps either by returning an invalid-but-not-null statement here and passing that back into `ActOnFinishSwitchStmt`, or by tracking an "invalid" flag on the `SwitchStack` entry.
1173–1177	Hmm. Removing this will result in us producing invalid ASTs in some cases (with duplicate `case` or `default` labels). That's a condition that it would be reasonable for AST consumers to assert on currently, so this is concerning. That said... it's inevitable that this work to keep more invalid constructs in the AST will result in such changes. Perhaps what we need is just a marker to say "beyond this point the AST does not necessarily correspond to any valid source code" for `Stmt` nodes, analogous to the `Invalid` marker on declarations. (Maybe a wrapper `InvalidStmt` node, so that tree traversals can easily avoid walking through it.) Let's try this change out as-is. It may be that this concern is baseless.

Updated the patch so that ActOnStartOfSwitchStmt returns void, and ActOnFinishSwitchStmt will skip some checks in case of error

rsmith added inline comments.Feb 28 2018, 4:49 PM

lib/Sema/SemaStmt.cpp
823	It's fragile to assume that the only way you can see an `OpaqueValueExpr` here is by it being created in `ActOnStartOfSwitchStmt`. We could tunnel this information through in another way, though, such as by tracking a bool in the `SwitchStack` in addition to the statement. However, perhaps it's time to bite the bullet and add actual support for error nodes in the AST. For example, we could add a new kind of placeholder type for an erroneous expression, and build syntactic expression trees with that type when we encounter errors.

aaron.ballman added inline comments.Feb 28 2018, 6:45 PM

lib/Sema/SemaStmt.cpp
823	FWIW, I would find error nodes in the AST to be extremely useful.

Revision Contents

Path

Size

include/

clang/

Sema/

Sema.h

8 lines

lib/

Parse/

ParseStmt.cpp

21 lines

Sema/

SemaStmt.cpp

55 lines

TreeTransform.h

19 lines

test/

Misc/

ast-dump-invalid-switch.cpp

105 lines

SemaCXX/

switch.cpp

17 lines

Diff 119142

include/clang/Sema/Sema.h

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	StmtResult ActOnIfStmt(SourceLocation IfLoc, bool IsConstexpr,			StmtResult ActOnIfStmt(SourceLocation IfLoc, bool IsConstexpr,
	Stmt *InitStmt,			Stmt *InitStmt,
	ConditionResult Cond, Stmt *ThenVal,			ConditionResult Cond, Stmt *ThenVal,
	SourceLocation ElseLoc, Stmt *ElseVal);			SourceLocation ElseLoc, Stmt *ElseVal);
	StmtResult BuildIfStmt(SourceLocation IfLoc, bool IsConstexpr,			StmtResult BuildIfStmt(SourceLocation IfLoc, bool IsConstexpr,
	Stmt *InitStmt,			Stmt *InitStmt,
	ConditionResult Cond, Stmt *ThenVal,			ConditionResult Cond, Stmt *ThenVal,
	SourceLocation ElseLoc, Stmt *ElseVal);			SourceLocation ElseLoc, Stmt *ElseVal);
	StmtResult ActOnStartOfSwitchStmt(SourceLocation SwitchLoc,			void ActOnStartOfSwitchStmt(SourceLocation SwitchLoc, Stmt *InitStmt,
	Stmt *InitStmt,			ConditionResult Cond);
	ConditionResult Cond);			StmtResult ActOnFinishSwitchStmt(SourceLocation SwitchLoc, Stmt *Body);
	StmtResult ActOnFinishSwitchStmt(SourceLocation SwitchLoc,
	Stmt Switch, Stmt Body);
	StmtResult ActOnWhileStmt(SourceLocation WhileLoc, ConditionResult Cond,			StmtResult ActOnWhileStmt(SourceLocation WhileLoc, ConditionResult Cond,
	Stmt *Body);			Stmt *Body);
	StmtResult ActOnDoStmt(SourceLocation DoLoc, Stmt *Body,			StmtResult ActOnDoStmt(SourceLocation DoLoc, Stmt *Body,
	SourceLocation WhileLoc, SourceLocation CondLParen,			SourceLocation WhileLoc, SourceLocation CondLParen,
	Expr *Cond, SourceLocation CondRParen);			Expr *Cond, SourceLocation CondRParen);

	StmtResult ActOnForStmt(SourceLocation ForLoc,			StmtResult ActOnForStmt(SourceLocation ForLoc,
	SourceLocation LParenLoc,			SourceLocation LParenLoc,
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Parse/ParseStmt.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	}			}

	// Install the body into the most deeply-nested case.			// Install the body into the most deeply-nested case.
	if (DeepestParsedCaseStmt) {			if (DeepestParsedCaseStmt) {
	// Broken sub-stmt shouldn't prevent forming the case statement properly.			// Broken sub-stmt shouldn't prevent forming the case statement properly.
	if (SubStmt.isInvalid())			if (SubStmt.isInvalid())
	SubStmt = Actions.ActOnNullStmt(SourceLocation());			SubStmt = Actions.ActOnNullStmt(SourceLocation());
	Actions.ActOnCaseStmtBody(DeepestParsedCaseStmt, SubStmt.get());			Actions.ActOnCaseStmtBody(DeepestParsedCaseStmt, SubStmt.get());
				} else {
				// The case statement is invalid, recover by returning the statement body.
				return SubStmt;
	}			}

	// Return the top level parsed statement tree.			// Return the top level parsed statement tree.
	return TopLevelCase;			return TopLevelCase;
	}			}

	/// ParseDefaultStatement			/// ParseDefaultStatement
	/// labeled-statement:			/// labeled-statement:
	▲ Show 20 Lines • Show All 169 Lines • ▼ Show 20 Lines
	// not the case for C90. Start the switch scope.			// not the case for C90. Start the switch scope.
	//			//
	// C++ 6.4p3:			// C++ 6.4p3:
	// A name introduced by a declaration in a condition is in scope from its			// A name introduced by a declaration in a condition is in scope from its
	// point of declaration until the end of the substatements controlled by the			// point of declaration until the end of the substatements controlled by the
	// condition.			// condition.
	// C++ 3.3.2p4:			// C++ 3.3.2p4:
	// Names declared in the for-init-statement, and in the condition of if,			// Names declared in the for-init-statement, and in the condition of if,
	// while, for, and switch statements are local to the if, while, for, or			// while, for, and switch statements are local to the if, while, for, or
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions succeed -> succeeds Are the concerns pointed out in the FIXME addressed by code not in this patch? aaron.ballman: succeed -> succeeds Are the concerns pointed out in the FIXME addressed by code not in this…
				ogoffartAuthorUnsubmitted Not Done Reply Inline Actions The FIXME is pointing out problems occuring if the parser found 'default' or 'case' statement, but cannot connect it to the corresponding 'switch' statement (because that switch statement did not exist as it was removed from the AST). Now that we always keep the 'switch' statement, this is no longer a problem. ogoffart: The FIXME is pointing out problems occuring if the parser found 'default' or 'case' statement…
				rsmithUnsubmitted Not Done Reply Inline Actions I'm uncomfortable about this; this change couples Parser to the implementation details of Sema. How about this: remove this assert and change `ActOnFinishSwitchStmt` to take a `StmtResult` instead (which might be invalid). Then you can tell from within `ActOnFinishSwitchStmt` whether to check the case statements against the condition based on whether the switch is in fact invalid. (Alternatively: change `ActOnStartSwitchStmt` to return `void` and make `ActOnFinishSwitchStmt` pick up the switch statement from the `SwitchStack`.) rsmith: I'm uncomfortable about this; this change couples Parser to the implementation details of Sema.
	// switch statement (including the controlled statement).			// switch statement (including the controlled statement).
	//			//
	unsigned ScopeFlags = Scope::SwitchScope;			unsigned ScopeFlags = Scope::SwitchScope;
	if (C99orCXX)			if (C99orCXX)
	ScopeFlags \|= Scope::DeclScope \| Scope::ControlScope;			ScopeFlags \|= Scope::DeclScope \| Scope::ControlScope;
	ParseScope SwitchScope(this, ScopeFlags);			ParseScope SwitchScope(this, ScopeFlags);

	// Parse the condition.			// Parse the condition.
	StmtResult InitStmt;			StmtResult InitStmt;
	Sema::ConditionResult Cond;			Sema::ConditionResult Cond;
	if (ParseParenExprOrCondition(&InitStmt, Cond, SwitchLoc,			if (ParseParenExprOrCondition(&InitStmt, Cond, SwitchLoc,
	Sema::ConditionKind::Switch))			Sema::ConditionKind::Switch))
	return StmtError();			return StmtError();

	StmtResult Switch =			Actions.ActOnStartOfSwitchStmt(SwitchLoc, InitStmt.get(), Cond);
	Actions.ActOnStartOfSwitchStmt(SwitchLoc, InitStmt.get(), Cond);

	if (Switch.isInvalid()) {
	// Skip the switch body.
	// FIXME: This is not optimal recovery, but parsing the body is more
	// dangerous due to the presence of case and default statements, which
	// will have no place to connect back with the switch.
	if (Tok.is(tok::l_brace)) {
	ConsumeBrace();
	SkipUntil(tok::r_brace);
	} else
	SkipUntil(tok::semi);
	return Switch;
	}

	// C99 6.8.4p3 - In C99, the body of the switch statement is a scope, even if			// C99 6.8.4p3 - In C99, the body of the switch statement is a scope, even if
	// there is no compound stmt. C90 does not have this clause. We only do this			// there is no compound stmt. C90 does not have this clause. We only do this
	// if the body isn't a compound statement to avoid push/pop in common cases.			// if the body isn't a compound statement to avoid push/pop in common cases.
	//			//
	// C++ 6.4p1:			// C++ 6.4p1:
	// The substatement in a selection-statement (each substatement, in the else			// The substatement in a selection-statement (each substatement, in the else
	// form of the if statement) implicitly defines a local scope.			// form of the if statement) implicitly defines a local scope.
	Show All 11 Lines

	// Read the body statement.			// Read the body statement.
	StmtResult Body(ParseStatement(TrailingElseLoc));			StmtResult Body(ParseStatement(TrailingElseLoc));

	// Pop the scopes.			// Pop the scopes.
	InnerScope.Exit();			InnerScope.Exit();
	SwitchScope.Exit();			SwitchScope.Exit();

	return Actions.ActOnFinishSwitchStmt(SwitchLoc, Switch.get(), Body.get());			return Actions.ActOnFinishSwitchStmt(SwitchLoc, Body.get());
	}			}

	/// ParseWhileStatement			/// ParseWhileStatement
	/// while-statement: [C99 6.8.5.1]			/// while-statement: [C99 6.8.5.1]
	/// 'while' '(' expression ')' statement			/// 'while' '(' expression ')' statement
	/// [C++] 'while' '(' condition ')' statement			/// [C++] 'while' '(' condition ')' statement
	StmtResult Parser::ParseWhileStatement(SourceLocation *TrailingElseLoc) {			StmtResult Parser::ParseWhileStatement(SourceLocation *TrailingElseLoc) {
	assert(Tok.is(tok::kw_while) && "Not a while stmt!");			assert(Tok.is(tok::kw_while) && "Not a while stmt!");
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Sema/SemaStmt.cpp

	Show First 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	} SwitchDiagnoser(Cond);			} SwitchDiagnoser(Cond);

	ExprResult CondResult =			ExprResult CondResult =
	PerformContextualImplicitConversion(SwitchLoc, Cond, SwitchDiagnoser);			PerformContextualImplicitConversion(SwitchLoc, Cond, SwitchDiagnoser);
	if (CondResult.isInvalid())			if (CondResult.isInvalid())
	return ExprError();			return ExprError();

	// C99 6.8.4.2p5 - Integer promotions are performed on the controlling expr.			// C99 6.8.4.2p5 - Integer promotions are performed on the controlling expr.
	return UsualUnaryConversions(CondResult.get());			return UsualUnaryConversions(CondResult.get());
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions This makes the condition result valid when it isn't. Users of this condition result may expect a valid condition result to return nonnull values when calling `get()`, which makes me uncomfortable. aaron.ballman: This makes the condition result valid when it isn't. Users of this condition result may expect…
				ogoffartAuthorUnsubmitted Not Done Reply Inline Actions Get return a non-null value. That's why i'm constructing an OpaqueValueExpr placeholder expression. The ConditionVar (nullptr in the line bellow) can be null. It is null in valid code most of the time actually, when one does not declare a new variable in in condition. But the result is that users of this condition will get a OpaqueValueExpr when calling get and should not be disturbed by that as they will just take that as an expression. ogoffart: Get return a non-null value. That's why i'm constructing an OpaqueValueExpr placeholder…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions Ah, sorry, I misread the code in my haste. aaron.ballman: Ah, sorry, I misread the code in my haste.
	}			}

	StmtResult Sema::ActOnStartOfSwitchStmt(SourceLocation SwitchLoc,			void Sema::ActOnStartOfSwitchStmt(SourceLocation SwitchLoc, Stmt *InitStmt,
				rsmithUnsubmitted Not Done Reply Inline Actions Won't this result in warnings or errors later on if we have `case` labels with expressions of other types? (Eg, narrowing warnings/errors) Please instead (somehow) track that the switch condition is invalid and skip those checks -- perhaps either by returning an invalid-but-not-null statement here and passing that back into `ActOnFinishSwitchStmt`, or by tracking an "invalid" flag on the `SwitchStack` entry. rsmith: Won't this result in warnings or errors later on if we have `case` labels with expressions of…
	Stmt *InitStmt, ConditionResult Cond) {			ConditionResult Cond) {
	if (Cond.isInvalid())			if (Cond.isInvalid())
	return StmtError();			Cond = ConditionResult(
				*this, nullptr,
				MakeFullExpr(new (Context) OpaqueValueExpr(SourceLocation(),
				Context.IntTy, VK_RValue),
				SwitchLoc),
				false);

	getCurFunction()->setHasBranchIntoScope();			getCurFunction()->setHasBranchIntoScope();

	SwitchStmt *SS = new (Context)			SwitchStmt *SS = new (Context)
	SwitchStmt(Context, InitStmt, Cond.get().first, Cond.get().second);			SwitchStmt(Context, InitStmt, Cond.get().first, Cond.get().second);
	getCurFunction()->SwitchStack.push_back(SS);			getCurFunction()->SwitchStack.push_back(SS);
	return SS;
	}			}

	static void AdjustAPSInt(llvm::APSInt &Val, unsigned BitWidth, bool IsSigned) {			static void AdjustAPSInt(llvm::APSInt &Val, unsigned BitWidth, bool IsSigned) {
	Val = Val.extOrTrunc(BitWidth);			Val = Val.extOrTrunc(BitWidth);
	Val.setIsSigned(IsSigned);			Val.setIsSigned(IsSigned);
	}			}

	/// Check the specified case value is in range for the given unpromoted switch			/// Check the specified case value is in range for the given unpromoted switch
	▲ Show 20 Lines • Show All 73 Lines • ▼ Show 20 Lines
	if (S.Context.hasSameUnqualifiedType(CondType, CaseType))			if (S.Context.hasSameUnqualifiedType(CondType, CaseType))
	return;			return;

	S.Diag(Case->getExprLoc(), diag::warn_comparison_of_mixed_enum_types_switch)			S.Diag(Case->getExprLoc(), diag::warn_comparison_of_mixed_enum_types_switch)
	<< CondType << CaseType << Cond->getSourceRange()			<< CondType << CaseType << Cond->getSourceRange()
	<< Case->getSourceRange();			<< Case->getSourceRange();
	}			}

	StmtResult			StmtResult Sema::ActOnFinishSwitchStmt(SourceLocation SwitchLoc,
	Sema::ActOnFinishSwitchStmt(SourceLocation SwitchLoc, Stmt *Switch,			Stmt *BodyStmt) {
	Stmt *BodyStmt) {
	SwitchStmt *SS = cast<SwitchStmt>(Switch);			SwitchStmt *SS = getCurFunction()->SwitchStack.back();
	assert(SS == getCurFunction()->SwitchStack.back() &&			assert(SS && "switch stack missing push/pop!");
	"switch stack missing push/pop!");

	getCurFunction()->SwitchStack.pop_back();			getCurFunction()->SwitchStack.pop_back();

	if (!BodyStmt) return StmtError();			if (!BodyStmt) return StmtError();
	SS->setBody(BodyStmt, SwitchLoc);			SS->setBody(BodyStmt, SwitchLoc);

	Expr *CondExpr = SS->getCond();			Expr *CondExpr = SS->getCond();
	if (!CondExpr) return StmtError();			if (!CondExpr) return StmtError();
	Show All 11 Lines
	// type (before the promotion) doesn't make sense, even when it can			// type (before the promotion) doesn't make sense, even when it can
	// be represented by the promoted type. Therefore we need to find			// be represented by the promoted type. Therefore we need to find
	// the pre-promotion type of the switch condition.			// the pre-promotion type of the switch condition.
	if (!CondExpr->isTypeDependent()) {			if (!CondExpr->isTypeDependent()) {
	// We have already converted the expression to an integral or enumeration			// We have already converted the expression to an integral or enumeration
	// type, when we started the switch statement. If we don't have an			// type, when we started the switch statement. If we don't have an
	// appropriate type now, just return an error.			// appropriate type now, just return an error.
	if (!CondType->isIntegralOrEnumerationType())			if (!CondType->isIntegralOrEnumerationType())
	return StmtError();			return SS;

	if (CondExpr->isKnownToHaveBooleanValue()) {			if (CondExpr->isKnownToHaveBooleanValue()) {
	// switch(bool_expr) {...} is often a programmer error, e.g.			// switch(bool_expr) {...} is often a programmer error, e.g.
	// switch(n && mask) { ... } // Doh - should be "n & mask".			// switch(n && mask) { ... } // Doh - should be "n & mask".
	// One can always use an if statement instead of switch(bool_expr).			// One can always use an if statement instead of switch(bool_expr).
	Diag(SwitchLoc, diag::warn_bool_switch_condition)			Diag(SwitchLoc, diag::warn_bool_switch_condition)
	<< CondExpr->getSourceRange();			<< CondExpr->getSourceRange();
	}			}
	}			}

	// Get the bitwidth of the switched-on value after promotions. We must			// Get the bitwidth of the switched-on value after promotions. We must
	// convert the integer case values to this width before comparison.			// convert the integer case values to this width before comparison.
	bool HasDependentValue			bool HasDependentValueOrError = CondExpr->isTypeDependent() \|\|
	= CondExpr->isTypeDependent() \|\| CondExpr->isValueDependent();			CondExpr->isValueDependent() \|\|
	unsigned CondWidth = HasDependentValue ? 0 : Context.getIntWidth(CondType);			isa<OpaqueValueExpr>(CondExpr);
				rsmithUnsubmitted Not Done Reply Inline Actions It's fragile to assume that the only way you can see an `OpaqueValueExpr` here is by it being created in `ActOnStartOfSwitchStmt`. We could tunnel this information through in another way, though, such as by tracking a bool in the `SwitchStack` in addition to the statement. However, perhaps it's time to bite the bullet and add actual support for error nodes in the AST. For example, we could add a new kind of placeholder type for an erroneous expression, and build syntactic expression trees with that type when we encounter errors. rsmith: It's fragile to assume that the only way you can see an `OpaqueValueExpr` here is by it being…
				aaron.ballmanUnsubmitted Not Done Reply Inline Actions FWIW, I would find error nodes in the AST to be extremely useful. aaron.ballman: FWIW, I would find error nodes in the AST to be extremely useful.
				unsigned CondWidth =
				HasDependentValueOrError ? 0 : Context.getIntWidth(CondType);
	bool CondIsSigned = CondType->isSignedIntegerOrEnumerationType();			bool CondIsSigned = CondType->isSignedIntegerOrEnumerationType();

	// Get the width and signedness that the condition might actually have, for			// Get the width and signedness that the condition might actually have, for
	// warning purposes.			// warning purposes.
	// FIXME: Grab an IntRange for the condition rather than using the unpromoted			// FIXME: Grab an IntRange for the condition rather than using the unpromoted
	// type.			// type.
	unsigned CondWidthBeforePromotion			unsigned CondWidthBeforePromotion =
	= HasDependentValue ? 0 : Context.getIntWidth(CondTypeBeforePromotion);			HasDependentValueOrError ? 0
				: Context.getIntWidth(CondTypeBeforePromotion);
	bool CondIsSignedBeforePromotion			bool CondIsSignedBeforePromotion
	= CondTypeBeforePromotion->isSignedIntegerOrEnumerationType();			= CondTypeBeforePromotion->isSignedIntegerOrEnumerationType();

	// Accumulate all of the case values in a vector so that we can sort them			// Accumulate all of the case values in a vector so that we can sort them
	// and detect duplicates. This vector contains the APInt for the case after			// and detect duplicates. This vector contains the APInt for the case after
	// it has been converted to the condition type.			// it has been converted to the condition type.
	typedef SmallVector<std::pair<llvm::APSInt, CaseStmt*>, 64> CaseValsTy;			typedef SmallVector<std::pair<llvm::APSInt, CaseStmt*>, 64> CaseValsTy;
	CaseValsTy CaseVals;			CaseValsTy CaseVals;

	// Keep track of any GNU case ranges we see. The APSInt is the low value.			// Keep track of any GNU case ranges we see. The APSInt is the low value.
	typedef std::vector<std::pair<llvm::APSInt, CaseStmt*> > CaseRangesTy;			typedef std::vector<std::pair<llvm::APSInt, CaseStmt*> > CaseRangesTy;
	CaseRangesTy CaseRanges;			CaseRangesTy CaseRanges;

	DefaultStmt *TheDefaultStmt = nullptr;			DefaultStmt *TheDefaultStmt = nullptr;

	bool CaseListIsErroneous = false;			bool CaseListIsErroneous = false;

	for (SwitchCase *SC = SS->getSwitchCaseList(); SC && !HasDependentValue;			for (SwitchCase *SC = SS->getSwitchCaseList();
	SC = SC->getNextSwitchCase()) {			SC && !HasDependentValueOrError; SC = SC->getNextSwitchCase()) {

	if (DefaultStmt *DS = dyn_cast<DefaultStmt>(SC)) {			if (DefaultStmt *DS = dyn_cast<DefaultStmt>(SC)) {
	if (TheDefaultStmt) {			if (TheDefaultStmt) {
	Diag(DS->getDefaultLoc(), diag::err_multiple_default_labels_defined);			Diag(DS->getDefaultLoc(), diag::err_multiple_default_labels_defined);
	Diag(TheDefaultStmt->getDefaultLoc(), diag::note_duplicate_case_prev);			Diag(TheDefaultStmt->getDefaultLoc(), diag::note_duplicate_case_prev);

	// FIXME: Remove the default statement from the switch block so that			// FIXME: Remove the default statement from the switch block so that
	// we'll return a valid AST. This requires recursing down the AST and			// we'll return a valid AST. This requires recursing down the AST and
	// finding it, not something we are set up to do right now. For now,			// finding it, not something we are set up to do right now. For now,
	// just lop the entire switch stmt out of the AST.			// just lop the entire switch stmt out of the AST.
	CaseListIsErroneous = true;			CaseListIsErroneous = true;
	}			}
	TheDefaultStmt = DS;			TheDefaultStmt = DS;

	} else {			} else {
	CaseStmt *CS = cast<CaseStmt>(SC);			CaseStmt *CS = cast<CaseStmt>(SC);

	Expr *Lo = CS->getLHS();			Expr *Lo = CS->getLHS();

	if (Lo->isTypeDependent() \|\| Lo->isValueDependent()) {			if (Lo->isTypeDependent() \|\| Lo->isValueDependent()) {
	HasDependentValue = true;			HasDependentValueOrError = true;
	break;			break;
	}			}

	checkEnumTypesInSwitchStmt(*this, CondExpr, Lo);			checkEnumTypesInSwitchStmt(*this, CondExpr, Lo);

	llvm::APSInt LoVal;			llvm::APSInt LoVal;

	if (getLangOpts().CPlusPlus11) {			if (getLangOpts().CPlusPlus11) {
	Show All 26 Lines
	AdjustAPSInt(LoVal, CondWidth, CondIsSigned);			AdjustAPSInt(LoVal, CondWidth, CondIsSigned);

	CS->setLHS(Lo);			CS->setLHS(Lo);

	// If this is a case range, remember it in CaseRanges, otherwise CaseVals.			// If this is a case range, remember it in CaseRanges, otherwise CaseVals.
	if (CS->getRHS()) {			if (CS->getRHS()) {
	if (CS->getRHS()->isTypeDependent() \|\|			if (CS->getRHS()->isTypeDependent() \|\|
	CS->getRHS()->isValueDependent()) {			CS->getRHS()->isValueDependent()) {
	HasDependentValue = true;			HasDependentValueOrError = true;
	break;			break;
	}			}
	CaseRanges.push_back(std::make_pair(LoVal, CS));			CaseRanges.push_back(std::make_pair(LoVal, CS));
	} else			} else
	CaseVals.push_back(std::make_pair(LoVal, CS));			CaseVals.push_back(std::make_pair(LoVal, CS));
	}			}
	}			}

	if (!HasDependentValue) {			if (!HasDependentValueOrError) {
	// If we don't have a default statement, check whether the			// If we don't have a default statement, check whether the
	// condition is constant.			// condition is constant.
	llvm::APSInt ConstantCondValue;			llvm::APSInt ConstantCondValue;
	bool HasConstantCond = false;			bool HasConstantCond = false;
	if (!HasDependentValue && !TheDefaultStmt) {			if (!HasDependentValueOrError && !TheDefaultStmt) {
	HasConstantCond = CondExpr->EvaluateAsInt(ConstantCondValue, Context,			HasConstantCond = CondExpr->EvaluateAsInt(ConstantCondValue, Context,
	Expr::SE_AllowSideEffects);			Expr::SE_AllowSideEffects);
	assert(!HasConstantCond \|\|			assert(!HasConstantCond \|\|
	(ConstantCondValue.getBitWidth() == CondWidth &&			(ConstantCondValue.getBitWidth() == CondWidth &&
	ConstantCondValue.isSigned() == CondIsSigned));			ConstantCondValue.isSigned() == CondIsSigned));
	}			}
	bool ShouldCheckConstantCond = HasConstantCond;			bool ShouldCheckConstantCond = HasConstantCond;

	▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines
	while (CI != CaseVals.end() && CI->first < EI->first)			while (CI != CaseVals.end() && CI->first < EI->first)
	CI++;			CI++;

	if (CI != CaseVals.end() && CI->first == EI->first)			if (CI != CaseVals.end() && CI->first == EI->first)
	continue;			continue;

	// Drop unneeded case ranges			// Drop unneeded case ranges
	for (; RI != CaseRanges.end(); RI++) {			for (; RI != CaseRanges.end(); RI++) {
	llvm::APSInt Hi =			llvm::APSInt Hi =
	RI->second->getRHS()->EvaluateKnownConstInt(Context);			RI->second->getRHS()->EvaluateKnownConstInt(Context);
	AdjustAPSInt(Hi, CondWidth, CondIsSigned);			AdjustAPSInt(Hi, CondWidth, CondIsSigned);
	if (EI->first <= Hi)			if (EI->first <= Hi)
	break;			break;
	rsmithUnsubmitted Not Done Reply Inline Actions Hmm. Removing this will result in us producing invalid ASTs in some cases (with duplicate `case` or `default` labels). That's a condition that it would be reasonable for AST consumers to assert on currently, so this is concerning. That said... it's inevitable that this work to keep more invalid constructs in the AST will result in such changes. Perhaps what we need is just a marker to say "beyond this point the AST does not necessarily correspond to any valid source code" for `Stmt` nodes, analogous to the `Invalid` marker on declarations. (Maybe a wrapper `InvalidStmt` node, so that tree traversals can easily avoid walking through it.) Let's try this change out as-is. It may be that this concern is baseless. rsmith: Hmm. Removing this will result in us producing invalid ASTs in some cases (with duplicate…
	}			}

	if (RI == CaseRanges.end() \|\| EI->first < RI->first) {			if (RI == CaseRanges.end() \|\| EI->first < RI->first) {
	hasCasesNotInSwitch = true;			hasCasesNotInSwitch = true;
	UnhandledNames.push_back(EI->second->getDeclName());			UnhandledNames.push_back(EI->second->getDeclName());
	}			}
	}			}

	Show All 16 Lines
	SS->setAllEnumCasesCovered();			SS->setAllEnumCasesCovered();
	}			}
	}			}

	if (BodyStmt)			if (BodyStmt)
	DiagnoseEmptyStmtBody(CondExpr->getLocEnd(), BodyStmt,			DiagnoseEmptyStmtBody(CondExpr->getLocEnd(), BodyStmt,
	diag::warn_empty_switch_body);			diag::warn_empty_switch_body);

	// FIXME: If the case list was broken is some way, we don't have a good system
	// to patch it up. Instead, just return the whole substmt as broken.
	if (CaseListIsErroneous)
	return StmtError();

	return SS;			return SS;
	}			}

	void			void
	Sema::DiagnoseAssignmentEnum(QualType DstType, QualType SrcType,			Sema::DiagnoseAssignmentEnum(QualType DstType, QualType SrcType,
	Expr *SrcExpr) {			Expr *SrcExpr) {
	if (Diags.isIgnored(diag::warn_not_in_enum_assignment, SrcExpr->getExprLoc()))			if (Diags.isIgnored(diag::warn_not_in_enum_assignment, SrcExpr->getExprLoc()))
	return;			return;
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Sema/TreeTransform.h

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	return getSema().ActOnIfStmt(IfLoc, IsConstexpr, Init, Cond, Then,			return getSema().ActOnIfStmt(IfLoc, IsConstexpr, Init, Cond, Then,
	ElseLoc, Else);			ElseLoc, Else);
	}			}

	/// \brief Start building a new switch statement.			/// \brief Start building a new switch statement.
	///			///
	/// By default, performs semantic analysis to build the new statement.			/// By default, performs semantic analysis to build the new statement.
	/// Subclasses may override this routine to provide different behavior.			/// Subclasses may override this routine to provide different behavior.
	StmtResult RebuildSwitchStmtStart(SourceLocation SwitchLoc, Stmt *Init,			void RebuildSwitchStmtStart(SourceLocation SwitchLoc, Stmt *Init,
	Sema::ConditionResult Cond) {			Sema::ConditionResult Cond) {
	return getSema().ActOnStartOfSwitchStmt(SwitchLoc, Init, Cond);			getSema().ActOnStartOfSwitchStmt(SwitchLoc, Init, Cond);
	}			}

	/// \brief Attach the body to the switch statement.			/// \brief Attach the body to the switch statement.
	///			///
	/// By default, performs semantic analysis to build the new statement.			/// By default, performs semantic analysis to build the new statement.
	/// Subclasses may override this routine to provide different behavior.			/// Subclasses may override this routine to provide different behavior.
	StmtResult RebuildSwitchStmtBody(SourceLocation SwitchLoc,			StmtResult RebuildSwitchStmtBody(SourceLocation SwitchLoc, Stmt *Body) {
	Stmt Switch, Stmt Body) {			return getSema().ActOnFinishSwitchStmt(SwitchLoc, Body);
	return getSema().ActOnFinishSwitchStmt(SwitchLoc, Switch, Body);
	}			}

	/// \brief Build a new while statement.			/// \brief Build a new while statement.
	///			///
	/// By default, performs semantic analysis to build the new statement.			/// By default, performs semantic analysis to build the new statement.
	/// Subclasses may override this routine to provide different behavior.			/// Subclasses may override this routine to provide different behavior.
	StmtResult RebuildWhileStmt(SourceLocation WhileLoc,			StmtResult RebuildWhileStmt(SourceLocation WhileLoc,
	Sema::ConditionResult Cond, Stmt *Body) {			Sema::ConditionResult Cond, Stmt *Body) {
	▲ Show 20 Lines • Show All 184 Lines • ▼ Show 20 Lines
	// Transform the condition.			// Transform the condition.
	Sema::ConditionResult Cond = getDerived().TransformCondition(			Sema::ConditionResult Cond = getDerived().TransformCondition(
	S->getSwitchLoc(), S->getConditionVariable(), S->getCond(),			S->getSwitchLoc(), S->getConditionVariable(), S->getCond(),
	Sema::ConditionKind::Switch);			Sema::ConditionKind::Switch);
	if (Cond.isInvalid())			if (Cond.isInvalid())
	return StmtError();			return StmtError();

	// Rebuild the switch statement.			// Rebuild the switch statement.
	StmtResult Switch			getDerived().RebuildSwitchStmtStart(S->getSwitchLoc(), Init.get(), Cond);
	= getDerived().RebuildSwitchStmtStart(S->getSwitchLoc(), Init.get(), Cond);
	if (Switch.isInvalid())
	return StmtError();

	// Transform the body of the switch statement.			// Transform the body of the switch statement.
	StmtResult Body = getDerived().TransformStmt(S->getBody());			StmtResult Body = getDerived().TransformStmt(S->getBody());
	if (Body.isInvalid())			if (Body.isInvalid())
	return StmtError();			return StmtError();

	// Complete the switch statement.			// Complete the switch statement.
	return getDerived().RebuildSwitchStmtBody(S->getSwitchLoc(), Switch.get(),			return getDerived().RebuildSwitchStmtBody(S->getSwitchLoc(), Body.get());
	Body.get());
	}			}

	template<typename Derived>			template<typename Derived>
	StmtResult			StmtResult
	TreeTransform<Derived>::TransformWhileStmt(WhileStmt *S) {			TreeTransform<Derived>::TransformWhileStmt(WhileStmt *S) {
	// Transform the condition			// Transform the condition
	Sema::ConditionResult Cond = getDerived().TransformCondition(			Sema::ConditionResult Cond = getDerived().TransformCondition(
	S->getWhileLoc(), S->getConditionVariable(), S->getCond(),			S->getWhileLoc(), S->getConditionVariable(), S->getCond(),
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

test/Misc/ast-dump-invalid-switch.cpp

This file was added.

				// RUN: not %clang_cc1 -std=c++11 -triple x86_64-linux-gnu -fms-extensions -ast-dump -ast-dump-filter Test %s \| FileCheck -check-prefix CHECK -strict-whitespace %s

				/* This test ensures that the AST is still complete, even for invalid code */

				namespace TestInvalidSwithCondition {
				int f(int x) {
				switch (_invalid_) {
				case 0:
				return 1;
				default:
				return 2;
				}
				}
				}

				// CHECK: NamespaceDecl {{.*}} TestInvalidSwithCondition
				// CHECK-NEXT: `-FunctionDecl
				// CHECK-NEXT: \|-ParmVarDecl
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: `-SwitchStmt
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-OpaqueValueExpr
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: \|-CaseStmt
				// CHECK-NEXT: \| \|-IntegerLiteral {{.*}} 'int' 0
				// CHECK-NEXT: \| \|-<<<NULL>>>
				// CHECK-NEXT: \| `-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 1
				// CHECK-NEXT: `-DefaultStmt
				// CHECK-NEXT: `-ReturnStmt
				// CHECK-NEXT: `-IntegerLiteral {{.*}} 'int' 2

				namespace TestSwitchConditionNotIntegral {
				int g(int *x) {
				switch (x) {
				case 0:
				return 1;
				default:
				return 2;
				}
				}
				}

				// CHECK: NamespaceDecl {{.*}} TestSwitchConditionNotIntegral
				// CHECK-NEXT: `-FunctionDecl
				// CHECK-NEXT: \|-ParmVarDecl
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: `-SwitchStmt
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-ImplicitCastExpr
				// CHECK-NEXT: \| `-DeclRefExpr {{.}} 'x' 'int '
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: \|-CaseStmt
				// CHECK-NEXT: \| \|-IntegerLiteral {{.*}} 'int' 0
				// CHECK-NEXT: \| \|-<<<NULL>>>
				// CHECK-NEXT: \| `-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 1
				// CHECK-NEXT: `-DefaultStmt
				// CHECK-NEXT: `-ReturnStmt
				// CHECK-NEXT: `-IntegerLiteral {{.*}} 'int' 2

				namespace TestSwitchInvalidCases {
				int g(int x) {
				switch (x) {
				case _invalid_:
				return 1;
				case _invalid_:
				return 2;
				case x:
				return 3;
				default:
				return 4;
				default:
				return 5;
				}
				}
				}

				// CHECK: NamespaceDecl {{.*}} TestSwitchInvalidCases
				// CHECK-NEXT: `-FunctionDecl
				// CHECK-NEXT: \|-ParmVarDecl
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: `-SwitchStmt
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-<<<NULL>>>
				// CHECK-NEXT: \|-ImplicitCastExpr
				// CHECK-NEXT: \| `-DeclRefExpr {{.*}}'x' 'int'
				// CHECK-NEXT: `-CompoundStmt
				// CHECK-NEXT: \|-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 1
				// CHECK-NEXT: \|-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 2
				// CHECK-NEXT: \|-CaseStmt
				// CHECK-NEXT: \| \|-DeclRefExpr {{.*}} 'x' 'int'
				// CHECK-NEXT: \| \|-<<<NULL>>>
				// CHECK-NEXT: \| `-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 3
				// CHECK-NEXT: \|-DefaultStmt
				// CHECK-NEXT: \| `-ReturnStmt
				// CHECK-NEXT: \| `-IntegerLiteral {{.*}} 'int' 4
				// CHECK-NEXT: `-DefaultStmt
				// CHECK-NEXT: `-ReturnStmt
				// CHECK-NEXT: `-IntegerLiteral {{.*}} 'int' 5

test/SemaCXX/switch.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	case Defined::a:	case Defined::a:
	break;	break;
	case (Defined)2: // expected-warning {{case value not in enumerated type 'OpaqueEnumWarnings::Defined'}}	case (Defined)2: // expected-warning {{case value not in enumerated type 'OpaqueEnumWarnings::Defined'}}
	break;	break;
	}	}
	}	}

	}	}

		namespace InvalidCondition {
		enum class color { red,
		blue,
		green };
		void test() {
		// When the condition is invalid, there should be no errors or warnings
		switch (invalidCode) { // expected-error {{use of undeclared identifier}}
		case 0:
		case -(1ll << 62) - 1:
		case (1ll << 62) + 1:
		case color::red:
		default:
		break;
		}
		}
		} // namespace InvalidCondition
Context not available.