This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
cfe/trunk/
-
trunk/
-
include/clang/Lex/
-
clang/
-
Lex/
-
Preprocessor.h
-
lib/Lex/
-
Lex/
-
PPCaching.cpp
-
Pragma.cpp
-
test/CodeCompletion/
-
CodeCompletion/
-
pragma-macro-token-caching.c

Differential D28772

[Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma in macro argument pre-expansion mode when skipping a function body
ClosedPublic

Authored by arphaman on Jan 16 2017, 7:29 AM.

Download Raw Diff

Details

Reviewers

bruno
akyrtzi
rsmith

Commits

rG24a1bedf765f: [Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma in…
rC296140: [Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma
rL296140: [Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma

Summary

This patch fixes a token caching problem that currently occurs when clang is skipping a function body (e.g. when looking for a code completion token) and at the same time caching the tokens for _Pragma when lexing it in macro argument pre-expansion mode.

When _Pragma is being lexed in macro argument pre-expansion mode, it caches the tokens so that it can avoid interpreting the pragma immediately (as the macro argument may not be used in the macro body), and then either backtracks over or commits these tokens. The problem is that, when we're backtracking/committing in such a scenario, there's already a previous backtracking position stored in BacktrackPositions (as we're skipping the function body), and this leads to a situation where the cached tokens from the pragma (like '(' 'string_literal' and ')') will remain in the cached tokens array incorrectly even after they're consumed (in the case of backtracking) or just ignored (in the case when they're committed). Furthermore, what makes it even worse, is that because of a previous backtracking position, the logic that deals with when should we call ExitCachingLexMode in CachingLex no longer works for us in this situation, and more tokens in the macro argument get cached, to the point where the EOF token that corresponds to the macro argument EOF is cached. This problem leads to all sorts of issues in code completion mode, where incorrect errors get presented and code completion completely fails to produce completion results.

Thanks for taking a look

Diff Detail

Repository: rL LLVM

Event Timeline

arphaman updated this revision to Diff 84561.Jan 16 2017, 7:29 AM

arphaman retitled this revision from to [Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma in macro argument pre-expansion mode when skipping a function body.

arphaman updated this object.

arphaman added reviewers: bruno, rsmith, akyrtzi.

arphaman set the repository for this revision to rL LLVM.

arphaman added a subscriber: cfe-commits.

Herald added a subscriber: nemanjai. · View Herald TranscriptJan 16 2017, 7:29 AM

Can we instead address this locally in _Pragma handling, by getting it to clear out the junk it inserted into the token stream when it's done (if backtracking is enabled)?

Sorry about the delay.
As per Richard's suggestion, the updated patch now makes the _Pragma parser responsible for initiating the removal of cached tokens.

Ping.

LGTM!

This revision is now accepted and ready to land.Feb 23 2017, 5:22 PM

Closed by commit rL296140: [Preprocessor] Fix incorrect token caching that occurs when lexing _Pragma (authored by arphaman). · Explain WhyFeb 24 2017, 9:57 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

cfe/

trunk/

include/

clang/

Lex/

Preprocessor.h

18 lines

lib/

Lex/

PPCaching.cpp

30 lines

Pragma.cpp

11 lines

test/

CodeCompletion/

pragma-macro-token-caching.c

18 lines

Diff 89688

cfe/trunk/include/clang/Lex/Preprocessor.h

Show First 20 Lines • Show All 1,071 Lines • ▼ Show 20 Lines	public:
/// at some point after EnableBacktrackAtThisPos. If you don't, caching of		/// at some point after EnableBacktrackAtThisPos. If you don't, caching of
/// tokens will continue indefinitely.		/// tokens will continue indefinitely.
///		///
void EnableBacktrackAtThisPos();		void EnableBacktrackAtThisPos();

/// \brief Disable the last EnableBacktrackAtThisPos call.		/// \brief Disable the last EnableBacktrackAtThisPos call.
void CommitBacktrackedTokens();		void CommitBacktrackedTokens();

		struct CachedTokensRange {
		CachedTokensTy::size_type Begin, End;
		};

		private:
		/// \brief A range of cached tokens that should be erased after lexing
		/// when backtracking requires the erasure of such cached tokens.
		Optional<CachedTokensRange> CachedTokenRangeToErase;

		public:
		/// \brief Returns the range of cached tokens that were lexed since
		/// EnableBacktrackAtThisPos() was previously called.
		CachedTokensRange LastCachedTokenRange();

		/// \brief Erase the range of cached tokens that were lexed since
		/// EnableBacktrackAtThisPos() was previously called.
		void EraseCachedTokens(CachedTokensRange TokenRange);

/// \brief Make Preprocessor re-lex the tokens that were lexed since		/// \brief Make Preprocessor re-lex the tokens that were lexed since
/// EnableBacktrackAtThisPos() was previously called.		/// EnableBacktrackAtThisPos() was previously called.
void Backtrack();		void Backtrack();

/// \brief True if EnableBacktrackAtThisPos() was called and		/// \brief True if EnableBacktrackAtThisPos() was called and
/// caching of tokens is on.		/// caching of tokens is on.
bool isBacktrackEnabled() const { return !BacktrackPositions.empty(); }		bool isBacktrackEnabled() const { return !BacktrackPositions.empty(); }

▲ Show 20 Lines • Show All 886 Lines • Show Last 20 Lines

cfe/trunk/lib/Lex/PPCaching.cpp

	Show All 29 Lines

	// Disable the last EnableBacktrackAtThisPos call.			// Disable the last EnableBacktrackAtThisPos call.
	void Preprocessor::CommitBacktrackedTokens() {			void Preprocessor::CommitBacktrackedTokens() {
	assert(!BacktrackPositions.empty()			assert(!BacktrackPositions.empty()
	&& "EnableBacktrackAtThisPos was not called!");			&& "EnableBacktrackAtThisPos was not called!");
	BacktrackPositions.pop_back();			BacktrackPositions.pop_back();
	}			}

				Preprocessor::CachedTokensRange Preprocessor::LastCachedTokenRange() {
				assert(isBacktrackEnabled());
				auto PrevCachedLexPos = BacktrackPositions.back();
				return CachedTokensRange{PrevCachedLexPos, CachedLexPos};
				}

				void Preprocessor::EraseCachedTokens(CachedTokensRange TokenRange) {
				assert(TokenRange.Begin <= TokenRange.End);
				if (CachedLexPos == TokenRange.Begin && TokenRange.Begin != TokenRange.End) {
				// We have backtracked to the start of the token range as we want to consume
				// them again. Erase the tokens only after consuming then.
				assert(!CachedTokenRangeToErase);
				CachedTokenRangeToErase = TokenRange;
				return;
				}
				// The cached tokens were committed, so they should be erased now.
				assert(TokenRange.End == CachedLexPos);
				CachedTokens.erase(CachedTokens.begin() + TokenRange.Begin,
				CachedTokens.begin() + TokenRange.End);
				CachedLexPos = TokenRange.Begin;
				ExitCachingLexMode();
				}

	// Make Preprocessor re-lex the tokens that were lexed since			// Make Preprocessor re-lex the tokens that were lexed since
	// EnableBacktrackAtThisPos() was previously called.			// EnableBacktrackAtThisPos() was previously called.
	void Preprocessor::Backtrack() {			void Preprocessor::Backtrack() {
	assert(!BacktrackPositions.empty()			assert(!BacktrackPositions.empty()
	&& "EnableBacktrackAtThisPos was not called!");			&& "EnableBacktrackAtThisPos was not called!");
	CachedLexPos = BacktrackPositions.back();			CachedLexPos = BacktrackPositions.back();
	BacktrackPositions.pop_back();			BacktrackPositions.pop_back();
	recomputeCurLexerKind();			recomputeCurLexerKind();
	}			}

	void Preprocessor::CachingLex(Token &Result) {			void Preprocessor::CachingLex(Token &Result) {
	if (!InCachingLexMode())			if (!InCachingLexMode())
	return;			return;

	if (CachedLexPos < CachedTokens.size()) {			if (CachedLexPos < CachedTokens.size()) {
	Result = CachedTokens[CachedLexPos++];			Result = CachedTokens[CachedLexPos++];
				// Erase the some of the cached tokens after they are consumed when
				// asked to do so.
				if (CachedTokenRangeToErase &&
				CachedTokenRangeToErase->End == CachedLexPos) {
				EraseCachedTokens(*CachedTokenRangeToErase);
				CachedTokenRangeToErase = None;
				}
	return;			return;
	}			}

	ExitCachingLexMode();			ExitCachingLexMode();
	Lex(Result);			Lex(Result);

	if (isBacktrackEnabled()) {			if (isBacktrackEnabled()) {
	// Cache the lexed token.			// Cache the lexed token.
	▲ Show 20 Lines • Show All 83 Lines • Show Last 20 Lines

cfe/trunk/lib/Lex/Pragma.cpp

Show First 20 Lines • Show All 154 Lines • ▼ Show 20 Lines	LexingFor_PragmaRAII(Preprocessor &PP, bool InMacroArgPreExpansion,
if (InMacroArgPreExpansion) {		if (InMacroArgPreExpansion) {
PragmaTok = OutTok;		PragmaTok = OutTok;
PP.EnableBacktrackAtThisPos();		PP.EnableBacktrackAtThisPos();
}		}
}		}

~LexingFor_PragmaRAII() {		~LexingFor_PragmaRAII() {
if (InMacroArgPreExpansion) {		if (InMacroArgPreExpansion) {
		// When committing/backtracking the cached pragma tokens in a macro
		// argument pre-expansion we want to ensure that either the tokens which
		// have been committed will be removed from the cache or that the tokens
		// over which we just backtracked won't remain in the cache after they're
		// consumed and that the caching will stop after consuming them.
		// Otherwise the caching will interfere with the way macro expansion
		// works, because we will continue to cache tokens after consuming the
		// backtracked tokens, which shouldn't happen when we're dealing with
		// macro argument pre-expansion.
		auto CachedTokenRange = PP.LastCachedTokenRange();
if (Failed) {		if (Failed) {
PP.CommitBacktrackedTokens();		PP.CommitBacktrackedTokens();
} else {		} else {
PP.Backtrack();		PP.Backtrack();
OutTok = PragmaTok;		OutTok = PragmaTok;
}		}
		PP.EraseCachedTokens(CachedTokenRange);
}		}
}		}

void failed() {		void failed() {
Failed = true;		Failed = true;
}		}
};		};

▲ Show 20 Lines • Show All 1,381 Lines • Show Last 20 Lines

cfe/trunk/test/CodeCompletion/pragma-macro-token-caching.c


				#define Outer(action) action

				void completeParam(int param) {
				;
				Outer(__extension__({ _Pragma("clang diagnostic push") }));
				param;
				}

				// RUN: %clang_cc1 -fsyntax-only -code-completion-at=%s:7:1 %s \| FileCheck %s
				// CHECK: param : [#int#]param

				void completeParamPragmaError(int param) {
				Outer(__extension__({ _Pragma(2) })); // expected-error {{_Pragma takes a parenthesized string literal}}
				param;
				}

				// RUN: %clang_cc1 -fsyntax-only -verify -code-completion-at=%s:16:1 %s \| FileCheck %s