This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Basic/
-
DiagnosticGroups.td
-
DiagnosticParseKinds.td
-
LangOptions.def
-
Driver/
-
Options.td
-
Lex/
-
Preprocessor.h
-
Parse/
-
Parser.h
-
lib/
-
Driver/ToolChains/
-
ToolChains/
-
Clang.cpp
-
Frontend/
-
CompilerInvocation.cpp
-
Lex/
-
Preprocessor.cpp
-
Parse/
-
ParsePragma.cpp
-
Parser.cpp
-
test/
-
Driver/
-
autocomplete.c
-
Parser/
-
max-tokens.cpp

Differential D72703

Add a warning, flags and pragmas to limit the number of pre-processor tokens in a translation unit
ClosedPublic

Authored by hans on Jan 14 2020, 7:16 AM.

Download Raw Diff

Details

Reviewers

thakis
rnk
rsmith

Commits

rG739b410f1ff5: Add a warning, flags and pragmas to limit the number of pre-processor tokens in…

Summary

See https://docs.google.com/document/d/1xMkTZMKx9llnMPgso0jrx3ankI4cv60xeZ0y4ksf4wc/preview for background discussion.

This adds a warning, flags and pragmas to limit the number of pre-processor tokens either at a certain point in a translation unit, or overall.

The idea is that this would allow projects to limit the size of certain widely included headers, or for translation units overall, as a way to insert backstops for header bloat and prevent compile-time regressions.

What do you think?

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hans created this revision.Jan 14 2020, 7:16 AM

I waited to see if there was any other feedback, but I'm in favor of this.

Should we try to come up with better pragma names? clang max_tokens doesn't seem to call to mind what it does: warn if there have been more than this many tokens so far in the translation unit. max_file_tokens has to do with the number of tokens in the translation unit overall, but it uses the terminology "file" instead of "translation unit". The user could interpret that as being in the current source file, ignoring includes.

Some ideas for the immediate version:

clang max_tokens_so_far
clang max_tokens_lexed
clang max_tokens_here

Some ideas for end-of-tu:

clang max_translation_unit_tokens
clang max_tu_tokens
clang global_max_tokens

I just want to say that finding the correlation between token count and compile time is a bit of a breakthrough! Could you expose a flag for printing token count so users can run their own analysis? Or does that already exist in baseline clang? It's easier to set a maximum for a codebase if the distribution is known.

Thanks!

In D72703#1832678, @rnk wrote:

I waited to see if there was any other feedback, but I'm in favor of this.

Should we try to come up with better pragma names? clang max_tokens doesn't seem to call to mind what it does: warn if there have been more than this many tokens so far in the translation unit. max_file_tokens has to do with the number of tokens in the translation unit overall, but it uses the terminology "file" instead of "translation unit". The user could interpret that as being in the current source file, ignoring includes.

Thanks for thinking about the names. I agree they are not ideal.

Some ideas for the immediate version:

clang max_tokens_so_far

clang max_tokens_lexed

clang max_tokens_here

I went with max_tokens because it's shorter, and I figured maybe the "here" could be implicit as most things happen where the pragma is. But since we'll also have the per-tu variant, maybe it makes sense to have a longer name. Of your alternatives I like max_tokens_here best.

Some ideas for end-of-tu:

clang max_translation_unit_tokens

clang max_tu_tokens

clang global_max_tokens

I went with "file" because tu is such a technical term and I'm not sure we generally use it in clang's interface. What do you think about max_tokens_total?

In D72703#1833018, @kimgr wrote:

I just want to say that finding the correlation between token count and compile time is a bit of a breakthrough!

I assume the same correlation could also be found with lines of code, but I think tokens is a better dimension to measure since it's less likely to be gamed, and it also is also kind of the basic work unit that the compiler deals with.

Could you expose a flag for printing token count so users can run their own analysis? Or does that already exist in baseline clang? It's easier to set a maximum for a codebase if the distribution is known.

I used this patch with -fmax-tokens 1 and scraped the output for my measurements. I would like to avoid adding a separate flag if we can avoid it.

I like the max_tokens_here / max_tokens_total variants.

Doing max_tokens_here / max_tokens_total.

Please take another look.

lgtm :)

This revision is now accepted and ready to land.Jan 23 2020, 2:47 PM

In D72703#1834177, @hans wrote:

I assume the same correlation could also be found with lines of code, but I think tokens is a better dimension to measure since it's less likely to be gamed, and it also is also kind of the basic work unit that the compiler deals with.

Yeah, and code with lots of documentatinon comments isn't penalized.

Could you expose a flag for printing token count so users can run their own analysis? Or does that already exist in baseline clang? It's easier to set a maximum for a codebase if the distribution is known.

I used this patch with -fmax-tokens 1 and scraped the output for my measurements. I would like to avoid adding a separate flag if we can avoid it.

Ah, it didn't occur to me that -fmax-tokens itself could be used like this. Thanks!

FWIW, I'm not ecstatic about max_tokens_here, I thought max_tokens_lexed had a nicer ring to it. /peanut.

FWIW, I'm not ecstatic about max_tokens_here, I thought max_tokens_lexed had a nicer ring to it. /peanut.

I mainly like the clarity in the difference between _here and _total. With _lexed, I feel that would not be as clear.

Closed by commit rG739b410f1ff5: Add a warning, flags and pragmas to limit the number of pre-processor tokens in… (authored by hans). · Explain WhyJan 27 2020, 7:07 AM

This revision was automatically updated to reflect the committed changes.

Herald added a project: Restricted Project. · View Herald TranscriptJan 27 2020, 7:07 AM

Revision Contents

Path

Size

clang/

include/

clang/

Basic/

DiagnosticGroups.td

29 lines

DiagnosticParseKinds.td

12 lines

LangOptions.def

2 lines

Driver/

Options.td

3 lines

Lex/

Preprocessor.h

21 lines

Parse/

Parser.h

2 lines

lib/

Driver/

ToolChains/

Clang.cpp

2 lines

Frontend/

CompilerInvocation.cpp

2 lines

Lex/

Preprocessor.cpp

10 lines

Parse/

ParsePragma.cpp

85 lines

Parser.cpp

10 lines

test/

Driver/

autocomplete.c

1 line

Parser/

max-tokens.cpp

23 lines

Diff 240568

clang/include/clang/Basic/DiagnosticGroups.td

	Show First 20 Lines • Show All 1,143 Lines • ▼ Show 20 Lines
	def NoDeref : DiagGroup<"noderef">;			def NoDeref : DiagGroup<"noderef">;

	// A group for cross translation unit static analysis related warnings.			// A group for cross translation unit static analysis related warnings.
	def CrossTU : DiagGroup<"ctu">;			def CrossTU : DiagGroup<"ctu">;

	def CTADMaybeUnsupported : DiagGroup<"ctad-maybe-unsupported">;			def CTADMaybeUnsupported : DiagGroup<"ctad-maybe-unsupported">;

	def FortifySource : DiagGroup<"fortify-source">;			def FortifySource : DiagGroup<"fortify-source">;

				def MaxTokens : DiagGroup<"max-tokens"> {
				code Documentation = [{
				The warning is issued if the number of pre-processor tokens exceeds
				the token limit, which can be set in three ways:

				1. As a limit at a specific point in a file, using the ``clang max_tokens_here``
				pragma:

				.. code-block: c++
				#pragma clang max_tokens_here 1234

				2. As a per-translation unit limit, using the ``-fmax-tokens`` command-line
				flag:

				.. code-block: console
				clang -c a.cpp -fmax-tokens 1234

				3. As a per-translation unit limit using the ``clang max_tokens_total`` pragma,
				which works like and overrides the ``-fmax-tokens`` flag:

				.. code-block: c++
				#pragma clang max_file_tokens 1234

				These limits can be helpful in limiting code growth through included files.

				Setting a token limit of zero means no limit.
				}];
				}

clang/include/clang/Basic/DiagnosticParseKinds.td

Show First 20 Lines • Show All 1,040 Lines • ▼ Show 20 Lines	def warn_pragma_expected_section_push_pop_or_name : Warning<
"expected push, pop or a string literal for the section name in '#pragma %0' - ignored">,		"expected push, pop or a string literal for the section name in '#pragma %0' - ignored">,
InGroup<IgnoredPragmas>;		InGroup<IgnoredPragmas>;
def warn_pragma_expected_section_label_or_name : Warning<		def warn_pragma_expected_section_label_or_name : Warning<
"expected a stack label or a string literal for the section name in '#pragma %0' - ignored">,		"expected a stack label or a string literal for the section name in '#pragma %0' - ignored">,
InGroup<IgnoredPragmas>;		InGroup<IgnoredPragmas>;
def warn_pragma_expected_init_seg : Warning<		def warn_pragma_expected_init_seg : Warning<
"expected 'compiler', 'lib', 'user', or a string literal for the section name in '#pragma %0' - ignored">,		"expected 'compiler', 'lib', 'user', or a string literal for the section name in '#pragma %0' - ignored">,
InGroup<IgnoredPragmas>;		InGroup<IgnoredPragmas>;

		def err_pragma_expected_integer : Error<"expected an integer argument in '#pragma %0'">;
def warn_pragma_expected_integer : Warning<		def warn_pragma_expected_integer : Warning<
"expected integer between %0 and %1 inclusive in '#pragma %2' - ignored">,		"expected integer between %0 and %1 inclusive in '#pragma %2' - ignored">,
InGroup<IgnoredPragmas>;		InGroup<IgnoredPragmas>;
def warn_pragma_ms_struct : Warning<		def warn_pragma_ms_struct : Warning<
"incorrect use of '#pragma ms_struct on\|off' - ignored">,		"incorrect use of '#pragma ms_struct on\|off' - ignored">,
InGroup<IgnoredPragmas>;		InGroup<IgnoredPragmas>;
def warn_pragma_extra_tokens_at_eol : Warning<		def warn_pragma_extra_tokens_at_eol : Warning<
"extra tokens at end of '#pragma %0' - ignored">,		"extra tokens at end of '#pragma %0' - ignored">,
▲ Show 20 Lines • Show All 313 Lines • ▼ Show 20 Lines	def err_concept_definition_not_identifier : Error<
"name defined in concept definition must be an identifier">;		"name defined in concept definition must be an identifier">;
def ext_concept_legacy_bool_keyword : ExtWarn<		def ext_concept_legacy_bool_keyword : ExtWarn<
"ISO C++2a does not permit the 'bool' keyword after 'concept'">,		"ISO C++2a does not permit the 'bool' keyword after 'concept'">,
InGroup<DiagGroup<"concepts-ts-compat">>;		InGroup<DiagGroup<"concepts-ts-compat">>;
def err_placeholder_expected_auto_or_decltype_auto : Error<		def err_placeholder_expected_auto_or_decltype_auto : Error<
"expected 'auto' or 'decltype(auto)' after concept name">;		"expected 'auto' or 'decltype(auto)' after concept name">;
}		}

		def warn_max_tokens : Warning<
		"the number of preprocessor source tokens (%0) exceeds this token limit (%1)">,
		InGroup<MaxTokens>;

		def warn_max_tokens_total : Warning<
		"the total number of preprocessor source tokens (%0) exceeds the token limit (%1)">,
		InGroup<MaxTokens>;

		def note_max_tokens_total_override : Note<"total token limit set here">;

} // end of Parser diagnostics		} // end of Parser diagnostics

clang/include/clang/Basic/LangOptions.def

	Show First 20 Lines • Show All 338 Lines • ▼ Show 20 Lines
	COMPATIBLE_VALUE_LANGOPT(FunctionAlignment, 5, 0, "Default alignment for functions")			COMPATIBLE_VALUE_LANGOPT(FunctionAlignment, 5, 0, "Default alignment for functions")

	LANGOPT(FixedPoint, 1, 0, "fixed point types")			LANGOPT(FixedPoint, 1, 0, "fixed point types")
	LANGOPT(PaddingOnUnsignedFixedPoint, 1, 0,			LANGOPT(PaddingOnUnsignedFixedPoint, 1, 0,
	"unsigned fixed point types having one extra padding bit")			"unsigned fixed point types having one extra padding bit")

	LANGOPT(RegisterStaticDestructors, 1, 1, "Register C++ static destructors")			LANGOPT(RegisterStaticDestructors, 1, 1, "Register C++ static destructors")

				COMPATIBLE_VALUE_LANGOPT(MaxTokens, 32, 0, "Max number of tokens per TU or 0")

	#undef LANGOPT			#undef LANGOPT
	#undef COMPATIBLE_LANGOPT			#undef COMPATIBLE_LANGOPT
	#undef BENIGN_LANGOPT			#undef BENIGN_LANGOPT
	#undef ENUM_LANGOPT			#undef ENUM_LANGOPT
	#undef COMPATIBLE_ENUM_LANGOPT			#undef COMPATIBLE_ENUM_LANGOPT
	#undef BENIGN_ENUM_LANGOPT			#undef BENIGN_ENUM_LANGOPT
	#undef VALUE_LANGOPT			#undef VALUE_LANGOPT
	#undef COMPATIBLE_VALUE_LANGOPT			#undef COMPATIBLE_VALUE_LANGOPT
	#undef BENIGN_VALUE_LANGOPT			#undef BENIGN_VALUE_LANGOPT

clang/include/clang/Driver/Options.td

	Show First 20 Lines • Show All 640 Lines • ▼ Show 20 Lines
	def emit_interface_stubs : Flag<["-"], "emit-interface-stubs">, Flags<[CC1Option]>, Group<Action_Group>,			def emit_interface_stubs : Flag<["-"], "emit-interface-stubs">, Flags<[CC1Option]>, Group<Action_Group>,
	HelpText<"Generate Inteface Stub Files.">;			HelpText<"Generate Inteface Stub Files.">;
	def emit_merged_ifs : Flag<["-"], "emit-merged-ifs">,			def emit_merged_ifs : Flag<["-"], "emit-merged-ifs">,
	Flags<[CC1Option]>, Group<Action_Group>,			Flags<[CC1Option]>, Group<Action_Group>,
	HelpText<"Generate Interface Stub Files, emit merged text not binary.">;			HelpText<"Generate Interface Stub Files, emit merged text not binary.">;
	def interface_stub_version_EQ : JoinedOrSeparate<["-"], "interface-stub-version=">, Flags<[CC1Option]>;			def interface_stub_version_EQ : JoinedOrSeparate<["-"], "interface-stub-version=">, Flags<[CC1Option]>;
	def exported__symbols__list : Separate<["-"], "exported_symbols_list">;			def exported__symbols__list : Separate<["-"], "exported_symbols_list">;
	def e : JoinedOrSeparate<["-"], "e">, Group<Link_Group>;			def e : JoinedOrSeparate<["-"], "e">, Group<Link_Group>;
				def fmax_tokens : Separate<["-"], "fmax-tokens">,
				HelpText<"Max total number of preprocessed tokens for -Wmax-tokens.">,
				Group<f_Group>, Flags<[CC1Option]>;
	def fPIC : Flag<["-"], "fPIC">, Group<f_Group>;			def fPIC : Flag<["-"], "fPIC">, Group<f_Group>;
	def fno_PIC : Flag<["-"], "fno-PIC">, Group<f_Group>;			def fno_PIC : Flag<["-"], "fno-PIC">, Group<f_Group>;
	def fPIE : Flag<["-"], "fPIE">, Group<f_Group>;			def fPIE : Flag<["-"], "fPIE">, Group<f_Group>;
	def fno_PIE : Flag<["-"], "fno-PIE">, Group<f_Group>;			def fno_PIE : Flag<["-"], "fno-PIE">, Group<f_Group>;
	def faccess_control : Flag<["-"], "faccess-control">, Group<f_Group>;			def faccess_control : Flag<["-"], "faccess-control">, Group<f_Group>;
	def falign_functions : Flag<["-"], "falign-functions">, Group<f_Group>;			def falign_functions : Flag<["-"], "falign-functions">, Group<f_Group>;
	def falign_functions_EQ : Joined<["-"], "falign-functions=">, Group<f_Group>;			def falign_functions_EQ : Joined<["-"], "falign-functions=">, Group<f_Group>;
	def fno_align_functions: Flag<["-"], "fno-align-functions">, Group<f_Group>;			def fno_align_functions: Flag<["-"], "fno-align-functions">, Group<f_Group>;
	▲ Show 20 Lines • Show All 2,747 Lines • Show Last 20 Lines

clang/include/clang/Lex/Preprocessor.h

Show First 20 Lines • Show All 410 Lines • ▼ Show 20 Lines	class Preprocessor {
/// The number of currently-active calls to Lex.		/// The number of currently-active calls to Lex.
///		///
/// Lex is reentrant, and asking for an (end-of-phase-4) token can often		/// Lex is reentrant, and asking for an (end-of-phase-4) token can often
/// require asking for multiple additional tokens. This counter makes it		/// require asking for multiple additional tokens. This counter makes it
/// possible for Lex to detect whether it's producing a token for the end		/// possible for Lex to detect whether it's producing a token for the end
/// of phase 4 of translation or for some other situation.		/// of phase 4 of translation or for some other situation.
unsigned LexLevel = 0;		unsigned LexLevel = 0;

		/// The number of (LexLevel 0) preprocessor tokens.
		unsigned TokenCount = 0;

		/// The maximum number of (LexLevel 0) tokens before issuing a -Wmax-tokens
		/// warning, or zero for unlimited.
		unsigned MaxTokens = 0;
		SourceLocation MaxTokensOverrideLoc;

public:		public:
struct PreambleSkipInfo {		struct PreambleSkipInfo {
SourceLocation HashTokenLoc;		SourceLocation HashTokenLoc;
SourceLocation IfTokenLoc;		SourceLocation IfTokenLoc;
bool FoundNonSkipPortion;		bool FoundNonSkipPortion;
bool FoundElse;		bool FoundElse;
SourceLocation ElseLoc;		SourceLocation ElseLoc;

▲ Show 20 Lines • Show All 578 Lines • ▼ Show 20 Lines	public:
void addPPCallbacks(std::unique_ptr<PPCallbacks> C) {		void addPPCallbacks(std::unique_ptr<PPCallbacks> C) {
if (Callbacks)		if (Callbacks)
C = std::make_unique<PPChainedCallbacks>(std::move(C),		C = std::make_unique<PPChainedCallbacks>(std::move(C),
std::move(Callbacks));		std::move(Callbacks));
Callbacks = std::move(C);		Callbacks = std::move(C);
}		}
/// \}		/// \}

		/// Get the number of tokens processed so far.
		unsigned getTokenCount() const { return TokenCount; }

		/// Get the max number of tokens before issuing a -Wmax-tokens warning.
		unsigned getMaxTokens() const { return MaxTokens; }

		void overrideMaxTokens(unsigned Value, SourceLocation Loc) {
		MaxTokens = Value;
		MaxTokensOverrideLoc = Loc;
		};

		SourceLocation getMaxTokensOverrideLoc() const { return MaxTokensOverrideLoc; }

/// Register a function that would be called on each token in the final		/// Register a function that would be called on each token in the final
/// expanded token stream.		/// expanded token stream.
/// This also reports annotation tokens produced by the parser.		/// This also reports annotation tokens produced by the parser.
void setTokenWatcher(llvm::unique_function<void(const clang::Token &)> F) {		void setTokenWatcher(llvm::unique_function<void(const clang::Token &)> F) {
OnToken = std::move(F);		OnToken = std::move(F);
}		}

bool isMacroDefined(StringRef Id) {		bool isMacroDefined(StringRef Id) {
▲ Show 20 Lines • Show All 1,344 Lines • Show Last 20 Lines

clang/include/clang/Parse/Parser.h

Show First 20 Lines • Show All 195 Lines • ▼ Show 20 Lines	class Parser : public CodeCompletionHandler {
std::unique_ptr<PragmaHandler> NoUnrollHintHandler;		std::unique_ptr<PragmaHandler> NoUnrollHintHandler;
std::unique_ptr<PragmaHandler> UnrollAndJamHintHandler;		std::unique_ptr<PragmaHandler> UnrollAndJamHintHandler;
std::unique_ptr<PragmaHandler> NoUnrollAndJamHintHandler;		std::unique_ptr<PragmaHandler> NoUnrollAndJamHintHandler;
std::unique_ptr<PragmaHandler> FPHandler;		std::unique_ptr<PragmaHandler> FPHandler;
std::unique_ptr<PragmaHandler> STDCFENVHandler;		std::unique_ptr<PragmaHandler> STDCFENVHandler;
std::unique_ptr<PragmaHandler> STDCCXLIMITHandler;		std::unique_ptr<PragmaHandler> STDCCXLIMITHandler;
std::unique_ptr<PragmaHandler> STDCUnknownHandler;		std::unique_ptr<PragmaHandler> STDCUnknownHandler;
std::unique_ptr<PragmaHandler> AttributePragmaHandler;		std::unique_ptr<PragmaHandler> AttributePragmaHandler;
		std::unique_ptr<PragmaHandler> MaxTokensHerePragmaHandler;
		std::unique_ptr<PragmaHandler> MaxTokensTotalPragmaHandler;

std::unique_ptr<CommentHandler> CommentSemaHandler;		std::unique_ptr<CommentHandler> CommentSemaHandler;

/// Whether the '>' token acts as an operator or not. This will be		/// Whether the '>' token acts as an operator or not. This will be
/// true except when we are parsing an expression within a C++		/// true except when we are parsing an expression within a C++
/// template argument list, where the '>' closes the template		/// template argument list, where the '>' closes the template
/// argument list.		/// argument list.
bool GreaterThanIsOperator;		bool GreaterThanIsOperator;
▲ Show 20 Lines • Show All 2,961 Lines • Show Last 20 Lines

clang/lib/Driver/ToolChains/Clang.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 5,711 Lines • ▼ Show 20 Lines	if (isa<PreprocessJobAction>(JA))
CmdArgs.push_back("-traditional-cpp");		CmdArgs.push_back("-traditional-cpp");
else		else
D.Diag(diag::err_drv_clang_unsupported) << A->getAsString(Args);		D.Diag(diag::err_drv_clang_unsupported) << A->getAsString(Args);
}		}

Args.AddLastArg(CmdArgs, options::OPT_dM);		Args.AddLastArg(CmdArgs, options::OPT_dM);
Args.AddLastArg(CmdArgs, options::OPT_dD);		Args.AddLastArg(CmdArgs, options::OPT_dD);

		Args.AddLastArg(CmdArgs, options::OPT_fmax_tokens);

// Handle serialized diagnostics.		// Handle serialized diagnostics.
if (Arg *A = Args.getLastArg(options::OPT__serialize_diags)) {		if (Arg *A = Args.getLastArg(options::OPT__serialize_diags)) {
CmdArgs.push_back("-serialize-diagnostic-file");		CmdArgs.push_back("-serialize-diagnostic-file");
CmdArgs.push_back(Args.MakeArgString(A->getValue()));		CmdArgs.push_back(Args.MakeArgString(A->getValue()));
}		}

if (Args.hasArg(options::OPT_fretain_comments_from_system_headers))		if (Args.hasArg(options::OPT_fretain_comments_from_system_headers))
CmdArgs.push_back("-fretain-comments-from-system-headers");		CmdArgs.push_back("-fretain-comments-from-system-headers");
▲ Show 20 Lines • Show All 1,259 Lines • Show Last 20 Lines

clang/lib/Frontend/CompilerInvocation.cpp

Show First 20 Lines • Show All 3,286 Lines • ▼ Show 20 Lines	if (Arg *A = Args.getLastArg(OPT_fclang_abi_compat_EQ)) {
} else if (Ver != "latest") {		} else if (Ver != "latest") {
Diags.Report(diag::err_drv_invalid_value)		Diags.Report(diag::err_drv_invalid_value)
<< A->getAsString(Args) << A->getValue();		<< A->getAsString(Args) << A->getValue();
}		}
}		}

Opts.CompleteMemberPointers = Args.hasArg(OPT_fcomplete_member_pointers);		Opts.CompleteMemberPointers = Args.hasArg(OPT_fcomplete_member_pointers);
Opts.BuildingPCHWithObjectFile = Args.hasArg(OPT_building_pch_with_obj);		Opts.BuildingPCHWithObjectFile = Args.hasArg(OPT_building_pch_with_obj);

		Opts.MaxTokens = getLastArgIntValue(Args, OPT_fmax_tokens, 0, Diags);
}		}

static bool isStrictlyPreprocessorAction(frontend::ActionKind Action) {		static bool isStrictlyPreprocessorAction(frontend::ActionKind Action) {
switch (Action) {		switch (Action) {
case frontend::ASTDeclList:		case frontend::ASTDeclList:
case frontend::ASTDump:		case frontend::ASTDump:
case frontend::ASTPrint:		case frontend::ASTPrint:
case frontend::ASTView:		case frontend::ASTView:
▲ Show 20 Lines • Show All 463 Lines • Show Last 20 Lines

clang/lib/Lex/Preprocessor.cpp

Show First 20 Lines • Show All 160 Lines • ▼ Show 20 Lines	Preprocessor::Preprocessor(std::shared_ptr<PreprocessorOptions> PPOpts,

if (this->PPOpts->GeneratePreamble)		if (this->PPOpts->GeneratePreamble)
PreambleConditionalStack.startRecording();		PreambleConditionalStack.startRecording();

ExcludedConditionalDirectiveSkipMappings =		ExcludedConditionalDirectiveSkipMappings =
this->PPOpts->ExcludedConditionalDirectiveSkipMappings;		this->PPOpts->ExcludedConditionalDirectiveSkipMappings;
if (ExcludedConditionalDirectiveSkipMappings)		if (ExcludedConditionalDirectiveSkipMappings)
ExcludedConditionalDirectiveSkipMappings->clear();		ExcludedConditionalDirectiveSkipMappings->clear();

		MaxTokens = LangOpts.MaxTokens;
}		}

Preprocessor::~Preprocessor() {		Preprocessor::~Preprocessor() {
assert(BacktrackPositions.empty() && "EnableBacktrack/Backtrack imbalance!");		assert(BacktrackPositions.empty() && "EnableBacktrack/Backtrack imbalance!");

IncludeMacroStack.clear();		IncludeMacroStack.clear();

// Destroy any macro definitions.		// Destroy any macro definitions.
▲ Show 20 Lines • Show All 780 Lines • ▼ Show 20 Lines	if (getLangOpts().CPlusPlusModules && LexLevel == 1 &&
default:		default:
ImportSeqState.handleMisc();		ImportSeqState.handleMisc();
break;		break;
}		}
}		}

LastTokenWasAt = Result.is(tok::at);		LastTokenWasAt = Result.is(tok::at);
--LexLevel;		--LexLevel;
if (OnToken && LexLevel == 0 && !Result.getFlag(Token::IsReinjected))
		if (LexLevel == 0 && !Result.getFlag(Token::IsReinjected)) {
		++TokenCount;
		if (OnToken)
OnToken(Result);		OnToken(Result);
}		}
		}

/// Lex a header-name token (including one formed from header-name-tokens if		/// Lex a header-name token (including one formed from header-name-tokens if
/// \p AllowConcatenation is \c true).		/// \p AllowConcatenation is \c true).
///		///
/// \param FilenameTok Filled in with the next token. On success, this will		/// \param FilenameTok Filled in with the next token. On success, this will
/// be either a header_name token. On failure, it will be whatever other		/// be either a header_name token. On failure, it will be whatever other
/// token was found instead.		/// token was found instead.
/// \param AllowMacroExpansion If \c true, allow the header name to be formed		/// \param AllowMacroExpansion If \c true, allow the header name to be formed
▲ Show 20 Lines • Show All 439 Lines • Show Last 20 Lines

clang/lib/Parse/ParsePragma.cpp

Show First 20 Lines • Show All 256 Lines • ▼ Show 20 Lines	PragmaAttributeHandler(AttributeFactory &AttrFactory)
: PragmaHandler("attribute"), AttributesForPragmaAttribute(AttrFactory) {}		: PragmaHandler("attribute"), AttributesForPragmaAttribute(AttrFactory) {}
void HandlePragma(Preprocessor &PP, PragmaIntroducer Introducer,		void HandlePragma(Preprocessor &PP, PragmaIntroducer Introducer,
Token &FirstToken) override;		Token &FirstToken) override;

/// A pool of attributes that were parsed in \#pragma clang attribute.		/// A pool of attributes that were parsed in \#pragma clang attribute.
ParsedAttributes AttributesForPragmaAttribute;		ParsedAttributes AttributesForPragmaAttribute;
};		};

		struct PragmaMaxTokensHereHandler : public PragmaHandler {
		PragmaMaxTokensHereHandler() : PragmaHandler("max_tokens_here") {}
		void HandlePragma(Preprocessor &PP, PragmaIntroducer Introducer,
		Token &FirstToken) override;
		};

		struct PragmaMaxTokensTotalHandler : public PragmaHandler {
		PragmaMaxTokensTotalHandler() : PragmaHandler("max_tokens_total") {}
		void HandlePragma(Preprocessor &PP, PragmaIntroducer Introducer,
		Token &FirstToken) override;
		};

} // end namespace		} // end namespace

void Parser::initializePragmaHandlers() {		void Parser::initializePragmaHandlers() {
AlignHandler = std::make_unique<PragmaAlignHandler>();		AlignHandler = std::make_unique<PragmaAlignHandler>();
PP.AddPragmaHandler(AlignHandler.get());		PP.AddPragmaHandler(AlignHandler.get());

GCCVisibilityHandler = std::make_unique<PragmaGCCVisibilityHandler>();		GCCVisibilityHandler = std::make_unique<PragmaGCCVisibilityHandler>();
PP.AddPragmaHandler("GCC", GCCVisibilityHandler.get());		PP.AddPragmaHandler("GCC", GCCVisibilityHandler.get());
▲ Show 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	void Parser::initializePragmaHandlers() {
PP.AddPragmaHandler(NoUnrollAndJamHintHandler.get());		PP.AddPragmaHandler(NoUnrollAndJamHintHandler.get());

FPHandler = std::make_unique<PragmaFPHandler>();		FPHandler = std::make_unique<PragmaFPHandler>();
PP.AddPragmaHandler("clang", FPHandler.get());		PP.AddPragmaHandler("clang", FPHandler.get());

AttributePragmaHandler =		AttributePragmaHandler =
std::make_unique<PragmaAttributeHandler>(AttrFactory);		std::make_unique<PragmaAttributeHandler>(AttrFactory);
PP.AddPragmaHandler("clang", AttributePragmaHandler.get());		PP.AddPragmaHandler("clang", AttributePragmaHandler.get());

		MaxTokensHerePragmaHandler = std::make_unique<PragmaMaxTokensHereHandler>();
		PP.AddPragmaHandler("clang", MaxTokensHerePragmaHandler.get());

		MaxTokensTotalPragmaHandler = std::make_unique<PragmaMaxTokensTotalHandler>();
		PP.AddPragmaHandler("clang", MaxTokensTotalPragmaHandler.get());
}		}

void Parser::resetPragmaHandlers() {		void Parser::resetPragmaHandlers() {
// Remove the pragma handlers we installed.		// Remove the pragma handlers we installed.
PP.RemovePragmaHandler(AlignHandler.get());		PP.RemovePragmaHandler(AlignHandler.get());
AlignHandler.reset();		AlignHandler.reset();
PP.RemovePragmaHandler("GCC", GCCVisibilityHandler.get());		PP.RemovePragmaHandler("GCC", GCCVisibilityHandler.get());
GCCVisibilityHandler.reset();		GCCVisibilityHandler.reset();
▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines	void Parser::resetPragmaHandlers() {
PP.RemovePragmaHandler(NoUnrollAndJamHintHandler.get());		PP.RemovePragmaHandler(NoUnrollAndJamHintHandler.get());
NoUnrollAndJamHintHandler.reset();		NoUnrollAndJamHintHandler.reset();

PP.RemovePragmaHandler("clang", FPHandler.get());		PP.RemovePragmaHandler("clang", FPHandler.get());
FPHandler.reset();		FPHandler.reset();

PP.RemovePragmaHandler("clang", AttributePragmaHandler.get());		PP.RemovePragmaHandler("clang", AttributePragmaHandler.get());
AttributePragmaHandler.reset();		AttributePragmaHandler.reset();

		PP.RemovePragmaHandler("clang", MaxTokensHerePragmaHandler.get());
		MaxTokensHerePragmaHandler.reset();

		PP.RemovePragmaHandler("clang", MaxTokensTotalPragmaHandler.get());
		MaxTokensTotalPragmaHandler.reset();
}		}

/// Handle the annotation token produced for #pragma unused(...)		/// Handle the annotation token produced for #pragma unused(...)
///		///
/// Each annot_pragma_unused is followed by the argument token so e.g.		/// Each annot_pragma_unused is followed by the argument token so e.g.
/// "#pragma unused(x,y)" becomes:		/// "#pragma unused(x,y)" becomes:
/// annot_pragma_unused 'x' annot_pragma_unused 'y'		/// annot_pragma_unused 'x' annot_pragma_unused 'y'
void Parser::HandlePragmaUnused() {		void Parser::HandlePragmaUnused() {
▲ Show 20 Lines • Show All 2,776 Lines • ▼ Show 20 Lines	void PragmaAttributeHandler::HandlePragma(Preprocessor &PP,
TokenArray[0].startToken();		TokenArray[0].startToken();
TokenArray[0].setKind(tok::annot_pragma_attribute);		TokenArray[0].setKind(tok::annot_pragma_attribute);
TokenArray[0].setLocation(FirstToken.getLocation());		TokenArray[0].setLocation(FirstToken.getLocation());
TokenArray[0].setAnnotationEndLoc(FirstToken.getLocation());		TokenArray[0].setAnnotationEndLoc(FirstToken.getLocation());
TokenArray[0].setAnnotationValue(static_cast<void *>(Info));		TokenArray[0].setAnnotationValue(static_cast<void *>(Info));
PP.EnterTokenStream(std::move(TokenArray), 1,		PP.EnterTokenStream(std::move(TokenArray), 1,
/DisableMacroExpansion=/false, /IsReinject=/false);		/DisableMacroExpansion=/false, /IsReinject=/false);
}		}

		// Handle '#pragma clang max_tokens 12345'.
		void PragmaMaxTokensHereHandler::HandlePragma(Preprocessor &PP,
		PragmaIntroducer Introducer,
		Token &Tok) {
		PP.Lex(Tok);
		if (Tok.is(tok::eod)) {
		PP.Diag(Tok.getLocation(), diag::err_pragma_missing_argument)
		<< "clang max_tokens_here" << /Expected=/true << "integer";
		return;
		}

		SourceLocation Loc = Tok.getLocation();
		uint64_t MaxTokens;
		if (Tok.isNot(tok::numeric_constant) \|\|
		!PP.parseSimpleIntegerLiteral(Tok, MaxTokens)) {
		PP.Diag(Tok.getLocation(), diag::err_pragma_expected_integer)
		<< "clang max_tokens_here";
		return;
		}

		if (Tok.isNot(tok::eod)) {
		PP.Diag(Tok.getLocation(), diag::warn_pragma_extra_tokens_at_eol)
		<< "clang max_tokens_here";
		return;
		}

		if (PP.getTokenCount() > MaxTokens) {
		PP.Diag(Loc, diag::warn_max_tokens)
		<< PP.getTokenCount() << (unsigned)MaxTokens;
		}
		}

		// Handle '#pragma clang max_file_tokens 12345'.
		void PragmaMaxTokensTotalHandler::HandlePragma(Preprocessor &PP,
		PragmaIntroducer Introducer,
		Token &Tok) {
		PP.Lex(Tok);
		if (Tok.is(tok::eod)) {
		PP.Diag(Tok.getLocation(), diag::err_pragma_missing_argument)
		<< "clang max_tokens_total" << /Expected=/true << "integer";
		return;
		}

		SourceLocation Loc = Tok.getLocation();
		uint64_t MaxTokens;
		if (Tok.isNot(tok::numeric_constant) \|\|
		!PP.parseSimpleIntegerLiteral(Tok, MaxTokens)) {
		PP.Diag(Tok.getLocation(), diag::err_pragma_expected_integer)
		<< "clang max_tokens_total";
		return;
		}

		if (Tok.isNot(tok::eod)) {
		PP.Diag(Tok.getLocation(), diag::warn_pragma_extra_tokens_at_eol)
		<< "clang max_tokens_total";
		return;
		}

		PP.overrideMaxTokens(MaxTokens, Loc);
		}

clang/lib/Parse/Parser.cpp

Show First 20 Lines • Show All 644 Lines • ▼ Show 20 Lines	bool Parser::ParseTopLevelDecl(DeclGroupPtrTy &Result, bool IsFirstDecl) {

case tok::annot_module_end:		case tok::annot_module_end:
Actions.ActOnModuleEnd(Tok.getLocation(), reinterpret_cast<Module *>(		Actions.ActOnModuleEnd(Tok.getLocation(), reinterpret_cast<Module *>(
Tok.getAnnotationValue()));		Tok.getAnnotationValue()));
ConsumeAnnotationToken();		ConsumeAnnotationToken();
return false;		return false;

case tok::eof:		case tok::eof:
		// Check whether -fmax-tokens was reached.
		if (PP.getMaxTokens() != 0 && PP.getTokenCount() > PP.getMaxTokens()) {
		PP.Diag(Tok.getLocation(), diag::warn_max_tokens_total)
		<< PP.getTokenCount() << PP.getMaxTokens();
		SourceLocation OverrideLoc = PP.getMaxTokensOverrideLoc();
		if (OverrideLoc.isValid()) {
		PP.Diag(OverrideLoc, diag::note_max_tokens_total_override);
		}
		}

// Late template parsing can begin.		// Late template parsing can begin.
if (getLangOpts().DelayedTemplateParsing)		if (getLangOpts().DelayedTemplateParsing)
Actions.SetLateTemplateParser(LateTemplateParserCallback,		Actions.SetLateTemplateParser(LateTemplateParserCallback,
PP.isIncrementalProcessingEnabled() ?		PP.isIncrementalProcessingEnabled() ?
LateTemplateParserCleanupCallback : nullptr,		LateTemplateParserCleanupCallback : nullptr,
this);		this);
if (!PP.isIncrementalProcessingEnabled())		if (!PP.isIncrementalProcessingEnabled())
Actions.ActOnEndOfTranslationUnit();		Actions.ActOnEndOfTranslationUnit();
▲ Show 20 Lines • Show All 1,868 Lines • Show Last 20 Lines

clang/test/Driver/autocomplete.c

	Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines
	// MRELOCMODELALL-NEXT: rwpi			// MRELOCMODELALL-NEXT: rwpi
	// MRELOCMODELALL-NEXT: static			// MRELOCMODELALL-NEXT: static
	// RUN: %clang --autocomplete=-Wma \| FileCheck %s -check-prefix=WARNING			// RUN: %clang --autocomplete=-Wma \| FileCheck %s -check-prefix=WARNING
	// WARNING: -Wmacro-redefined			// WARNING: -Wmacro-redefined
	// WARNING-NEXT: -Wmain			// WARNING-NEXT: -Wmain
	// WARNING-NEXT: -Wmain-return-type			// WARNING-NEXT: -Wmain-return-type
	// WARNING-NEXT: -Wmalformed-warning-check			// WARNING-NEXT: -Wmalformed-warning-check
	// WARNING-NEXT: -Wmany-braces-around-scalar-init			// WARNING-NEXT: -Wmany-braces-around-scalar-init
				// WARNING-NEXT: -Wmax-tokens
	// WARNING-NEXT: -Wmax-unsigned-zero			// WARNING-NEXT: -Wmax-unsigned-zero
	// RUN: %clang --autocomplete=-Wno-invalid-pp- \| FileCheck %s -check-prefix=NOWARNING			// RUN: %clang --autocomplete=-Wno-invalid-pp- \| FileCheck %s -check-prefix=NOWARNING
	// NOWARNING: -Wno-invalid-pp-token			// NOWARNING: -Wno-invalid-pp-token
	// RUN: %clang --autocomplete=-analyzer-checker \| FileCheck %s -check-prefix=ANALYZER			// RUN: %clang --autocomplete=-analyzer-checker \| FileCheck %s -check-prefix=ANALYZER
	// ANALYZER: unix.Malloc			// ANALYZER: unix.Malloc
	// RUN: %clang --autocomplete=-std= \| FileCheck %s -check-prefix=STDVAL			// RUN: %clang --autocomplete=-std= \| FileCheck %s -check-prefix=STDVAL
	// STDVAL: c99			// STDVAL: c99
	//			//
	Show All 23 Lines

clang/test/Parser/max-tokens.cpp

This file was added.

				// RUN: %clang_cc1 -fsyntax-only -verify %s
				// RUN: %clang_cc1 -fsyntax-only -verify %s -DMAX_TOKENS -fmax-tokens 2
				// RUN: %clang_cc1 -fsyntax-only -verify %s -DMAX_TOKENS_OVERRIDE -fmax-tokens 9

				int x, y, z;

				#pragma clang max_tokens_here // expected-error {{missing argument to '#pragma clang max_tokens_here'; expected integer}}
				#pragma clang max_tokens_here foo // expected-error {{expected an integer argument in '#pragma clang max_tokens_here'}}
				#pragma clang max_tokens_here 123 456 // expected-warning{{extra tokens at end of '#pragma clang max_tokens_here' - ignored}}

				#pragma clang max_tokens_here 1 // expected-warning{{the number of preprocessor source tokens (7) exceeds this token limit (1)}}


				#pragma clang max_tokens_total // expected-error{{missing argument to '#pragma clang max_tokens_total'; expected integer}}
				#pragma clang max_tokens_total foo // expected-error{{expected an integer argument in '#pragma clang max_tokens_total'}}
				#pragma clang max_tokens_total 123 456 // expected-warning{{extra tokens at end of '#pragma clang max_tokens_total' - ignored}}

				#ifdef MAX_TOKENS_OVERRIDE
				#pragma clang max_tokens_total 3 // expected-warning@+4{{the total number of preprocessor source tokens (8) exceeds the token limit (3)}}
				// expected-note@-1{{total token limit set here}}
				#elif MAX_TOKENS
				// expected-warning@+1{{the total number of preprocessor source tokens (8) exceeds the token limit (2)}}
				#endif

This is an archive of the discontinued LLVM Phabricator instance.

Add a warning, flags and pragmas to limit the number of pre-processor tokens in a translation unitClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 240568

clang/include/clang/Basic/DiagnosticGroups.td

clang/include/clang/Basic/DiagnosticParseKinds.td

clang/include/clang/Basic/LangOptions.def

clang/include/clang/Driver/Options.td

clang/include/clang/Lex/Preprocessor.h

clang/include/clang/Parse/Parser.h

clang/lib/Driver/ToolChains/Clang.cpp

clang/lib/Frontend/CompilerInvocation.cpp

clang/lib/Lex/Preprocessor.cpp

clang/lib/Parse/ParsePragma.cpp

clang/lib/Parse/Parser.cpp

clang/test/Driver/autocomplete.c

clang/test/Parser/max-tokens.cpp

Add a warning, flags and pragmas to limit the number of pre-processor tokens in a translation unit
ClosedPublic