This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/clang/Lex/
-
clang/
-
Lex/
4
PreprocessorOptions.h
-
lib/
-
Frontend/
-
CompilerInstance.cpp
-
Lex/
3/8
Preprocessor.cpp

Differential D36492

[time-report] Add preprocessor timer
AbandonedPublic

Authored by modocache on Aug 8 2017, 3:25 PM.

Download Raw Diff

Details

Reviewers

vsk

Summary

gcc -ftime-report provides a breakdown of how much time GCC spends
doing preprocessing, parsing, template instantiation, and more:

g++ -ftime-report foo.cpp
Execution times (seconds)
 phase setup             :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall    1414 kB ( 8%) ggc
 phase parsing           :   0.10 (100%) usr   0.08 (100%) sys   0.18 (95%) wall   15955 kB (88%) ggc
 phase opt and generate  :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 5%) wall     489 kB ( 3%) ggc
 |name lookup            :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.04 (21%) wall    1054 kB ( 6%) ggc
 preprocessing           :   0.02 (20%) usr   0.01 (13%) sys   0.03 (16%) wall     896 kB ( 5%) ggc
 parser (global)         :   0.00 ( 0%) usr   0.03 (38%) sys   0.04 (21%) wall    6603 kB (37%) ggc
 parser struct body      :   0.04 (40%) usr   0.00 ( 0%) sys   0.04 (21%) wall    2791 kB (15%) ggc
 parser enumerator list  :   0.01 (10%) usr   0.00 ( 0%) sys   0.00 ( 0%) wall      44 kB ( 0%) ggc
 parser function body    :   0.00 ( 0%) usr   0.02 (25%) sys   0.02 (11%) wall    1047 kB ( 6%) ggc
 parser inl. meth. body  :   0.01 (10%) usr   0.01 (13%) sys   0.03 (16%) wall    1093 kB ( 6%) ggc
 template instantiation  :   0.02 (20%) usr   0.01 (12%) sys   0.02 (11%) wall    3280 kB (18%) ggc
 LRA non-specific        :   0.00 ( 0%) usr   0.00 ( 0%) sys   0.01 ( 5%) wall       0 kB ( 0%) ggc
 TOTAL                 :   0.10             0.08             0.19              18028 kB

clang -ftime-report gives an incredibly detailed breakdown of how long
each LLVM pass takes, but when it comes to front-end times, only
provides the generic "Clang front-end timer", or "Code Generation Time".
Here's an example of its output:
https://gist.github.com/modocache/d74833818107ed50d11387a5a4e3fb72

As a result, when attempting to diagnose slow compile times with Clang,
users are forced to use external profiling tools in order to determine
whether the bottleneck is in parsing, template instantiation, or LLVM.

This diff adds the first of several timers that aim to match
gcc -ftime-report: a timer that outputs the amount of time spent in the Clang
preprocessor.

Diff Detail

Build Status

Buildable 9296
Build 9296: arc lint + arc unit

Event Timeline

modocache created this revision.Aug 8 2017, 3:25 PM

Harbormaster completed remote builds in B9157: Diff 110291.Aug 8 2017, 3:25 PM

Thanks for working on this. Collecting better timing information in the frontend sgtm. It's cheap to do, and we can use the information to guide our efforts re: attacking the compile-time problem. Feel free to add me to future timing-related reviews. Regarding this specific patch:

Could you add a short test (perhaps in test/Misc?) that checks that -ftime-report prints out an entry for 'Preprocessing'?

It'd be nice to dump this timer from Preprocessor::PrintStats(), too.

lib/Lex/Preprocessor.cpp
661	I wonder whether this is too fine-grained. I think setting up a timer in Preprocessor::Lex() might capture more information. Would you mind experimenting with that?
661	Nitpick: it may be useful to add PPOpts::getTimer(), in case we find more sites where we need to either get back the PP timer or nullptr.

Thanks for the feedback, @vsk, I really appreciate it! I have some other work done for this on my local checkout, but I was going a little bonkers working on it without knowing whether people would want it merged or not. I'll update this with your feedback and upload the rest for review as well.

It'd be nice to dump this timer from Preprocessor::PrintStats(), too.

Good idea, will do!

lib/Lex/Preprocessor.cpp
661	I think setting up a timer in `Preprocessor::Lex()` might capture more information. Yes, can do! As it happens, the timers in this current diff cover all of the event counters that are incremented and then printed in `Preprocessor::PrintStats()`, except two: `NumTokenPaste` and `NumFastTokenPaste`. Beginning the timer in the Lexer would allow me to measure the time it takes to do token pasting as well, so I think this is a great idea. Thanks! it may be useful to add `PPOpts::getTimer()` Agreed, will do!

FWIW, I strongly support the idea of adding more detailed timers into the frontend. Thanks for working on it!
I probably won't be very helpful in reviewing this code, but I'd appreciate if you CC me in the future patches.

Thanks,
Michael

Add PreprocessorOptions::getTimer, and move the timer to the top of Preprocessor::Lex().

Oops, sorry, didn't mean to remove the subscribers. arc diff --verbatim strikes again.

modocache marked 3 inline comments as done.Aug 15 2017, 10:29 AM

Friendly ping! I think this is ready to be reviewed. It adds an additional row, Preprocessing, to the Miscellaneous Ungrouped Timers section of the clang -ftime-report output:

===-------------------------------------------------------------------------===
                         Miscellaneous Ungrouped Timers
===-------------------------------------------------------------------------===

   ---User Time---   --System Time--   --User+System--   ---Wall Time---  --- Name ---
   0.0360 ( 17.8%)   0.0060 ( 35.3%)   0.0420 ( 19.2%)   0.3440 ( 50.6%)  Code Generation Time
   0.0390 ( 19.3%)   0.0050 ( 29.4%)   0.0440 ( 20.1%)   0.1734 ( 25.5%)  LLVM IR Generation Time
   0.1270 ( 62.9%)   0.0060 ( 35.3%)   0.1330 ( 60.7%)   0.1622 ( 23.9%)  Preprocessing
   0.2020 (100.0%)   0.0170 (100.0%)   0.2190 (100.0%)   0.6796 (100.0%)  Total

Grouping these timers is something I'd like to explore in future commits.

LGTM.

This revision is now accepted and ready to land.Aug 16 2017, 11:59 AM

This looks really usefull, thanks for working on this!

lib/Lex/Preprocessor.cpp
746	Doesn't this just start a timer and immediately end the timer? Shouldn't we do: `llvm::TimeRegion LexTime(PPOpts->getTimer())` so that the dtor gets called when this function returns and we track the time spent in this function? Also: this is a pretty hot function, and it looks like TimeRegion does some non-trivial work if time is being tracked. Have you tried testing this on a big c++ file with and without this patch and seeing what the difference in compile time looks like?

modocache planned changes to this revision.Aug 16 2017, 2:02 PM

modocache added inline comments.

lib/Lex/Preprocessor.cpp
746	Ah, yes you're right! Sorry about that. Actually keeping the timer alive for the duration of the method also reveals that the method is called recursively, which causes an assert, because timers can't be started twice. Another spot in Clang works around this with a "reference counted" timer: https://github.com/llvm-mirror/clang/blob/6ac9c51ede0a50cca13dd4ac03562c036f7a3f48/lib/CodeGen/CodeGenAction.cpp#L130-L134. I have a more generic version of this "reference counting timer" that I've been using for some of the other timers I've been adding; maybe I'll use it here as well.

eduardo-elizondo added a subscriber: eduardo-elizondo.Aug 18 2017, 1:52 PM

eduardo-elizondo added inline comments.

include/clang/Lex/PreprocessorOptions.h
165	Should this be named "Lexing Time" or "Lexing" instead of "Preprocessing"?

modocache added inline comments.Aug 19 2017, 1:15 PM

include/clang/Lex/PreprocessorOptions.h
165	Good idea! Now that the timer's being started up in the `Preprocessor::Lex` method, it probably should be named "Lexing". Alternatively, I could move this into, say, `Lexer::Lex`. I guess there's not much of a distinction in Clang between "lexing" and "preprocessing." I had originally picked this name because that's what appears in `gcc -ftime-report`, but I guess we don't need to keep the names the same.

eduardo-elizondo removed a subscriber: eduardo-elizondo.Aug 20 2017, 6:28 PM

eduardo-elizondo added a subscriber: eduardo-elizondo.

MatzeB added a subscriber: MatzeB.Aug 22 2017, 1:13 PM

MatzeB added inline comments.

lib/Lex/Preprocessor.cpp
746	FWIF: I share Eriks concerns about compiletime. Timers are enabled in optimized builds, and querying them is not free. So putting one into a function that is called a lot and is time critical seems like a bad idea (do benchmarking to prove or disprove this!).

MatzeB mentioned this in D36847: [Support] Add reentrant start/stop Timer methods.Aug 23 2017, 9:55 AM

vsk added inline comments.Aug 23 2017, 10:19 AM

lib/Lex/Preprocessor.cpp
746	The timer is not started or queried unless -ftime-report is enabled. In the common case the overhead amounts to one extra null-check. And when we're collecting timing information, some performance degradation (say, within 5%) should be acceptable. I agree that we should get a sense for what the overhead is, but am not convinced that this should be a blocking issue.

avt77 added a subscriber: avt77.Feb 14 2018, 5:06 AM

avt77 added inline comments.Feb 14 2018, 5:40 AM

include/clang/Lex/PreprocessorOptions.h
165	Preprocessing does not means lexing. It includes much more, for example, file inclusion, include pathes elaboration, macros expansion, etc. We could merge all these parts or we could output the info for evey single piece. In any case I think we need a group of timers related to frontend (or maybe several groups: for pp, parser, Sema, codegen...). And maybe we need special switches to disable/enable the groups (-ftime-report could be used as some super switch for all of them). As result we need some repository of the timers and most probably this repository should be kept in PreprocessorOpts. If there are no objections I can take this job on me and come up with new more general version of the patch. If it's OK then I'd like to collect the proposes about the required details of the time profile: what exactly we'd like to see in output? We have gcc output as an initial example but maybe we need more (or less) details?
lib/Lex/Preprocessor.cpp
746	Here we have more than "one extra null-check": we're dealing with constructor/destructor as well (and it's for every Lex()). I agree that it should be acceptable for us but we have to profile it (maybe we should not profile Lex() but something above it).

modocache abandoned this revision.Feb 14 2018, 7:40 AM

modocache added inline comments.

include/clang/Lex/PreprocessorOptions.h
165	If there are no objections I can take this job on me and come up with new more general version of the patch. If it's OK then I'd like to collect the proposes about the required details of the time profile: what exactly we'd like to see in output? We have gcc output as an initial example but maybe we need more (or less) details? Yes, absolutely! This sounds fantastic, thank you for doing this work. I'll abandon this diff so it doesn't clog up anyone else's review queues. Thanks again!

lrineau added a subscriber: lrineau.Feb 13 2019, 2:09 AM

Herald added a subscriber: jdoerfert. · View Herald TranscriptFeb 13 2019, 2:09 AM

Revision Contents

Path

Size

include/

clang/

Lex/

PreprocessorOptions.h

19 lines

lib/

Frontend/

CompilerInstance.cpp

4 lines

Lex/

Preprocessor.cpp

2 lines

Diff 111195

include/clang/Lex/PreprocessorOptions.h

//===--- PreprocessorOptions.h ----------------------------------- C++ --===//		//===--- PreprocessorOptions.h ----------------------------------- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_CLANG_LEX_PREPROCESSOROPTIONS_H_		#ifndef LLVM_CLANG_LEX_PREPROCESSOROPTIONS_H_
#define LLVM_CLANG_LEX_PREPROCESSOROPTIONS_H_		#define LLVM_CLANG_LEX_PREPROCESSOROPTIONS_H_

#include "clang/Basic/SourceLocation.h"		#include "clang/Basic/SourceLocation.h"
#include "llvm/ADT/IntrusiveRefCntPtr.h"		#include "llvm/ADT/IntrusiveRefCntPtr.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/StringSet.h"		#include "llvm/ADT/StringSet.h"
		#include "llvm/Support/Timer.h"
#include <cassert>		#include <cassert>
#include <set>		#include <set>
#include <string>		#include <string>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {
class MemoryBuffer;		class MemoryBuffer;
}		}

namespace clang {		namespace clang {

class Preprocessor;		class Preprocessor;
class LangOptions;		class LangOptions;

/// \brief Enumerate the kinds of standard library that		/// \brief Enumerate the kinds of standard library that
enum ObjCXXARCStandardLibraryKind {		enum ObjCXXARCStandardLibraryKind {
ARCXX_nolib,		ARCXX_nolib,
/// \brief libc++		/// \brief libc++
ARCXX_libcxx,		ARCXX_libcxx,
/// \brief libstdc++		/// \brief libstdc++
ARCXX_libstdcxx		ARCXX_libstdcxx
};		};

/// PreprocessorOptions - This class is used for passing the various options		/// PreprocessorOptions - This class is used for passing the various options
/// used in preprocessor initialization to InitializePreprocessor().		/// used in preprocessor initialization to InitializePreprocessor().
class PreprocessorOptions {		class PreprocessorOptions {
		llvm::Timer PPTimer;
public:		public:
std::vector<std::pair<std::string, bool/isUndef/> > Macros;		std::vector<std::pair<std::string, bool/isUndef/> > Macros;
std::vector<std::string> Includes;		std::vector<std::string> Includes;
std::vector<std::string> MacroIncludes;		std::vector<std::string> MacroIncludes;

/// \brief Initialize the preprocessor with the compiler and target specific		/// \brief Initialize the preprocessor with the compiler and target specific
/// predefines.		/// predefines.
unsigned UsePredefines : 1;		unsigned UsePredefines : 1;
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	public:

/// \brief Whether the compiler instance should retain (i.e., not free)		/// \brief Whether the compiler instance should retain (i.e., not free)
/// the buffers associated with remapped files.		/// the buffers associated with remapped files.
///		///
/// This flag defaults to false; it can be set true only through direct		/// This flag defaults to false; it can be set true only through direct
/// manipulation of the compiler invocation object, in cases where the		/// manipulation of the compiler invocation object, in cases where the
/// compiler invocation and its buffers will be reused.		/// compiler invocation and its buffers will be reused.
bool RetainRemappedFileBuffers;		bool RetainRemappedFileBuffers;

		/// \brief Whether to measure the amount of time spent in code related to
		/// preprocessing. This flag defaults to false.
		bool ShowTimers;

/// \brief The Objective-C++ ARC standard library that we should support,		/// \brief The Objective-C++ ARC standard library that we should support,
/// by providing appropriate definitions to retrofit the standard library		/// by providing appropriate definitions to retrofit the standard library
/// with support for lifetime-qualified pointers.		/// with support for lifetime-qualified pointers.
ObjCXXARCStandardLibraryKind ObjCXXARCStandardLibrary;		ObjCXXARCStandardLibraryKind ObjCXXARCStandardLibrary;

/// \brief Records the set of modules		/// \brief Records the set of modules
class FailedModulesSet {		class FailedModulesSet {
llvm::StringSet<> Failed;		llvm::StringSet<> Failed;
Show All 12 Lines	public:
///		///
/// This pointer will be shared among all of the compiler instances created		/// This pointer will be shared among all of the compiler instances created
/// to (re)build modules, so that once a module fails to build anywhere,		/// to (re)build modules, so that once a module fails to build anywhere,
/// other instances will see that the module has failed and won't try to		/// other instances will see that the module has failed and won't try to
/// build it again.		/// build it again.
std::shared_ptr<FailedModulesSet> FailedModules;		std::shared_ptr<FailedModulesSet> FailedModules;

public:		public:
PreprocessorOptions() : UsePredefines(true), DetailedRecord(false),		PreprocessorOptions() : PPTimer("preprocessor", "Preprocessing"),
		eduardo-elizondoUnsubmitted Not Done Reply Inline Actions Should this be named "Lexing Time" or "Lexing" instead of "Preprocessing"? eduardo-elizondo: Should this be named "Lexing Time" or "Lexing" instead of "Preprocessing"?
		modocacheAuthorUnsubmitted Not Done Reply Inline Actions Good idea! Now that the timer's being started up in the `Preprocessor::Lex` method, it probably should be named "Lexing". Alternatively, I could move this into, say, `Lexer::Lex`. I guess there's not much of a distinction in Clang between "lexing" and "preprocessing." I had originally picked this name because that's what appears in `gcc -ftime-report`, but I guess we don't need to keep the names the same. modocache: Good idea! Now that the timer's being started up in the `Preprocessor::Lex` method, it probably…
		avt77Unsubmitted Not Done Reply Inline Actions Preprocessing does not means lexing. It includes much more, for example, file inclusion, include pathes elaboration, macros expansion, etc. We could merge all these parts or we could output the info for evey single piece. In any case I think we need a group of timers related to frontend (or maybe several groups: for pp, parser, Sema, codegen...). And maybe we need special switches to disable/enable the groups (-ftime-report could be used as some super switch for all of them). As result we need some repository of the timers and most probably this repository should be kept in PreprocessorOpts. If there are no objections I can take this job on me and come up with new more general version of the patch. If it's OK then I'd like to collect the proposes about the required details of the time profile: what exactly we'd like to see in output? We have gcc output as an initial example but maybe we need more (or less) details? avt77: Preprocessing does not means lexing. It includes much more, for example, file inclusion…
		modocacheAuthorUnsubmitted Not Done Reply Inline Actions If there are no objections I can take this job on me and come up with new more general version of the patch. If it's OK then I'd like to collect the proposes about the required details of the time profile: what exactly we'd like to see in output? We have gcc output as an initial example but maybe we need more (or less) details? Yes, absolutely! This sounds fantastic, thank you for doing this work. I'll abandon this diff so it doesn't clog up anyone else's review queues. Thanks again! modocache: > If there are no objections I can take this job on me and come up with new more general…
		UsePredefines(true),
		DetailedRecord(false),
DisablePCHValidation(false),		DisablePCHValidation(false),
AllowPCHWithCompilerErrors(false),		AllowPCHWithCompilerErrors(false),
DumpDeserializedPCHDecls(false),		DumpDeserializedPCHDecls(false),
PrecompiledPreambleBytes(0, true),		PrecompiledPreambleBytes(0, true),
GeneratePreamble(false),		GeneratePreamble(false),
RemappedFilesKeepOriginalName(true),		RemappedFilesKeepOriginalName(true),
RetainRemappedFileBuffers(false),		RetainRemappedFileBuffers(false),
		ShowTimers(false),
ObjCXXARCStandardLibrary(ARCXX_nolib) { }		ObjCXXARCStandardLibrary(ARCXX_nolib) { }

void addMacroDef(StringRef Name) { Macros.emplace_back(Name, false); }		void addMacroDef(StringRef Name) { Macros.emplace_back(Name, false); }
void addMacroUndef(StringRef Name) { Macros.emplace_back(Name, true); }		void addMacroUndef(StringRef Name) { Macros.emplace_back(Name, true); }
void addRemappedFile(StringRef From, StringRef To) {		void addRemappedFile(StringRef From, StringRef To) {
RemappedFiles.emplace_back(From, To);		RemappedFiles.emplace_back(From, To);
}		}

Show All 17 Lines	void resetNonModularOptions() {
ImplicitPTHInclude.clear();		ImplicitPTHInclude.clear();
TokenCache.clear();		TokenCache.clear();
SingleFileParseMode = false;		SingleFileParseMode = false;
LexEditorPlaceholders = true;		LexEditorPlaceholders = true;
RetainRemappedFileBuffers = true;		RetainRemappedFileBuffers = true;
PrecompiledPreambleBytes.first = 0;		PrecompiledPreambleBytes.first = 0;
PrecompiledPreambleBytes.second = 0;		PrecompiledPreambleBytes.second = 0;
}		}

		llvm::Timer *getTimer() {
		return ShowTimers ? &PPTimer : nullptr;
		}
};		};

} // end namespace clang		} // end namespace clang

#endif		#endif

lib/Frontend/CompilerInstance.cpp

Show First 20 Lines • Show All 361 Lines • ▼ Show 20 Lines	static void InitializeFileRemapping(DiagnosticsEngine &Diags,

SourceMgr.setOverridenFilesKeepOriginalName(		SourceMgr.setOverridenFilesKeepOriginalName(
InitOpts.RemappedFilesKeepOriginalName);		InitOpts.RemappedFilesKeepOriginalName);
}		}

// Preprocessor		// Preprocessor

void CompilerInstance::createPreprocessor(TranslationUnitKind TUKind) {		void CompilerInstance::createPreprocessor(TranslationUnitKind TUKind) {
const PreprocessorOptions &PPOpts = getPreprocessorOpts();		PreprocessorOptions &PPOpts = getPreprocessorOpts();
		if (getFrontendOpts().ShowTimers)
		PPOpts.ShowTimers = true;

// Create a PTH manager if we are using some form of a token cache.		// Create a PTH manager if we are using some form of a token cache.
PTHManager *PTHMgr = nullptr;		PTHManager *PTHMgr = nullptr;
if (!PPOpts.TokenCache.empty())		if (!PPOpts.TokenCache.empty())
PTHMgr = PTHManager::Create(PPOpts.TokenCache, getDiagnostics());		PTHMgr = PTHManager::Create(PPOpts.TokenCache, getDiagnostics());

// Create the Preprocessor.		// Create the Preprocessor.
HeaderSearch *HeaderInfo =		HeaderSearch *HeaderInfo =
▲ Show 20 Lines • Show All 1,686 Lines • Show Last 20 Lines

lib/Lex/Preprocessor.cpp

Show First 20 Lines • Show All 652 Lines • ▼ Show 20 Lines
/// IdentifierInfo methods that compute these properties will need to change to		/// IdentifierInfo methods that compute these properties will need to change to
/// match.		/// match.
bool Preprocessor::HandleIdentifier(Token &Identifier) {		bool Preprocessor::HandleIdentifier(Token &Identifier) {
assert(Identifier.getIdentifierInfo() &&		assert(Identifier.getIdentifierInfo() &&
"Can't handle identifiers without identifier info!");		"Can't handle identifiers without identifier info!");

IdentifierInfo &II = *Identifier.getIdentifierInfo();		IdentifierInfo &II = *Identifier.getIdentifierInfo();

// If the information about this identifier is out of date, update it from		// If the information about this identifier is out of date, update it from
		vskUnsubmitted Done Reply Inline Actions I wonder whether this is too fine-grained. I think setting up a timer in Preprocessor::Lex() might capture more information. Would you mind experimenting with that? vsk: I wonder whether this is too fine-grained. I think setting up a timer in Preprocessor::Lex()…
		modocacheAuthorUnsubmitted Done Reply Inline Actions I think setting up a timer in `Preprocessor::Lex()` might capture more information. Yes, can do! As it happens, the timers in this current diff cover all of the event counters that are incremented and then printed in `Preprocessor::PrintStats()`, except two: `NumTokenPaste` and `NumFastTokenPaste`. Beginning the timer in the Lexer would allow me to measure the time it takes to do token pasting as well, so I think this is a great idea. Thanks! it may be useful to add `PPOpts::getTimer()` Agreed, will do! modocache: > I think setting up a timer in `Preprocessor::Lex()` might capture more information. Yes, can…
		vskUnsubmitted Done Reply Inline Actions Nitpick: it may be useful to add PPOpts::getTimer(), in case we find more sites where we need to either get back the PP timer or nullptr. vsk: Nitpick: it may be useful to add PPOpts::getTimer(), in case we find more sites where we need…
// the external source.		// the external source.
// We have to treat __VA_ARGS__ in a special way, since it gets		// We have to treat __VA_ARGS__ in a special way, since it gets
// serialized with isPoisoned = true, but our preprocessor may have		// serialized with isPoisoned = true, but our preprocessor may have
// unpoisoned it if we're defining a C99 macro.		// unpoisoned it if we're defining a C99 macro.
if (II.isOutOfDate()) {		if (II.isOutOfDate()) {
bool CurrentIsPoisoned = false;		bool CurrentIsPoisoned = false;
if (&II == Ident__VA_ARGS__)		if (&II == Ident__VA_ARGS__)
CurrentIsPoisoned = Ident__VA_ARGS__->isPoisoned();		CurrentIsPoisoned = Ident__VA_ARGS__->isPoisoned();
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	if (((LastTokenWasAt && II.isModulesImport()) \|\|
ModuleImportPath.clear();		ModuleImportPath.clear();
ModuleImportExpectsIdentifier = true;		ModuleImportExpectsIdentifier = true;
CurLexerKind = CLK_LexAfterModuleImport;		CurLexerKind = CLK_LexAfterModuleImport;
}		}
return true;		return true;
}		}

void Preprocessor::Lex(Token &Result) {		void Preprocessor::Lex(Token &Result) {
		llvm::TimeRegion(PPOpts->getTimer());
		erik.pilkingtonUnsubmitted Not Done Reply Inline Actions Doesn't this just start a timer and immediately end the timer? Shouldn't we do: `llvm::TimeRegion LexTime(PPOpts->getTimer())` so that the dtor gets called when this function returns and we track the time spent in this function? Also: this is a pretty hot function, and it looks like TimeRegion does some non-trivial work if time is being tracked. Have you tried testing this on a big c++ file with and without this patch and seeing what the difference in compile time looks like? erik.pilkington: Doesn't this just start a timer and immediately end the timer? Shouldn't we do: `llvm…
		modocacheAuthorUnsubmitted Not Done Reply Inline Actions Ah, yes you're right! Sorry about that. Actually keeping the timer alive for the duration of the method also reveals that the method is called recursively, which causes an assert, because timers can't be started twice. Another spot in Clang works around this with a "reference counted" timer: https://github.com/llvm-mirror/clang/blob/6ac9c51ede0a50cca13dd4ac03562c036f7a3f48/lib/CodeGen/CodeGenAction.cpp#L130-L134. I have a more generic version of this "reference counting timer" that I've been using for some of the other timers I've been adding; maybe I'll use it here as well. modocache: Ah, yes you're right! Sorry about that. Actually keeping the timer alive for the duration of…
		MatzeBUnsubmitted Not Done Reply Inline Actions FWIF: I share Eriks concerns about compiletime. Timers are enabled in optimized builds, and querying them is not free. So putting one into a function that is called a lot and is time critical seems like a bad idea (do benchmarking to prove or disprove this!). MatzeB: FWIF: I share Eriks concerns about compiletime. Timers are enabled in optimized builds, and…
		vskUnsubmitted Not Done Reply Inline Actions The timer is not started or queried unless -ftime-report is enabled. In the common case the overhead amounts to one extra null-check. And when we're collecting timing information, some performance degradation (say, within 5%) should be acceptable. I agree that we should get a sense for what the overhead is, but am not convinced that this should be a blocking issue. vsk: The timer is not started or queried unless -ftime-report is enabled. In the common case the…
		avt77Unsubmitted Not Done Reply Inline Actions Here we have more than "one extra null-check": we're dealing with constructor/destructor as well (and it's for every Lex()). I agree that it should be acceptable for us but we have to profile it (maybe we should not profile Lex() but something above it). avt77: Here we have more than "one extra null-check": we're dealing with constructor/destructor as…

// We loop here until a lex function returns a token; this avoids recursion.		// We loop here until a lex function returns a token; this avoids recursion.
bool ReturnedToken;		bool ReturnedToken;
do {		do {
switch (CurLexerKind) {		switch (CurLexerKind) {
case CLK_Lexer:		case CLK_Lexer:
ReturnedToken = CurLexer->Lex(Result);		ReturnedToken = CurLexer->Lex(Result);
break;		break;
case CLK_PTHLexer:		case CLK_PTHLexer:
▲ Show 20 Lines • Show All 206 Lines • Show Last 20 Lines