This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/
-
clang/
-
Lex/
1
HeaderSearch.h
-
Preprocessor.h
-
Serialization/
1/1
ASTBitCodes.h
-
ASTReader.h
-
ASTWriter.h
-
lib/
-
Lex/
-
HeaderSearch.cpp
-
PPDirectives.cpp
1/3
Preprocessor.cpp
-
Serialization/
-
ASTReader.cpp
1/2
ASTWriter.cpp

Differential D114095

[clang][lex] Include tracking: simplify and move to preprocessor
ClosedPublic

Authored by jansvoboda11 on Nov 17 2021, 8:20 AM.

Download Raw Diff

Details

Reviewers

Bigcheese
dexonsmith
vsapsai

Commits

rGf72027233044: [clang][lex] Include tracking: simplify and move to preprocessor

Summary

This patch replaces the exact include count of each file in HeaderFileInfo with a set of included files in Preprocessor.

The number of includes isn't a property of a header file but rather a preprocessor state. The exact number of includes is not used anywhere except statistic tracking.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jansvoboda11 created this revision.Nov 17 2021, 8:20 AM

Herald added a subscriber: mgrang. · View Herald TranscriptNov 17 2021, 8:20 AM

jansvoboda11 requested review of this revision.Nov 17 2021, 8:20 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 17 2021, 8:20 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

jansvoboda11 added a child revision: D114096: [clang][lex][modules] Stop tracking number of includes.Nov 17 2021, 8:22 AM

Harbormaster completed remote builds in B134746: Diff 387950.Nov 17 2021, 9:10 AM

I'm not sure this commit stands on its own, since it introduces a (temporary) regression in .pcm size; I suggest squashing with the follow-up, https://reviews.llvm.org/D114096.

clang/include/clang/Lex/HeaderSearch.h
61	Note that `clang-format` is complaining about a trailing space / here.

dexonsmith mentioned this in D114096: [clang][lex][modules] Stop tracking number of includes.Nov 17 2021, 12:54 PM

Squash with D114096.

jansvoboda11 retitled this revision from [clang][lex][modules] Move number of includes from HeaderFileInfo to Preprocessor to [clang][lex] Include tracking: simplify and move to preprocessor.Nov 18 2021, 9:10 AM

jansvoboda11 edited the summary of this revision. (Show Details)

jansvoboda11 added a child revision: D112915: [clang][modules] Track included files per submodule.Nov 18 2021, 9:37 AM

Harbormaster completed remote builds in B134922: Diff 388231.Nov 18 2021, 9:46 AM

vsapsai mentioned this in D112915: [clang][modules] Track included files per submodule.Nov 18 2021, 3:21 PM

Haven't checked the details of serialization/deserialization. High-level question about different serialization approaches (bitvector vs list of file ids) is in D112915.

clang/lib/Lex/Preprocessor.cpp
553	Why do you need to `getFileInfo` but don't use it? I have no objections but it looks like it deserves a comment because it's not obvious.

vsapsai added inline comments.Nov 18 2021, 4:01 PM

clang/include/clang/Serialization/ASTBitCodes.h
700	Small detail. Do you need to add the new record to `ASTWriter::WriteBlockInfoBlock`? It's not critical but you are one of those who might be grateful later seeing `PP_INCLUDED_FILES` instead of `UnknownCode66` in .pcm dump.

Add new record into block info.

jansvoboda11 marked an inline comment as done.Nov 19 2021, 12:48 AM

jansvoboda11 added inline comments.

clang/lib/Lex/Preprocessor.cpp

553

Without the call, I'm hitting some assertions when running C++20 modules tests:

Assertion failed: (CurDiagID == std::numeric_limits<unsigned>::max() && "Multiple diagnostics in flight at once!"), function Report, file Diagnostic.h, line 1526.

fatal error: error in backend: -verify directives found after rather than during normal parsing of <llvm>/clang/test/CXX/modules-ts/basic/basic.def.odr/p6/global-vs-module.cpp

Might need to investigate more to be able to write up a reasonable comment here.

Harbormaster completed remote builds in B135054: Diff 388407.Nov 19 2021, 1:23 AM

vsapsai added inline comments.Nov 19 2021, 11:12 AM

clang/lib/Serialization/ASTWriter.cpp
882	I believe `PP_INCLUDED_FILES` is located in `AST_BLOCK`. Yes, you write it in `ASTWriter::WritePreprocessor` but after `Stream.ExitBlock()` together with `MACRO_OFFSET`. And then you read it in `ASTReader::ReadASTBlock` at the top level and not inside `case PREPROCESSOR_BLOCK_ID` or from `ModuleFile::MacroCursor`.

jansvoboda11 added inline comments.Nov 22 2021, 12:09 AM

clang/lib/Serialization/ASTWriter.cpp
882	That's right. This is modeled after `PP_CONDITIONAL_STACK` and `PP_COUNTER_VALUE`. The problem is that the whole `PREPROCESSOR_BLOCK_ID` is treated as "macros only" block that's not being split into individual records right away and is instead deserialized lazily in `ReadMacroRecord`, `ReadDefinedMacros`, `resolvePendingMacro`. I think this should be fixed eventually, but I didn't want to expand the scope of my changes, since it's already somewhat complex already.

Move record ID from PREPROCESSOR_BLOCK to AST_BLOCK.

Harbormaster completed remote builds in B135351: Diff 388814.Nov 22 2021, 1:49 AM

This LGTM by the way (not sure if that was already clear, since you abandoned a different review than I anticipated when squashing, and I'd "accepted" the other one); also see my suggestion to move the call to getFileInfo inside of markIncluded (might be error prone not to make that change).

Haven't clicked "accept" here in case @vsapsai has more comments.

clang/lib/Lex/Preprocessor.cpp
553	Not sure precisely why that assertion fires, but the call to `getFileInfo` makes sense since it's mutating -- it adds a HeaderFileInfo entry (and `IncrementIncludeCount()` used to call it). There are code paths that call `getExistingFileInfo` that will expect it to have been created on inclusion. I suggest moving the `getFileInfo()` call to inside `markIncluded()` and adding a comment that says "Create the HeaderFileInfo if it doesn't already exist" or something. Parting of marking it included should be to ensure that callers of HeaderSearch::getExistingFileInfo get something back. (This'd be more obvious (and the comment unnecessary) if getFileInfo were renamed to getOrCreateFileInfo, after which getExistingFileInfo could be renamed to getFileInfo.)

I've mentioned it in D112915 as we've discussed the stored data format there. But my concern was that bitvector packing might be not the most space-efficient encoding. I haven't done proper testing, just off-the-cuff comparison and it looks like for the most of frameworks in iOS SDK storing included headers per submodule takes less space than encoding them as a bitvector. I have an idea why that might be happening but I haven't checked it in debugger, so'll keep it to myself to avoid derailing the discussion.

In D114095#3160103, @vsapsai wrote:

I've mentioned it in D112915 as we've discussed the stored data format there. But my concern was that bitvector packing might be not the most space-efficient encoding. I haven't done proper testing, just off-the-cuff comparison and it looks like for the most of frameworks in iOS SDK storing included headers per submodule takes less space than encoding them as a bitvector. I have an idea why that might be happening but I haven't checked it in debugger, so'll keep it to myself to avoid derailing the discussion.

Let's bring the conversation over here. I ran the same UIKit test you did and compared the following:

current trunk
current trunk with this patch
current trunk with this patch, with bitvector replaced by vector of IDs (32-bit integers).

The following table shows sizes of .pcm files in bytes and their delta compared to trunk:

+----------+-----------------+-----------------+
|   trunk  |    bit vector   |    ID vector    |
+----------+-----------------+-----------------+
|   281932 |   281944    +12 |   281988    +56 |
|   989840 |   989784    -56 |   989968   +128 |
|   837116 |   837084    -32 |   837212    +96 |
|   899924 |   899912    -12 |   900004    +80 |
|   710296 |   710296     +0 |   710376    +80 |
|   273140 |   273144     +4 |   273196    +56 |
|  3649856 |  3649024   -832 |  3650804   +948 |
|   207676 |   207692    +16 |   207740    +64 |
|   342792 |   342804    +12 |   342860    +68 |
|  4137660 |  4137460   -200 |  4137940   +280 |
|   173536 |   173564    +28 |   173580    +44 |
|   787120 |   787144    +24 |   787180    +60 |
|  1260652 |  1260596    -56 |  1260804   +152 |
|   255072 |   255092    +20 |   255128    +56 |
|   973204 |   973228    +24 |   973268    +64 |
|   398952 |   398940    -12 |   399036    +84 |
|   631516 |   631516     +0 |   631588    +72 |
|  5252932 |  5252348   -584 |  5253612   +680 |
|   230160 |   230168     +8 |   230228    +68 |
|    24460 |    24500    +40 |    24500    +40 |
|    53244 |    53280    +36 |    53288    +44 |
|    75932 |    75952    +20 |    75972    +40 |
|    32840 |    32876    +36 |    32884    +44 |
+----------+-----------------+-----------------+
| 22479852 | 22478348  -1504 | 22483156  +3304 |
+----------+-----------------+-----------------+

Used command:

echo '#import <UIKit/UIKit.h>' | ./bin/clang -fsyntax-only -isysroot "$(xcrun --sdk iphoneos --show-sdk-path)" -target arm64-apple-ios -fmodules -fmodules-cache-path=modules.noindex -x objective-c -

Patch that I applied on top of the one under review to get vector of IDs:

bitvector-to-id-vector.diff2 KBDownload

I see how the bitvector could explode for large fine-grained modules. They have lots of input files (-> large bitvectors in each submodule), but each submodule only includes a handful files (-> bitvectors are sparse). It seems like this doesn't actually happen, at least in our SDK.

@vsapsai Do you think this warrants more thorough investigation?

Thanks for measuring the .pcm impact! And I appreciate including the trunk baseline.

The results aren't exactly the same I've seen before, so please forgive me my suspiciousness. When you mention

current trunk with this patch

do you mean "just this D114095 patch"? Or were you testing with D112915 applied? Because I was testing with per-submodule tracking applied. I am going to check again but it would be useful to know if you were comparing with per-submodule tracking (D112915) or not.

You're right, I measured only this patch, not per-submodule include tracking (D112915).

With per-submodule tracking, the results look like this:

+----------+------------------+------------------+-------------------+------------------+
| original |     ID vector    |    bit vector    | subm. w incl. [%] | 1 in bitvec. [%] |
+----------+------------------+------------------+-------------------+------------------+
|    23348 |    23380     +32 |    23380     +32 |       100.0       |       33.3       |
|    52188 |    52224     +36 |    52224     +36 |       100.0       |        6.3       |
|    74808 |    74856     +48 |    74840     +32 |       100.0       |        9.4       |
|   171772 |   171836     +64 |   171840     +68 |        40.0       |        5.7       |
|   206524 |   206584     +60 |   206540     +16 |       100.0       |       32.5       |
|   227716 |   227904    +188 |   227856    +140 |         6.3       |       33.3       |
|   253656 |   253812    +156 |   253788    +132 |        90.0       |        7.6       |
|   271332 |   271584    +252 |   271524    +192 |        85.7       |        8.5       |
|   280280 |   280460    +180 |   280428    +148 |        91.7       |        5.6       |
|   340024 |   340176    +152 |   340144    +120 |        21.4       |       10.9       |
|   394692 |   394928    +236 |   394872    +180 |        25.0       |       18.1       |
|   629740 |   630028    +288 |   629940    +200 |        83.3       |       20.0       |
|   707456 |   707732    +276 |   707676    +220 |        85.7       |       13.9       |
|   785508 |   785632    +124 |   785616    +108 |        85.7       |       18.1       |
|   835204 |   835824    +620 |   835616    +412 |        93.8       |       12.5       |
|   887764 |   888004    +240 |   887940    +176 |         8.7       |       19.1       |
|   971352 |   971504    +152 |   971500    +148 |        66.7       |        6.0       |
|   994112 |   995048    +936 |   994672    +560 |        93.5       |        9.1       |
|  1248888 |  1249408    +520 |  1249352    +464 |        29.6       |        5.0       |
|  3642908 |  3650076   +7168 |  3652668   +9760 |        74.5       |        2.8       |
|  4112848 |  4114016   +1168 |  4113780    +932 |        17.7       |        5.3       |
|  5213344 |  5216228   +2884 |  5216552   +3208 |        22.0       |        2.2       |
+----------+------------------+------------------+-------------------+------------------+
| 22325464 | 22341244  +15780 | 22342748  +17284 |                   |                  |
+----------+------------------+------------------+-------------------+------------------+

The subm. w incl. [%] column shows the percentage of submodules that include any headers and for the ones that do, 1 in bitvec. [%] shows how sparse are the bitvectors on average (what percentage of 1 bits they contain).

It seems like smaller modules are generally better off with bitvectors, but for larger modules with greater cumulative number of includes, the bitvectors get long and sparse. And it's the larger modules whose size ends up impacting the overall size of module cache. I think that matches my intuition and roughly corresponds to your own measurements.

(Note that in any case, the module cache growth is negligible: .071% for ID vector and .077% for bitvector.)

Given that, I think we should commit this patch with ID vectors, even though in isolation (without D112915) it's the worse solution. WDYT?

In D114095#3188557, @jansvoboda11 wrote:

Given that, I think we should commit this patch with ID vectors, even though in isolation (without D112915) it's the worse solution. WDYT?

I have to admit that I think ID vector code is easier to understand and the format is easier to understand. And if bitvector approach is more complicated but doesn't gain us much in efficiency, then I think it's not worth pursuing. Though I'm glad and grateful that you've tried bitvector approach, that's very helpful.

My preference is to use ID vectors but I'm open to other opinions as I might be missing other trade-offs besides complexity and .pcm file size.

And I want to thank you for checking the sparseness of submodules and how with the bigger modules the impact of the sparseness becomes more pronounced.

@vsapsai do you have any further concerns? My only intended change at this point is Duncan's suggestion.

In D114095#3261073, @jansvoboda11 wrote:

@vsapsai do you have any further concerns? My only intended change at this point is Duncan's suggestion.

My understanding was that we were going with ID vectors approach and not with bitvectors, and the current code still uses bitvectors. Also I have a minor readability concern about auto usage, it seems to be used more often than the coding standard asks for. But that usage is in serialization/deserialization code which I wasn't sure if it was going to stay.

Move HeaderSearch::getFileInfo() call into Preprocessor::markIncluded().
Switch from bitvector to vector of file IDs in AST files.
Remove unnecessary usages of auto.

In D114095#3263086, @vsapsai wrote:

My understanding was that we were going with ID vectors approach and not with bitvectors, and the current code still uses bitvectors. Also I have a minor readability concern about auto usage, it seems to be used more often than the coding standard asks for. But that usage is in serialization/deserialization code which I wasn't sure if it was going to stay.

Ah, sorry about that. Fixed in latest revision.

Looks good to me. Thanks for working on it!

Though I haven't built it locally and it can be useful for pre-merge checks to finish.

This revision is now accepted and ready to land.Jan 25 2022, 12:34 PM

Harbormaster completed remote builds in B145557: Diff 402968.Jan 26 2022, 2:24 AM

This revision was landed with ongoing or failed builds.Jan 26 2022, 6:56 AM

Closed by commit rGf72027233044: [clang][lex] Include tracking: simplify and move to preprocessor (authored by jansvoboda11). · Explain Why

This revision was automatically updated to reflect the committed changes.

jansvoboda11 added a commit: rGf72027233044: [clang][lex] Include tracking: simplify and move to preprocessor.

jansvoboda11 mentioned this in D155131: [clang][modules] Deserialize included files lazily.Jul 12 2023, 3:33 PM

jansvoboda11 mentioned this in rG6504d87fc0c8: [clang][modules] Deserialize included files lazily.Jul 13 2023, 3:00 PM

jansvoboda11 mentioned this in D157559: [clang][modules] Respect "-fmodule-name=" when serializing included files into a PCH.Aug 9 2023, 4:05 PM

jansvoboda11 mentioned this in rGbbdb0c7e4496: [clang][modules] Respect "-fmodule-name=" when serializing included files into….Aug 10 2023, 10:25 AM

Revision Contents

Path

Size

clang/

include/

clang/

Lex/

HeaderSearch.h

11 lines

Preprocessor.h

21 lines

Serialization/

ASTBitCodes.h

3 lines

ASTReader.h

1 line

ASTWriter.h

1 line

lib/

Lex/

HeaderSearch.cpp

20 lines

PPDirectives.cpp

2 lines

Preprocessor.cpp

2 lines

Serialization/

ASTReader.cpp

24 lines

ASTWriter.cpp

41 lines

Diff 403251

clang/include/clang/Lex/HeaderSearch.h

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
class LangOptions;		class LangOptions;
class Module;		class Module;
class Preprocessor;		class Preprocessor;
class TargetInfo;		class TargetInfo;

/// The preprocessor keeps track of this information for each		/// The preprocessor keeps track of this information for each
/// file that is \#included.		/// file that is \#included.
struct HeaderFileInfo {		struct HeaderFileInfo {
		// TODO: Whether the file was imported is not a property of the file itself.
		// It's a preprocessor state, move it there.
		dexonsmithUnsubmitted Not Done Reply Inline Actions Note that `clang-format` is complaining about a trailing space / here. dexonsmith: Note that `clang-format` is complaining about a trailing space / ` ` here.
/// True if this is a \#import'd file.		/// True if this is a \#import'd file.
unsigned isImport : 1;		unsigned isImport : 1;

/// True if this is a \#pragma once file.		/// True if this is a \#pragma once file.
unsigned isPragmaOnce : 1;		unsigned isPragmaOnce : 1;

/// Keep track of whether this is a system header, and if so,		/// Keep track of whether this is a system header, and if so,
/// whether it is C++ clean or not. This can be set by the include paths or		/// whether it is C++ clean or not. This can be set by the include paths or
Show All 22 Lines	struct HeaderFileInfo {
/// into the appropriate framework subdirectories, and therefore are		/// into the appropriate framework subdirectories, and therefore are
/// provided via a header map. This bit indicates when this is one of		/// provided via a header map. This bit indicates when this is one of
/// those framework headers.		/// those framework headers.
unsigned IndexHeaderMapHeader : 1;		unsigned IndexHeaderMapHeader : 1;

/// Whether this file has been looked up as a header.		/// Whether this file has been looked up as a header.
unsigned IsValid : 1;		unsigned IsValid : 1;

/// The number of times the file has been included already.
unsigned short NumIncludes = 0;

/// The ID number of the controlling macro.		/// The ID number of the controlling macro.
///		///
/// This ID number will be non-zero when there is a controlling		/// This ID number will be non-zero when there is a controlling
/// macro whose IdentifierInfo may not yet have been loaded from		/// macro whose IdentifierInfo may not yet have been loaded from
/// external storage.		/// external storage.
unsigned ControllingMacroID = 0;		unsigned ControllingMacroID = 0;

/// If this file has a \#ifndef XXX (or equivalent) guard that		/// If this file has a \#ifndef XXX (or equivalent) guard that
▲ Show 20 Lines • Show All 355 Lines • ▼ Show 20 Lines	void AddFileAlias(const FileEntry *File, StringRef Alias) {
getFileInfo(File).Aliases.insert(Alias);		getFileInfo(File).Aliases.insert(Alias);
}		}

/// Mark the specified file as part of a module.		/// Mark the specified file as part of a module.
void MarkFileModuleHeader(const FileEntry *FE,		void MarkFileModuleHeader(const FileEntry *FE,
ModuleMap::ModuleHeaderRole Role,		ModuleMap::ModuleHeaderRole Role,
bool isCompilingModuleHeader);		bool isCompilingModuleHeader);

/// Increment the count for the number of times the specified
/// FileEntry has been entered.
void IncrementIncludeCount(const FileEntry *File) {
++getFileInfo(File).NumIncludes;
}

/// Mark the specified file as having a controlling macro.		/// Mark the specified file as having a controlling macro.
///		///
/// This is used by the multiple-include optimization to eliminate		/// This is used by the multiple-include optimization to eliminate
/// no-op \#includes.		/// no-op \#includes.
void SetFileControllingMacro(const FileEntry *File,		void SetFileControllingMacro(const FileEntry *File,
const IdentifierInfo *ControllingMacro) {		const IdentifierInfo *ControllingMacro) {
getFileInfo(File).ControllingMacro = ControllingMacro;		getFileInfo(File).ControllingMacro = ControllingMacro;
}		}
▲ Show 20 Lines • Show All 377 Lines • Show Last 20 Lines

clang/include/clang/Lex/Preprocessor.h

Show First 20 Lines • Show All 444 Lines • ▼ Show 20 Lines	struct PreambleSkipInfo {
PreambleSkipInfo(SourceLocation HashTokenLoc, SourceLocation IfTokenLoc,		PreambleSkipInfo(SourceLocation HashTokenLoc, SourceLocation IfTokenLoc,
bool FoundNonSkipPortion, bool FoundElse,		bool FoundNonSkipPortion, bool FoundElse,
SourceLocation ElseLoc)		SourceLocation ElseLoc)
: HashTokenLoc(HashTokenLoc), IfTokenLoc(IfTokenLoc),		: HashTokenLoc(HashTokenLoc), IfTokenLoc(IfTokenLoc),
FoundNonSkipPortion(FoundNonSkipPortion), FoundElse(FoundElse),		FoundNonSkipPortion(FoundNonSkipPortion), FoundElse(FoundElse),
ElseLoc(ElseLoc) {}		ElseLoc(ElseLoc) {}
};		};

		using IncludedFilesSet = llvm::DenseSet<const FileEntry *>;

private:		private:
friend class ASTReader;		friend class ASTReader;
friend class MacroArgs;		friend class MacroArgs;

class PreambleConditionalStackStore {		class PreambleConditionalStackStore {
enum State {		enum State {
Off = 0,		Off = 0,
Recording = 1,		Recording = 1,
▲ Show 20 Lines • Show All 299 Lines • ▼ Show 20 Lines	private:

/// The preprocessor state for preprocessing outside of any submodule.		/// The preprocessor state for preprocessing outside of any submodule.
SubmoduleState NullSubmoduleState;		SubmoduleState NullSubmoduleState;

/// The current submodule state. Will be \p NullSubmoduleState if we're not		/// The current submodule state. Will be \p NullSubmoduleState if we're not
/// in a submodule.		/// in a submodule.
SubmoduleState *CurSubmoduleState;		SubmoduleState *CurSubmoduleState;

		/// The files that have been included.
		IncludedFilesSet IncludedFiles;

/// The set of known macros exported from modules.		/// The set of known macros exported from modules.
llvm::FoldingSet<ModuleMacro> ModuleMacros;		llvm::FoldingSet<ModuleMacro> ModuleMacros;

/// The names of potential module macros that we've not yet processed.		/// The names of potential module macros that we've not yet processed.
llvm::SmallVector<const IdentifierInfo *, 32> PendingModuleMacroNames;		llvm::SmallVector<const IdentifierInfo *, 32> PendingModuleMacroNames;

/// The list of module macros, for each identifier, that are not overridden by		/// The list of module macros, for each identifier, that are not overridden by
/// any other module macro.		/// any other module macro.
▲ Show 20 Lines • Show All 443 Lines • ▼ Show 20 Lines	public:
macros(bool IncludeExternalMacros = true) const {		macros(bool IncludeExternalMacros = true) const {
macro_iterator begin = macro_begin(IncludeExternalMacros);		macro_iterator begin = macro_begin(IncludeExternalMacros);
macro_iterator end = macro_end(IncludeExternalMacros);		macro_iterator end = macro_end(IncludeExternalMacros);
return llvm::make_range(begin, end);		return llvm::make_range(begin, end);
}		}

/// \}		/// \}

		/// Mark the file as included.
		/// Returns true if this is the first time the file was included.
		bool markIncluded(const FileEntry *File) {
		HeaderInfo.getFileInfo(File);
		return IncludedFiles.insert(File).second;
		}

		/// Return true if this header has already been included.
		bool alreadyIncluded(const FileEntry *File) const {
		return IncludedFiles.count(File);
		}

		/// Get the set of included files.
		IncludedFilesSet &getIncludedFiles() { return IncludedFiles; }
		const IncludedFilesSet &getIncludedFiles() const { return IncludedFiles; }

/// Return the name of the macro defined before \p Loc that has		/// Return the name of the macro defined before \p Loc that has
/// spelling \p Tokens. If there are multiple macros with same spelling,		/// spelling \p Tokens. If there are multiple macros with same spelling,
/// return the last one defined.		/// return the last one defined.
StringRef getLastMacroWithSpelling(SourceLocation Loc,		StringRef getLastMacroWithSpelling(SourceLocation Loc,
ArrayRef<TokenValue> Tokens) const;		ArrayRef<TokenValue> Tokens) const;

const std::string &getPredefines() const { return Predefines; }		const std::string &getPredefines() const { return Predefines; }

▲ Show 20 Lines • Show All 1,280 Lines • Show Last 20 Lines

clang/include/clang/Serialization/ASTBitCodes.h

Show First 20 Lines • Show All 689 Lines • ▼ Show 20 Lines	enum ASTRecordTypes {
/// A table of skipped ranges within the preprocessing record.		/// A table of skipped ranges within the preprocessing record.
PPD_SKIPPED_RANGES = 63,		PPD_SKIPPED_RANGES = 63,

/// Record code for the Decls to be checked for deferred diags.		/// Record code for the Decls to be checked for deferred diags.
DECLS_TO_CHECK_FOR_DEFERRED_DIAGS = 64,		DECLS_TO_CHECK_FOR_DEFERRED_DIAGS = 64,

/// Record code for \#pragma float_control options.		/// Record code for \#pragma float_control options.
FLOAT_CONTROL_PRAGMA_OPTIONS = 65,		FLOAT_CONTROL_PRAGMA_OPTIONS = 65,

		/// Record code for included files.
		PP_INCLUDED_FILES = 66,
		vsapsaiUnsubmitted Done Reply Inline Actions Small detail. Do you need to add the new record to `ASTWriter::WriteBlockInfoBlock`? It's not critical but you are one of those who might be grateful later seeing `PP_INCLUDED_FILES` instead of `UnknownCode66` in .pcm dump. vsapsai: Small detail. Do you need to add the new record to `ASTWriter::WriteBlockInfoBlock`? It's not…
};		};

/// Record types used within a source manager block.		/// Record types used within a source manager block.
enum SourceManagerRecordTypes {		enum SourceManagerRecordTypes {
/// Describes a source location entry (SLocEntry) for a		/// Describes a source location entry (SLocEntry) for a
/// file.		/// file.
SM_SLOC_FILE_ENTRY = 1,		SM_SLOC_FILE_ENTRY = 1,

▲ Show 20 Lines • Show All 1,440 Lines • Show Last 20 Lines

clang/include/clang/Serialization/ASTReader.h

Show First 20 Lines • Show All 1,323 Lines • ▼ Show 20 Lines	private:

llvm::Error ReadASTBlock(ModuleFile &F, unsigned ClientLoadCapabilities);		llvm::Error ReadASTBlock(ModuleFile &F, unsigned ClientLoadCapabilities);
llvm::Error ReadExtensionBlock(ModuleFile &F);		llvm::Error ReadExtensionBlock(ModuleFile &F);
void ReadModuleOffsetMap(ModuleFile &F) const;		void ReadModuleOffsetMap(ModuleFile &F) const;
void ParseLineTable(ModuleFile &F, const RecordData &Record);		void ParseLineTable(ModuleFile &F, const RecordData &Record);
llvm::Error ReadSourceManagerBlock(ModuleFile &F);		llvm::Error ReadSourceManagerBlock(ModuleFile &F);
llvm::BitstreamCursor &SLocCursorForID(int ID);		llvm::BitstreamCursor &SLocCursorForID(int ID);
SourceLocation getImportLocation(ModuleFile *F);		SourceLocation getImportLocation(ModuleFile *F);
		void readIncludedFiles(ModuleFile &F, StringRef Blob, Preprocessor &PP);
ASTReadResult ReadModuleMapFileBlock(RecordData &Record, ModuleFile &F,		ASTReadResult ReadModuleMapFileBlock(RecordData &Record, ModuleFile &F,
const ModuleFile *ImportedBy,		const ModuleFile *ImportedBy,
unsigned ClientLoadCapabilities);		unsigned ClientLoadCapabilities);
llvm::Error ReadSubmoduleBlock(ModuleFile &F,		llvm::Error ReadSubmoduleBlock(ModuleFile &F,
unsigned ClientLoadCapabilities);		unsigned ClientLoadCapabilities);
static bool ParseLanguageOptions(const RecordData &Record, bool Complain,		static bool ParseLanguageOptions(const RecordData &Record, bool Complain,
ASTReaderListener &Listener,		ASTReaderListener &Listener,
bool AllowCompatibleDifferences);		bool AllowCompatibleDifferences);
▲ Show 20 Lines • Show All 978 Lines • Show Last 20 Lines

clang/include/clang/Serialization/ASTWriter.h

Show First 20 Lines • Show All 459 Lines • ▼ Show 20 Lines	private:
/// Calculate hash of the pcm content.		/// Calculate hash of the pcm content.
static std::pair<ASTFileSignature, ASTFileSignature>		static std::pair<ASTFileSignature, ASTFileSignature>
createSignature(StringRef AllBytes, StringRef ASTBlockBytes);		createSignature(StringRef AllBytes, StringRef ASTBlockBytes);

void WriteInputFiles(SourceManager &SourceMgr, HeaderSearchOptions &HSOpts,		void WriteInputFiles(SourceManager &SourceMgr, HeaderSearchOptions &HSOpts,
std::set<const FileEntry *> &AffectingModuleMaps);		std::set<const FileEntry *> &AffectingModuleMaps);
void WriteSourceManagerBlock(SourceManager &SourceMgr,		void WriteSourceManagerBlock(SourceManager &SourceMgr,
const Preprocessor &PP);		const Preprocessor &PP);
		void writeIncludedFiles(raw_ostream &Out, const Preprocessor &PP);
void WritePreprocessor(const Preprocessor &PP, bool IsModule);		void WritePreprocessor(const Preprocessor &PP, bool IsModule);
void WriteHeaderSearch(const HeaderSearch &HS);		void WriteHeaderSearch(const HeaderSearch &HS);
void WritePreprocessorDetail(PreprocessingRecord &PPRec,		void WritePreprocessorDetail(PreprocessingRecord &PPRec,
uint64_t MacroOffsetsBase);		uint64_t MacroOffsetsBase);
void WriteSubmodules(Module *WritingModule);		void WriteSubmodules(Module *WritingModule);

void WritePragmaDiagnosticMappings(const DiagnosticsEngine &Diag,		void WritePragmaDiagnosticMappings(const DiagnosticsEngine &Diag,
bool isModule);		bool isModule);
▲ Show 20 Lines • Show All 311 Lines • Show Last 20 Lines

clang/lib/Lex/HeaderSearch.cpp

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	HeaderSearch::HeaderSearch(std::shared_ptr<HeaderSearchOptions> HSOpts,
const TargetInfo *Target)		const TargetInfo *Target)
: HSOpts(std::move(HSOpts)), Diags(Diags),		: HSOpts(std::move(HSOpts)), Diags(Diags),
FileMgr(SourceMgr.getFileManager()), FrameworkMap(64),		FileMgr(SourceMgr.getFileManager()), FrameworkMap(64),
ModMap(SourceMgr, Diags, LangOpts, Target, *this) {}		ModMap(SourceMgr, Diags, LangOpts, Target, *this) {}

void HeaderSearch::PrintStats() {		void HeaderSearch::PrintStats() {
llvm::errs() << "\n*** HeaderSearch Stats:\n"		llvm::errs() << "\n*** HeaderSearch Stats:\n"
<< FileInfo.size() << " files tracked.\n";		<< FileInfo.size() << " files tracked.\n";
unsigned NumOnceOnlyFiles = 0, MaxNumIncludes = 0, NumSingleIncludedFiles = 0;		unsigned NumOnceOnlyFiles = 0;
for (unsigned i = 0, e = FileInfo.size(); i != e; ++i) {		for (unsigned i = 0, e = FileInfo.size(); i != e; ++i)
NumOnceOnlyFiles += (FileInfo[i].isPragmaOnce \|\| FileInfo[i].isImport);		NumOnceOnlyFiles += (FileInfo[i].isPragmaOnce \|\| FileInfo[i].isImport);
if (MaxNumIncludes < FileInfo[i].NumIncludes)		llvm::errs() << " " << NumOnceOnlyFiles << " #import/#pragma once files.\n";
MaxNumIncludes = FileInfo[i].NumIncludes;
NumSingleIncludedFiles += FileInfo[i].NumIncludes == 1;
}
llvm::errs() << " " << NumOnceOnlyFiles << " #import/#pragma once files.\n"
<< " " << NumSingleIncludedFiles << " included exactly once.\n"
<< " " << MaxNumIncludes << " max times a file is included.\n";

llvm::errs() << " " << NumIncluded << " #include/#include_next/#import.\n"		llvm::errs() << " " << NumIncluded << " #include/#include_next/#import.\n"
<< " " << NumMultiIncludeFileOptzn		<< " " << NumMultiIncludeFileOptzn
<< " #includes skipped due to the multi-include optimization.\n";		<< " #includes skipped due to the multi-include optimization.\n";

llvm::errs() << NumFrameworkLookups << " framework lookups.\n"		llvm::errs() << NumFrameworkLookups << " framework lookups.\n"
<< NumSubFrameworkLookups << " subframework lookups.\n";		<< NumSubFrameworkLookups << " subframework lookups.\n";
}		}
▲ Show 20 Lines • Show All 1,127 Lines • ▼ Show 20 Lines
/// header file info (\p HFI)		/// header file info (\p HFI)
static void mergeHeaderFileInfo(HeaderFileInfo &HFI,		static void mergeHeaderFileInfo(HeaderFileInfo &HFI,
const HeaderFileInfo &OtherHFI) {		const HeaderFileInfo &OtherHFI) {
assert(OtherHFI.External && "expected to merge external HFI");		assert(OtherHFI.External && "expected to merge external HFI");

HFI.isImport \|= OtherHFI.isImport;		HFI.isImport \|= OtherHFI.isImport;
HFI.isPragmaOnce \|= OtherHFI.isPragmaOnce;		HFI.isPragmaOnce \|= OtherHFI.isPragmaOnce;
HFI.isModuleHeader \|= OtherHFI.isModuleHeader;		HFI.isModuleHeader \|= OtherHFI.isModuleHeader;
HFI.NumIncludes += OtherHFI.NumIncludes;

if (!HFI.ControllingMacro && !HFI.ControllingMacroID) {		if (!HFI.ControllingMacro && !HFI.ControllingMacroID) {
HFI.ControllingMacro = OtherHFI.ControllingMacro;		HFI.ControllingMacro = OtherHFI.ControllingMacro;
HFI.ControllingMacroID = OtherHFI.ControllingMacroID;		HFI.ControllingMacroID = OtherHFI.ControllingMacroID;
}		}

HFI.DirInfo = OtherHFI.DirInfo;		HFI.DirInfo = OtherHFI.DirInfo;
HFI.External = (!HFI.IsValid \|\| HFI.External);		HFI.External = (!HFI.IsValid \|\| HFI.External);
▲ Show 20 Lines • Show All 144 Lines • ▼ Show 20 Lines	bool HeaderSearch::ShouldEnterIncludeFile(Preprocessor &PP,

// If this is a #import directive, check that we have not already imported		// If this is a #import directive, check that we have not already imported
// this header.		// this header.
if (isImport) {		if (isImport) {
// If this has already been imported, don't import it again.		// If this has already been imported, don't import it again.
FileInfo.isImport = true;		FileInfo.isImport = true;

// Has this already been #import'ed or #include'd?		// Has this already been #import'ed or #include'd?
if (FileInfo.NumIncludes && !TryEnterImported())		if (PP.alreadyIncluded(File) && !TryEnterImported())
return false;		return false;
} else {		} else {
// Otherwise, if this is a #include of a file that was previously #import'd		// Otherwise, if this is a #include of a file that was previously #import'd
// or if this is the second #include of a #pragma once file, ignore it.		// or if this is the second #include of a #pragma once file, ignore it.
if ((FileInfo.isPragmaOnce \|\| FileInfo.isImport) && !TryEnterImported())		if ((FileInfo.isPragmaOnce \|\| FileInfo.isImport) && !TryEnterImported())
return false;		return false;
}		}

// Next, check to see if the file is wrapped with #ifndef guards. If so, and		// Next, check to see if the file is wrapped with #ifndef guards. If so, and
// if the macro that guards it is defined, we know the #include has no effect.		// if the macro that guards it is defined, we know the #include has no effect.
if (const IdentifierInfo *ControllingMacro		if (const IdentifierInfo *ControllingMacro
= FileInfo.getControllingMacro(ExternalLookup)) {		= FileInfo.getControllingMacro(ExternalLookup)) {
// If the header corresponds to a module, check whether the macro is already		// If the header corresponds to a module, check whether the macro is already
// defined in that module rather than checking in the current set of visible		// defined in that module rather than checking in the current set of visible
// modules.		// modules.
if (M ? PP.isMacroDefinedInLocalModule(ControllingMacro, M)		if (M ? PP.isMacroDefinedInLocalModule(ControllingMacro, M)
: PP.isMacroDefined(ControllingMacro)) {		: PP.isMacroDefined(ControllingMacro)) {
++NumMultiIncludeFileOptzn;		++NumMultiIncludeFileOptzn;
return false;		return false;
}		}
}		}

// Increment the number of times this file has been included.		IsFirstIncludeOfFile = PP.markIncluded(File);
++FileInfo.NumIncludes;

IsFirstIncludeOfFile = FileInfo.NumIncludes == 1;

return true;		return true;
}		}

size_t HeaderSearch::getTotalMemory() const {		size_t HeaderSearch::getTotalMemory() const {
return SearchDirs.capacity()		return SearchDirs.capacity()
+ llvm::capacity_in_bytes(FileInfo)		+ llvm::capacity_in_bytes(FileInfo)
+ llvm::capacity_in_bytes(HeaderMaps)		+ llvm::capacity_in_bytes(HeaderMaps)
▲ Show 20 Lines • Show All 548 Lines • Show Last 20 Lines

clang/lib/Lex/PPDirectives.cpp

Show First 20 Lines • Show All 2,052 Lines • ▼ Show 20 Lines	Preprocessor::ImportAction Preprocessor::HandleHeaderIncludeOrImport(

if (PPOpts->SingleFileParseMode)		if (PPOpts->SingleFileParseMode)
Action = IncludeLimitReached;		Action = IncludeLimitReached;

// If we've reached the max allowed include depth, it is usually due to an		// If we've reached the max allowed include depth, it is usually due to an
// include cycle. Don't enter already processed files again as it can lead to		// include cycle. Don't enter already processed files again as it can lead to
// reaching the max allowed include depth again.		// reaching the max allowed include depth again.
if (Action == Enter && HasReachedMaxIncludeDepth && File &&		if (Action == Enter && HasReachedMaxIncludeDepth && File &&
HeaderInfo.getFileInfo(&File->getFileEntry()).NumIncludes)		alreadyIncluded(*File))
Action = IncludeLimitReached;		Action = IncludeLimitReached;

// Determine whether we should try to import the module for this #include, if		// Determine whether we should try to import the module for this #include, if
// there is one. Don't do so if precompiled module support is disabled or we		// there is one. Don't do so if precompiled module support is disabled or we
// are processing this module textually (because we're building the module).		// are processing this module textually (because we're building the module).
if (Action == Enter && File && SuggestedModule && getLangOpts().Modules &&		if (Action == Enter && File && SuggestedModule && getLangOpts().Modules &&
!isForModuleBuilding(SuggestedModule.getModule(),		!isForModuleBuilding(SuggestedModule.getModule(),
getLangOpts().CurrentModule,		getLangOpts().CurrentModule,
▲ Show 20 Lines • Show All 1,254 Lines • Show Last 20 Lines

clang/lib/Lex/Preprocessor.cpp

Show First 20 Lines • Show All 543 Lines • ▼ Show 20 Lines	if (!SourceMgr.isLoadedFileID(MainFileID)) {
// precompiled preamble), do so now.		// precompiled preamble), do so now.
if (SkipMainFilePreamble.first > 0)		if (SkipMainFilePreamble.first > 0)
CurLexer->SetByteOffset(SkipMainFilePreamble.first,		CurLexer->SetByteOffset(SkipMainFilePreamble.first,
SkipMainFilePreamble.second);		SkipMainFilePreamble.second);

// Tell the header info that the main file was entered. If the file is later		// Tell the header info that the main file was entered. If the file is later
// #imported, it won't be re-entered.		// #imported, it won't be re-entered.
if (const FileEntry *FE = SourceMgr.getFileEntryForID(MainFileID))		if (const FileEntry *FE = SourceMgr.getFileEntryForID(MainFileID))
HeaderInfo.IncrementIncludeCount(FE);		markIncluded(FE);
}		}
		vsapsaiUnsubmitted Not Done Reply Inline Actions Why do you need to `getFileInfo` but don't use it? I have no objections but it looks like it deserves a comment because it's not obvious. vsapsai: Why do you need to `getFileInfo` but don't use it? I have no objections but it looks like it…
		jansvoboda11AuthorUnsubmitted Done Reply Inline Actions Without the call, I'm hitting some assertions when running C++20 modules tests: Assertion failed: (CurDiagID == std::numeric_limits<unsigned>::max() && "Multiple diagnostics in flight at once!"), function Report, file Diagnostic.h, line 1526. fatal error: error in backend: -verify directives found after rather than during normal parsing of <llvm>/clang/test/CXX/modules-ts/basic/basic.def.odr/p6/global-vs-module.cpp Might need to investigate more to be able to write up a reasonable comment here. jansvoboda11: Without the call, I'm hitting some assertions when running C++20 modules tests: ``` Assertion…
		dexonsmithUnsubmitted Not Done Reply Inline Actions Not sure precisely why that assertion fires, but the call to `getFileInfo` makes sense since it's mutating -- it adds a HeaderFileInfo entry (and `IncrementIncludeCount()` used to call it). There are code paths that call `getExistingFileInfo` that will expect it to have been created on inclusion. I suggest moving the `getFileInfo()` call to inside `markIncluded()` and adding a comment that says "Create the HeaderFileInfo if it doesn't already exist" or something. Parting of marking it included should be to ensure that callers of HeaderSearch::getExistingFileInfo get something back. (This'd be more obvious (and the comment unnecessary) if getFileInfo were renamed to getOrCreateFileInfo, after which getExistingFileInfo could be renamed to getFileInfo.) dexonsmith: Not sure precisely why that assertion fires, but the call to `getFileInfo` makes sense since…

// Preprocess Predefines to populate the initial preprocessor state.		// Preprocess Predefines to populate the initial preprocessor state.
std::unique_ptr<llvm::MemoryBuffer> SB =		std::unique_ptr<llvm::MemoryBuffer> SB =
llvm::MemoryBuffer::getMemBufferCopy(Predefines, "<built-in>");		llvm::MemoryBuffer::getMemBufferCopy(Predefines, "<built-in>");
assert(SB && "Cannot create predefined source buffer");		assert(SB && "Cannot create predefined source buffer");
FileID FID = SourceMgr.createFileID(std::move(SB));		FileID FID = SourceMgr.createFileID(std::move(SB));
assert(FID.isValid() && "Could not create FileID for predefines?");		assert(FID.isValid() && "Could not create FileID for predefines?");
setPredefinesFileID(FID);		setPredefinesFileID(FID);
▲ Show 20 Lines • Show All 906 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTReader.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,881 Lines • ▼ Show 20 Lines	HeaderFileInfoTrait::ReadData(internal_key_ref key, const unsigned char *d,
const unsigned char *End = d + DataLen;		const unsigned char *End = d + DataLen;
HeaderFileInfo HFI;		HeaderFileInfo HFI;
unsigned Flags = *d++;		unsigned Flags = *d++;
// FIXME: Refactor with mergeHeaderFileInfo in HeaderSearch.cpp.		// FIXME: Refactor with mergeHeaderFileInfo in HeaderSearch.cpp.
HFI.isImport \|= (Flags >> 5) & 0x01;		HFI.isImport \|= (Flags >> 5) & 0x01;
HFI.isPragmaOnce \|= (Flags >> 4) & 0x01;		HFI.isPragmaOnce \|= (Flags >> 4) & 0x01;
HFI.DirInfo = (Flags >> 1) & 0x07;		HFI.DirInfo = (Flags >> 1) & 0x07;
HFI.IndexHeaderMapHeader = Flags & 0x01;		HFI.IndexHeaderMapHeader = Flags & 0x01;
// FIXME: Find a better way to handle this. Maybe just store a
// "has been included" flag?
HFI.NumIncludes = std::max(endian::readNext<uint16_t, little, unaligned>(d),
HFI.NumIncludes);
HFI.ControllingMacroID = Reader.getGlobalIdentifierID(		HFI.ControllingMacroID = Reader.getGlobalIdentifierID(
M, endian::readNext<uint32_t, little, unaligned>(d));		M, endian::readNext<uint32_t, little, unaligned>(d));
if (unsigned FrameworkOffset =		if (unsigned FrameworkOffset =
endian::readNext<uint32_t, little, unaligned>(d)) {		endian::readNext<uint32_t, little, unaligned>(d)) {
// The framework offset is 1 greater than the actual offset,		// The framework offset is 1 greater than the actual offset,
// since 0 is used as an indicator for "no framework name".		// since 0 is used as an indicator for "no framework name".
StringRef FrameworkName(FrameworkStrings + FrameworkOffset - 1);		StringRef FrameworkName(FrameworkStrings + FrameworkOffset - 1);
HFI.Framework = HS->getUniqueFrameworkName(FrameworkName);		HFI.Framework = HS->getUniqueFrameworkName(FrameworkName);
▲ Show 20 Lines • Show All 1,055 Lines • ▼ Show 20 Lines	case INPUT_FILE_OFFSETS:
(const llvm::support::unaligned_uint64_t *)Blob.data();		(const llvm::support::unaligned_uint64_t *)Blob.data();
F.InputFilesLoaded.resize(NumInputs);		F.InputFilesLoaded.resize(NumInputs);
F.NumUserInputFiles = NumUserInputs;		F.NumUserInputFiles = NumUserInputs;
break;		break;
}		}
}		}
}		}

		void ASTReader::readIncludedFiles(ModuleFile &F, StringRef Blob,
		Preprocessor &PP) {
		using namespace llvm::support;

		const unsigned char D = (const unsigned char )Blob.data();
		unsigned FileCount = endian::readNext<uint32_t, little, unaligned>(D);

		for (unsigned I = 0; I < FileCount; ++I) {
		size_t ID = endian::readNext<uint32_t, little, unaligned>(D);
		InputFileInfo IFI = readInputFileInfo(F, ID);
		if (llvm::ErrorOr<const FileEntry *> File =
		PP.getFileManager().getFile(IFI.Filename))
		PP.getIncludedFiles().insert(*File);
		}
		}

llvm::Error ASTReader::ReadASTBlock(ModuleFile &F,		llvm::Error ASTReader::ReadASTBlock(ModuleFile &F,
unsigned ClientLoadCapabilities) {		unsigned ClientLoadCapabilities) {
BitstreamCursor &Stream = F.Stream;		BitstreamCursor &Stream = F.Stream;

if (llvm::Error Err = Stream.EnterSubBlock(AST_BLOCK_ID))		if (llvm::Error Err = Stream.EnterSubBlock(AST_BLOCK_ID))
return Err;		return Err;
F.ASTBlockStartOffset = Stream.GetCurrentBitNo();		F.ASTBlockStartOffset = Stream.GetCurrentBitNo();

▲ Show 20 Lines • Show All 722 Lines • ▼ Show 20 Lines	case MACRO_OFFSET: {
std::make_pair(LocalBaseMacroID,		std::make_pair(LocalBaseMacroID,
F.BaseMacroID - LocalBaseMacroID));		F.BaseMacroID - LocalBaseMacroID));

MacrosLoaded.resize(MacrosLoaded.size() + F.LocalNumMacros);		MacrosLoaded.resize(MacrosLoaded.size() + F.LocalNumMacros);
}		}
break;		break;
}		}

		case PP_INCLUDED_FILES:
		readIncludedFiles(F, Blob, PP);
		break;

case LATE_PARSED_TEMPLATE:		case LATE_PARSED_TEMPLATE:
LateParsedTemplates.emplace_back(		LateParsedTemplates.emplace_back(
std::piecewise_construct, std::forward_as_tuple(&F),		std::piecewise_construct, std::forward_as_tuple(&F),
std::forward_as_tuple(Record.begin(), Record.end()));		std::forward_as_tuple(Record.begin(), Record.end()));
break;		break;

case OPTIMIZE_PRAGMA_OPTIONS:		case OPTIMIZE_PRAGMA_OPTIONS:
if (Record.size() != 1)		if (Record.size() != 1)
▲ Show 20 Lines • Show All 9,310 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTWriter.cpp

Show First 20 Lines • Show All 856 Lines • ▼ Show 20 Lines	#define RECORD(X) EmitRecordID(X, #X, Stream, Record)
RECORD(OPTIMIZE_PRAGMA_OPTIONS);		RECORD(OPTIMIZE_PRAGMA_OPTIONS);
RECORD(MSSTRUCT_PRAGMA_OPTIONS);		RECORD(MSSTRUCT_PRAGMA_OPTIONS);
RECORD(POINTERS_TO_MEMBERS_PRAGMA_OPTIONS);		RECORD(POINTERS_TO_MEMBERS_PRAGMA_OPTIONS);
RECORD(UNUSED_LOCAL_TYPEDEF_NAME_CANDIDATES);		RECORD(UNUSED_LOCAL_TYPEDEF_NAME_CANDIDATES);
RECORD(DELETE_EXPRS_TO_ANALYZE);		RECORD(DELETE_EXPRS_TO_ANALYZE);
RECORD(CUDA_PRAGMA_FORCE_HOST_DEVICE_DEPTH);		RECORD(CUDA_PRAGMA_FORCE_HOST_DEVICE_DEPTH);
RECORD(PP_CONDITIONAL_STACK);		RECORD(PP_CONDITIONAL_STACK);
RECORD(DECLS_TO_CHECK_FOR_DEFERRED_DIAGS);		RECORD(DECLS_TO_CHECK_FOR_DEFERRED_DIAGS);
		RECORD(PP_INCLUDED_FILES);

// SourceManager Block.		// SourceManager Block.
BLOCK(SOURCE_MANAGER_BLOCK);		BLOCK(SOURCE_MANAGER_BLOCK);
RECORD(SM_SLOC_FILE_ENTRY);		RECORD(SM_SLOC_FILE_ENTRY);
RECORD(SM_SLOC_BUFFER_ENTRY);		RECORD(SM_SLOC_BUFFER_ENTRY);
RECORD(SM_SLOC_BUFFER_BLOB);		RECORD(SM_SLOC_BUFFER_BLOB);
RECORD(SM_SLOC_BUFFER_BLOB_COMPRESSED);		RECORD(SM_SLOC_BUFFER_BLOB_COMPRESSED);
RECORD(SM_SLOC_EXPANSION_ENTRY);		RECORD(SM_SLOC_EXPANSION_ENTRY);

// Preprocessor Block.		// Preprocessor Block.
BLOCK(PREPROCESSOR_BLOCK);		BLOCK(PREPROCESSOR_BLOCK);
RECORD(PP_MACRO_DIRECTIVE_HISTORY);		RECORD(PP_MACRO_DIRECTIVE_HISTORY);
RECORD(PP_MACRO_FUNCTION_LIKE);		RECORD(PP_MACRO_FUNCTION_LIKE);
RECORD(PP_MACRO_OBJECT_LIKE);		RECORD(PP_MACRO_OBJECT_LIKE);
RECORD(PP_MODULE_MACRO);		RECORD(PP_MODULE_MACRO);
RECORD(PP_TOKEN);		RECORD(PP_TOKEN);

		vsapsaiUnsubmitted Not Done Reply Inline Actions I believe `PP_INCLUDED_FILES` is located in `AST_BLOCK`. Yes, you write it in `ASTWriter::WritePreprocessor` but after `Stream.ExitBlock()` together with `MACRO_OFFSET`. And then you read it in `ASTReader::ReadASTBlock` at the top level and not inside `case PREPROCESSOR_BLOCK_ID` or from `ModuleFile::MacroCursor`. vsapsai: I believe `PP_INCLUDED_FILES` is located in `AST_BLOCK`. Yes, you write it in `ASTWriter…
		jansvoboda11AuthorUnsubmitted Done Reply Inline Actions That's right. This is modeled after `PP_CONDITIONAL_STACK` and `PP_COUNTER_VALUE`. The problem is that the whole `PREPROCESSOR_BLOCK_ID` is treated as "macros only" block that's not being split into individual records right away and is instead deserialized lazily in `ReadMacroRecord`, `ReadDefinedMacros`, `resolvePendingMacro`. I think this should be fixed eventually, but I didn't want to expand the scope of my changes, since it's already somewhat complex already. jansvoboda11: That's right. This is modeled after `PP_CONDITIONAL_STACK` and `PP_COUNTER_VALUE`. The problem…
// Submodule Block.		// Submodule Block.
BLOCK(SUBMODULE_BLOCK);		BLOCK(SUBMODULE_BLOCK);
RECORD(SUBMODULE_METADATA);		RECORD(SUBMODULE_METADATA);
RECORD(SUBMODULE_DEFINITION);		RECORD(SUBMODULE_DEFINITION);
RECORD(SUBMODULE_UMBRELLA_HEADER);		RECORD(SUBMODULE_UMBRELLA_HEADER);
RECORD(SUBMODULE_HEADER);		RECORD(SUBMODULE_HEADER);
RECORD(SUBMODULE_TOPHEADER);		RECORD(SUBMODULE_TOPHEADER);
RECORD(SUBMODULE_UMBRELLA_DIR);		RECORD(SUBMODULE_UMBRELLA_DIR);
▲ Show 20 Lines • Show All 878 Lines • ▼ Show 20 Lines	hash_value_type ComputeHash(key_type_ref key) {
// match even when symlinking or excess path elements ("foo/../", "../")		// match even when symlinking or excess path elements ("foo/../", "../")
// change the form of the name. However, complete path is still the key.		// change the form of the name. However, complete path is still the key.
return llvm::hash_combine(key.Size, key.ModTime);		return llvm::hash_combine(key.Size, key.ModTime);
}		}

std::pair<unsigned, unsigned>		std::pair<unsigned, unsigned>
EmitKeyDataLength(raw_ostream& Out, key_type_ref key, data_type_ref Data) {		EmitKeyDataLength(raw_ostream& Out, key_type_ref key, data_type_ref Data) {
unsigned KeyLen = key.Filename.size() + 1 + 8 + 8;		unsigned KeyLen = key.Filename.size() + 1 + 8 + 8;
unsigned DataLen = 1 + 2 + 4 + 4;		unsigned DataLen = 1 + 4 + 4;
for (auto ModInfo : Data.KnownHeaders)		for (auto ModInfo : Data.KnownHeaders)
if (Writer.getLocalOrImportedSubmoduleID(ModInfo.getModule()))		if (Writer.getLocalOrImportedSubmoduleID(ModInfo.getModule()))
DataLen += 4;		DataLen += 4;
if (Data.Unresolved.getPointer())		if (Data.Unresolved.getPointer())
DataLen += 4;		DataLen += 4;
return emitULEBKeyDataLength(KeyLen, DataLen, Out);		return emitULEBKeyDataLength(KeyLen, DataLen, Out);
}		}

Show All 15 Lines	void EmitData(raw_ostream &Out, key_type_ref key,
endian::Writer LE(Out, little);		endian::Writer LE(Out, little);
uint64_t Start = Out.tell(); (void)Start;		uint64_t Start = Out.tell(); (void)Start;

unsigned char Flags = (Data.HFI.isImport << 5)		unsigned char Flags = (Data.HFI.isImport << 5)
\| (Data.HFI.isPragmaOnce << 4)		\| (Data.HFI.isPragmaOnce << 4)
\| (Data.HFI.DirInfo << 1)		\| (Data.HFI.DirInfo << 1)
\| Data.HFI.IndexHeaderMapHeader;		\| Data.HFI.IndexHeaderMapHeader;
LE.write<uint8_t>(Flags);		LE.write<uint8_t>(Flags);
LE.write<uint16_t>(Data.HFI.NumIncludes);

if (!Data.HFI.ControllingMacro)		if (!Data.HFI.ControllingMacro)
LE.write<uint32_t>(Data.HFI.ControllingMacroID);		LE.write<uint32_t>(Data.HFI.ControllingMacroID);
else		else
LE.write<uint32_t>(Writer.getIdentifierRef(Data.HFI.ControllingMacro));		LE.write<uint32_t>(Writer.getIdentifierRef(Data.HFI.ControllingMacro));

unsigned Offset = 0;		unsigned Offset = 0;
if (!Data.HFI.Framework.empty()) {		if (!Data.HFI.Framework.empty()) {
▲ Show 20 Lines • Show All 432 Lines • ▼ Show 20 Lines	if (Loc.isInvalid())
return true;		return true;
if (PP.getSourceManager().getFileID(Loc) == PP.getPredefinesFileID())		if (PP.getSourceManager().getFileID(Loc) == PP.getPredefinesFileID())
return true;		return true;
}		}

return false;		return false;
}		}

		void ASTWriter::writeIncludedFiles(raw_ostream &Out, const Preprocessor &PP) {
		using namespace llvm::support;

		const Preprocessor::IncludedFilesSet &IncludedFiles = PP.getIncludedFiles();

		std::vector<uint32_t> IncludedInputFileIDs;
		IncludedInputFileIDs.reserve(IncludedFiles.size());

		for (const FileEntry *File : IncludedFiles) {
		auto InputFileIt = InputFileIDs.find(File);
		if (InputFileIt == InputFileIDs.end())
		continue;
		IncludedInputFileIDs.push_back(InputFileIt->second);
		}

		llvm::sort(IncludedInputFileIDs);

		endian::Writer LE(Out, little);
		LE.write<uint32_t>(IncludedInputFileIDs.size());
		for (uint32_t ID : IncludedInputFileIDs)
		LE.write<uint32_t>(ID);
		}

/// Writes the block containing the serialized form of the		/// Writes the block containing the serialized form of the
/// preprocessor.		/// preprocessor.
void ASTWriter::WritePreprocessor(const Preprocessor &PP, bool IsModule) {		void ASTWriter::WritePreprocessor(const Preprocessor &PP, bool IsModule) {
uint64_t MacroOffsetsBase = Stream.GetCurrentBitNo();		uint64_t MacroOffsetsBase = Stream.GetCurrentBitNo();

PreprocessingRecord *PPRec = PP.getPreprocessingRecord();		PreprocessingRecord *PPRec = PP.getPreprocessingRecord();
if (PPRec)		if (PPRec)
WritePreprocessorDetail(*PPRec, MacroOffsetsBase);		WritePreprocessorDetail(*PPRec, MacroOffsetsBase);
▲ Show 20 Lines • Show All 192 Lines • ▼ Show 20 Lines	void ASTWriter::WritePreprocessor(const Preprocessor &PP, bool IsModule) {

unsigned MacroOffsetAbbrev = Stream.EmitAbbrev(std::move(Abbrev));		unsigned MacroOffsetAbbrev = Stream.EmitAbbrev(std::move(Abbrev));
{		{
RecordData::value_type Record[] = {MACRO_OFFSET, MacroOffsets.size(),		RecordData::value_type Record[] = {MACRO_OFFSET, MacroOffsets.size(),
FirstMacroID - NUM_PREDEF_MACRO_IDS,		FirstMacroID - NUM_PREDEF_MACRO_IDS,
MacroOffsetsBase - ASTBlockStartOffset};		MacroOffsetsBase - ASTBlockStartOffset};
Stream.EmitRecordWithBlob(MacroOffsetAbbrev, Record, bytes(MacroOffsets));		Stream.EmitRecordWithBlob(MacroOffsetAbbrev, Record, bytes(MacroOffsets));
}		}

		{
		auto Abbrev = std::make_shared<BitCodeAbbrev>();
		Abbrev->Add(BitCodeAbbrevOp(PP_INCLUDED_FILES));
		Abbrev->Add(BitCodeAbbrevOp(BitCodeAbbrevOp::Blob));
		unsigned IncludedFilesAbbrev = Stream.EmitAbbrev(std::move(Abbrev));

		SmallString<2048> Buffer;
		raw_svector_ostream Out(Buffer);
		writeIncludedFiles(Out, PP);
		RecordData::value_type Record[] = {PP_INCLUDED_FILES};
		Stream.EmitRecordWithBlob(IncludedFilesAbbrev, Record, Buffer.data(),
		Buffer.size());
		}
}		}

void ASTWriter::WritePreprocessorDetail(PreprocessingRecord &PPRec,		void ASTWriter::WritePreprocessorDetail(PreprocessingRecord &PPRec,
uint64_t MacroOffsetsBase) {		uint64_t MacroOffsetsBase) {
if (PPRec.local_begin() == PPRec.local_end())		if (PPRec.local_begin() == PPRec.local_end())
return;		return;

SmallVector<PPEntityOffset, 64> PreprocessedEntityOffsets;		SmallVector<PPEntityOffset, 64> PreprocessedEntityOffsets;
▲ Show 20 Lines • Show All 4,387 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[clang][lex] Include tracking: simplify and move to preprocessorClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 403251

clang/include/clang/Lex/HeaderSearch.h

clang/include/clang/Lex/Preprocessor.h

clang/include/clang/Serialization/ASTBitCodes.h

clang/include/clang/Serialization/ASTReader.h

clang/include/clang/Serialization/ASTWriter.h

clang/lib/Lex/HeaderSearch.cpp

clang/lib/Lex/PPDirectives.cpp

clang/lib/Lex/Preprocessor.cpp

clang/lib/Serialization/ASTReader.cpp

clang/lib/Serialization/ASTWriter.cpp

[clang][lex] Include tracking: simplify and move to preprocessor
ClosedPublic